+ All Categories
Home > Documents > NIHMS266526-supplement-Supplemental_Online_Materials.pdf

NIHMS266526-supplement-Supplemental_Online_Materials.pdf

Date post: 01-Jun-2018
Category:
Upload: dilawar-jan
View: 221 times
Download: 0 times
Share this document with a friend

of 10

Transcript
  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    1/22

    Supplementary Online Materials for:

    Formation of regulatory patterns during signal propagation in a mammalian cellular network

    Avi Ma’ayan, Sherry L. Jenkins, Susana Neves, Anthony Hasseldine, Elizabeth Grace, Benjamin

    Dubin-Thaler, Narat J. Eungdamrong, Gehzi Weng, Prahlad Ram, J. Jeremy Rice,

    Aaron Kershenbaum, Gustavo A. Stolovitzky, Robert D. Blitzer, and Ravi Iyengar

    Contents

    Methods

    Supporting text

    Supporting Figures S1- S11

    Supporting Table S1

    Supporting references

    Author contributions

    Table of supporting external files: text files, source code segments, movies and spreadsheets

    Methods

    Constructing the network:We used published research literature to identify the key components of signaling pathways

    and cellular machines, and their binary interactions. Most components (~80%) have been described

    in hippocampal neurons or related neuronal cells. Other components are from other cells, but areincluded because they are key components in processes known to occur in hippocampal neurons,

    such as translation. We then established that these interactions were both direct and functionally

    relevant. All of the connections were individually verified by at least one of the authors of this paper by reading the relevant primary paper(s). We developed a system made of 545 components (nodes)

    and 1259 links (connections). We used arbitrary but consistent rules to sort components into various

    groups. For instance, transcription factors are considered a as part of the transcriptional machinery,although it may also be equally valid to consider them as the most downstream component of thecentral signaling network. Similarly the AMPA receptor-channel (AMPAR) is considered part of the

    ion channels in the electrical response system since its activity is essential to defining the

     postsynaptic response, although it binds to and is activated by glutamate, and hence can be alsoconsidered a ligand gated receptor-channel in the plasma membrane.

    The links were specified by two criteria: function and biochemical mechanism. Three types

    of functional links were specified. This follows the rules used for representation of pathways in

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    2/22

     

    2

    Science’s STKE (S1). Links may be activating, inhibitory or neutral. Neutral links do not specify

    directionality between components, and are mostly used to represent scaffolding and anchoring

    undirected or bidirectional interactions. The biochemical specification includes defining the

    reactions as non-covalent binding interactions or enzymatic reactions. Within the enzymaticcategory, reactions were further specified as phosphorylation, dephosphorylation, hydrolysis, etc.

    These two criteria for specification are independent and were defined for all interactions. For the

    analyses in this study we only used the functional criteria: activating, inhibitory or neutralspecifications.

    We chose papers that demonstrated direct interactions that were supported by either

     biochemical or physiological effects of the interactions. From these papers we identified thecomponents and interactions that make up the system we analyzed. During this specification process

    we did not consider whether these interactions would come together to form higher order

    organizational units. Each component and interaction was validated by a reference from the primaryliterature (1202 papers were used). A list of authors who read the papers to validate the components

    and interactions is provided under authors contributions.

    Storage of the network data:The data describing the components (nodes) and their interactions (links) that compose the

    network are stored in a text file, see Text file S1, using the flat file *sig format described below.

    The *sig flat text file format:

    Source Name: cellular component that is affecting a target componentSource Human Accession: Swiss-Prot accession if available

    Source Mouse Accession: Swiss-Prot accession if available

    Source Type: the type of molecule classification this componentSource Location: cellular localization of the component

    Target Name: generalized cellular component that is affected by the source componentTarget Human Accession: Swiss-Prot accession if available

    Target Mouse Accession: Swiss-Prot accession if availableTarget Type: the type of molecule classification for this component

    Target Location: generalized cellular localization of the componentEffect: activation (+), inhibition (-), or neutral (0)

    Type of Interaction: type of chemical interaction directly linking the two components

    PubMed ID: PubMed database accession number

    Sorting cellular components (nodes) into cellular machines:Components (nodes) were separated into functional machines based on their molecule type and

    location attributes using Code segment S7.

    Visualization of the entire network:The diagram of the network in figure S1 was created with Code segment S8 that creates Text file

    S2. This text file can be loaded into the Pajek software for visualization (S2).

    Subnetworks obtained by signal propagation from ligands:

    To generate these subnetworks we used a depth-first search algorithm expanding the search solely in

    the downstream direction, to confirm with biologically specified directional transfer of information.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    3/22

     

    3

    For example, when a Gs protein is activated by an upstream ligand-receptor interaction it connects

    only to the downstream component adenylyl cyclase. The subnetworks do not include another Gs

    coupled receptor unless that receptor also interacts with the specific ligand.

    The recursive algorithm (using depth-first search) is listed as Code segment S9. It was used tocreate 15 subnetworks for each node of the type “Ligand” by changing the number of steps from the

    ligand. This concept is termed breadth-first search and is illustrated in Movie S20. The subnetworks

    were then analyzed by computing the number of nodes, links, clustering, average path length andmotifs.

    Visualization of the resulting data for connections per step for all ligands, Spreadsheet S23, was

    created with MatlabTM

    , Natick, MA (S3).Three sets of subnetworks induced by the nodes: GLUTAMATE, NE and BDNF were analyzed for

    number of nodes and links, clustering, average path-length and motifs with Code segment S10.

    These results, Spreadsheet S24, were visualized with ExcelTM

     (S4).

    Overall statistics of the network:

    The connectivity distribution data, Spreadsheet S25, was plotted with MatlabTM

    , Natick, MA (S3).

    Characteristic path-length measure was implemented by using Floyd’s algorithm (S5), with Code

    segment S11.

    Clustering Coefficient was implemented with custom code based on the concept developed by Wattsand Strogatz (S6), using Code segment S12.

    Grid Coefficient was implemented with custom code based on the concept developed by Caldarelliet al. (S7), using Code segment S13.

    The results, Spreadsheet S26, were compared to the same parameters computed for 100 shufflednetworks of one island that maintain the same connectivity distribution using an algorithm

    developed by Milo et al. (S8), using Code segment S14.

     Network Motifs:Counting motifs in the network was accomplished by using the MFinder program developed by

    Kashtan et al. (S9). Motifs of size 3, 4, 5 and 6 were visualized using VisioTM

     (S10).See Table S29 of the complete results for the motifs of sizes 3 and 4.

    Output files in text format from the MFinder program:

    Motifs size 3: Text file S3.Motifs size 4: Text file S4.

    Motifs size 5: Text file S5.

    Motifs size 6: Text file S6.

    Our method for motif search:

    In our detailed analysis of subnetworks, we only considered closed loop circular motifs. Wesearched for these motifs using a depth-first search algorithm (Code segments S17 through S19).An example of counting motifs in a toy network is given below:

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    4/22

     

    4

     A

    C D

    B

     The above toy network contain: 1 positive feedback loop of size 3, 1 negative feedback loop of size3 and 1 negative feed-forward motif of size 4. Nodes C and D contribute to 3 motifs whereas nodes

    B and C contribute to 2 motifs. This is slightly different from the method used by Kashtan et al. (S9) because, with their method, A, B, C, and D will not be considered a feedforward loop due to the link

     between A and D.

    Positive vs. negative feedback and feedforward loops:

    Positive feedback loops motifs are defined as loops where all the links in the loop are positive(activating), or there are an even number of negative (inhibitory) links. Negative feedback loops are

    loops where there is an odd number of negative links. Feedforward loops have two “arms”, each

    starting at a source node and merging into a target (sink) node. To determine if a feedforward loop is positive or negative, each “arm” was evaluated separately. Both “arms” must be positive for the

    feedforward motifs to be positive. An “arm” that has all links positive or even number of negative

    links is considered positive; whereas, an arm with an odd number of negative links is considerednegative.

    Subnetworks from source to target nodes:

    Series of subnetworks from specific source nodes (generally ligands) to specific targets

    (components within machines such as channels or transcription factors), with limited path lengths,

    were created using the following code: Code segment S15. The concept is illustrated in Movie S21. Shuffled networks, where only links that do not involved the source nodes and target nodes, were

    created for statistical control. In the control subnetworks, we preserved the biologically specified

    connectivity and directionality between ligands and their receptors and from the immediateupstream components to the AMPA channel and CREB. For all other connections we randomly

    swapped the direction of the connection while preserving the overall connectivity structure of thenetwork. Curve fitting of the data, Spreadsheet S27, was done with ExcelTM

     (S4).

    Subnets based on nodal connectivity:

    Series of sub-networks created based on nodal connectivity were created using Code segment S16.

    The concept is illustrated in Movie S22. Search for feedback loops was implemented with Codesegment S17, feed-forward loops with Code segment S18, bi-fan motifs with Code segment S19.

    The results are summarized in Spreadsheet S28  and were visualized with ExcelTM

      (S4). The

    number of islands does not consider highly connected nodes as single islands when they are notincluded. For example, PKA, a highly connected node (k=49) is not included in the subnetworks

    with lower connectivity threshold and is not considered an island by itself.

    Computing the Density of Information Processing (DIP):

    DIP is a measure of the local density of motifs and their interconnectedness within the interaction

    space of the network. DIP is defined as:

    i

    ii

    ii

    i  GC  L L

     M  M  DIP   ⋅⎟⎟

     ⎠

     ⎞⎜⎜⎝ 

    ⎛ 

    −=

    1

    1   (1) 

    Whereiiiiii

      BIFAN FFLFFLFBLFBL M    ++++= 4343

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    5/22

     

    5

     M i is the total number of feedback loops, feedforward loops and bifan motifs.  Li is the total links and  

    i  represent the step. FBL3  and FBL4  are feedback loops of size 3 and 4, FFL3  and FFL4  are

    feedforward loops of size 3 and 4 and  BIFAN  are bi-fan motifs of size 4. GC is the grid coefficient

    representing interconnectedness for the motifs, computed for the subnetwork at step i.

    Computing the Motif Location Index (MLI):

    MLI measure the concentration of specified motifs and various locations within thenetwork. MLI can vary from 0 to 1 depending on its relative distance from the extracellular ligands

    to cellular machine, where 0 indicates location at the level of machines. MLI was calculated as

    follows:

     MLI  =n

    CPLLCPLM 

    CPLM n

    i   ii

    i∑=

    ⎟⎟ ⎠

     ⎞⎜⎜⎝ 

    ⎛ 

    +1  (2)

    where n is the size of the motif, CPLM  is the characteristic path length from a node within the motif

    to all other nodes in the cellular machine and CPLL is the characteristic path length from a node to

    all extracellular ligands. If a node is an extracellular ligand then CPLL = 0; if the node is in the plasma membrane CPLL= 1. If a node belongs to a cellular machine, CPLM   = 0. The average

    shortest path length was computed using Floyd’s algorithm (S5).

    Supporting text

    Functional Organization: From Components to Information Processing Motifs

    Components within mammalian cells interact with one another to form local networks thattogether form a single large network. This organization is essential for cellular components to

    coordinate their individual activities and achieve the cohesiveness needed for cellular functions.

    Information needs to flow between components in a continuous and organized manner. Determininghow this flow of information occurs is a crucial step in understanding the functional organization of

    mammalian cells. This system of interacting cellular components based on phenotypic behavior

    allows us to analyze the flow of information between the components to identify the emergence ofregulatory motifs that are capable of processing information as it flows through the network.

    Regulatory motifs such as feedback loops and feedforward motifs allow the cell to process

    information from extracellular signals, and decide when such information persists and when it is

    transient. The presence of positive feedback loops arising from coupled biochemical reactions leadsto switching behavior that can enable state changes (S11). State change triggered by biochemical

    switches has been demonstrated in developmental systems (S12).

    The overall profile of motifs may be an initial indicator of the cell’s information processingcapability, when the quantitative and temporal characteristics of the interactions and consequent

    motif formation are taken into account. Such quantitative specification may lead to comparative

     profiles different from that observed here. Nevertheless, the identification and characterization of

    motifs allows us to move from components and interactions to the next level of organization withinthe cell.

    Distinct regulatory motifs are active in response to signals from different ligands. The balanceof the emergent positive and negative motifs may define the capability of the ligand to induce

     plasticity or maintain homeostasis. Comparison between motifs assumes that the motifs can be

    formed and function concurrently. The relationship between the interaction steps in this

     pseudodynamic analysis which considers propagation of functional connectivity over chemicalspace and time steps needs to be specified at the level of individual interactions. This is discussed in

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    6/22

     

    6

    greater detail below. The relationship between distances in chemical space in such a functional

    organization scheme and subcellular localization in terms of organelles and identified physical

    compartments require further analysis to identify where the two types of organization overlap and

    where they diverge.At the functional level, each network configuration is likely to have its own distinct pattern of

    regulatory motifs, and thus define capability of the cell for either maintaining homeostasis or

     promoting state change such as that seen in synaptic plasticity. Would a large ensemble of suchstates make the system intractably complex, such that we would not be able to either understand or

     predict its behavior? This is unlikely because many of the states may have profiles of regulatory

    motifs that produce convergent effects on cellular machines. Such convergence would provide amolecular basis for redundancy of cellular function and endow the cellular system with robustness.

    Functionally, cellular network states may fall into two broad categories: those that preserve

    homeostasis during system perturbation and those that promote reorganization of the parts of thecellular network. Such reorganization may include the synthesis of new components such as

    transcription factors as well as other components that are required for expression of phenotypic

     behavior. The notion that multiple network states may converge to yield a limited set of phenotypic

    responses is supported by recent studies of networks of neurons where similar network behavior, asdefined by firing patterns, can arise from multiple network configurations based on individual

     parameters that are quite different (S13). Perhaps such convergence represents a higher order oforganized redundancy arising from the many feedforward motifs. Thus, it is likely that cellularsystems are robust at varying levels of organization.

    From Qualitative Representation to Quantitative Behavior of MotifsQualitative representations of cellular interaction such as those analyzed here do not take

    into account individual reaction rates that define individual links. This simplifying assumption leads

    to questions such as whether the identity and function of regulatory motifs would be preserved if thereaction reactions for the links that make up the motifs vary over many orders of magnitude.

    Experimental studies have shown that some of the links are relevant only at specific times, atdifferent perturbation, and at different locations within the cell. Such conditionality can be

    manifested in many formats. Positive (also termed coherent) feedforward loops (FFL) can functionin several ways. For example, they can be coincidence detectors (AND gates), provide redundant

     pathways to the same effectors (OR gates); they can lead to long sustained activity if one arm is fastand the other arm is slower (S14, S15). These functions depend on the reaction rates of the two

    arms. To obtain a preliminary idea of the potential validity of the motifs we have identified in this

    study, we gathered a set of representative reactions from various classes (i.e., ligand-protein, protein-protein, and enzymatic reactions) and evaluated their rates by comparing on and off rates

    and K cat  values. The maximum observable difference was about 1000 fold (Table S2). We then

    constructed a toy ODE model for motif 44 (Fig. S3). A schematic of the model is shown (Fig.S11A). We varied the difference in rates between the two arms by a factor of up to 1000. Although

    there are temporal differences in the output, the overall output profiles are quite similar within the

    same time scale (Fig. S11B). These initial simulations suggest that the integrity of the motif is persevered even with widely varying rates.

    Although this toy ODE model has few components, it should be noted that this is likely to be

    a limiting case since with multiple components in each arm, reactions rates are also likely to vary

    and thus reducing the difference between the two arms. This is especially true for situations at two-three steps from the ligand, the location where the motifs become more prevalent. Thus, it is likely

    that many of the motifs identified by qualitative analysis will be operative even when interaction

    rates vary and the concentrations of components change in a regulated manner. Definitive proof that

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    7/22

     

    7

    each of these motifs is functional will only come when each reaction within a motif is parameterized

    and input/output relationships are determined, and the existence of the motifs will be experimentally

    verified.

    Configuration of Motifs and choosing between homeostasis and plasticity:

    The conversion of short term biochemical changes into long-term physiological changes including

    state changes is often driven prolonged activation of key protein kinases and/or the activation orinduction of transcription factors. The extent and duration of activity of these downstream

    components is regulated by the upstream signaling network. Since the network contains various

    types of regulatory motifs, the balance between these motifs will determine the extent and durationof the signal reaching these key components. Abundance of positive feed-forward or feedback loops

    would evoke the extended activation of these components, while the presence of negative feedback

    loops and gates could limit the duration of activation of the downstream components. Hence, theratio of the positive to negative feedback and feed-forward loops can determine whether or not

    downstream components are activated for extended periods at sufficient levels. This extent and

    duration of activation of the downstream components like the protein kinases or transcription factor

    may in turn determine if the cell can maintain homeostatic behavior or undergo long-term physiological changes. Thus, the ratio of positive to negative feedback and feed-forward motifs

    could be a key indicator of whether, in the presence of external signals, the cell is configured tomaintain homeostatic balance or undergo state change. Since the propagation of connectivity bydifferent ligands involves different components and pathways, the ratio of the positive and negative

    motifs is a function of the types of signals the cell receives and hence is conditional. Comparison of

    the ratio of these motifs recruited per step for different ligands, glutamate, NE and BDNF clearlydemonstrates that the regulatory configuration is defined by the ligand that initiates the signal

     propagation. Thus, by balancing the positive to negative motifs for different extracellular signals the

    cell may be able to tune itself to either respond or not respond to extracellular signals by undergoinglong-term physiological changes. It is noteworthy that for both glutamate and BDNF in the

     pathways to CREB the positive and negative loops are evenly balanced through nine steps. Such balance of regulatory loops provides an explanation of how homeostasis within the cell can be

    achieved even when a perturbing signal propagates through the system. The relative abundance ofthe positive feedback and feedforward loops as compared to the negative loops in the sub-network

    for NE to AMPA channels and to CREB is also noteworthy. These ratios may indicate that this sub-network can hold and transfer information across time-scales leading to persistent changes. This

    configuration of motifs might provide a systems level explanation of why the cAMP pathway is

    associated with the late phase of long-term potentiation in the CA1 neuron (S16). It is alsonoteworthy that propagation of connectivity from BDNF to AMPA channels and CREB results in

    equal numbers of positive and negative feedback loops suggesting that BDNF would not play an

    important role in changing the state of this cell. This is in agreement with experimental observationthat BDNF induces plasticity by affecting the functions of the presynaptic CA3 neuron (S17)

    although acute effects of BDNF are observed in the postsynaptic CA1 neuron (S18). Thus some of

    the characteristic responses evoked by important neurotransmitters may be understood byidentifying the configurations of the regulatory motifs they evoke within the cell.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    8/22

     

    8

    Supporting figures

    Fig. S1

    Fig. S1 Visualization of the mammalian neuronal cellular network using the Pajek software

    The network is visualized by placing nodes as triangles based in their functional compartments.Size of triangles demonstrates the level of connectivity for the node. Green arrows represent

    activation links, red arrows inhibition links, and blue arrows neutral links. The network data was

    stored in the .net file format using custom code (Code segment S8). The output was then loaded intothe Pajek program (S2) for visualization.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    9/22

     

    9

    Fig. S2

     A C

    BStatistical characteristics of the fully connected network: Average links per node: 4.62 Characteristic Path Length: 4.21 (3.81 ± 0.026)*Grid Coefficient: 0.026 (0.005 ± 0.0005)*Clustering Coefficient: 0.11 (0.03 ± 0.004)** Mean ± SD computed for 100 shuffled networks

    Fig. S2 Statistics of the fully connected network

    A) Distribution of interactions (links) into various categories. Interactions (links) in the cellular

    network were sorted using two different criteria: the biochemical reaction and the effect of the link.

    Upper pie chart shows sorting by biochemical reaction mechanism and the lower pie chart shows the

    distribution of connections that activate or inhibit downstream components. Neutral links indicateinteractions where the directionality is unspecified. B) Overall network characteristics: In

     parentheses are the mean and standard deviation of the characteristic path-lengths, clustering-

    coefficients and grid-coefficients computed for 100 shuffled “null” networks with the sameconnectivity distribution as the cellular network. These shuffled networks were created using an

    approach based on Milo et al (S8). C) Log-log plot of the connectivity distribution. The number of

    links per node (k ) is plotted against the number of nodes with the same level of connectivity.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    10/22

     

    10

    Fig. S3 

    Fig. S3 Schematic representation of the motifs identified within the cellular network

     Network Motifs identified using the MFinder program. The figure visually represents the text based

    output, produced by the MFinder program. These motifs are the most statistically significant

    identified network motifs within the network. The MFinder program searches for motifs in directed

    networks, it does not distinguish between positive and negative links. Thus, for this analysis weconsidered positive and negative links as unidirectional and neutral links as bi-directional. The

    MFinder program was developed by Alon and coworkers (S9). The network motifs of size 5 and 6were identified using the sampling methods. The various types of motifs are identified by numbers.

    The counts of the various types of motifs in the fully connected network are given in Table S1.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    11/22

     

    11

    Table S1 

    Motifs counts

    Moti f # CN* SN** Z-score

    31 16 4.8 ± 2.8 3.98

    32 22 9.3 ± 3.3 3.84

    33 14 8.1 ± 2.6 2.30

    34 36 12.5 ± 3.7 6.38

    35 32 12.2 ± 3.8 5.16

    36 25 9.8 ± 3.7 4.12

    41 1011 186.6 ± 32.6 25.31

    42 108 68.2 ± 14.8 2.69

    43 26 7.2 ± 4.8 3.91

    44 303 104.0 ± 15.5 12.88

    45 57 17.8 ± 7.3 5.39

    46 105 40.0 ± 10.9 5.97

    47 49 31.5 ± 8.3 2.12

    * CN- Cellular Network.

    ** SN- Shuffled networks.Mean ± SD computed for 100 shuffled networks.

    Using the sampling method (S8):

    Motifs counts

    Moti f # CN* SN** Z-score

    51 0.254 0.023 ± 0.025 9.27

    52 0.065 0.002 ± 0.003 11.42

    53 0.662 0.005 ± 0.011 60.05

    54 0.048 0.000 ± 0.002 26.57

    55 0.100 0.003 ± 0.007 13.92

    56 0.020 0.0 ± 0.0 NA57 0.016 0.0 ± 0.0 NA

    61 0.395 0.001 ± 0.011 36.01

    62 0.189 0.0 ± 0.0 NA

    63 0.050 0.0 ± 0.0 NA

    64 0.096 0.0 ± 0.0 NA

    65 0.070 0.0 ± 0.0 NA

    66 0.059 0.0 ± 0.0 NA

    67 0.049 0.0 ± 0.0 NA

    Table S1 Counts of the various types of motifs found in the cellular network  Motifs were counted using the MFinder program (S9). Counts for the motifs in the cellular network

    (CN) are compared to the mean for 100 control (shuffled) networks. Z-score indicates the statistical

    significance for the number found in the real network as compared to the mean of the controlnetworks. Motifs of size 5 and 6 were estimated using the sampling method described by Kashtan et

    al. (S9). Estimated values are given as fractions (count of the specified motifs / the total number of

    motifs of the same size).

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    12/22

     

    12

    Fig. S4

     A B

    C D E

    Fig. S4 Analysis of subnetworks from glutamate, NE and BDNF

    The number of links (A) and nodes (B) added per step, downstream from three ligands known to

    regulate plasticity in hippocampal neurons. C) Characteristic Path Length (CPL), D)  ClusteringCoefficient (CC) and E) Grid Coefficient (GC) computed for subnetworks emanating from

    glutamate, nor-epinephrine and BDNF.

    Fig. S5

     A B C 

    Fig. S5 Comparison of positive vs. negative feed-forward loop motifs in subnetworks

    emanating in steps from the three ligands: glutamate, nor-epinephrine and BDNFCounts were obtained using the source code described under Methods. Positive feed-forward loop

    motifs are feed-forward loops where both “arms” to the target node are positive. If at least one arm

    is negative the feed-forward is considered negative. Positive “arm” is defined as an “arm” with

    either no negative links or even number of negative links.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    13/22

     

    13

    Fig. S6 

    Fig. S6 Regulatory motifs identified in subnetworks emanating in steps from glutamate (Glu),

    norepinephrine (NE) and BDNF

    The motifs identified using our algorithms (see custom codes S17-S19) can classify motifs based on

    the different types of links and nodes. This is an enhancement to the MFinder program that onlyidentifies more abstract motifs based on directionality only.

    A) The scaffold motif is made of three nodes connected solely by neutral links.

    A-I: A schematic illustration of a scaffold motif.A-II: The counts of scaffold motifs found in subnetworks generated from the three ligands

    A-III: Representative examples of the scaffold motifs.B) The bi-fan motifB-I: Schematic representation of the bifan motif.

    B-II: The counts of bi-fan motifs found in subnetworks generated from the three ligands.

    B-III: Representative examples of bifan motifs.

    C) The feedforward loop motifs are composed of three or four nodes connected with directed linksonly where a source node feeds into a target node through two alternative routes.

    C-I: Schematic representations of the feedforward loops

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    14/22

     

    14

    C-II: The counts of feed-forward loop motifs found in the subnetworks generated from the three

    ligands

    C-III: Representative examples of feed-forward motifs.

    Positive feed-forward loop motifs are feed-forward loops where both “arms” to the target node are positive. Positive “arm” is defined as an “arm” with either no negative links or even number of

    negative links. If at least one arm is negative the feed-forward is considered negative.

    Abbreviations: PKA: protein kinase A, PKC: protein kinase C, CaM: calmodulin, CaN: calcineurin,

    GlyR: glycine receptor, NMDAR: NMDA receptor, AC2: adenylyl cyclase 2, PLCβ: phospholipaseC beta, CREB: cyclic-AMP response element binding protein. CREM: cyclic-AMP response

    element modulator. β-ARK: beta adrenergic receptor kinase.

    Fig. S7  A

    B

    Fig. S7 Size of subnetworks from extracellular ligands to specified effectors

    A) Schematic representation of the source and target nodes for the subnetworks starting from the

    three extracellular ligands: glutamate, nor-epinephrine (NE) and brain-drived neurotrophic factor(BDNF), (source nodes) to reach the effectors AMPA receptor-channels (AMPAR) or the

    transcription factor cyclic-AMP response element binding protein (CREB) (target nodes). B)  The

    changes in the number of nodes as more steps are allowed to reach the target nodes (effectors) from

    the source ligands for six sets of subnetworks.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    15/22

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    16/22

     

    16

    Fig. S9

     A

    B C D

    E F G

    Fig. S9 Subnetworks characteristics from extracellular ligands to AMPAR

    A) Schematic of the source and target nodes for the subnetworks starting from the three extracellularligands: glutamate, nor-epinephrine (NE) and brain-derived neurotrophic factor (BDNF), (source

    nodes) to reach the effector AMPA channel/receptor (AMPAR) (target node). B-D) Changes in the

    number of links as more steps are allowed to reach the effectors from each ligand are compared to

    the same analysis with shuffled networks. Only the directionality of the links that do not involve theligands or the effectors was randomly swapped while preserving the connectivity. The resultant

    graphs for both the cellular network and shuffled networks were curve fitted. For all cellular

    subnetworks the best fit function was linear (R 2 =0.99-0.96). Individual values are given as lin R 

    2.

    For all of the shuffled networks the best fit was obtained for either exponential or power law

    functions (lin and pow respectively). E-G) Counts of the number of total three and four component positive and negative feedback and feedforward loops, in subnetworks where more steps aregradually allowed between the source and target nodes.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    17/22

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    18/22

     

    18

    Fig. S11

     A

    input

     Act ivated A Act ivated B

    output

    Negative

    Regulator 

     

    B

    0 500 1000 15000.00

    0.05

    0.10

    0.15 kcat A/kcat B = 1/1

    kcat A/kcat B = 10/1

    kcat A/kcat B = 10/0.1

    kcat A/kcat B = 10/0.01

    t

          O    u     t    p    u     t

    0 500 1000 15000.00

    0.05

    0.10

    0.15

    Km A/Km B = 1/1

    Km A/Km B = 10/1

    Km A/Km B = 10/0.1

    Km A/Km B = 10/0.01

    t

          O    u     t    p    u     t

     

    Fig. S11 Quantitative evaluation of a simple feedforward motif (motif 44 in Fig. S3).

    A) A feedforward motif (Motif 44, Fig S3) was constructed and simulated using a system of coupledODEs. In this model, a simulated square pulse of input activates enzymes A and B. After 300seconds, the input was “washed out,” and the behavior of the output, which is activated by both A

    and B, was studied. To balance the activities of the activator, a negative regulator was included.

    The initial concentration of all components was 1 µM. B) To study the motif behavior, sensitivityanalyses were performed on k cat and K m. The ratio between the kinetics parameters of the two arms

    were varied from 10 (k catA = 10 s-1

    , k catB = 1 s-1

    ), to 1000 (k catA = 10 s-1

    , k catB = 0.01 s-1

    ). Similaranalysis was performed on K m. The activity of the output component is plotted as a function of

    time. The analysis shows that the motif behavior is robust. Changing the kinetics parameters over 3

    orders of magnitude had little effect on the qualitative behavior of the motif. As assessed by input-output relationships. These initial results suggests that components with widely varying reactions

    rates can come together to form functional units.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    19/22

     

    19

     

    Table S2

    Molec#1/Catalyst Molec#2/Substrate/Product Reaction typek f  

    (/µM/s)kr  (/s)

    KD/KM (µM)

    kcat (/s) Ref. Notes

    Rap1GAPII Rap1 GAP 1.4 80.5 65 11 (1)

    cAMP PKA site1 Binding 1.67 0.0167 0.010 (2)

    cAMP PKA site2 Binding 0.0093 0.00028 0.030 (2)

    β ARK β -AR Phosphorylation 0.2 0.2 (3)

    Csk Src Phosphorylation 1.8 40 22 0.4 (4)2-fold adjustment for T;KD & kr are lower limits

    calcium N-term calmodulin Binding 6 500 (5)

    calcium C-term calmodulin High Aff-Binding 6 10 (5)

    calcium C-term calmodulin Low Aff-Binding 6 100 (5)

    high and low-affinitybinding @ C-term reflects

    CaM complexing;dissociation but not

    association rate constantswere modified by CaM's

    interactions

    MKP3 ERK2 Dephosphorylation 0.022 0.2 (6)

    CDK1-cyclin Clb2 unidentified substrate/s35

    (ATP)2.2 (7)

    Used crude cell lysate assubstrate. i.e. no single

    target, but perhaps morephysiological

    PIP2 PKB (PH dom) Binding 0.5 (8)

    RasGAP Ras GAP 4.8 9.12 (9)

    Caspase 9 Caspase3 Cleavage 0.089 0.0168 (10)

    Caspase 3 PARP Cleavage 5.6 0.213 (10)

    Sos Ras GEF 0.50505 0.02 (11)

    PKA Representative substrate Phosphorylation 2.4 2.7 (12)

    PKA Representative substrate Phosphorylation 7.5 9 (13)

    CDK5 (p35isoform)

    Tau Phosphorylation 33 0.0433 (14)

    CDK5 (p25isoform)

    Tau Phosphorylation 27 0.217 (14)

    CalcineurinG-substrate, a substrate for

    PKGDephosphorylation 3.8 0.41 (15)

    Calcineurin DARPP-32, a substrate forPKA Dephosphorylation 1.6 0.2 (15)

    Calcineurin synapsin I (site 1) Dephosphorylation 7 0.053 (15)

    Calcineurin synapsin I (site 2) Dephosphorylation 4.4 0.04 (15)

    Grb2 Sos Binding 0.025 0.0168 (16)

    PKC (Ca bound) DAG Binding 0.008 8.6348 (16)

    calmodulin MLCK (non phosphorylated) Binding 28 0.031 (17)

    calmodulin MLCK (non phosphorylated) Binding 8 0.186 (17)

    MEK MAPK Phosphorylation 0.476 0.113(18,19)

    Km = 0.476 uM for 1stphosphorylation, 0.046

    uM for secondphosphorylation

    Raf MEK1 Phosphorylation 0.8 0.105 (20)

    GSK3β  β-CATENINPhosphorylation 0.1276 3.5

    (21,

    22)1.7 6.13 (23)

    STEP MAPK Dephosphorylation 38 12(24,25)

     AC 1

     AC cAMP Synthesis 1.4 7.9(26,27)

    PDE4D cAMP Cleavage

    NWASP Arp2/3 Binding 0.003 (28)

    Grb2 NWASP Binding 0.005 (28)

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    20/22

     

    20

    References for Table S2:1. Kraemer, A., Brinkmann, T., Plettner, I., Goody, R. & Wittinghofer, A. (2002) J Mol Biol 324, 763-74.2. Zorn, M., Fladmark, K. E., Ogreid, D., Jastorff, B., Doskeland, S. O. & Dostmann, W. R. (1995) FEBS Lett 362, 291-4.3. Kim, C. M., Dion, S. B. & Benovic, J. L. (1993) J Biol Chem 268, 15412-8.4. Lieser, S. A., Shindler, C., Aubol, B. E., Lee, S., Sun, G. & Adams, J. A. (2005) J Biol Chem 280, 7769-76.5. Gaertner, T. R., Putkey, J. A. & Waxham, M. N. (2004) J Biol Chem 279, 39374-82.6. Zhou, B., Wang, Z. X., Zhao, Y., Brautigan, D. L. & Zhang, Z. Y. (2002) J Biol Chem 277, 31818-25.7. Ubersax, J. A., Woodbury, E. L., Quang, P. N., Paraz, M., Blethrow, J. D., Shah, K., Shokat, K. M. & Morgan, D. O. (2003)

    Nature 425, 859-64.8. Frech, M., Andjelkovic, M., Ingley, E., Reddy, K. K., Falck, J. R. & Hemmings, B. A. (1997) J Biol Chem 272, 8474-81.

    9. Gideon, P., John, J., Frech, M., Lautwein, A., Clark, R., Scheffler, J. E. & Wittinghofer, A. (1992) Mol Cell Biol 12, 2050-6.10. Bentele, M., Lavrik, I., Ulrich, M., Stosser, S., Heermann, D. W., Kalthoff, H., Krammer, P. H. & Eils, R. (2004) J Cell Biol 166, 

    839-51.11. Orita, S., Kaibuchi, K., Kuroda, S., Shimizu, K., Nakanishi, H. & Takai, Y. (1993) J Biol Chem 268, 25542-6.12. Hemmings, H. C., Jr., Nairn, A. C. & Greengard, P. (1984) J Biol Chem 259, 14491-7.13. Bhalla, U. S. (2002) Biophys J 83, 740-52.14. Hashiguchi, M., Saito, T., Hisanaga, S. & Hashiguchi, T. (2002) J Biol Chem 277, 44525-30.15. King, M. M., Huang, C. Y., Chock, P. B., Nairn, A. C., Hemmings, H. C., Jr., Chan, K. F. & Greengard, P. (1984) J Biol Chem 

    259, 8080-3.16. Bhalla, U. S., Ram, P. T. & Iyengar, R. (2002) Science 297, 1018-23.17. Kasturi, R., Vasulka, C. & Johnson, J. D. (1993) J Biol Chem 268, 7958-64.18. Seger, R., Ahn, N. G., Posada, J., Munar, E. S., Jensen, A. M., Cooper, J. A., Cobb, M. H. & Krebs, E. G. (1992) J Biol Chem 

    267, 14373-81.19. Haystead, T. A., Dent, P., Wu, J., Haystead, C. M. & Sturgill, T. W. (1992) FEBS Lett 306, 17-22.20. Force, T., Bonventre, J. V., Heidecker, G., Rapp, U., Avruch, J. & Kyriakis, J. M. (1994) Proc Natl Acad Sci U S A 91, 1270-4.21. Ikeda, S., Kishida, S., Yamamoto, H., Murai, H., Koyama, S. & Kikuchi, A. (1998) Embo J 17, 1371-84.

    22. Lee, E., Salic, A., Kruger, R., Heinrich, R. & Kirschner, M. W. (2003) PLoS Biol 1, E10.23. Paul, S., Snyder, G. L., Yokakura, H., Picciotto, M. R., Nairn, A. C. & Lombroso, P. J. (2000) J Neurosci 20, 5630-8.24. Tang, W. J., Stanzel, M. & Gilman, A. G. (1995) Biochemistry 34, 14563-72.25. Taussig, R., Tang, W. J., Hepler, J. R. & Gilman, A. G. (1994) J Biol Chem 269, 6093-100.26. Lim, J., Pahlke, G. & Conti, M. (1999) J Biol Chem 274, 19677-85.27. Bolger, G. B., Erdogan, S., Jones, R. E., Loughney, K., Scotland, G., Hoffmann, R., Wilkinson, I., Farrell, C. & Houslay, M. D.

    (1997) Biochem J 328 ( Pt 2), 539-48.28. Carlier, M. F., Nioche, P., Broutin-L'Hermite, I., Boujemaa, R., Le Clainche, C., Egile, C., Garbay, C., Ducruix, A., Sansonetti,

    P. & Pantaloni, D. (2000) J Biol Chem 275, 21946-52.

    Table S2 Reaction rates for representative reactions underlying interactions in the network

    and a listing of the primary references from which these rates were obtained

    Rates for several classes of the reactions underlying the links in the network were collected from

     published studies. The reactions for which rates are shown include: non-covalent binding includingligand-protein and protein-protein interactions, enzymatic reactions such as cAMP synthesis,

     phosphorylation, dephosphorylation, proteolytic cleavage, and changes in catalytic activity resulting

    from protein-protein interactions such as the GEF and GAP activity for GTPases. These types ofreactions account for a large portion of the interactions within the cellular network.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    21/22

     

    21

    Table S3

    Table 3 Supporting external files

    The network database is provided in Text file S1 as a flat file where the field separator is a blank

    space. Each line in the file describes an interaction extracted from a specific journal. The tokensdescription is provided in the methods section under “Storage of the network data”. The Pajek

    software can be used to visualize the network with Text file S2 (S2). Text files S3-S6 are the raw

    output files from the MFinder program. Code segments S7-S19 contain algorithms written in C/C++used to generate, analyze and visualize the subnetworks in the study. Movies S20-S23 illustrate the

    concept we termed pseudodynamics used to create the three different types of subnetworks.

    Spreadsheets S23-S28 contain the raw data from the analysis. These tables were used to create most

    of the graphs in the study. The supporting external files can be found also athttp://amp.pharm.mssm.edu/network/sm/SM.htm and at http://www.mssm.edu/labs/iyengar .

    Other supporting online materials are available at: http://amp.pharm.mssm.edu/network/index.htm 

    File name Description

    Text file S1   Network data in flat file format.

    Text file S2 Pajek format text file describing the network data and assigns 2D location for nodes on the map.

    Text file S3 Raw output from the MFinder program searching for motifs of size 3.

    Text file S4 Raw output from the MFinder program searching for motifs of size 4.

    Text file S5 Raw output from the MFinder program searching for motifs of size 5.

    Text file S6 Raw output from the MFinder program searching for motifs of size 6.Code segment S7 Rules for sorting components into functional machines.

    Code segment S8 Converts the network into a Pajek format text file.

    Code segment S9 Creates sub-network where connectivity is propagated from ligands.

    Code segment S10 Analysis for nodes, interactions, motifs, clustering and path-lengths of sub-networks starting at

    ligands.

    Code segment S11 Implementation of the Floyd algorithm to compute the characteristic path-length.

    Code segment S12 Implementation of the clustering coefficient algorithm.

    Code segment S13 Implementation of the grid coefficient algorithm.

    Code segment S14 Algorithm used to create 100 shuffled networks with the same connectivity distribution and oneisland. 

    Code segment S15 Code segment used to create sub-network from specified source node to specified target node.

    Code segment S16 Code used to create sub-networks with gradual allowable nodal connectivity.

    Code segment S17 Algorithm used to search for feedback loops.Code segment S18 Algorithm used to search for feed-forward loops.

    Code segment S19 Algorithm used to search for bi-fan motifs.

    Movie S20 Illustration of the breadth-first concept.

    Movie S21 Illustration of the sub-networks where the source and target are specified.

    Movie S22 Illustration of the concept of gradual inclusion of nodes based on connectivity.

    Spreadsheet S23 Connectivity propagation per step from all ligands.

    Spreadsheet S24  Number of nodes, interactions, motifs, computed clustering and path-lengths for sub-networks

    starting at ligands.

    Spreadsheet S25 Connectivity distribution table.

    Spreadsheet S26 Characteristic path-lengths, clustering coefficients and grid coefficients for the real network andfor 100 shuffled networks with the same connectivity distribution.

    Spreadsheet S27 Results of the analysis of sub-networks from source to target.

    Spreadsheet S28 Analysis of sub-networks based on nodal connectivity.

  • 8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf

    22/22

     

    Supporting References:

    S1. N. R. Gough, L. B. Ray, Science STKE. 135, EG8 (2002)

    S2. http://vlado.fmf.uni-lj.si/pub/networks/pajek/ (2004)S3. http://www.mathworks.com/products/matlab/ (2004)

    S4. http://office.microsoft.com/excel/ (2004)

    S5. T. H. Cormen, et al. 2002, Introduction to Algorithms, MIT Press Cambridge, MA.S6. D. J. Watts, S. H. Strogatz, Nature. 393, 440 (1998)

    S7. G. R. Caldarelli, Pastor-Satorras et al. European Physical Journal B. 38, 183 (2004)

    S8. R. Milo, S. Shen-Orr, S. Itzkovitz, et al., Science. 298, 824 (2002)S9. N. Kashtan, S. Itzkovitz, R. Milo, Bioinformatics. 20, 1746 (2004)

    S10. http://office.microsoft.com/visio (2004)

    S11. U.S. Bhalla, R. Iyengar, Science. 283, 381 (1999)S12. W. Xiong , J. E, Ferrell Jr. Nature. 426, 460 (2003)

    S13. A.A. Prinz AA, D. Bucher, E. Marder,  Nature Neuroscience 7, 1345 (2004)

    S14. S. Mangan, U. Alon, Proc Natl Acad Sci U S A. 100, 11980 (2003)

    S15. S. Mangan, A. Zaslaver, U. Alon, J Mol Biol. 334, 197 (2003)S16 P. V. Nguyen, T. Abel, E. R. Kandel, Science. 265, 1104 (1994).

    S17. S. S. Zakharenko, S. L. Patterson, I. Dragatsis, et al., Neuron. 39, 975 (2003).S18. Y. Kovalchuk, E. Hanse, K. W. Kafitz, et al., Science. 295, 1729 (2002).

    Author Contributions 

    AM assembled the large-scale network, wrote all of the custom-code and conducted all of

    the analysis described in this study. SLJ developed the database to manage and maintain the

    components links and references programs; AM, SN, AH, EG, BD-T, NJE, SLJ, RDB and RI foundand read the original papers cited as references to verify the components and interactions. GW and

    PTR developed the original connections map of nearly 200 components on which the network usedin this study was based upon. JJR, AK and GAS provided approaches to analyze and statistically

    validate several of the key findings. NJE performed the quantitative analysis of the feedforwardmotif. AH assembled the table of rate constants. RDB provided supervision of the development of

    interactions maps and its anchoring to the hippocampal neuron. RI was responsible for overallsupervision of the project including analysis of the data and writing of the manuscript and is

    responsible for the final version of the submitted paper.


Recommended