Post on 01-Jun-2018
transcript
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
1/22
Supplementary Online Materials for:
Formation of regulatory patterns during signal propagation in a mammalian cellular network
Avi Ma’ayan, Sherry L. Jenkins, Susana Neves, Anthony Hasseldine, Elizabeth Grace, Benjamin
Dubin-Thaler, Narat J. Eungdamrong, Gehzi Weng, Prahlad Ram, J. Jeremy Rice,
Aaron Kershenbaum, Gustavo A. Stolovitzky, Robert D. Blitzer, and Ravi Iyengar
Contents
Methods
Supporting text
Supporting Figures S1- S11
Supporting Table S1
Supporting references
Author contributions
Table of supporting external files: text files, source code segments, movies and spreadsheets
Methods
Constructing the network:We used published research literature to identify the key components of signaling pathways
and cellular machines, and their binary interactions. Most components (~80%) have been described
in hippocampal neurons or related neuronal cells. Other components are from other cells, but areincluded because they are key components in processes known to occur in hippocampal neurons,
such as translation. We then established that these interactions were both direct and functionally
relevant. All of the connections were individually verified by at least one of the authors of this paper by reading the relevant primary paper(s). We developed a system made of 545 components (nodes)
and 1259 links (connections). We used arbitrary but consistent rules to sort components into various
groups. For instance, transcription factors are considered a as part of the transcriptional machinery,although it may also be equally valid to consider them as the most downstream component of thecentral signaling network. Similarly the AMPA receptor-channel (AMPAR) is considered part of the
ion channels in the electrical response system since its activity is essential to defining the
postsynaptic response, although it binds to and is activated by glutamate, and hence can be alsoconsidered a ligand gated receptor-channel in the plasma membrane.
The links were specified by two criteria: function and biochemical mechanism. Three types
of functional links were specified. This follows the rules used for representation of pathways in
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
2/22
2
Science’s STKE (S1). Links may be activating, inhibitory or neutral. Neutral links do not specify
directionality between components, and are mostly used to represent scaffolding and anchoring
undirected or bidirectional interactions. The biochemical specification includes defining the
reactions as non-covalent binding interactions or enzymatic reactions. Within the enzymaticcategory, reactions were further specified as phosphorylation, dephosphorylation, hydrolysis, etc.
These two criteria for specification are independent and were defined for all interactions. For the
analyses in this study we only used the functional criteria: activating, inhibitory or neutralspecifications.
We chose papers that demonstrated direct interactions that were supported by either
biochemical or physiological effects of the interactions. From these papers we identified thecomponents and interactions that make up the system we analyzed. During this specification process
we did not consider whether these interactions would come together to form higher order
organizational units. Each component and interaction was validated by a reference from the primaryliterature (1202 papers were used). A list of authors who read the papers to validate the components
and interactions is provided under authors contributions.
Storage of the network data:The data describing the components (nodes) and their interactions (links) that compose the
network are stored in a text file, see Text file S1, using the flat file *sig format described below.
The *sig flat text file format:
Source Name: cellular component that is affecting a target componentSource Human Accession: Swiss-Prot accession if available
Source Mouse Accession: Swiss-Prot accession if available
Source Type: the type of molecule classification this componentSource Location: cellular localization of the component
Target Name: generalized cellular component that is affected by the source componentTarget Human Accession: Swiss-Prot accession if available
Target Mouse Accession: Swiss-Prot accession if availableTarget Type: the type of molecule classification for this component
Target Location: generalized cellular localization of the componentEffect: activation (+), inhibition (-), or neutral (0)
Type of Interaction: type of chemical interaction directly linking the two components
PubMed ID: PubMed database accession number
Sorting cellular components (nodes) into cellular machines:Components (nodes) were separated into functional machines based on their molecule type and
location attributes using Code segment S7.
Visualization of the entire network:The diagram of the network in figure S1 was created with Code segment S8 that creates Text file
S2. This text file can be loaded into the Pajek software for visualization (S2).
Subnetworks obtained by signal propagation from ligands:
To generate these subnetworks we used a depth-first search algorithm expanding the search solely in
the downstream direction, to confirm with biologically specified directional transfer of information.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
3/22
3
For example, when a Gs protein is activated by an upstream ligand-receptor interaction it connects
only to the downstream component adenylyl cyclase. The subnetworks do not include another Gs
coupled receptor unless that receptor also interacts with the specific ligand.
The recursive algorithm (using depth-first search) is listed as Code segment S9. It was used tocreate 15 subnetworks for each node of the type “Ligand” by changing the number of steps from the
ligand. This concept is termed breadth-first search and is illustrated in Movie S20. The subnetworks
were then analyzed by computing the number of nodes, links, clustering, average path length andmotifs.
Visualization of the resulting data for connections per step for all ligands, Spreadsheet S23, was
created with MatlabTM
, Natick, MA (S3).Three sets of subnetworks induced by the nodes: GLUTAMATE, NE and BDNF were analyzed for
number of nodes and links, clustering, average path-length and motifs with Code segment S10.
These results, Spreadsheet S24, were visualized with ExcelTM
(S4).
Overall statistics of the network:
The connectivity distribution data, Spreadsheet S25, was plotted with MatlabTM
, Natick, MA (S3).
Characteristic path-length measure was implemented by using Floyd’s algorithm (S5), with Code
segment S11.
Clustering Coefficient was implemented with custom code based on the concept developed by Wattsand Strogatz (S6), using Code segment S12.
Grid Coefficient was implemented with custom code based on the concept developed by Caldarelliet al. (S7), using Code segment S13.
The results, Spreadsheet S26, were compared to the same parameters computed for 100 shufflednetworks of one island that maintain the same connectivity distribution using an algorithm
developed by Milo et al. (S8), using Code segment S14.
Network Motifs:Counting motifs in the network was accomplished by using the MFinder program developed by
Kashtan et al. (S9). Motifs of size 3, 4, 5 and 6 were visualized using VisioTM
(S10).See Table S29 of the complete results for the motifs of sizes 3 and 4.
Output files in text format from the MFinder program:
Motifs size 3: Text file S3.Motifs size 4: Text file S4.
Motifs size 5: Text file S5.
Motifs size 6: Text file S6.
Our method for motif search:
In our detailed analysis of subnetworks, we only considered closed loop circular motifs. Wesearched for these motifs using a depth-first search algorithm (Code segments S17 through S19).An example of counting motifs in a toy network is given below:
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
4/22
4
A
C D
B
The above toy network contain: 1 positive feedback loop of size 3, 1 negative feedback loop of size3 and 1 negative feed-forward motif of size 4. Nodes C and D contribute to 3 motifs whereas nodes
B and C contribute to 2 motifs. This is slightly different from the method used by Kashtan et al. (S9) because, with their method, A, B, C, and D will not be considered a feedforward loop due to the link
between A and D.
Positive vs. negative feedback and feedforward loops:
Positive feedback loops motifs are defined as loops where all the links in the loop are positive(activating), or there are an even number of negative (inhibitory) links. Negative feedback loops are
loops where there is an odd number of negative links. Feedforward loops have two “arms”, each
starting at a source node and merging into a target (sink) node. To determine if a feedforward loop is positive or negative, each “arm” was evaluated separately. Both “arms” must be positive for the
feedforward motifs to be positive. An “arm” that has all links positive or even number of negative
links is considered positive; whereas, an arm with an odd number of negative links is considerednegative.
Subnetworks from source to target nodes:
Series of subnetworks from specific source nodes (generally ligands) to specific targets
(components within machines such as channels or transcription factors), with limited path lengths,
were created using the following code: Code segment S15. The concept is illustrated in Movie S21. Shuffled networks, where only links that do not involved the source nodes and target nodes, were
created for statistical control. In the control subnetworks, we preserved the biologically specified
connectivity and directionality between ligands and their receptors and from the immediateupstream components to the AMPA channel and CREB. For all other connections we randomly
swapped the direction of the connection while preserving the overall connectivity structure of thenetwork. Curve fitting of the data, Spreadsheet S27, was done with ExcelTM
(S4).
Subnets based on nodal connectivity:
Series of sub-networks created based on nodal connectivity were created using Code segment S16.
The concept is illustrated in Movie S22. Search for feedback loops was implemented with Codesegment S17, feed-forward loops with Code segment S18, bi-fan motifs with Code segment S19.
The results are summarized in Spreadsheet S28 and were visualized with ExcelTM
(S4). The
number of islands does not consider highly connected nodes as single islands when they are notincluded. For example, PKA, a highly connected node (k=49) is not included in the subnetworks
with lower connectivity threshold and is not considered an island by itself.
Computing the Density of Information Processing (DIP):
DIP is a measure of the local density of motifs and their interconnectedness within the interaction
space of the network. DIP is defined as:
i
ii
ii
i GC L L
M M DIP ⋅⎟⎟
⎠
⎞⎜⎜⎝
⎛
−
−=
−
−
1
1 (1)
Whereiiiiii
BIFAN FFLFFLFBLFBL M ++++= 4343
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
5/22
5
M i is the total number of feedback loops, feedforward loops and bifan motifs. Li is the total links and
i represent the step. FBL3 and FBL4 are feedback loops of size 3 and 4, FFL3 and FFL4 are
feedforward loops of size 3 and 4 and BIFAN are bi-fan motifs of size 4. GC is the grid coefficient
representing interconnectedness for the motifs, computed for the subnetwork at step i.
Computing the Motif Location Index (MLI):
MLI measure the concentration of specified motifs and various locations within thenetwork. MLI can vary from 0 to 1 depending on its relative distance from the extracellular ligands
to cellular machine, where 0 indicates location at the level of machines. MLI was calculated as
follows:
MLI =n
CPLLCPLM
CPLM n
i ii
i∑=
⎟⎟ ⎠
⎞⎜⎜⎝
⎛
+1 (2)
where n is the size of the motif, CPLM is the characteristic path length from a node within the motif
to all other nodes in the cellular machine and CPLL is the characteristic path length from a node to
all extracellular ligands. If a node is an extracellular ligand then CPLL = 0; if the node is in the plasma membrane CPLL= 1. If a node belongs to a cellular machine, CPLM = 0. The average
shortest path length was computed using Floyd’s algorithm (S5).
Supporting text
Functional Organization: From Components to Information Processing Motifs
Components within mammalian cells interact with one another to form local networks thattogether form a single large network. This organization is essential for cellular components to
coordinate their individual activities and achieve the cohesiveness needed for cellular functions.
Information needs to flow between components in a continuous and organized manner. Determininghow this flow of information occurs is a crucial step in understanding the functional organization of
mammalian cells. This system of interacting cellular components based on phenotypic behavior
allows us to analyze the flow of information between the components to identify the emergence ofregulatory motifs that are capable of processing information as it flows through the network.
Regulatory motifs such as feedback loops and feedforward motifs allow the cell to process
information from extracellular signals, and decide when such information persists and when it is
transient. The presence of positive feedback loops arising from coupled biochemical reactions leadsto switching behavior that can enable state changes (S11). State change triggered by biochemical
switches has been demonstrated in developmental systems (S12).
The overall profile of motifs may be an initial indicator of the cell’s information processingcapability, when the quantitative and temporal characteristics of the interactions and consequent
motif formation are taken into account. Such quantitative specification may lead to comparative
profiles different from that observed here. Nevertheless, the identification and characterization of
motifs allows us to move from components and interactions to the next level of organization withinthe cell.
Distinct regulatory motifs are active in response to signals from different ligands. The balanceof the emergent positive and negative motifs may define the capability of the ligand to induce
plasticity or maintain homeostasis. Comparison between motifs assumes that the motifs can be
formed and function concurrently. The relationship between the interaction steps in this
pseudodynamic analysis which considers propagation of functional connectivity over chemicalspace and time steps needs to be specified at the level of individual interactions. This is discussed in
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
6/22
6
greater detail below. The relationship between distances in chemical space in such a functional
organization scheme and subcellular localization in terms of organelles and identified physical
compartments require further analysis to identify where the two types of organization overlap and
where they diverge.At the functional level, each network configuration is likely to have its own distinct pattern of
regulatory motifs, and thus define capability of the cell for either maintaining homeostasis or
promoting state change such as that seen in synaptic plasticity. Would a large ensemble of suchstates make the system intractably complex, such that we would not be able to either understand or
predict its behavior? This is unlikely because many of the states may have profiles of regulatory
motifs that produce convergent effects on cellular machines. Such convergence would provide amolecular basis for redundancy of cellular function and endow the cellular system with robustness.
Functionally, cellular network states may fall into two broad categories: those that preserve
homeostasis during system perturbation and those that promote reorganization of the parts of thecellular network. Such reorganization may include the synthesis of new components such as
transcription factors as well as other components that are required for expression of phenotypic
behavior. The notion that multiple network states may converge to yield a limited set of phenotypic
responses is supported by recent studies of networks of neurons where similar network behavior, asdefined by firing patterns, can arise from multiple network configurations based on individual
parameters that are quite different (S13). Perhaps such convergence represents a higher order oforganized redundancy arising from the many feedforward motifs. Thus, it is likely that cellularsystems are robust at varying levels of organization.
From Qualitative Representation to Quantitative Behavior of MotifsQualitative representations of cellular interaction such as those analyzed here do not take
into account individual reaction rates that define individual links. This simplifying assumption leads
to questions such as whether the identity and function of regulatory motifs would be preserved if thereaction reactions for the links that make up the motifs vary over many orders of magnitude.
Experimental studies have shown that some of the links are relevant only at specific times, atdifferent perturbation, and at different locations within the cell. Such conditionality can be
manifested in many formats. Positive (also termed coherent) feedforward loops (FFL) can functionin several ways. For example, they can be coincidence detectors (AND gates), provide redundant
pathways to the same effectors (OR gates); they can lead to long sustained activity if one arm is fastand the other arm is slower (S14, S15). These functions depend on the reaction rates of the two
arms. To obtain a preliminary idea of the potential validity of the motifs we have identified in this
study, we gathered a set of representative reactions from various classes (i.e., ligand-protein, protein-protein, and enzymatic reactions) and evaluated their rates by comparing on and off rates
and K cat values. The maximum observable difference was about 1000 fold (Table S2). We then
constructed a toy ODE model for motif 44 (Fig. S3). A schematic of the model is shown (Fig.S11A). We varied the difference in rates between the two arms by a factor of up to 1000. Although
there are temporal differences in the output, the overall output profiles are quite similar within the
same time scale (Fig. S11B). These initial simulations suggest that the integrity of the motif is persevered even with widely varying rates.
Although this toy ODE model has few components, it should be noted that this is likely to be
a limiting case since with multiple components in each arm, reactions rates are also likely to vary
and thus reducing the difference between the two arms. This is especially true for situations at two-three steps from the ligand, the location where the motifs become more prevalent. Thus, it is likely
that many of the motifs identified by qualitative analysis will be operative even when interaction
rates vary and the concentrations of components change in a regulated manner. Definitive proof that
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
7/22
7
each of these motifs is functional will only come when each reaction within a motif is parameterized
and input/output relationships are determined, and the existence of the motifs will be experimentally
verified.
Configuration of Motifs and choosing between homeostasis and plasticity:
The conversion of short term biochemical changes into long-term physiological changes including
state changes is often driven prolonged activation of key protein kinases and/or the activation orinduction of transcription factors. The extent and duration of activity of these downstream
components is regulated by the upstream signaling network. Since the network contains various
types of regulatory motifs, the balance between these motifs will determine the extent and durationof the signal reaching these key components. Abundance of positive feed-forward or feedback loops
would evoke the extended activation of these components, while the presence of negative feedback
loops and gates could limit the duration of activation of the downstream components. Hence, theratio of the positive to negative feedback and feed-forward loops can determine whether or not
downstream components are activated for extended periods at sufficient levels. This extent and
duration of activation of the downstream components like the protein kinases or transcription factor
may in turn determine if the cell can maintain homeostatic behavior or undergo long-term physiological changes. Thus, the ratio of positive to negative feedback and feed-forward motifs
could be a key indicator of whether, in the presence of external signals, the cell is configured tomaintain homeostatic balance or undergo state change. Since the propagation of connectivity bydifferent ligands involves different components and pathways, the ratio of the positive and negative
motifs is a function of the types of signals the cell receives and hence is conditional. Comparison of
the ratio of these motifs recruited per step for different ligands, glutamate, NE and BDNF clearlydemonstrates that the regulatory configuration is defined by the ligand that initiates the signal
propagation. Thus, by balancing the positive to negative motifs for different extracellular signals the
cell may be able to tune itself to either respond or not respond to extracellular signals by undergoinglong-term physiological changes. It is noteworthy that for both glutamate and BDNF in the
pathways to CREB the positive and negative loops are evenly balanced through nine steps. Such balance of regulatory loops provides an explanation of how homeostasis within the cell can be
achieved even when a perturbing signal propagates through the system. The relative abundance ofthe positive feedback and feedforward loops as compared to the negative loops in the sub-network
for NE to AMPA channels and to CREB is also noteworthy. These ratios may indicate that this sub-network can hold and transfer information across time-scales leading to persistent changes. This
configuration of motifs might provide a systems level explanation of why the cAMP pathway is
associated with the late phase of long-term potentiation in the CA1 neuron (S16). It is alsonoteworthy that propagation of connectivity from BDNF to AMPA channels and CREB results in
equal numbers of positive and negative feedback loops suggesting that BDNF would not play an
important role in changing the state of this cell. This is in agreement with experimental observationthat BDNF induces plasticity by affecting the functions of the presynaptic CA3 neuron (S17)
although acute effects of BDNF are observed in the postsynaptic CA1 neuron (S18). Thus some of
the characteristic responses evoked by important neurotransmitters may be understood byidentifying the configurations of the regulatory motifs they evoke within the cell.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
8/22
8
Supporting figures
Fig. S1
Fig. S1 Visualization of the mammalian neuronal cellular network using the Pajek software
The network is visualized by placing nodes as triangles based in their functional compartments.Size of triangles demonstrates the level of connectivity for the node. Green arrows represent
activation links, red arrows inhibition links, and blue arrows neutral links. The network data was
stored in the .net file format using custom code (Code segment S8). The output was then loaded intothe Pajek program (S2) for visualization.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
9/22
9
Fig. S2
A C
BStatistical characteristics of the fully connected network: Average links per node: 4.62 Characteristic Path Length: 4.21 (3.81 ± 0.026)*Grid Coefficient: 0.026 (0.005 ± 0.0005)*Clustering Coefficient: 0.11 (0.03 ± 0.004)** Mean ± SD computed for 100 shuffled networks
Fig. S2 Statistics of the fully connected network
A) Distribution of interactions (links) into various categories. Interactions (links) in the cellular
network were sorted using two different criteria: the biochemical reaction and the effect of the link.
Upper pie chart shows sorting by biochemical reaction mechanism and the lower pie chart shows the
distribution of connections that activate or inhibit downstream components. Neutral links indicateinteractions where the directionality is unspecified. B) Overall network characteristics: In
parentheses are the mean and standard deviation of the characteristic path-lengths, clustering-
coefficients and grid-coefficients computed for 100 shuffled “null” networks with the sameconnectivity distribution as the cellular network. These shuffled networks were created using an
approach based on Milo et al (S8). C) Log-log plot of the connectivity distribution. The number of
links per node (k ) is plotted against the number of nodes with the same level of connectivity.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
10/22
10
Fig. S3
Fig. S3 Schematic representation of the motifs identified within the cellular network
Network Motifs identified using the MFinder program. The figure visually represents the text based
output, produced by the MFinder program. These motifs are the most statistically significant
identified network motifs within the network. The MFinder program searches for motifs in directed
networks, it does not distinguish between positive and negative links. Thus, for this analysis weconsidered positive and negative links as unidirectional and neutral links as bi-directional. The
MFinder program was developed by Alon and coworkers (S9). The network motifs of size 5 and 6were identified using the sampling methods. The various types of motifs are identified by numbers.
The counts of the various types of motifs in the fully connected network are given in Table S1.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
11/22
11
Table S1
Motifs counts
Moti f # CN* SN** Z-score
31 16 4.8 ± 2.8 3.98
32 22 9.3 ± 3.3 3.84
33 14 8.1 ± 2.6 2.30
34 36 12.5 ± 3.7 6.38
35 32 12.2 ± 3.8 5.16
36 25 9.8 ± 3.7 4.12
41 1011 186.6 ± 32.6 25.31
42 108 68.2 ± 14.8 2.69
43 26 7.2 ± 4.8 3.91
44 303 104.0 ± 15.5 12.88
45 57 17.8 ± 7.3 5.39
46 105 40.0 ± 10.9 5.97
47 49 31.5 ± 8.3 2.12
* CN- Cellular Network.
** SN- Shuffled networks.Mean ± SD computed for 100 shuffled networks.
Using the sampling method (S8):
Motifs counts
Moti f # CN* SN** Z-score
51 0.254 0.023 ± 0.025 9.27
52 0.065 0.002 ± 0.003 11.42
53 0.662 0.005 ± 0.011 60.05
54 0.048 0.000 ± 0.002 26.57
55 0.100 0.003 ± 0.007 13.92
56 0.020 0.0 ± 0.0 NA57 0.016 0.0 ± 0.0 NA
61 0.395 0.001 ± 0.011 36.01
62 0.189 0.0 ± 0.0 NA
63 0.050 0.0 ± 0.0 NA
64 0.096 0.0 ± 0.0 NA
65 0.070 0.0 ± 0.0 NA
66 0.059 0.0 ± 0.0 NA
67 0.049 0.0 ± 0.0 NA
Table S1 Counts of the various types of motifs found in the cellular network Motifs were counted using the MFinder program (S9). Counts for the motifs in the cellular network
(CN) are compared to the mean for 100 control (shuffled) networks. Z-score indicates the statistical
significance for the number found in the real network as compared to the mean of the controlnetworks. Motifs of size 5 and 6 were estimated using the sampling method described by Kashtan et
al. (S9). Estimated values are given as fractions (count of the specified motifs / the total number of
motifs of the same size).
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
12/22
12
Fig. S4
A B
C D E
Fig. S4 Analysis of subnetworks from glutamate, NE and BDNF
The number of links (A) and nodes (B) added per step, downstream from three ligands known to
regulate plasticity in hippocampal neurons. C) Characteristic Path Length (CPL), D) ClusteringCoefficient (CC) and E) Grid Coefficient (GC) computed for subnetworks emanating from
glutamate, nor-epinephrine and BDNF.
Fig. S5
A B C
Fig. S5 Comparison of positive vs. negative feed-forward loop motifs in subnetworks
emanating in steps from the three ligands: glutamate, nor-epinephrine and BDNFCounts were obtained using the source code described under Methods. Positive feed-forward loop
motifs are feed-forward loops where both “arms” to the target node are positive. If at least one arm
is negative the feed-forward is considered negative. Positive “arm” is defined as an “arm” with
either no negative links or even number of negative links.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
13/22
13
Fig. S6
Fig. S6 Regulatory motifs identified in subnetworks emanating in steps from glutamate (Glu),
norepinephrine (NE) and BDNF
The motifs identified using our algorithms (see custom codes S17-S19) can classify motifs based on
the different types of links and nodes. This is an enhancement to the MFinder program that onlyidentifies more abstract motifs based on directionality only.
A) The scaffold motif is made of three nodes connected solely by neutral links.
A-I: A schematic illustration of a scaffold motif.A-II: The counts of scaffold motifs found in subnetworks generated from the three ligands
A-III: Representative examples of the scaffold motifs.B) The bi-fan motifB-I: Schematic representation of the bifan motif.
B-II: The counts of bi-fan motifs found in subnetworks generated from the three ligands.
B-III: Representative examples of bifan motifs.
C) The feedforward loop motifs are composed of three or four nodes connected with directed linksonly where a source node feeds into a target node through two alternative routes.
C-I: Schematic representations of the feedforward loops
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
14/22
14
C-II: The counts of feed-forward loop motifs found in the subnetworks generated from the three
ligands
C-III: Representative examples of feed-forward motifs.
Positive feed-forward loop motifs are feed-forward loops where both “arms” to the target node are positive. Positive “arm” is defined as an “arm” with either no negative links or even number of
negative links. If at least one arm is negative the feed-forward is considered negative.
Abbreviations: PKA: protein kinase A, PKC: protein kinase C, CaM: calmodulin, CaN: calcineurin,
GlyR: glycine receptor, NMDAR: NMDA receptor, AC2: adenylyl cyclase 2, PLCβ: phospholipaseC beta, CREB: cyclic-AMP response element binding protein. CREM: cyclic-AMP response
element modulator. β-ARK: beta adrenergic receptor kinase.
Fig. S7 A
B
Fig. S7 Size of subnetworks from extracellular ligands to specified effectors
A) Schematic representation of the source and target nodes for the subnetworks starting from the
three extracellular ligands: glutamate, nor-epinephrine (NE) and brain-drived neurotrophic factor(BDNF), (source nodes) to reach the effectors AMPA receptor-channels (AMPAR) or the
transcription factor cyclic-AMP response element binding protein (CREB) (target nodes). B) The
changes in the number of nodes as more steps are allowed to reach the target nodes (effectors) from
the source ligands for six sets of subnetworks.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
15/22
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
16/22
16
Fig. S9
A
B C D
E F G
Fig. S9 Subnetworks characteristics from extracellular ligands to AMPAR
A) Schematic of the source and target nodes for the subnetworks starting from the three extracellularligands: glutamate, nor-epinephrine (NE) and brain-derived neurotrophic factor (BDNF), (source
nodes) to reach the effector AMPA channel/receptor (AMPAR) (target node). B-D) Changes in the
number of links as more steps are allowed to reach the effectors from each ligand are compared to
the same analysis with shuffled networks. Only the directionality of the links that do not involve theligands or the effectors was randomly swapped while preserving the connectivity. The resultant
graphs for both the cellular network and shuffled networks were curve fitted. For all cellular
subnetworks the best fit function was linear (R 2 =0.99-0.96). Individual values are given as lin R
2.
For all of the shuffled networks the best fit was obtained for either exponential or power law
functions (lin and pow respectively). E-G) Counts of the number of total three and four component positive and negative feedback and feedforward loops, in subnetworks where more steps aregradually allowed between the source and target nodes.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
17/22
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
18/22
18
Fig. S11
A
input
Act ivated A Act ivated B
output
Negative
Regulator
B
0 500 1000 15000.00
0.05
0.10
0.15 kcat A/kcat B = 1/1
kcat A/kcat B = 10/1
kcat A/kcat B = 10/0.1
kcat A/kcat B = 10/0.01
t
O u t p u t
0 500 1000 15000.00
0.05
0.10
0.15
Km A/Km B = 1/1
Km A/Km B = 10/1
Km A/Km B = 10/0.1
Km A/Km B = 10/0.01
t
O u t p u t
Fig. S11 Quantitative evaluation of a simple feedforward motif (motif 44 in Fig. S3).
A) A feedforward motif (Motif 44, Fig S3) was constructed and simulated using a system of coupledODEs. In this model, a simulated square pulse of input activates enzymes A and B. After 300seconds, the input was “washed out,” and the behavior of the output, which is activated by both A
and B, was studied. To balance the activities of the activator, a negative regulator was included.
The initial concentration of all components was 1 µM. B) To study the motif behavior, sensitivityanalyses were performed on k cat and K m. The ratio between the kinetics parameters of the two arms
were varied from 10 (k catA = 10 s-1
, k catB = 1 s-1
), to 1000 (k catA = 10 s-1
, k catB = 0.01 s-1
). Similaranalysis was performed on K m. The activity of the output component is plotted as a function of
time. The analysis shows that the motif behavior is robust. Changing the kinetics parameters over 3
orders of magnitude had little effect on the qualitative behavior of the motif. As assessed by input-output relationships. These initial results suggests that components with widely varying reactions
rates can come together to form functional units.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
19/22
19
Table S2
Molec#1/Catalyst Molec#2/Substrate/Product Reaction typek f
(/µM/s)kr (/s)
KD/KM (µM)
kcat (/s) Ref. Notes
Rap1GAPII Rap1 GAP 1.4 80.5 65 11 (1)
cAMP PKA site1 Binding 1.67 0.0167 0.010 (2)
cAMP PKA site2 Binding 0.0093 0.00028 0.030 (2)
β ARK β -AR Phosphorylation 0.2 0.2 (3)
Csk Src Phosphorylation 1.8 40 22 0.4 (4)2-fold adjustment for T;KD & kr are lower limits
calcium N-term calmodulin Binding 6 500 (5)
calcium C-term calmodulin High Aff-Binding 6 10 (5)
calcium C-term calmodulin Low Aff-Binding 6 100 (5)
high and low-affinitybinding @ C-term reflects
CaM complexing;dissociation but not
association rate constantswere modified by CaM's
interactions
MKP3 ERK2 Dephosphorylation 0.022 0.2 (6)
CDK1-cyclin Clb2 unidentified substrate/s35
(ATP)2.2 (7)
Used crude cell lysate assubstrate. i.e. no single
target, but perhaps morephysiological
PIP2 PKB (PH dom) Binding 0.5 (8)
RasGAP Ras GAP 4.8 9.12 (9)
Caspase 9 Caspase3 Cleavage 0.089 0.0168 (10)
Caspase 3 PARP Cleavage 5.6 0.213 (10)
Sos Ras GEF 0.50505 0.02 (11)
PKA Representative substrate Phosphorylation 2.4 2.7 (12)
PKA Representative substrate Phosphorylation 7.5 9 (13)
CDK5 (p35isoform)
Tau Phosphorylation 33 0.0433 (14)
CDK5 (p25isoform)
Tau Phosphorylation 27 0.217 (14)
CalcineurinG-substrate, a substrate for
PKGDephosphorylation 3.8 0.41 (15)
Calcineurin DARPP-32, a substrate forPKA Dephosphorylation 1.6 0.2 (15)
Calcineurin synapsin I (site 1) Dephosphorylation 7 0.053 (15)
Calcineurin synapsin I (site 2) Dephosphorylation 4.4 0.04 (15)
Grb2 Sos Binding 0.025 0.0168 (16)
PKC (Ca bound) DAG Binding 0.008 8.6348 (16)
calmodulin MLCK (non phosphorylated) Binding 28 0.031 (17)
calmodulin MLCK (non phosphorylated) Binding 8 0.186 (17)
MEK MAPK Phosphorylation 0.476 0.113(18,19)
Km = 0.476 uM for 1stphosphorylation, 0.046
uM for secondphosphorylation
Raf MEK1 Phosphorylation 0.8 0.105 (20)
GSK3β β-CATENINPhosphorylation 0.1276 3.5
(21,
22)1.7 6.13 (23)
STEP MAPK Dephosphorylation 38 12(24,25)
AC 1
AC cAMP Synthesis 1.4 7.9(26,27)
PDE4D cAMP Cleavage
NWASP Arp2/3 Binding 0.003 (28)
Grb2 NWASP Binding 0.005 (28)
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
20/22
20
References for Table S2:1. Kraemer, A., Brinkmann, T., Plettner, I., Goody, R. & Wittinghofer, A. (2002) J Mol Biol 324, 763-74.2. Zorn, M., Fladmark, K. E., Ogreid, D., Jastorff, B., Doskeland, S. O. & Dostmann, W. R. (1995) FEBS Lett 362, 291-4.3. Kim, C. M., Dion, S. B. & Benovic, J. L. (1993) J Biol Chem 268, 15412-8.4. Lieser, S. A., Shindler, C., Aubol, B. E., Lee, S., Sun, G. & Adams, J. A. (2005) J Biol Chem 280, 7769-76.5. Gaertner, T. R., Putkey, J. A. & Waxham, M. N. (2004) J Biol Chem 279, 39374-82.6. Zhou, B., Wang, Z. X., Zhao, Y., Brautigan, D. L. & Zhang, Z. Y. (2002) J Biol Chem 277, 31818-25.7. Ubersax, J. A., Woodbury, E. L., Quang, P. N., Paraz, M., Blethrow, J. D., Shah, K., Shokat, K. M. & Morgan, D. O. (2003)
Nature 425, 859-64.8. Frech, M., Andjelkovic, M., Ingley, E., Reddy, K. K., Falck, J. R. & Hemmings, B. A. (1997) J Biol Chem 272, 8474-81.
9. Gideon, P., John, J., Frech, M., Lautwein, A., Clark, R., Scheffler, J. E. & Wittinghofer, A. (1992) Mol Cell Biol 12, 2050-6.10. Bentele, M., Lavrik, I., Ulrich, M., Stosser, S., Heermann, D. W., Kalthoff, H., Krammer, P. H. & Eils, R. (2004) J Cell Biol 166,
839-51.11. Orita, S., Kaibuchi, K., Kuroda, S., Shimizu, K., Nakanishi, H. & Takai, Y. (1993) J Biol Chem 268, 25542-6.12. Hemmings, H. C., Jr., Nairn, A. C. & Greengard, P. (1984) J Biol Chem 259, 14491-7.13. Bhalla, U. S. (2002) Biophys J 83, 740-52.14. Hashiguchi, M., Saito, T., Hisanaga, S. & Hashiguchi, T. (2002) J Biol Chem 277, 44525-30.15. King, M. M., Huang, C. Y., Chock, P. B., Nairn, A. C., Hemmings, H. C., Jr., Chan, K. F. & Greengard, P. (1984) J Biol Chem
259, 8080-3.16. Bhalla, U. S., Ram, P. T. & Iyengar, R. (2002) Science 297, 1018-23.17. Kasturi, R., Vasulka, C. & Johnson, J. D. (1993) J Biol Chem 268, 7958-64.18. Seger, R., Ahn, N. G., Posada, J., Munar, E. S., Jensen, A. M., Cooper, J. A., Cobb, M. H. & Krebs, E. G. (1992) J Biol Chem
267, 14373-81.19. Haystead, T. A., Dent, P., Wu, J., Haystead, C. M. & Sturgill, T. W. (1992) FEBS Lett 306, 17-22.20. Force, T., Bonventre, J. V., Heidecker, G., Rapp, U., Avruch, J. & Kyriakis, J. M. (1994) Proc Natl Acad Sci U S A 91, 1270-4.21. Ikeda, S., Kishida, S., Yamamoto, H., Murai, H., Koyama, S. & Kikuchi, A. (1998) Embo J 17, 1371-84.
22. Lee, E., Salic, A., Kruger, R., Heinrich, R. & Kirschner, M. W. (2003) PLoS Biol 1, E10.23. Paul, S., Snyder, G. L., Yokakura, H., Picciotto, M. R., Nairn, A. C. & Lombroso, P. J. (2000) J Neurosci 20, 5630-8.24. Tang, W. J., Stanzel, M. & Gilman, A. G. (1995) Biochemistry 34, 14563-72.25. Taussig, R., Tang, W. J., Hepler, J. R. & Gilman, A. G. (1994) J Biol Chem 269, 6093-100.26. Lim, J., Pahlke, G. & Conti, M. (1999) J Biol Chem 274, 19677-85.27. Bolger, G. B., Erdogan, S., Jones, R. E., Loughney, K., Scotland, G., Hoffmann, R., Wilkinson, I., Farrell, C. & Houslay, M. D.
(1997) Biochem J 328 ( Pt 2), 539-48.28. Carlier, M. F., Nioche, P., Broutin-L'Hermite, I., Boujemaa, R., Le Clainche, C., Egile, C., Garbay, C., Ducruix, A., Sansonetti,
P. & Pantaloni, D. (2000) J Biol Chem 275, 21946-52.
Table S2 Reaction rates for representative reactions underlying interactions in the network
and a listing of the primary references from which these rates were obtained
Rates for several classes of the reactions underlying the links in the network were collected from
published studies. The reactions for which rates are shown include: non-covalent binding includingligand-protein and protein-protein interactions, enzymatic reactions such as cAMP synthesis,
phosphorylation, dephosphorylation, proteolytic cleavage, and changes in catalytic activity resulting
from protein-protein interactions such as the GEF and GAP activity for GTPases. These types ofreactions account for a large portion of the interactions within the cellular network.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
21/22
21
Table S3
Table 3 Supporting external files
The network database is provided in Text file S1 as a flat file where the field separator is a blank
space. Each line in the file describes an interaction extracted from a specific journal. The tokensdescription is provided in the methods section under “Storage of the network data”. The Pajek
software can be used to visualize the network with Text file S2 (S2). Text files S3-S6 are the raw
output files from the MFinder program. Code segments S7-S19 contain algorithms written in C/C++used to generate, analyze and visualize the subnetworks in the study. Movies S20-S23 illustrate the
concept we termed pseudodynamics used to create the three different types of subnetworks.
Spreadsheets S23-S28 contain the raw data from the analysis. These tables were used to create most
of the graphs in the study. The supporting external files can be found also athttp://amp.pharm.mssm.edu/network/sm/SM.htm and at http://www.mssm.edu/labs/iyengar .
Other supporting online materials are available at: http://amp.pharm.mssm.edu/network/index.htm
File name Description
Text file S1 Network data in flat file format.
Text file S2 Pajek format text file describing the network data and assigns 2D location for nodes on the map.
Text file S3 Raw output from the MFinder program searching for motifs of size 3.
Text file S4 Raw output from the MFinder program searching for motifs of size 4.
Text file S5 Raw output from the MFinder program searching for motifs of size 5.
Text file S6 Raw output from the MFinder program searching for motifs of size 6.Code segment S7 Rules for sorting components into functional machines.
Code segment S8 Converts the network into a Pajek format text file.
Code segment S9 Creates sub-network where connectivity is propagated from ligands.
Code segment S10 Analysis for nodes, interactions, motifs, clustering and path-lengths of sub-networks starting at
ligands.
Code segment S11 Implementation of the Floyd algorithm to compute the characteristic path-length.
Code segment S12 Implementation of the clustering coefficient algorithm.
Code segment S13 Implementation of the grid coefficient algorithm.
Code segment S14 Algorithm used to create 100 shuffled networks with the same connectivity distribution and oneisland.
Code segment S15 Code segment used to create sub-network from specified source node to specified target node.
Code segment S16 Code used to create sub-networks with gradual allowable nodal connectivity.
Code segment S17 Algorithm used to search for feedback loops.Code segment S18 Algorithm used to search for feed-forward loops.
Code segment S19 Algorithm used to search for bi-fan motifs.
Movie S20 Illustration of the breadth-first concept.
Movie S21 Illustration of the sub-networks where the source and target are specified.
Movie S22 Illustration of the concept of gradual inclusion of nodes based on connectivity.
Spreadsheet S23 Connectivity propagation per step from all ligands.
Spreadsheet S24 Number of nodes, interactions, motifs, computed clustering and path-lengths for sub-networks
starting at ligands.
Spreadsheet S25 Connectivity distribution table.
Spreadsheet S26 Characteristic path-lengths, clustering coefficients and grid coefficients for the real network andfor 100 shuffled networks with the same connectivity distribution.
Spreadsheet S27 Results of the analysis of sub-networks from source to target.
Spreadsheet S28 Analysis of sub-networks based on nodal connectivity.
8/9/2019 NIHMS266526-supplement-Supplemental_Online_Materials.pdf
22/22
Supporting References:
S1. N. R. Gough, L. B. Ray, Science STKE. 135, EG8 (2002)
S2. http://vlado.fmf.uni-lj.si/pub/networks/pajek/ (2004)S3. http://www.mathworks.com/products/matlab/ (2004)
S4. http://office.microsoft.com/excel/ (2004)
S5. T. H. Cormen, et al. 2002, Introduction to Algorithms, MIT Press Cambridge, MA.S6. D. J. Watts, S. H. Strogatz, Nature. 393, 440 (1998)
S7. G. R. Caldarelli, Pastor-Satorras et al. European Physical Journal B. 38, 183 (2004)
S8. R. Milo, S. Shen-Orr, S. Itzkovitz, et al., Science. 298, 824 (2002)S9. N. Kashtan, S. Itzkovitz, R. Milo, Bioinformatics. 20, 1746 (2004)
S10. http://office.microsoft.com/visio (2004)
S11. U.S. Bhalla, R. Iyengar, Science. 283, 381 (1999)S12. W. Xiong , J. E, Ferrell Jr. Nature. 426, 460 (2003)
S13. A.A. Prinz AA, D. Bucher, E. Marder, Nature Neuroscience 7, 1345 (2004)
S14. S. Mangan, U. Alon, Proc Natl Acad Sci U S A. 100, 11980 (2003)
S15. S. Mangan, A. Zaslaver, U. Alon, J Mol Biol. 334, 197 (2003)S16 P. V. Nguyen, T. Abel, E. R. Kandel, Science. 265, 1104 (1994).
S17. S. S. Zakharenko, S. L. Patterson, I. Dragatsis, et al., Neuron. 39, 975 (2003).S18. Y. Kovalchuk, E. Hanse, K. W. Kafitz, et al., Science. 295, 1729 (2002).
Author Contributions
AM assembled the large-scale network, wrote all of the custom-code and conducted all of
the analysis described in this study. SLJ developed the database to manage and maintain the
components links and references programs; AM, SN, AH, EG, BD-T, NJE, SLJ, RDB and RI foundand read the original papers cited as references to verify the components and interactions. GW and
PTR developed the original connections map of nearly 200 components on which the network usedin this study was based upon. JJR, AK and GAS provided approaches to analyze and statistically
validate several of the key findings. NJE performed the quantitative analysis of the feedforwardmotif. AH assembled the table of rate constants. RDB provided supervision of the development of
interactions maps and its anchoring to the hippocampal neuron. RI was responsible for overallsupervision of the project including analysis of the data and writing of the manuscript and is
responsible for the final version of the submitted paper.