+ All Categories
Home > Documents > Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor...

Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor...

Date post: 24-Mar-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
50
Network Biology- part V Jun Zhu, Ph. D. Professor of Genomics and Genetic Sciences Icahn Institute of Genomics and Multi-scale Biology The Tisch Cancer Institute Icahn Medical School at Mount Sinai New York, NY @IcahnInstitute http://research.mssm.edu/integrative-network-biology/ Email: [email protected]
Transcript
Page 1: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Network Biology- part V

Jun Zhu, Ph. D.

Professor of Genomics and Genetic Sciences

Icahn Institute of Genomics and Multi-scale Biology

The Tisch Cancer Institute

Icahn Medical School at Mount Sinai

New York, NY

@IcahnInstitute

http://research.mssm.edu/integrative-network-biology/

Email: [email protected]

Page 2: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Goals of the workshop

▶ NOT to teach you how to use one method or one

program

▶ Learn from history

▶ Learn about critical thinking

– What you want to achieve?

– What you need to achieve the goal?

– How to abstract a biological problem into

mathematical problem?

– What are underlying assumptions and problems?

Page 3: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Why it is so hard to model biological systems? ▶ The more we learn, the more complicated it becomes!

Post transcriptional regulation

• Splicing (1981)

• RNA editing (1986)

• miRNA mediated regulation (1993)

Post translational regulation

• Phosphorylation

• Glycosaltion

• acetylation It is not one gene to one protein anymore!

Epigenetic regulation : heritable

changes in gene function that cannot

be explained

by changes in DNA sequence

• DNA methylation

• Chromotin structure

Junk DNA?

Page 4: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

The cost of developing a prescription drug that gains market approval

Mullin Scie. Ameri. 2014

Page 5: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

What are Bayesian networks? Association vs Causality

From Stephen Friend

Page 6: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

A simple biological question: are there

causal/reactive relationships?

Page 7: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

A Bayesian network approach:

Best model

Page 8: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

A Bayesian network approach:

A

B C

Best models Markov Equivalent models

A

A

A

B

B

B

C

C

C

Page 9: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

A Bayesian network ≠ a causal structure

Markov Equivalent models

A

A

A

B

B

B

C

C

C

|B C A

Page 10: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Why it is so hard to model biological systems? ▶ The more we learn, the more complicated it becomes!

Post transcriptional regulation

• Splicing (1981)

• RNA editing (1986)

• miRNA mediated regulation (1993)

Post translational regulation

• Phosphorylation

• Glycosaltion

• acetylation It is not one gene to one protein anymore!

Epigenetic regulation : heritable

changes in gene function that cannot

be explained

by changes in DNA sequence

• DNA methylation

• Chromotin structure

Junk DNA?

Page 11: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Complex diseases: observations to models

diseases

per

turb

ati

on

s

per

turb

ati

on

s

Page 12: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

X

F1

F2

F0

Diabetes

resistant

Diabetes

susceptible

Animal model: mouse F2 intercrosses

Bayesian network: how to break

Markov equivalent?

Page 13: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Variation in mRNA leads to

variation in protein, which in

turn can lead to disease

Causal inference: genetics

Perturbations with a causal anchor

--Natural variation in a segregating population provides the same type of

causal anchor

DNA Supporting

Gene X

Variation in DNA leads to

variation in mRNA

AA

CA

GT

T

AA

CG

GT

T

High expression, alt

splicing, codon

change, etc.

Low expression, no alt.

splicing, no codon

change, etc.

Central Dogma of Biology

Schadt et al. Nature Genetics (2005)

Page 14: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

A Bayesian network approach:

Best models Markov Equivalent models

Page 15: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

A framework for building causal networks

probabilistic

graphic models

Microarray data

Proteomic data

Genomics

Genetics

Medline Biocarta/Biopathway Biologists

Database

GUI Hypothesis, test

High throughput

data

knowledge

Metabolomic data

Page 16: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Structure priors based on causality

▶ Estimate confidence of causality

– Bootstrap samples for 200

times

– Factions of causal, reactive,

independent calls

▶ The pair is independent

▶ The pair is causa/reactive

Zhu et al., PLoS CompBio, 2007

Page 17: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: integrating genetic data

• Give a sense of causality to Bayesian network

• how much improvement is achieved by integrating genetic data?

Page 18: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: integrating genetics

Experimental Hsd11b1 signature : mice treated with Hsd1

inhibitor

Prediction Hsd1 signatures based on BxD data

Correlation to Hsd1 10% of predicted signature overlap with experimental one

BN without genetics 20% of predicted signature overlap with experimental one

BN with genetics 52% of predicted signature overlap with experimental one

Zhu J et al, Cytogenet Genome Res. (2004)

Page 19: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian Network: a simulation study

Zhu et al., PLoS CompBio, 2007

Page 20: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: Genetics information is critical

when sample size is small

Largest improvement in recall occurs

with smaller sample sizes

Zhu et al., PLoS CompBio, 2007

Page 21: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: integrating genetic data

L1 L2 Ln-1 Ln

G1 G2 Gn-1 Gn Gj

Lj

Cis-regulation

Genetic loci

trans-regulation Transcriptional regulation

Gene

Page 22: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

recall

pre

cis

ion

Weak signals Strong signals

300 samples 900 samples 300 samples 900 samples

Page 23: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: why samples matter?

Page 24: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

A framework for data integration

probabilistic

graphic models

Microarray data

Proteomic data

Genomics

Genetics

Medline Biocarta/Biopathway Biologists

Database

GUI Hypothesis, test

High throughput

data

knowledge

Metabolomic data

Page 25: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Yeast-2-hybrid system

Limitations:

• High false positive and negative rates

• Only for soluble proteins

• not in a physiological condition

Lodish, et al., Molecular Cell Biology

Gene fusion Gene fusion

reporter gene

Page 26: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: PPI

Zhu J et al, Nature Genetics, 2008

Page 27: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: PPI

Zhu J et al, Nature Genetics, 2008

3-clique

4-clique 4-clique

3-clique

Clique community

(partial clique)

Page 28: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: PPI

Zhu J et al, Nature Genetics, 2008

Page 29: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: Transcription Factors

C B

TF

D E

Is the TF is functional?

Are genes B, C, D, and E are correlated?

Page 30: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network: Transcription Factors

Introducing scale-free priors for TF or protein

complex

)()( TwgTp

)),(log()(

Rg

cutoffi

i

rgTrTw

Zhu J et al, Nature Genetics, 2008

Page 32: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Zhu J et al, Nature Genetics, 2008

Integration improves network qualities

BN KO data GO terms TF data

w/o any priors 125 55 26

w/ genetics

priors 139 59 34

w/ genetics, TF

and PPI

priors 152 66 52

Page 33: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Zhu J et al, Nature Genetics, 2008

LEU2 GCN4

ILV6

GCN4

LEU2 KO gives rise to small expression signature

• LEU2 KO sig enriched (p~10E-18)

• GCN4 downregulated in LEU2 KO small signature

ILV6 gives rise to large expression signature

• ILV6 KO sig enriched (p~10E-52)

• GCN4 upregulated in ILV6 KO large signature

Prospective validation is the gold

standard

Page 34: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

How does LEU2 affect LEU3 activity?

LEU3 binding sites

LEU2

mRNA expression

LEU2 LEU3

Surrogate marker for Leu3p activity

Page 35: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

A framework for building causal networks

probabilistic

graphic models

Microarray data

Proteomic data

Genomics

Genetics

Medline Biocarta/Biopathway Biologists

Database

GUI Hypothesis, test

High throughput

data

knowledge

Metabolomic data

Page 37: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Zhu et al, PLoS Biology, 2012

Metabolite abundance is under genetic control

Page 38: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

KEGG biochemical pathways

emdeemp ,)(

Zhu et al, PLoS Biology, 2012

Page 39: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

LEU2 mRNA is causal to 2-isopropylmalate

KEGG pathway

Zhu et al, PLoS Biology, 2012

Page 40: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

LEU3 binding site

LEU2

With metabolomic data

Page 41: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

LEU3 regulation

• The activity of Leu3p is positively regulated by alpha-isopropylmalate (IPM), the product of the first step in leucine biosynthesis

Sze JY, et al. (1992) In vitro transcriptional activation by a metabolic intermediate: activation by Leu3 depends on alpha-isopropylmalate. Science 258(5085):1143-5

• The degree of activation by Leu3p is Leu3p concentration dependent, and it has been shown that LEU3 gene expression is regulated by general amino acid control, which is mediated by the GCN4 transcription factor

Zhou K, et al. (1987) Structure of yeast regulatory gene LEU3 and evidence that LEU3 itself is under general amino acid control. Nucleic Acids Res 15(13):5261-73

Page 42: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

2-isopropylmalate: mechanism of causal

regulator LEU2

LEU2 genotype LEU2 activity 2-isopropylmalate

LEU3 activity Transcriptional response for

genes with LEU3 binding sites

Page 43: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Zhu et al, PLoS Biology, 2012

Consistent with KEGG pathway

Page 44: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

What else can you learn from integrating

metabolomic data? Metabolite QTLs Causal candidates

Protein degradation

Metabolite

Signature

size

KO

Page 45: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Zhu et al, PLoS Biology, 2012

Page 46: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Zhu et al, PLoS Biology, 2012

Is the transcriptional effect real?

Page 47: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Zhu et al, PLoS Biology, 2012

PHM7-ko affects many metabolites

Page 48: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Integration of CNV blocks into Bayesian networks

Network-based model selection

Random

gene

Tran et al. BMC Sys. Biol. 2011

Page 49: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Bayesian network:

• Problems:

• Very computational intensive

• Need a large amount of data

• No positive/negative feedback controls

Page 50: Network Biology- part V - NUS · Causal inference: genetics Perturbations with a causal anchor --Natural variation in a segregating population provides the same type of causal anchor

Aknowledgements Mount Sinai

Genomics Institute

Eric Schadt

Bin Zhang

Zhidong Tu

Charles Powell

Patrizia Casaccia

Zhu lab

Seungyeul Yoo

Eunjee Lee

Li Wang

Luan Lin

Quan Long

•Icahn Institute of Genomics and Multiscale Biology,

Icahn School of Medicine at Mount Sinai

•Janssen

•Canary Foundation

•Prostate Cancer Foundation

•NIH

•NCI

Supported by:

Boston University

Avrum Spira

Joshua Campbell

U Washington

Roger Baumgarner

Berkerley

Rachel Brem

Princeton

Lenoid Kruglyak


Recommended