+ All Categories
Home > Documents > Andrey Alexeyenko [email protected] Functional analysis with pathways and networks.

Andrey Alexeyenko [email protected] Functional analysis with pathways and networks.

Date post: 01-Jan-2016
Category:
Upload: garey-eaton
View: 223 times
Download: 0 times
Share this document with a friend
Popular Tags:
21
Andrey Alexeyenko [email protected] Functional analysis with pathways and networks
Transcript
Page 1: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Andrey Alexeyenko

[email protected]

Functional analysis with pathways and networks

Page 2: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

FunCoup: on-line interactome resource

Andrey Alexeyenko and Erik L.L. Sonnhammer (2009) Global networks of functional coupling in eukaryotes from comprehensive data integration. Genome Research.

http:

//Fu

nCou

p.sb

c.su

.se

Page 3: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Data integration in predicted networks

Page 4: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.
Page 5: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Species-specific networks:evolutionary conservation

Mouse vs. human

Page 6: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Cancer-specific networks:links inferred from

expression, methylation, mutations

Functional couplingtranscription transcription transcription methylation methylation methylation mutation methylation mutation transcriptionmutation mutation

+ mutated gene

State-of-the-art method to beat:

Reverse engeneering froma single

source (usually transcriptome)

Page 7: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Do gene networks tell the story?

• Yellow diamonds: somatic mutations in prostate cancer

• Pink crosses: also mutated in glioblastome (TCGA)

State-of-the-art method to beat:Frequency analysis of somatic mutations

Page 8: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Network analysis made statistically sound:compared to a reference and quantified

N links_real = 12

N links_expected = 4.65

Standard deviation = 1.84

Z = (N links_observed – N links_expected) / SD = 3.98P-value = 0.0000344FDR < 0.1

Actual network: observed pattern A random pattern

Question:Is ANXA2 related to TGFbeta signaling?

Page 9: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Network enrichment analysis: what is the reference?

We refer to a randomized network: preserves topology, lost biology

• Link swapping (Perl): (original algorithm by Maslov & Sneppen, 2003) – work with Simon Merid.

• De novo network generation (C++): work with T. McCormack, O. Frings, E. Sonnhammer.

• Matrix permutation (R): work with Woojoo Lee, Yudi Pawitan.

Page 10: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Biological analysis of differential expression

Functional set?

Our alternative:Network enrichment analysis

Altered genes

0 50 100 150 200 250 300 350 400 450

No. of positives, random groups

0

200

400

600

800

1000

1200

1400

No.

of p

ositi

ves,

rea

l gro

ups

GEA_SIGN GEA_REST NEA_SIGN NEA_REST

State-of-the-art method to beat:Gene set enrichment analysis

Page 11: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Network enrichment analysis: applications

N links_real = 6N links_expected = 1.00

Standard deviation = 1.25Z = 3.97P-value = 0.0000356

Question:Does gene expression in MAT230414 relate to “response to tumor cell”?

Pathway characterization Detection of driver mutations Coherence of genome alterations

N links_real = 55N links_expected = 37.05

Standard deviation = 3.59Z = 3.59P-value = 0.00016

Question:Could copy number alteration in EHR in HOU501106 lead to changes of its transcriptome and proteome?

N links_real = 0N links_expected = 1.05

Standard deviation = 0.80Z = -1.31P-value = 0.905

Question: Are CNA in HOU501106 coherent?

State-of-the-art method to beat: Observational science

Page 12: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Analysis of cancer-specific wiringPathway network of normal

vs. tumor tissues

Edges connect pathways given a higher (N>9; p0<0.01; pFDR<0.20) number of gene-gene links (pfc>0.5) between them (seen as edge labels). Known pathways (circles) are classified as: • signaling,• metabolic,• cancer,• other disease.

Blue lines: evidence from mRNA co-expression under normal conditions + ALL human & mouse data.

Red lines: evidence from mRNA co-expression in expO tumor samples + ALL human data + mouse PPI.

Node size: number of pathway members in the network.

Edge opacity: p0.

Edge thickness: number of gene-gene links.

Page 13: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Arrow of time: network prospective(work with J. Meyer, Duke Univ.)

Alexeyenko et al. Dynamic zebrafish interactome reveals transcriptional mechanisms of dioxin toxicity PLoS One. 2010

Question: Routes of embryonic intoxication in fish?Answer: through cytochrome P4501A, calcium and iron metabolism,neuronal and retinal development

Page 14: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Genetic association of sequence variants near AGER/NOTCH4 and dementia.Bennet AM, Reynolds CA, Eriksson UK, Hong MG, Blennow K, Gatz M, Alexeyenko A, Pedersen NL, Prince JA.J Alzheimers Dis. 2011;24(3):475-84.

Genome-wide pathway analysis implicates intracellular transmembrane protein transport in Alzheimer disease.Hong MG, Alexeyenko A, Lambert JC, Amouyel P, Prince JA.J Hum Genet. 2010 Oct;55(10):707-9. Epub 2010 Jul 29.

Analysis of lipid pathway genes indicates association of sequence variation near SREBF1/TOM1L2/ATPAF2 with dementia risk.Reynolds CA, Hong MG, Eriksson UK, Blennow K, Wiklund F, Johansson B, Malmberg B, Berg S, Alexeyenko A, Grönberg H, Gatz M, Pedersen NL, Prince JA.Hum Mol Genet. 2010 May 15;19(10):2068-78. Epub 2010 Feb 18.

Validation of candidate disease genes(work with Jonathan Prince, MEB, KI)

Question: Is there extra evidence for GWAS-candidates to be involved?Answer: Yes, for some…

Page 15: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Mutations accumulated in somatic genomes of cancer cell lines

(work with Pelin Akan, Joakim Lundeberg)

Question: Do mutations within a cancer genome behave like a quasi-pathway?Answer: Yes.

Question: How similar are mutation patterns in different cancers?Answer: A lot, but only for drivers and at the pathway level.

Page 16: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Pathway view on the set of toxicity-associated alleles

The analysis detected more significantly enriched pathways than for the negative control gene sets of the same size (215 vs. 139; p0 < 0.001; FDR<0.05). More specifically, many thus found pathways were associated with cancer, apoptosis, cell division etc.

Red node: list of top 50 genes with most contrast allele patternsGrey node: negative control listYellow: enriched/depleted pathwaysEdge width: no. of gene-gene links in the networkEdge opacity: confidenceGreen edges: enrichmentRed edges: depletion

Edges produced by less than 3 list genes are not shown

Page 17: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Experimental perturbations of syndecan-1 in cancer cells(work with T. Szatmari, K. Dobra, KI Huddinge)

Lines:Red: depletionBlue: enrichment

Question: Second-order downstream targets of syndecan modulation?Answer: Segments of cell cycle etc.

Page 18: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Network analysis:what’s in it for you?

• Prioritization among multiple “technical positives”

• Generalization “genes => pathways”, hence new, lower, dimension

• Analysis of individual genes “as if” pathways• Testing biological hypotheses, validation• Modular part of high-throughput analytic

pipelines

Page 19: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Network analysis:how to succeed?

• Analyze prioritized candidates (from genotyping, DE, GWAS…) rather than any genes.

• Do not lean on single “interesting” network links. Employ statistics!i.e.

“concrete questions” => “testable hypotheses” => “concrete answers”

The amount of information in known gene networks is enormous.

Let’s just use it!

Page 20: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Acknowledgements:

Erik Sonnhammer’s bioinformatics group

KICancer

http://FunCoup.sbc.su.se

Erik Sonnhammer,Thomas Schmitt,Oliver Frings,Andreas Tjärnberg, Dimitri Guala, Mun-Gwan Hong, Jonathan Prince,Rochellys Diaz Heijtz, Angelo de Milito, Meng Chen, Simin Merid, Yudi Pawitan,Woojoo Lee,Erik Aurell,Joakim Lundeberg,Pelin Akan,Joel Meyer,Katalin Dobra,Tunde Szatmari, Serhiy Souchelnytskyi,Ingemar Ernberg,

Page 21: Andrey Alexeyenko andrej.alekseenko@scilifelab.se Functional analysis with pathways and networks.

Gene network discovery:getting rid of spurious links

0.7

0.5

0.4

Data processing inequality:“Direct links convey more information than indirect ones”PCIT algorithm:Partial Correlation & Information Theory


Recommended