+ All Categories
Home > Documents > Epigenomics: SomeEpigenomics: Some Statistical

Epigenomics: SomeEpigenomics: Some Statistical

Date post: 03-Feb-2022
Category:
Upload: others
View: 6 times
Download: 0 times
Share this document with a friend
42
Epigenomics: Some Epigenomics: Some Epigenomics: Some Epigenomics: Some Statistical Applications Statistical Applications Statistical Applications Statistical Applications Rafael A. Irizarry Rafael A. Irizarry Department of Biostatistics J h H ki Bl b Sh l fP bli John Hopkins Bloomberg School of Public Health
Transcript
Page 1: Epigenomics: SomeEpigenomics: Some Statistical

Epigenomics: SomeEpigenomics: SomeEpigenomics: Some Epigenomics: Some Statistical ApplicationsStatistical ApplicationsStatistical Applications Statistical Applications

Rafael A. IrizarryRafael A. IrizarryDepartment of BiostatisticsJ h H ki Bl b S h l f P bliJohn Hopkins Bloomberg School of Public

Health

Page 2: Epigenomics: SomeEpigenomics: Some Statistical

AcknowledgementsAcknowledgementsAcknowledgementsAcknowledgements

• Tom Albert Nimblegen• Tom Albert, Nimblegen• Benilton Carvalho, JHU Biostatistics• Andy Feinberg Lab, JHU Medicine• Todd Richmond, NimblegenTodd Richmond, Nimblegen• Hao Wu, JHU Biostatistics

J W B Bi t t• Jean Wu, Brown Biostat• Vasan Yegnasubramanian, JHU g

Oncology

Page 3: Epigenomics: SomeEpigenomics: Some Statistical

OutlineOutlineOutlineOutline

• Quick Introduction to Epigenetics• Quick Introduction to Epigenetics• Introduction to Methylationy• Overview of competing technologies

Review: Expression arrays lessons• Review: Expression arrays lessons• Comparisonp• Role of statisticians

Page 4: Epigenomics: SomeEpigenomics: Some Statistical

Genetics: the alphabet of lifeGenetics: the alphabet of life

• Letters of DNALetters of DNA sequence carry the informationinformation

Page 5: Epigenomics: SomeEpigenomics: Some Statistical

EpigeneticsEpigenetics

Page 6: Epigenomics: SomeEpigenomics: Some Statistical

(3.4x10-10 meters/bp) x (6x109 bp/genome) = ~2 meters/genome

Radius of the nucleus is ~ 10 µM !!!µ

Klug and Cummings, 1997

Page 7: Epigenomics: SomeEpigenomics: Some Statistical

[(6 x 109 bp/genome) / (195 bp/nucleosome)] = ~ 30.8 x 106 nucleosomes/genome~ 5 % of nuclear volume

Page 8: Epigenomics: SomeEpigenomics: Some Statistical

http://www.albany.edu/~achm110/solenoidchriomatin.html

Page 9: Epigenomics: SomeEpigenomics: Some Statistical

Epigenetics: the grammar of Epigenetics: the grammar of lifelifelifelife

Page 10: Epigenomics: SomeEpigenomics: Some Statistical

DNA methylationDNA methylationDNA methylationDNA methylation

DNA methyltransferase I (DNMT1)Obsered to expeced = Pr(CG) / { Pr(C) Pr(G) }

Page 11: Epigenomics: SomeEpigenomics: Some Statistical

DNA methylation can lead to silencing of gene expressionDNA methylation can lead to silencing of gene expression

HDACMeCP2

Sin3A HDACMeCP2

Sin3A

HDAC2

HDAC1

MBD3

RbAp46RbAp48

Mi 2

MTA268kD

66kDHDAC2

HDAC1

MBD3

RbAp46RbAp48

Mi 2

MTA268kD

66kD>2 MDalton Complex

MBD2Mi-2 MBD2Mi-2

Robertson and Wolffe, Nat Rev Genet, 2000

Page 12: Epigenomics: SomeEpigenomics: Some Statistical

ENCODE TrackENCODE TrackENCODE TrackENCODE Track

Page 13: Epigenomics: SomeEpigenomics: Some Statistical

Expression Array LessonsExpression Array Lessons

Page 14: Epigenomics: SomeEpigenomics: Some Statistical

NormalizationNormalizationNormalizationNormalization

Page 15: Epigenomics: SomeEpigenomics: Some Statistical

Probe effectProbe effectProbe effectProbe effect

I t it B k d P b Eff t Q tit EIntensity = Background + Probe Effect x Quantity x Error

Page 16: Epigenomics: SomeEpigenomics: Some Statistical

Sequence effect for BGSequence effect for BGSequence effect for BGSequence effect for BGWu et al. (2004) JASA 99(468) 909

Aff ∑ ∑ 125

jbk CGTAj

kj kAffinity =

= ∈∑ ∑= 1

1 },,,{,μ µj,k ~ smooth function

of k

Page 17: Epigenomics: SomeEpigenomics: Some Statistical

Back to MethylationBack to Methylation

High throughput of course….High throughput of course….

Page 18: Epigenomics: SomeEpigenomics: Some Statistical

Densities for three methodsDensities for three methodsDensities for three methodsDensities for three methods

HCT116 lots of methylation

DKO very little methylation

Page 19: Epigenomics: SomeEpigenomics: Some Statistical

Hunh?Hunh?Hunh?Hunh?

Page 20: Epigenomics: SomeEpigenomics: Some Statistical

MeDIP (like ChIPchip)MeDIP (like ChIPchip)

Total Reverse crosslinks Amplify Label/hybridizeAmplify Label/hybridize

Crosslink YLyse & Sonicate

IP Reverse crosslinks

Amplify Label/hybridize

Other controls for IP(e.g., no antibody, non-

specific antibody)

Page 21: Epigenomics: SomeEpigenomics: Some Statistical

Some DataSome DataSome DataSome Data

Page 22: Epigenomics: SomeEpigenomics: Some Statistical

Problem: Not specificProblem: Not specificProblem: Not specificProblem: Not specific

Page 23: Epigenomics: SomeEpigenomics: Some Statistical

HELP: Two enzymesHELP: Two enzymesHELP: Two enzymesHELP: Two enzymesCuts at CCGG Cuts at CMCGG

No Methylation

Page 24: Epigenomics: SomeEpigenomics: Some Statistical

HELP after PCRHELP after PCRHELP after PCRHELP after PCR

No Methylation

Page 25: Epigenomics: SomeEpigenomics: Some Statistical

HELPHELPHELPHELP

Methylation

Page 26: Epigenomics: SomeEpigenomics: Some Statistical

HELPHELPHELPHELP

No Methylation

Page 27: Epigenomics: SomeEpigenomics: Some Statistical

Problem with HELPProblem with HELPProblem with HELPProblem with HELPCuts at CCGG Cuts at CMCGG

No Methylation

Page 28: Epigenomics: SomeEpigenomics: Some Statistical

The ProblemThe ProblemThe ProblemThe Problem

Ob d t d P (CG) / { P (C) P (G) }Obsered to expeced = Pr(CG) / { Pr(C) Pr(G) }

Page 29: Epigenomics: SomeEpigenomics: Some Statistical

Proportion of neighboring CpG also Proportion of neighboring CpG also th l t d/ t th l t dth l t d/ t th l t dmethylated/not methylatedmethylated/not methylated

Page 30: Epigenomics: SomeEpigenomics: Some Statistical

McRBC on Tiling arrayMcRBC on Tiling arrayMcRBC on Tiling arrayMcRBC on Tiling array

Page 31: Epigenomics: SomeEpigenomics: Some Statistical

ROC nowROC nowROC nowROC now

Page 32: Epigenomics: SomeEpigenomics: Some Statistical

ENCODE TrackENCODE TrackENCODE TrackENCODE Track

Page 33: Epigenomics: SomeEpigenomics: Some Statistical

Problems for StatisticiansProblems for StatisticiansProblems for StatisticiansProblems for Statisticians

• Background Correction +• Background Correction + Normalization

• Probability Model for Segments• Use these to from null and alternative• Use these to from null and alternative

models… we need power!• Use these to create bump finding

algorithmsg• Adapt to high-throughput sequencing

Page 34: Epigenomics: SomeEpigenomics: Some Statistical

Supplemental SlidesSupplemental Slides

Page 35: Epigenomics: SomeEpigenomics: Some Statistical

McRBC: One enzymeMcRBC: One enzymeMcRBC: One enzymeMcRBC: One enzymeCuts at AmCG or GmCG Input

No Methylation

Page 36: Epigenomics: SomeEpigenomics: Some Statistical

McRBC after GelMcRBC after GelMcRBC after GelMcRBC after Gel

No Methylation

Page 37: Epigenomics: SomeEpigenomics: Some Statistical

McRBC after GelMcRBC after GelMcRBC after GelMcRBC after Gel

No Methylation

Page 38: Epigenomics: SomeEpigenomics: Some Statistical

McRBCMcRBCMcRBCMcRBC

Methylation

Page 39: Epigenomics: SomeEpigenomics: Some Statistical

McRBC after GELMcRBC after GELMcRBC after GELMcRBC after GEL

Methylation

Page 40: Epigenomics: SomeEpigenomics: Some Statistical

McRBC after GELMcRBC after GELMcRBC after GELMcRBC after GEL

Methylation

Page 41: Epigenomics: SomeEpigenomics: Some Statistical
Page 42: Epigenomics: SomeEpigenomics: Some Statistical

Recommended