+ All Categories
Home > Documents > AM Patil III Sem

AM Patil III Sem

Date post: 06-Apr-2018
Category:
Upload: ayyangouda-patil
View: 218 times
Download: 0 times
Share this document with a friend

of 70

Transcript
  • 8/3/2019 AM Patil III Sem

    1/70

    Genome Wide Detection of

    Protein-DNA Interactions

    A. M. Patil

  • 8/3/2019 AM Patil III Sem

    2/70

    Contents

    Introduction

    Detection techniques DNA-Protein

    ChIPChIP-on-chip

    ChIP-SAGE

    ChIP-PETApplication

    Conclusion

  • 8/3/2019 AM Patil III Sem

    3/70

    Genome

    Predominantly composed ofnon-coding sequences (>98%)

    Significant portion of non-CDS serve as transcriptional

    regulatory element

    Previous investigations: molecular, genetical and biochemicalapproach identified several DNA binding proteins.

    Identification and characterization of gene regulatorysequences, reveal gene regulatory networks in cells. Whichserves as a fundamental in understanding developmentalbiology and in determining the molecular basis of manybiological processes.

  • 8/3/2019 AM Patil III Sem

    4/70

    Continued

    The interaction b/n DNA & protein has an important role in

    Repairing damaged DNA.

    Recombination & replication.

    Genomic integrity.

    Controls of epigenetic changes.Differentiation & Development.

  • 8/3/2019 AM Patil III Sem

    5/70

    These proteins interact with DNA by means ofvarious structural Motifs

  • 8/3/2019 AM Patil III Sem

    6/70

    Detecting: Protein-DNA interaction

    Gel retardation assays

    DNase foot printing

    DNA modification assay

    Nitrocellulose filter binding assay

    Crystallography

  • 8/3/2019 AM Patil III Sem

    7/70

    Gel retardation assay:

    Principle: DNA Protein complexes move slowly through an electrophoretic

    gel than DNA.

    Retarded band

    DNA Protein

  • 8/3/2019 AM Patil III Sem

    8/70

    DNase foot printing

    5 10 15 20

    Principle: DNA with bound protein is protected from nuclease digestion

    20

    15

    10

    5

    Footprint

    5 10 15 20

    P

  • 8/3/2019 AM Patil III Sem

    9/70

    Modification Protection assay: Protein bound to DNA can protect specific bases from

    chemical modification

    G G

    Modification G G

    G G G Gmm m

    P

    Residue

    protected

    Missing cleavage product

    because protected residue

    notmodified

    DNA

  • 8/3/2019 AM Patil III Sem

    10/70

    Incubate labeled DNA with protein

    Filter the mixture through a filter disk

    made of nitrocellulose Proteins bind to nitrocellulose, but

    DNA does not. Any DNA that is retainedon the filter is there because it isinteracting with the protein.

    Nitrocellulose filter binding assay

  • 8/3/2019 AM Patil III Sem

    11/70

    DNA sequences interacting with proteins(TF)..contd.

    Conventionally, studied in test tubes using purifiedproteins and naked DNA fragments.

    Demerit: In vitro methods do not replicate the in vivoconditions faithfully physiological condition.

    DNA inside cell exist in a compact chromatin state withdistinct properties from that of naked DNA

    Besides chromatin organization and distributionof histone variants histone modification: acetylation,methylation.

  • 8/3/2019 AM Patil III Sem

    12/70

    ChIP

    Allows study of protein-DNA interactions directly inside cell

    under in-vivo physiological conditions.

    Protocol:

    Cross linking of proteins and DNA (Formaldehyde)

    Extraction and fragmentation of chrom DNA (mechanical

    shearing / enzym

    atic digestion)Immuno-affinity purification (using specific antibody

    against a protein)

    Assay purified DNA (southern blotting/ PCRs)

  • 8/3/2019 AM Patil III Sem

    13/70

    Preparation of cross linked chromatin-cell line, tissue or whole organism.

    Formaldehyde-common ,Dimethyapitimidate (DMA) & disuccinimedyl glutrate

    (DSG)

    O

    II

    H-C-H

    Readily permeable & generates zero length cross link.

    Formaldehyde reactive dipolar in which protein or DNA carbon acts as nucleophiliccentre.Amino & Imino gp of A.A & of DNA readily react with formaldehyde give Schiff

    base. Which further react with 2nd amino group & condense to give the DNA-Protein

    complex.

    Continued

  • 8/3/2019 AM Patil III Sem

    14/70

    Cells or tissues incubated with 1%-10-15min RT or4.c > time.

    Increase cross link time make fragmentation of chromatin difficult.

    Cell lysed isolate nuclei.

    Chromatin released from nuclei by ultrasonicsonication , solubilizes & fragments chromatin 500-2000bp.

    Sonication important step to optimize-Longfragment not immunoprecipitated efficiently

    /detected on microarray againstshort fragment

    Empirical test :time of cross linking no. of

    sonication session

    Continued ..

  • 8/3/2019 AM Patil III Sem

    15/70

    Chromatin immuno precipitationcontd.

    Immunoprecipitation : affinity & specificity crucial.

    Not all available antibody efficiently immunoprecipites the protein DNA

    complexes.Masking of epitome by formaldehyde cross link.

    Incubation with chromatin (2ng) with magnetic / agarose beads coupled

    with antibodies, followed by many wash to remove unbound fraction.

    Assaying purified DNA

    Southern blottingPCR

  • 8/3/2019 AM Patil III Sem

    16/70

    Chromatin immuno-precipitationcontd.

  • 8/3/2019 AM Patil III Sem

    17/70

    Chromatin immunoprecipitationcontd.

    Demerits

    Allows to study interaction between proteins & limitedno. of suspected DNA sequences.

    Genome wide interaction cannot be studied.

  • 8/3/2019 AM Patil III Sem

    18/70

    ChIP-on-chip

    Genomic binding location of regulatory factors can be

    determined using Chromatin immunoprecipitation

    (ChIP) followed by determination of enriched fragments

    by DNA microarray (chip) hybridisation.

  • 8/3/2019 AM Patil III Sem

    19/70

    G

    eneral features of DNAm

    icroarray

  • 8/3/2019 AM Patil III Sem

    20/70

    Microarray features

    Density of the arrayNumber of features in one array.

    ResolutionDistance between each feature on the contiguous

    genomic region.

    Level of genome representation

    Total amount of genomic sequence represented by

    the array.

  • 8/3/2019 AM Patil III Sem

    21/70

    (ChIP)-enriched targets. Probes Cy5/Cy3 ratios significantly high confidence (P < 0.001) level.

  • 8/3/2019 AM Patil III Sem

    22/70

  • 8/3/2019 AM Patil III Sem

    23/70

    Contd

    Approach to detect ChiP enriched DNA 2 color

    competitive hybridisation .ChiP DNA one color

    Control DNA other fluorescent color hybridized to

    array. Ratio b/w ChiP & control DNA correlates with

    enrichment of specific DNA sequences.

    Alternately, to detect ChiP enriched DNA is detected

    through 1 color hybridization. Test & control to

    different arrays.

  • 8/3/2019 AM Patil III Sem

    24/70

    Microarray data analysis

    Single array error model

    Rank based

    Chip & control DNA signal Ratio.

    To test specific enrichment is significant

    Single error model assumes ofnormal distribution & significance is than

    calculated by one sided probability.

    Rank based model no assumption but require the multiple replicates.

    ChIPOTle (Chromatin Immuno- Precipitation On Tiled arrays), Implemented as

    a Microsoft Excel macro written in Visual Basic, ChIPOTle uses a sliding

    window approach that yields improvements in the identification of bona fide

    sites of protein-DNA interaction.

    Michael et.al. 2005.

  • 8/3/2019 AM Patil III Sem

    25/70

    Data interpretation

    Microarray & ChIP-on-chipExpression experiments each element on microarraymeasures the abundance of RNA molecules of fixedlength.

    ChIP-on-chip each element measures the abundance ofpopulation of fragments of various lengths due toeffect of chromatin shearing.Expression experiment the data are two tailed &

    roughly symetric ,there is biological significanceassociated with both low & high ratio measurement.ChIP-on-chip measurement derived from experimentsarise as a mixture of two distribution .first correspondsto population of genomic fragments specifically enrichedby Chip, & second corresponds to remaining population ofgenomic DNA that is not Chip enriched representingbackground or noise.

  • 8/3/2019 AM Patil III Sem

    26/70

    Validating the Identified Protein-Binding Sites

    Microarray- and SAGE-coupled ChIP assaysMultiple steps to identify the protein-bindingsites in the genome, errors introduced at

    various stepsValidation :Testing the relative enrichment ofthe identified sequences in the ChIP DNA incomparison to the total genomic DNA.

    Quantitative PCR

  • 8/3/2019 AM Patil III Sem

    27/70

    Merits of CHIP on chip

    Availability of high density oligo-nucleotide arrays representing entiregenomes allows comprehensive mapping of

    protein-DNA interactions

    Technique doesnt rely upon previousknowledge of transcriptional regulatory

    elementsIs a unbiased and comprehensive approach

  • 8/3/2019 AM Patil III Sem

    28/70

    Contd..

    Major drawback

    Requirement of highly specific antibodies for the

    proteins

    Specific antibodies for all proteins are unavailable

    There is immunoprecipitation oflarge no. of non-

    specific of DNA sequences

  • 8/3/2019 AM Patil III Sem

    29/70

    Alternate ways to purify DNA binding proteins nospecific antibodies

    Using a Engineered cell systemsBy introducing epitope tag to endogenous locus (TF)

    DNA adenine methyl transferase (DamID) identification

    Method: expression of protein of interest fused with E.coli DamID,

    binding of fusion protein cause methylation of local DNA sequences,

    extract and digest DNA with methylation sensitive RE

    Demerit: Dam fusion proteins may have distinct properties fromthat of endogenous proteins

  • 8/3/2019 AM Patil III Sem

    30/70

    Chip-SAGE

  • 8/3/2019 AM Patil III Sem

    31/70

    Principles of SAGE

    A short sequence tag (10-14bp) contains scientificinformation to uniquely identify a transcript provided thatthe tag is obtained from a unique position within eachtranscript.

    Sequence tags can be linked together to form long serialmolecules that can be cloned and sequenced .

    Quantification of the number of times a particular tag isobserved , provides the expression level of thecorresponding transcript.

  • 8/3/2019 AM Patil III Sem

    32/70

  • 8/3/2019 AM Patil III Sem

    33/70

    Advantages over ChiP-chip. It does not depend on preselectedsequences & it can scan the whole genome

    Disadvantages

    Majority of ChIP DNA actually corresponds to non-specifically

    immunoprecipitated sequences and requires sequencing a large

    number of clones to distinguish the truly enriched sequences

    from background.

    Cost of sequencing is high.

  • 8/3/2019 AM Patil III Sem

    34/70

    Chip-SAGE

    Does not require priorknowledge of sequence to be

    analyzed

    Discovery of new sequencesis possible

    Quantification of sequences is

    possible.

    ChiP-Chip

    Require prior knowledge ofsequence to be analyzed

    Comparison is only possiblewith existing sequences.

    Quantification is not easy.

  • 8/3/2019 AM Patil III Sem

    35/70

    Applications

    Decoding the Transcriptional Regulatory Function ofGenome

    Sequences.

    Dissecting the Transcriptional Regulatory Circuits.

    Unravelling Epigenetic Mechanisms.

  • 8/3/2019 AM Patil III Sem

    36/70

    ChiP-SAGE

    High resolution genome wide mapping of histone modification in

    Yeast

    Distribution of hyperacetylated histones H3 & H4

    AcH3, AcH4that recognize hyperacetylated histones in yeast.

    Compared directly the acetylation levels b/w promoter & codingregion among 6040 genes.

    Tae-young Roh, 2004

  • 8/3/2019 AM Patil III Sem

    37/70

    NlaIII site appears once every 316 bp

    class II enzyme, MmeI

  • 8/3/2019 AM Patil III Sem

    38/70

    Generally believed thatpromoter region are highly acetylated.

    The analysis unexpectedly revealed that Highest histone H3acetylation observed afterATG start codon & within the first500 bp of ORFs.

    The acetylation of H4 followed similar pattern.

    Deletion of gene encoding GCN5 ,GCN5 is critical catalytic subunit

    ofADA & SAGA complexes & plays a global role in regulatinghistone H3 acetylation.

    Deletion of GCN 5 specifically eliminated the peak acetylation inpromoter , increased acetylation in 3 ORF.

    Tae-young Roh,2004

  • 8/3/2019 AM Patil III Sem

    39/70

  • 8/3/2019 AM Patil III Sem

    40/70

    These suggest that GCN 5 functions to maintain higher levels of H3

    acetylation in promoter & 5 end of coding region of transcription unit

  • 8/3/2019 AM Patil III Sem

    41/70

    Transcription regulator networks in yeastTonglee et al., 2002

    Epitome Tagging

  • 8/3/2019 AM Patil III Sem

    42/70

    Genome wide analysis for yeast transcription regulatory bind topromoter sequences across genome.

    Epitome tagging.

    All 141 transcription factorsYeast Proteome Database for study.

    Yeast strains were constructed so that each of transcription factor containedcMyc epitome tag.

    Epitope tag might be expected to affect the function of some

    transcriptional regulators; for 17 of the 141 factors, were not able toobtain viable tagged cells, despite three attempts to tag each regulator.

    106 TFonly showed immunoblot.Each strain one TF .

    Continued .

  • 8/3/2019 AM Patil III Sem

    43/70

    Calculated a confidence value (P value) for each spotfrom each array by using an error model.

    Generally describe results obtained at a P value

    threshold of 0.001 because this threshold maximizesinclusion of legitimate regulator-DNA interactions andminimizes false positives.

    Various experimental and analytical methodsindicate that the frequency of false positives

    in the genome-wide location data at the 0.001

    threshold is 6% to 10%

    Continued .

  • 8/3/2019 AM Patil III Sem

    44/70

    Nearly 4000 interactions between regulators and promoter

    regions at a P value threshold of 0.001.

    The promoter regions of2343 of 6270 yeast genes (37%) were

    bound by one or more of the 106 transcriptional regulators.

    Many yeast promoters were bound by multiple transcriptional

    regulators a feature previously associated with gene regulation in

    higher eukaryotes.

    Suggesting that yeast genes are also frequently regulated

    through combinations of regulators.

    Continued .

  • 8/3/2019 AM Patil III Sem

    45/70

    Effect ofPvalue threshold. The sum ofalregulator-promoter region

    interactions is displayed as a function of varying Pvalue thresholds applied

    to the entire location data set for the 106 regulators.More stringent P

    values reduce the numberofinteractions reported but decrease the

    likelihood of false-positive results .

    Continued .

  • 8/3/2019 AM Patil III Sem

    46/70

  • 8/3/2019 AM Patil III Sem

    47/70

    Chromosome wide analysis for differential methylation innormal and transformed human cells.

    Michael et al 2005

    Methylation of cytosine at CpG dinucleotide is a major epigenitic

    modification observed in mammalian cells.

    CpG islands are exceptions and less methylated (are associated

    with promoter regions of several house keeping genes).

    Abberant promoter hypermethylation is believed to contribute to

    sporadic cancer.

  • 8/3/2019 AM Patil III Sem

    48/70

    Relevance of DNA methylation in normaldevelopment and disease

    Immunocapturing technique combined

    with DNA microarray for enrichment of

    methylated DNA and detection

    Use of methylation sensitive RE

    (HpaII) 3.9% of all CpGs

    Antibodies specific to -5 mcytosine

    MeDIP (Methylated DNA

    immunoprecipitation)

  • 8/3/2019 AM Patil III Sem

    49/70

    Relevance of MeDIP technique

  • 8/3/2019 AM Patil III Sem

    50/70

    Assessing the abberant CpG island methylation intransformed cell

    CpG island methylation profile for both SW48 colon cancer

    and normal colon mucosa was generated

    Microarray representing ~ 12,000CpG island probes (derived

    from CpG island library, 75% unique sequences, remaining

    repeatative elements (SINES,LINES,satellites) as well as

    ribosomal and mitochondrial genes.

  • 8/3/2019 AM Patil III Sem

    51/70

    Observation

    Normal cells

    Most of CpG islands including ribos and mito. DNA showed basal/low level

    methylation

    Sequences with highest methylation included repetitive DNA, promoters of

    imprinted genes or genes residing on X-chromosomes

    Transformed cells

    Methylation profile was almost same.

    Hypermethylation was observed in 108 clones of which 82 clones

    represented ribos DNA (reported in cells at the time of aging/neoplasia but

    physiologic al role unknown), remaining 26 clones were unique

  • 8/3/2019 AM Patil III Sem

    52/70

    Ne

    wtargetsforhypermethylationi

    ncolo

    Cell matrix interactionapoptosisCell cycle progression

  • 8/3/2019 AM Patil III Sem

    53/70

    Contd

    Combining MedIP with hybridization on CpG island microarray

    could be used to detect epigenetically silenced genes in cancer

    cellsPattern ofCpG island methylation is conserved and no. of

    genes that are hypermethylated in transformed cell are

    unexpectedly low.

  • 8/3/2019 AM Patil III Sem

    54/70

    Transcriptional regulatory circuitry in human ESCLaurie et al 2005

    Mammalian development requires over 200 specialized cell types

    All derived from single totipotent zygotic cell

    Embryonic StemC

    ellsDerived from inner cell mass ofblastocyst.

    Are capable of generating any cell type of body (Pluriotent)

    Realizing their therapeutic potential, underastanding the

    transcriptional regulatory circuitry is fundamental in knowing

    molecular basis ofpluripotency and self renewal of ESC

  • 8/3/2019 AM Patil III Sem

    55/70

    Three major TF OCT4, SOX2 and Nanog

    Key stem cell regulators in self renewal ability of stem

    cells

    They regulate genes encoding other transcriptional

    regulators involved in determining the developmental

    potential of cells

    Known to interact with one another in a complex

    synergistic way

    Contd..

  • 8/3/2019 AM Patil III Sem

    56/70

    Genome wide location analysis (CHIP on chip)

    Objective:

    To identify the target sites for three key stem cellregulators

    DNA microarray design:Oligo-nucleotide covering (-8 to +2 Kb relative to trans.

    start site) for 17,917 annoatated genes

  • 8/3/2019 AM Patil III Sem

    57/70

    Examples of target sites for OCT4

  • 8/3/2019 AM Patil III Sem

    58/70

    OCT4 showed binding to 623 (3%) of protein coding genes

    SOX2 to 1271 (7%)

    Nanog to 1681 (9%)

    OCT4 and SOX2 shared half of bound proteins, while > 90% of Nanog

    bound genes were of both OCT4 and SOX2

    Regulators bind to both active and inactive/ repressed genes (that

    encode for TF involved in regulation of gene required for meso, endo

    and ectodermal diffrn)

    RESULTS

  • 8/3/2019 AM Patil III Sem

    59/70

    Examples of protein coding genes co-occupied by OCT4, SOX2and Nanog in close proximity

  • 8/3/2019 AM Patil III Sem

    60/70

  • 8/3/2019 AM Patil III Sem

    61/70

    Myogenic differentiationAlexandre et.al 2005

    Genome wide TF binding & expression profiling to assemble

    regulatory network controlling Myogenic differentiation in

    mammalian cells.

    Myoblast Myotubes (multinucleate)

    Governed by TF ,MRFs(MyoD & Myogenin) MEF2

    Myod bHLH bind to E boxes of genes.

    First step of regulatory cascade involves expression ofMyoD

    which subsequently leads to myogenin & MEF2 promoting

    myogenesis.

  • 8/3/2019 AM Patil III Sem

    62/70

    Contd..

    Beyond these first steps knowledge is somewhat fragmentary:

    Relatively few physiological targets of MRFs and MEF2 have

    been identified.

    The number of genes known to be regulated by these factors is

    considerably smaller than the number of genes induced upon

    myogenic differentiation

  • 8/3/2019 AM Patil III Sem

    63/70

    Contd..

    Recent attempts to identify MyoD-binding sites exploitedgene expression profiling of cells that ectopically expressMyoD.

    complications

    Ectopic expression of a bHLH transcription factor canlead to promiscuous binding to E boxes throughout thegenome. The fact that another E-box-binding factor,Myc, binds to an extensive portion of the genome

  • 8/3/2019 AM Patil III Sem

    64/70

    To know contribution of MRF family members

    on gene expression patterns.

    ChiP chip using promoter Chip from C2ci2

    myoblast .

  • 8/3/2019 AM Patil III Sem

    65/70

  • 8/3/2019 AM Patil III Sem

    66/70

    Confirmation for binding by MyoD, myogenin, and MEF2 to asubset of targets

  • 8/3/2019 AM Patil III Sem

    67/70

    Growing myoblasts, MyoD bound a

    Set of genes involved in synapse specification andutilization (NMj) and neuromuscular function

    Bound by MyoD in MT play a role in muscle

    development & contraction.

  • 8/3/2019 AM Patil III Sem

    68/70

    MRFs ,MEF2-Triangle

    Square TF

    Circle Non TF

  • 8/3/2019 AM Patil III Sem

    69/70

    ChiP-PETChiP-PET

    Ruan and colleagues :Pair of signature tags,paired-end tags (PET), both ends of a DNAconcatenate.

    Like SAGE, the PET method can be used to obtainlarge numbers ofsequence tags with moderatecost.

    PET additional information of start and end ofeach ChIP DNA fragment, used precisely locate the

    binding sites. To achieve this, one identifies theoverlapping PET, and the overlap between differentPET fragments would correspond to the binding sites.

  • 8/3/2019 AM Patil III Sem

    70/70


Recommended