+ All Categories
Home > Documents > Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA...

Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA...

Date post: 09-May-2020
Category:
Upload: others
View: 14 times
Download: 0 times
Share this document with a friend
62
Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012
Transcript
Page 1: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Measuring gene expression with

DNA microarrays

02.01.2012 and 04.01.2012

Page 2: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Outline •  Microarrays for the detection of gene expression

–  Technologies for microarrays –  Normalization

•  Lowess •  Quantile normalization •  Variance stabilized normalization

–  Exploratory data analysis –  Validation

Page 3: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Motivation •  Monitoring gene expression

–  Comparing different samples •  Tissues •  Strains of bacteria or yeasts

–  Time series

•  Whole genome expression (tiling arrays) •  Pathogen detection •  Resequencing •  Study protein-DNA interaction

Page 4: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Technologies

Page 5: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Common technologies •  (spotted) cDNA arrays

–  Custom made –  Lengths up to 1000 bp

•  Oligonucleotide arrays –  Industrially

manufactured (Affymetrix, Agilent, Nimblegen, etc)

–  25 bp (Affy), ~60 for other technologies

•  Single experiments –  Evaluate intensities –  Absolute transcript

levels

•  Two dye experiments –  Evaluate ratio of

intensities

•  Different strategies for normalization and analysis

Page 6: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection
Page 7: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Manufacturing oligonucleotide arrays

Page 8: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection
Page 9: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Oligonucleotide array design

Page 10: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection
Page 11: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection
Page 12: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection
Page 13: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Two colour cDNA array

Page 14: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Red vs green overlay

Page 15: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection
Page 16: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Preliminary data analysis

Plots and strategies

Page 17: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Typical workflow

From Bolstad

Page 18: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Influences

Measuring Yi,k intensity of probe i on array k

•  Total RNA amount •  Total sample amount •  Efficiency of

–  RNA extraction –  Reverse transcription –  cDNA amplification –  cRNA transcription –  Labeling

•  Hybridization –  Efficiency –  Specificity

•  Scanner settings

Page 19: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Analysis by inspection

•  Box plot •  Scatter plot •  QQ plot •  MvA plot •  sdm plot •  MAD plot

Page 20: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Box plots

Page 21: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Scatter plot In

tens

ities

G

Intensities R

Page 22: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Scatter plot In

tens

ities

G

Intensities R

Page 23: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

QQ-plot

Page 24: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

MvA plot

•  Comparison of two arrays (Affymetrix) or two samples (e.g. Cy3 and Cy5 labeled)

•  X axis: A – average intensity

A = 0.5*(log R + log G) •  Y axis: M – log ratio

M = log R – log G A

M

Page 25: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

MvA plots

Page 26: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

SDM plots

•  Standard deviation vs. mean

Page 27: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Median absolute deviation

•  Comparison between arrays •  MADi,j = medianj{|xi1 –xj1|,|xi2 – xj2|, …)}

Page 28: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Normalization

Page 29: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Sources of Artifacts

scanning

data: (R,G,...)

PCR product amplification purification

printing

Hybridize"

RNA"

Test sample"

cDNA"

RNA"

Reference sample"

cDNA"

excitation

red laser green

laser

emission

overlay images

Production"

Plate effects (?)

Intensity effects (labelling efficiency)

Intensity effects (quenching)

Slide by H. Bengston

Page 30: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Hybridization of the same sample to 2 chips/channels

•  Random and systematic measurement errors

•  Biases result in scatter plots not centered around the x-y diagonal

Page 31: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Hybridization of the same sample to 2 chips/channels

Page 32: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Normalization - two problems

I.  How to detect biases? Which genes to use for estimating biases among chips/channels?

II.  How to remove the biases?

Page 33: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Which genes to use for bias detection?

All genes on the chip –  Assumption: Most of the genes are equally

expressed in the compared samples, the proportion of the differential genes is low (<20%).

–  Limits: •  Not appropriate when comparing highly

heterogeneous samples (different tissues) •  Not appropriate for analysis of ‘dedicated

chips’ (apoptosis chips, inflammation chips etc)

Page 34: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

House keeping genes

•  Based on prior knowledge a set of genes can be regarded as equally expressed in the compared samples

•  Affy novel chips: ‘normalization set’ of 100 genes

•  NHGRI’s cDNA microarrays: 70 "house-keeping" genes set

•  Limits:   The validity of the assumption is questionable   Housekeeping genes are usually expressed at high

levels, not informative for the low intensities range

Page 35: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Bias detection

•  Spiked-in controls from other organism, over a range of concentrations

•  Limits:   low number of controls- less robust   Can’t detect biases due to differences in RNA extraction

protocols

•  “Invariant set” •  Trying to identify genes that are expressed at similar

levels in the compared samples without relying on any prior knowledge:

•  Rank the genes in each chip according to their expression level

•  Find genes with small change in ranks

Page 36: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Normalization Methods

Influence parameters

Page 37: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Commonly used approaches

•  Global intensity scaling •  LOESS •  Quantil normalization •  Variance stabilized normalization (vsn)

Page 38: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Global normalization (Scaling)

•  A single normalization factor (k) is computed for balancing chips\channels:

Xinorm = k*Xi

•  Multiplying intensities by this factor equalizes the mean (median) intensity among compared chips

•  Found in many papers, not recommended

Page 39: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Global Normalization

Before After

Page 40: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Before Normalization After Scaling

Page 41: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

LOESS

•  Locally weighted scatter plot smoothing •  Synonymous with lowess

•  Compensate for intensity-dependent biases

•  Separate the data into windows of a given size

•  Apply a regression function to the segmented data

Page 42: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

A

We expect the M vs A plot to look like:

M = log(Cy3/Cy5)

Page 43: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Intensity-dependent bias

A

M = log(Cy3/Cy5)

Low intensities

M<0: Cy3<Cy5

High intensities

M>0: Cy3>Cy5

Page 44: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Separate data

Page 45: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Intensity-Dependent Normalization Assumption: Most of the genes are equally expressed at all intensities

Lowess – fitting local regression curve – c(A)

Page 46: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

LOWESS normalization

Page 47: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Quantile Normalization

•  Sort intensities in each chip •  Compute mean intensity in each rank across the chips •  Replace each intensity by the mean intensity at its rank

Chip #1 Chip #2 Chip #3 Average chip

Page 48: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Quantile normalization

Page 49: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Quantile normalization

Page 50: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Comparison After lowess normalization After quantile normalization

Page 51: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Variance stabilized normalization

Measured intensity = offset + gain x true abundance

Yik = αik + βik xk

βik = βiβk exp(nik)

βi : per sample normalization factor

βk : sequence-wise labeling efficiency

nik ~ N(0, s22) : multiplicative noise

Page 52: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Variance stabilizing normalization

•  Powerful method incorporating – Background substraction – Error model – Analysis of significantly expressed genes

•  Typically employed in the analysis of ratios – Many genes are lowly expressed

Page 53: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Additive vs. multiplicative noise

From Huber

Page 54: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection
Page 55: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Variance stabilizing transformation

Page 56: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

vsn transformation

Page 57: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

arsinh and log

Page 58: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Generalized logarithm

Huber

Page 59: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Exploratory data analysis

Fold change ANOVA

Median polish

Page 60: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Validation

Sensitivity, Specificity ROC curves

Page 61: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Receiver operating characteristic

•  A framework to compare the performance of binary classifiers

•  Plot of false positive rate (sensitivity) vs true positive rate (1-specificity)

•  TPR = TP/P •  FPR = FP/N

Page 62: Measuring gene expression with DNA microarrays · 2012-01-08 · Measuring gene expression with DNA microarrays 02.01.2012 and 04.01.2012 . Outline • Microarrays for the detection

Thank you for your attention!


Recommended