+ All Categories
Home > Documents > 8 Association Mapping

8 Association Mapping

Date post: 05-Apr-2018
Category:
Upload: joao-ricardo-bachega
View: 220 times
Download: 0 times
Share this document with a friend

of 31

Transcript
  • 7/31/2019 8 Association Mapping

    1/31

    Marcos Malosetti & Fred van EeuwijkIntroduction to mixed model QTL mapping using GenStat

    13-15th December 2010, ESALQ Piracicaba, So Paulo, Brazil

  • 7/31/2019 8 Association Mapping

    2/31

    QTL mapping

    Objective of QTL mapping studies:describe phenotypes in relation to underlying geneticfactors (called QTL)QTL = Quantitative Trait Locus

    Finding statistical association betweeninformation at the DNA level (molecular markers)and phenotypic variation

    Linkage analysis: common approach, conventional

    QTL mappingLD mapping or association mapping : more recently

  • 7/31/2019 8 Association Mapping

    3/31

    Linkage and linkage disequilibrium

    Statistical association between markers andphenotypes found when there is linkage(disequilibrium) between markers and QTLs.Linkage disequilibrium (LD): non-random

    association of alleles at two locinot necessarily on the same chromosome

    Linkage: non-random association of alleles at twoloci due to limited recombination between the loci

    Linkage necessarily involves loci on the samechromosome

  • 7/31/2019 8 Association Mapping

    4/31

    Conventional QTL mapping versus LD mapping

    Designed crosses Association panel

    Both, linkage analysis and LD mapping, rely on linkagedisequilibrium to detect QTLs

  • 7/31/2019 8 Association Mapping

    5/31

    Conventional QTL mapping versus LD mappingDesigned crosses Association panel

    LD marker-QTL is only consequence of linkage.

    LD marker-QTL can be consequence of linkage but alsoother factors can cause LD.

    Population admixture

  • 7/31/2019 8 Association Mapping

    6/31

    Genetic relatedness / population substructure

    All genotypes of a segregating population have by

    expectation equal relatedness/correlationIn association mapping panels the genotypesshow heterogeneous genetic relatedness:

    Unstructured: coefficient of coancestry ij Structured genetic relatedness (populationsubstructure)

    Not accounting for relatedness (coancestry orpopulation substructure) will cause spuriousassociationsLD mapping requires more elaborated modelling of thegenetic relatedness in the population

  • 7/31/2019 8 Association Mapping

    7/31

    Association panel: a set of interconnected genotypes

    known or unknownpopulation history

  • 7/31/2019 8 Association Mapping

    8/31

    K

    Genetic correlation between individuals: unstructured

    K = kinship, coeff. of coancestry

    pedigree

    markers

  • 7/31/2019 8 Association Mapping

    9/31

    Genetic correlation between individuals: structured

    Identify sets of more or lesshomogenous genotypes

  • 7/31/2019 8 Association Mapping

    10/31

    Quantifying genetic relatedness / Structuring VCOV(G)

    Bayesian clustering STRUCTURE (Pritchard etal. 2000)It can be computationally intensivePopulation assumptions not always compatible with

    plant populations (mainly developed to be used inhuman population genetics)Not always obvious how to define the model to use

    0%

    10%

    20%

    30%

    40%

    50%

    60%

    70%

    80%

    90%

    100%

    6 6

    6 7

    6 8

    6 9

    7 0

    7 2

    7 4

    7 5

    7 6

    7 7

    7 8

    7 9

    8 0

    8 1 1 3 5 7 8 9

    1 0

    1 1

    2 1

    2 8

    3 2

    8 3

    8 6

    8 8

    9 2

    9 3

    9 5

    1 0 8

    1 1 0

    1 1 1

    1 1 2

    1 1 7

    1 2 0

    1 2 1

    1 2 2

    1 2 6

    1 2 8

    1 3 2

    1 3 5

    1 3 6

    1 3 7

    1 4 0

    1 4 8

    1 5 2

    1 5 3

    1 5 5

    1 5 6

    1 5 7

    1 5 8

    1 5 9

    1 7 2

    1 7 4

    1 7 5

    1 7 6

    1 7 7

    1 7 9

    1 8 0

    1 8 5

    1 8 6

    1 9 2 4

    2 5

    3 1

    3 4

    5 2

    8 9

    9 0

    9 1

    9 4

    9 6

    9 9

    1 0 7

    1 0 9

    1 1 4

    1 1 8

    1 1 9

    1 2 3

    1 2 4

    1 2 5

    1 2 7

    1 2 9

    1 3 0

    1 3 1

    1 3 3

    1 3 4

    1 3 8

    1 3 9

    1 4 1

    1 5 0

    1 5 1

    1 5 4

    1 6 1

    1 6 6

    1 6 8

    1 6 9

    1 7 0

    1 7 1

    1 7 3

    1 7 8

    1 8 2

    1 8 3

    1 8 4

    1 8 8

    1 4

    1 6

    2 2

    2 3

    2 4

    2 6

    2 7

    2 9

    3 0

    3 3

    3 5

    3 6

    3 7

    3 8

    3 9

    4 0

    4 1

    4 2

    4 3

    4 4

    4 5

    4 6

    4 7

    4 8

    4 9

    5 0

    5 1

    5 3

    5 4

    5 5

    5 6

    5 7

    5 8

    5 9

    6 0

    6 1

    6 2

    6 3

    6 4

    6 5

    7 1

    8 2

    8 4

    1 0 6

    1 1 3

    1 1 5

    1 4 2

    1 4 4

    1 4 6

    1 4 7

    1 4 9

    1 8 7

    1 8 9

    1 9 1

    1 2

    1 3

    1 5

    1 7

    1 8

    1 9

    2 0

    1 0 0

    1 0 1

    1 0 2

    1 0 3

    1 0 4

    1 0 5

    1 6 0

    1 6 2

    1 6 3

    1 6 5

    1 6 7

    A n c e s t r y

    Genotype

    EMed

    Turk

    SWMed

    NMed2

    NMed6

  • 7/31/2019 8 Association Mapping

    11/31

    Quantifying genetic relatedness / Structuring VCOV(G)

    Classical multivariate approachesSimple, fastSimilar results with STRUCTUREWhere to define boundaries between groups?

    Other criteria (e.g. geographical origin)

  • 7/31/2019 8 Association Mapping

    12/31

    Eigenanalysis

  • 7/31/2019 8 Association Mapping

    13/31

    Eigenanalysis

    PCA on genotype x marker scores matrix with aformal test for the number of axes (dimensions)No discrete groups, but set of PCs be used ascovariates in marker trait association analysisWhen PCs are introduced in random part of amixed model, they will approximate the fullgenetic relationship matrix

    Straightforward, simple, and is easy to program ina conventional statistical package

  • 7/31/2019 8 Association Mapping

    14/31

    Mixed models and LD mapping in GenStat

    LD mapping models should accommodatethe complex genetic relationships in thepopulation.

    Mixed models are particularly suitable(GenStat).Suite of GenStat procedures developed to

    run different models for LD mapping.Procedures can be run from the GUI.

  • 7/31/2019 8 Association Mapping

    15/31

    A mixed model for LD mapping

    P = genotype + error

    ),0(~),0(~ 22 N error N G genotype

    P = marker + genotype* + error

    iii GP

    iiii G xP

    MMif 1

    Mmif 0

    mmif 1

    i

    i

    i

    x

    x

    x

  • 7/31/2019 8 Association Mapping

    16/31

    A naive mixed model for LD mapping

    This model assumes UNRELATED genotypes

    Standard assumption:

    ),0(~),0(~22

    N error N G genotype

    Relationship matrix K=I

    ),0(~ 2genotype I N G

    This model ignores genetic relatedness/ population structure

    1000

    10010

    1

    K

    P = marker + genotype* + error

    iiii G xP

  • 7/31/2019 8 Association Mapping

    17/31

    K should be in the model to correct for relatedness

    Now the relationship matrix (K) is in the modelK = kinship matrix derived from pedigree/marker

    information

    Change model assumption:

    ),0(~)2,0(~ 22 N error K N G g

    Relationship matrix K I

    II I I I

    K

    321

    332313

    2212

    11

    P = marker + genotype* + error

    iiii G xP

  • 7/31/2019 8 Association Mapping

    18/31

    LD mapping using eigenanalysis

    The PC scores represent relatedness / populationstructurePCs impose approximate covariance structure

    Computationally less intensive than full structuring ofVCOV(G)

    ) ,0( N ~error ) ,0( N ~G) ,0( N ~C 22genotype2scores

    P = marker + PCs + genotype* + error

    ii M

    m ,iii GC xP

  • 7/31/2019 8 Association Mapping

    19/31

    Population structure

    This model imposes a common covariance between genotypes withina groupGenotypes from different groups are still assumed unrelatedGroups from STRUCTURE or clustering

    ),0(~),0(~),0(~ 222 N error N G N C genotypegroup

    Relationship matrix K I

    Group 1 Group 2

    P = marker + group + genotype + error

    ik ik ii GC xP )(

  • 7/31/2019 8 Association Mapping

    20/31

    LD mapping in GenStat 13

  • 7/31/2019 8 Association Mapping

    21/31

  • 7/31/2019 8 Association Mapping

    22/31

    Correcting for genetic relatedness: kinship vsnull

  • 7/31/2019 8 Association Mapping

    23/31

    Correcting for genetic relatedness: PC scores vs null

  • 7/31/2019 8 Association Mapping

    24/31

    LD decay plots

  • 7/31/2019 8 Association Mapping

    25/31

    LD decay plots

    No correction Correction for population structure

    Response marker =predictor marker +error

    Response marker =PC scores / groups +predictor marker +error

  • 7/31/2019 8 Association Mapping

    26/31

    LD image plots

    No correction Correction for population structure

  • 7/31/2019 8 Association Mapping

    27/31

    Genetic relatedness in segregating populations

    Segregating populationsNo selectionNo mutation

    No genetic driftSimple model does notwork

    More complex residualstructure

    F1

    100-1000 offsprings

    Parent 2Parent 1

    QTL = Quantitative Trait Locus

  • 7/31/2019 8 Association Mapping

    28/31

    Modelling genetic relatedness for QTL detection

    yi quantitative trait responsex

    i genotypic covariable (marker information)additive marker effectResidual random variation consists of

    Genetic residual with a correlation structure

    Relationship matrix ij coefficient of coancestry between genotypes i and j

    Standard independent residual (experimental error)

    iiii G x y *

    ),0(~)2,0(~

    2

    2

    *

    *

    N K N G

    i

    Gi

    II I I I

    G

    321

    332313

    2212

    11

    2*

  • 7/31/2019 8 Association Mapping

    29/31

    Chi-square test for segregation distortions

    Allele frequencies show deviations from expectation

  • 7/31/2019 8 Association Mapping

    30/31

    Genome-wide scan: plant height

    Model includinggenetic relatedness(kinship)information)

    Model ignoringrelatedness(kinship)information)

  • 7/31/2019 8 Association Mapping

    31/31

    Conclusions

    Study of genetic relatedness crucial in LD

    mappingKinshipEigenanalysisClustering methods (including STRUCTURE)

    Need to control for genetic relatedness whenassessing:

    Marker marker association (LD decay)

    Marker trait association (LD mapping)GenStat procedures / GUI can be used to run allthese types of analyses


Recommended