+ All Categories
Home > Documents > Bioinformatics and MAS

Bioinformatics and MAS

Date post: 04-Jun-2018
Category:
Upload: brij-mohan-singh
View: 217 times
Download: 0 times
Share this document with a friend

of 79

Transcript
  • 8/13/2019 Bioinformatics and MAS

    1/79

    Bioinformatics and MAS

    - an Indian Experience

    N. K. Singh and T. R. Sharma

    NRC Plant BiotechnologyIARI, New Delhi

  • 8/13/2019 Bioinformatics and MAS

    2/79

    Outline1. Informatics in MAS2. Rice genome sequence information for mapping,

    tagging and map-based cloning of genes3. Mining novel alleles of cloned genes in the

    germplasm

    4. Synteny and colinearity- transferring rice genome

    information to wheat

    5. Databases and web-based tools to assist

    breeders

    6. High throughput genotyping to save time andcost

    7. Proposed activities for the Indo-Aus wheat MAS

    network

  • 8/13/2019 Bioinformatics and MAS

    3/79

    1. Informatics in MAS

  • 8/13/2019 Bioinformatics and MAS

    4/79

    Applications incrop improvement

    (MAS/ Transgenics)

    Identify DNA

    markers linkedwith traits(Simple/ QTL)

    Basic Resources :Germplasm, mutant lines, knock outs, mapping populations,

    GSTs, ESTs, BAC libraries, BAC-end sequences, Bioinformatics

    Functional

    genomics

    Genotype

    Transcriptome

    Proteome

    Metabolome

    Phenome

    Pilot genome

    sequencing

    High density

    molecular genetic map

    Large scale

    genome

    sequencing

    Gene based markers

    Map based cloning

    High throughput gene/ marker discovery

    Arrow

    ofTime

    Phase 2

    Phase 3

    Phase1

    Phase 4

    Different Phases of Plant Genomics Research

    Rice

    Tomato

    SoybeanSorghum

    Medicago

    Brassica

    ChickpeaPigeonpea

    Mango

    Banana

    Maize

    Wheat

    Cotton

    Sugarcane

  • 8/13/2019 Bioinformatics and MAS

    5/79

    Status of genome maps

    5 (Medicago,Lotus, Soybean,

    Sorghum, Tomato)

    3 (Arabidopsis,Rice, Poplar)

    Plants

    108 (Human, Chimp,

    Mouse, Rat, Fruitfly, Mosquito, Fogu

    fish)

    Animals

    543268Microbes

    In progressCompletedGroup

    Source: NCBI

  • 8/13/2019 Bioinformatics and MAS

    6/79

    Enormous size

    of crop genomes1pg = 1 bill ion base pairs (1000 Mbp)

    Arabidopsis

    125 Mb

    Rice

    390 Mb

    Sorghum1000 Mb

    Human3000 Mb

    Maize

    2500 Mb

    Barley

    6000 Mb

    Wheat16000 Mb

    Microbes

    5 Mb

  • 8/13/2019 Bioinformatics and MAS

    7/79

    What is needed?

    1. De novo tools- Better maps, ESTs, BACs

    (Sequencing, genotyping facility)

    2. Comparative genomics- leverage

    information from model species

    (Genome-informatics facility, human resource)

    3. Create novel genetic variation- wide

    crosses, transgenics

    (Green houses, tissue culture facility, gene constructs, IPR)

    (for Molecular Breeding in Orphan Crops)

  • 8/13/2019 Bioinformatics and MAS

    8/79

    NRCPBGenoinformatics

    Centre

  • 8/13/2019 Bioinformatics and MAS

    9/79

    2. Rice genome sequence

    information for mapping,tagging and map-based

    cloning of genes

  • 8/13/2019 Bioinformatics and MAS

    10/79

  • 8/13/2019 Bioinformatics and MAS

    11/79

    Ref: IRGSP,Nature 11 August 2005

    Maps of the 12 Sequenced Rice Chromosomes

    Size = 388.8 Mb

  • 8/13/2019 Bioinformatics and MAS

    12/79

    Mapping of QTLs/Genes for

    important traits in rice

    Basmati quality traits

    Grain number

    Salt tolerance

    Blast resistance

  • 8/13/2019 Bioinformatics and MAS

    13/79

    Gene Discovery: Efforts at NRCPB, IARI and CSSRI

    Effect of 100 mM NaCl on salt susceptible (MI 48) and

    salt tolerant (CSR 27) varieties of rice

    Fine mapping

    of QTLs, and

    expression

    profiling

    (micro array/proteomics) of

    genes for

    complex

    agronomictraits

  • 8/13/2019 Bioinformatics and MAS

    14/79

    P1 P2 RILs

    0

    5

    10

    15

    20

    25

    30

    5.51

    -5.71

    5.91

    -6.11

    6.31

    -6.51

    6.71

    -6.91

    7.11

    -7.31

    7.51

    -7.71

    7.91

    -8.11

    8.31

    -8.51

    8.71

    -8.91

    9.11

    -9.31

    Grain length (in mm)

    Frequency

    P2

    P1

    Distribution of Grain length in RILs

    Grain length

  • 8/13/2019 Bioinformatics and MAS

    15/79

    Overview of Quality QTLs in Pusa 1121

  • 8/13/2019 Bioinformatics and MAS

    16/79

    Effect of 100 mM NaCl on salt susceptible (MI 48)

    and salt tolerant (CSR 27) varieties of rice

    Salt tolerance

  • 8/13/2019 Bioinformatics and MAS

    17/79

    Graphic display of detected QTLs for 17 salt tolerance parameters

  • 8/13/2019 Bioinformatics and MAS

    18/79

    sizeBAC Acc.No

    MARKER cM

    148Kb

    40.5 AC 121327149Kb

    150Kb

    AC118340134 kb

    AC 145349

    171 Kb

    RM224 136 Kb

    AC 122143

    4.5

    1.3

    9.5

    AC 109832

    AC125780

    AC125782

    AC10484610.7

    3.6

    Pi kh0.5

    0.7TRS 26

    TRS 33

    130 Kb

    AC 145349

    TRS 2635,016-35059

    142 Kb

    CG

    TRS 3316,903-16 972

    171Kb

    AC104846

    170502-171522

    136 Kb

    size BAC Acc.No

    R

    gene

    GENETIC & PHYSICAL MAP of Pi-kh locus - Comparative Genomics Approaches

    Genetic map ofPi kh locus,Chr 11

    Physical map ofPi-kh locus

    in Nippon bare

    Candidate gene identified

    28 MB ~1 MB 142kb 1.5kb

    RM206 0

    Search for NewSSR markers inNippon bare

    RM2190

    RM6965

    CAPS100

    RM202

    RM536

    (7400 genes) (18 genes)

  • 8/13/2019 Bioinformatics and MAS

    19/79

    Cloning of Disease Resistance Gene in Rice

    R-gene like sequences (Nipponbare)

    Design PCR primers flanking to the R-gene Expected PCR product

  • 8/13/2019 Bioinformatics and MAS

    20/79

    Tetep

    ATT Poly A

    990AAcAAA

    990 bp ORF

    P2TACP1

    -343 -269 -101 -64 -47 T1WUN-1

    MotiffMeJA resp

    Element

    CAAT

    BOX

    TATA

    BOX

    GT1

    S

    - 221

    Isolation & Structure of the Candidate Gene

    Comparison of gene structure between NB & cloned gene

    PCR amplification

    ofPi-kh gene from

    Tetep and HP2216

    Gene structure

    HP2216

    ATT Poly A

    990AAcAAA

    P2TACP1

    -343 -269 -101 -64 -47 G1WUN-1

    MotiffMeJA resp

    Element

    CAAT

    BOX

    TATA

    BOX

    GT1

    S

    - 221

    Sharma et al. Mol Gen Genomics: 274:569-578

  • 8/13/2019 Bioinformatics and MAS

    21/79

  • 8/13/2019 Bioinformatics and MAS

    22/79

    Konishi et al. (2006)qSH 1 (BEL1-type homeobox)Seed

    shattering

    10

    Ma et. (2006)Lsi 1 (Silicon transporter)Lodging

    tolerance

    9

    Xu et al. (2006)Sub1 (Ethylene response factor-

    like)

    Submergence

    tolerance

    8

    Sharma et al. (2005)Pikh (NBS-LRR type protein)Blast

    resistance

    7

    Bradburry et al. (2005)BAD2 (betaine aldehyde

    dehydrogenase 2)

    Grain aroma6

    Ren et al. (2005)SKC1 (a HKT type transpor ter)Salt tolerance5Ashikari et al. (2005)OsCKX2 (cytokinin oxydase)Grain number4

    Liu et al. (2004)Sbe 3 (starch branching enzymes)Amylose

    content

    3

    Sasaki et al. (2002)Sd 1 (gibberellin-20-oxidase)Plant height2

    Song et al. (1995)Xa 21 (NBS-LRR type receptor

    kinase)

    Bacterial leaf

    blightresistance

    1

    ReferenceGeneTraitS.

    no.

    Table: Growing number of genes for important agronomic traits cloned

    recently making use of the rice genome sequence information

  • 8/13/2019 Bioinformatics and MAS

    23/79

    3. Mining novel alleles

    of cloned genes in the

    germplasm

  • 8/13/2019 Bioinformatics and MAS

    24/79

    Phenotypic analysis of O .s a t i v a

    lines and species with M . g r i s e a

    Resistant: O. punctata, O. latifolia,

    O. officinalis, O. rhizomatis,

    Coloro, Jatto, K-60,

    Fukunishiki and Tetap

    Susceptible: O. rufipogon, O. nivara,O. minuta, O. gradiglumis,

    HP2216, Co-39, Bhrigudhan

    Lesions

  • 8/13/2019 Bioinformatics and MAS

    25/79

    1 2 3 4 5 6 7 8 9 10 11 12 13 14M M

    10001500

    750

    250

    PCR amplification of Pi-Kh allele from rice

    lines and wild Oryza species

    bp

    Quantification (A) and Restriction analysis (B) ofplasmid to check presence of insert

    2027

    bp

    23224361

    pUC1 2 3 4 5 6M UC 1 2 3 4 5 6 7 8 UC M

    A B

    M

    PCR amplification of Pi-kh gene from Different Rice Lines

    F- Primer R- Primer

  • 8/13/2019 Bioinformatics and MAS

    26/79

    Consensus sequence derived from

    individual reads

  • 8/13/2019 Bioinformatics and MAS

    27/79

    Tetep

    HP2216

    Coloro

    K-60

    Jatto

    Co-39

    Bhrigudhan

    Fukunishiki

    Nipponbare

    O. nivara

    O .rufipogon

    O. latifolia

    O.officinalis

    O. rhizomatis

    O. punctata

    O. minuta

    O.gradiglumis

    1bp 1765 bp

    ORFs and number of exons predicted in P i - k hgene isolated fromdifferent O . sa t iv alines and wild species

    Mining blast resistance genes from wild species of Rice

  • 8/13/2019 Bioinformatics and MAS

    28/79

    0

    20

    40

    60

    80

    100

    Transition

    Transversion

    Indels

    Num

    berofSNPs

    Lines/Species

    HP2216

    Nipp

    onba

    re

    O.niva

    ra

    O.rufip

    ogon

    O.min

    uta

    O.rhizo

    matis

    O.latis

    folia

    O.officin

    alis

    O.gran

    diglu

    mis

    O.punc

    tata

    Co-39

    Colo

    ro

    Bhrigu

    dhan

    Jatto

    K-60

    Fuk

    unishk

    i

    Number of transitions, transversions and

    indels in P i - khallele

    Indica Japonica Wild species

  • 8/13/2019 Bioinformatics and MAS

    29/79

    Phylogenetic relationship among P i - khgenes amplified from

    different wild species of rice.

    Allele mining for disease resistance genes

  • 8/13/2019 Bioinformatics and MAS

    30/79

    An overview of the sequence contigs of the badh1 gene of 16 rice

    varieties based on sequence reads obtained using 16 pair ofprimers, assembled using Phred/Phrap/Consed software)

  • 8/13/2019 Bioinformatics and MAS

    31/79

    Screen shot of Consed window showing location of

    one of the 20 SNPs discovered by sequencing of the

    badh1 gene fragments from 16 rice varieties.

  • 8/13/2019 Bioinformatics and MAS

    32/79

    Location of PCR primers (reverse primer underlined) and 20 SNPs (highlighted)

    in the badh1 gene of rice. The gene has 15 exons (in bold) and 14 introns

  • 8/13/2019 Bioinformatics and MAS

    33/79

    Summary of the SNP alleles in 16 rice varieties and the reference

    variety Nipponbare at 20 different positions in the badh1 gene,

    starting from the ATG codon of Nipponbare..

    SNP: BADH1- S1 S2 S3 S4 S5 S6 S7 S8 S9 S10 S11 S12 S13 S14 S15 S16 S17 S18 S19 S20

  • 8/13/2019 Bioinformatics and MAS

    34/79

    4. Synteny and colinearity

    -transferring riceinformation to wheat

  • 8/13/2019 Bioinformatics and MAS

    35/79

    Rice chromosome 11 long arm:

    comparison with wheat

    0

    5

    10

    15

    20

    25

    30

    35

    40

    45

    No.

    ofGenes

    1 2 3 4 5 6 7

    Wheat Chromosome Group

    Genome wide analysis of homology between 56298 predicted

  • 8/13/2019 Bioinformatics and MAS

    36/79

    Genome wide analysis of homology between 56298 predicted

    rice gene CDS (from the IRGSP sequence) and 39,813 wheat EST

    contigs (from wheat SNP consortium, build 3), plus 3792 bin-mapped wheat EST contigs (USDA-NSF wheat genome project,

    version Aug. 03)

    Homology plot of 5840 rice genes mapped on 21 wheat chromosomes

  • 8/13/2019 Bioinformatics and MAS

    37/79

  • 8/13/2019 Bioinformatics and MAS

    38/79

  • 8/13/2019 Bioinformatics and MAS

    39/79

    Ancient duplications in the Rice GenomeAncient duplications in the Rice Genome

    Rice-Sorghum gene colinearity

  • 8/13/2019 Bioinformatics and MAS

    40/79

    Rice-Sorghum gene colinearity

    sorghum

    rice

    Rice Maize gene colinearity

  • 8/13/2019 Bioinformatics and MAS

    41/79

    Rice-Maize gene colinearity

    rice

    maize

  • 8/13/2019 Bioinformatics and MAS

    42/79

    Typical Patterns of Synteny between rice and wheat

    Distribution of Copy Number among the

  • 8/13/2019 Bioinformatics and MAS

    43/79

    Distribution of Copy Number among the

    4659 Rice Gene Homologs of Wheat

  • 8/13/2019 Bioinformatics and MAS

    44/79

    Conserved Synteny of

    Single Copy Rice Genes with Wheat

  • 8/13/2019 Bioinformatics and MAS

    45/79

  • 8/13/2019 Bioinformatics and MAS

    46/79

  • 8/13/2019 Bioinformatics and MAS

    47/79

    Conserved Synteny

    of single copy ricegenes with wheat

    A. Rice chr 1/ wheat chr 3

    B. Rice chr 2/ wheat chr 6

    C. Rice chr 6/ wheat chr 8

    Transposition ofgenes concentrated

    near wheat

    centromeres

    Wheat ChromosomesRice-Wheat Colinearity

  • 8/13/2019 Bioinformatics and MAS

    48/79

    1 2 3 4 5 6 7 8 9 10 11 12

    Rice Chromosomes

    y

    based on 1063 single

    copy rice genes

    Conclusions:1. Seven wheat chromosomes

    seem to have evolved fromthe 12 ancestral rice

    chromosomes by 3 centric

    fusion and 6 translocation

    events

    1. W1 = R5 + R10

    W2 = R7 + R4

    W3 = R1

    W4 = R3 + R11

    W5 = R12 + R9 +R3

    W6 = R2

    W7 = R6 + R8

    Predicting wheat bin location of 6178 unmapped single copy rice gene homologs

  • 8/13/2019 Bioinformatics and MAS

    49/79

    Predicting wheat bin location of 6178 unmapped rice gene homologs

  • 8/13/2019 Bioinformatics and MAS

    50/79

    V lid ti f th di t d h t bi l ti

  • 8/13/2019 Bioinformatics and MAS

    51/79

    1. Chinese Spring ; 2. CS del 6AL8; 3. CS del 6BL5; 4. CS del 6DL10

    Validation of the predicted wheat bin location

    of unmapped single copy rice genes

    Experimental Validation of the Predicted

  • 8/13/2019 Bioinformatics and MAS

    52/79

    35% of the 213

    single copyrice gene

    homologs,

    representingall 12 rice and

    all 7 wheat

    chromosomes,

    mapped totheir predicted

    bin location

    Map Location of genes in Wheat

  • 8/13/2019 Bioinformatics and MAS

    53/79

  • 8/13/2019 Bioinformatics and MAS

    54/79

    5. Databases and

    web-based tools

    to assist breeders

  • 8/13/2019 Bioinformatics and MAS

    55/79

    Containing Info on 56298 genes

  • 8/13/2019 Bioinformatics and MAS

    56/79

  • 8/13/2019 Bioinformatics and MAS

    57/79

    Distribution of R-like Genes and Defense

  • 8/13/2019 Bioinformatics and MAS

    58/79

    Response Genes in Rice Genome

    1 7378 115 1.56 28 0.38

    2 5770 84 1.46 10 0.17

    3 6544 46 0.7 28 0.434 5831 38 0.65 4 0.07

    5 4977 54 1.08 20 0.4

    6 5071 63 1.24 16 0.32

    7 4858 62 1.28 11 0.23

    8 4536 62 1.37 13 0.29

    9 3561 44 1.24 8 0.22

    10 3933 39 0.99 8 0.2

    11 4436 115 2.59 11 0.25

    12 4355 53 1.22 10 0.23

    Total 61250 775 15.38 167 3.19

    Def.

    Response

    Def.Response

    (%)

    Chromoso

    me No.

    Total

    genes

    R-

    genes

    R-genes

    (%)

    R- gene like seq.= NBS-LRR, LZ-NBS-LRR, LRR-TM, Misc. (putative, known genes)

    Defense response genes = chitinases, glucanases, thaumatin like proteins

    Different types of R-Like Genes Distributed on

  • 8/13/2019 Bioinformatics and MAS

    59/79

    Rice Chromosomes

    0

    10

    20

    30

    40

    50

    60

    70

    80

    1 2 3 4 5 6 7 8 9 10 11 12

    Rice Chrom osom e

    No.

    of

    R-genes

    NBS-LRR LZ-NBS-LRR LRR-TM

    Misc Def.Resp

    Distribution of Defense Response Genes on

  • 8/13/2019 Bioinformatics and MAS

    60/79

    p

    Rice Chromosomes

    0

    5

    10

    15

    20

    25

    D

    ef.

    R

    esp

    o

    n

    se

    G

    ene

    s

    N

    o

    .

    1 2 3 4 5 6 7 8 9 10 11 12

    Rice Chromosome

    Thaumatin Glucanases Chitinases

    Mapping of R-like Genes on Rice Chromosome 1

  • 8/13/2019 Bioinformatics and MAS

    61/79

    Chr 1 Chr 2

    Chr 3 Chr 4

    Chr 5 Chr 6

    Mapping of R-like Genes on Rice Chromosome 11

  • 8/13/2019 Bioinformatics and MAS

    62/79

    A total o f 176 R- and DR-genes clusters ident if ied

    Chr 7 Chr 8

    Chr 9 Chr 10

    Chr 12Chr11

    Mapping of R-like Genes on Rice Chromosome 11

  • 8/13/2019 Bioinformatics and MAS

    63/79

  • 8/13/2019 Bioinformatics and MAS

    64/79

  • 8/13/2019 Bioinformatics and MAS

    65/79

  • 8/13/2019 Bioinformatics and MAS

    66/79

  • 8/13/2019 Bioinformatics and MAS

    67/79

    To sum up

  • 8/13/2019 Bioinformatics and MAS

    68/79

    VansanuDhan-A Rice Genome Database created at NRCPB contains

    562898 genes info. It is being used in functional and comparative

    genome analysis in rice.

    We found 942 R-gene and Defense Response gene like sequences in

    the rice genome. The physical location and orientation of each genedelineated.

    Comparative analysis of indica- japonica sequences helped us in

    mapping and cloning of a new Rice blast gene Pi-kh.

    Extensive sequence variation was observed between the Pi-kh alleles

    amplified from wild species and land races of rice. These alleles are

    being used in functional validation experiments.

    Analysis of Pi-kh locus in indica- japonica provided an insight in the

    presence of SSR elements in this region which may play an important

    role in shuff ling of genes in the genome.

  • 8/13/2019 Bioinformatics and MAS

    69/79

    6. High throughput

    genotyping to save

    time and cost

    Multipex SNP assays using Sequenom MassARRAY system

  • 8/13/2019 Bioinformatics and MAS

    70/79

    Sequence of flanking pre-amplification primers(PCRP) and single nucleotide extension primers(UEP) for genotyping of 20 SNPs of badh1 gene

    Assaying SNPs by MALDI-ToF MS

  • 8/13/2019 Bioinformatics and MAS

    71/79

    Once a SNP is identified, 3 primers are required to enable genotyping including:-

    - Two PCR primers to amplify the region around the SNP

    5 3

    - One Extension primer which anneals directly adjacent to the SNP.

    5 3

    MALDI-TOF Mass Spectrometry

  • 8/13/2019 Bioinformatics and MAS

    72/79

    DNA samples (384 no.) are mixed with amatrix, spotted onto a MALDI plate and

    loaded into the Mass Spectrometer.

    Each spot is subjected to pulses of

    nitrogen laser (337nm) in vacuum, which

    vaporises and ionises sample. Matrixabsorbs most of the laser energy,

    preventing degradation of the sample,

    and allows ionisation of some of the DNA

    substrate.

    Application of an electric field causes

    DNA ions to enter flight tube and are

    accelerated towards Mass detector.

    All ions gain same kinetic energy, so

    larger ions take longer to reach detector.

    The variation in Time of Flight allows

    separation based on size.

    Assaying SNPs by MALDI-ToF MS

  • 8/13/2019 Bioinformatics and MAS

    73/79

    0

    20

    40

    60

    80

    100

    120

    4300 4400 4500 4600 4700 4800 4900 5000

    C = 273 Da

    T = 288 Da

    A = 297 DaG = 313 Da

    Difference in peaksize reveals the SNPallele for that rice

    variety

    UnextendedSNP primer

    Extended SNPprimer

    GenotypingGenotyping

  • 8/13/2019 Bioinformatics and MAS

    74/79

    Genotyping spectra for one well (11Genotyping spectra for one well (11 SNPsSNPs))

  • 8/13/2019 Bioinformatics and MAS

    75/79

    Genotyping of badh1_S5 SNP Using Sequenom MassARRAY

    badh1_S5 badh1_S5Another Another

  • 8/13/2019 Bioinformatics and MAS

    76/79

    Pusa

    NPT11

    Pusa

    1342

    badh1_S5

    Unextended

    primer

    Primer

    extended

    with C

    Primer

    extendedwith T

    Primer

    of theMultiplex

    Primer

    of theMultiplex

  • 8/13/2019 Bioinformatics and MAS

    77/79

    7. Proposed activities for

    the Indo-Australianwheat MAS project

    Proposed Activities

  • 8/13/2019 Bioinformatics and MAS

    78/79

    p

    Database of markers and cloned genes

    ESTs, GSS, HTGS, SSRs,, SNPs, Traits

    Allele mining for specified genesGBSS-1, Glu-1, Amylase, Lr, Sr and Yr genes etc.

    High throughput genotypingSequenom Mass Array

    Capillary sequencer fragment analysis

  • 8/13/2019 Bioinformatics and MAS

    79/79

    Thank You

    Very Much


Recommended