+ All Categories
Home > Documents > A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000...

A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000...

Date post: 16-Oct-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
12
A MAP OF THE RECENT A MAP OF THE RECENT POSITIVE SELECTION IN THE POSITIVE SELECTION IN THE HUMAN GENOME HUMAN GENOME Benjamin F. Voight, Sridhar Kudaravalli, Benjamin F. Voight, Sridhar Kudaravalli, Xiaoquan Wen, Jonathan K. Pritchard Xiaoquan Wen, Jonathan K. Pritchard Presented by Presented by : : Debolina Ganguly Debolina Ganguly
Transcript
Page 1: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

A MAP OF THE RECENT A MAP OF THE RECENT POSITIVE SELECTION IN THE POSITIVE SELECTION IN THE

HUMAN GENOMEHUMAN GENOME

Benjamin F. Voight, Sridhar Kudaravalli, Benjamin F. Voight, Sridhar Kudaravalli, Xiaoquan Wen, Jonathan K. Pritchard Xiaoquan Wen, Jonathan K. Pritchard

Presented byPresented by::Debolina GangulyDebolina Ganguly

Page 2: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

Overview:Overview:Identifies signals of recent positive selection, Identifies signals of recent positive selection, knowledge of which provides information about knowledge of which provides information about the adaptation of modern humans to local the adaptation of modern humans to local conditions.conditions.Dramatic changes in the local environment Dramatic changes in the local environment resulted in powerful selection pressures on resulted in powerful selection pressures on new genotypes that are better suited for the new genotypes that are better suited for the new environments. Examples: Response to new environments. Examples: Response to Malaria, Lactase gene in response to dairy Malaria, Lactase gene in response to dairy farming etc.farming etc.

Page 3: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

Contd..Contd..

Best examples of recent selection, until Best examples of recent selection, until now are from studies of candidate genes.now are from studies of candidate genes.Thus it is not known as of:Thus it is not known as of:how widespread these signals arehow widespread these signals are

whether these are the same genes that were whether these are the same genes that were important in the earlier important in the earlier evoluionevoluion of the human of the human lineagelineagewhether they are geographically restrictedwhether they are geographically restricted

Page 4: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

Aim of the studyAim of the study

Find loci where there is strong, very Find loci where there is strong, very recent selection in favor of alleles that recent selection in favor of alleles that have not yet reached fixation.have not yet reached fixation.Detect signals of selective sweeps in Detect signals of selective sweeps in progress.progress.Creation of selection maps on ongoing Creation of selection maps on ongoing sweeps.sweeps.

Page 5: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

ResultsResultsAnalyzed genome wide SNP data from phase 1 of the Hap Map Analyzed genome wide SNP data from phase 1 of the Hap Map project. Had 800,000 polymorphic SNPproject. Had 800,000 polymorphic SNP’’s in a total of 309 s in a total of 309 unrelated individuals. unrelated individuals. There were 3 distinct population samples of unrelated individualThere were 3 distinct population samples of unrelated individualss89 Japanese and Han Chinese individuals from Tokyo and Beijing 89 Japanese and Han Chinese individuals from Tokyo and Beijing respectively referred as the East Asians.respectively referred as the East Asians.60 individuals from northern and western Europe.60 individuals from northern and western Europe.60 Yoruba from Ibadan, Nigeria. 60 Yoruba from Ibadan, Nigeria. Unless mentioned , study focused on autosomes only.Unless mentioned , study focused on autosomes only.They derived genome wide high resolution LD based They derived genome wide high resolution LD based recombination maps separately for all 3 samples.recombination maps separately for all 3 samples.The goal of the study was the identification of loci where stronThe goal of the study was the identification of loci where strong g selection had driven new alleles up to intermediate frequencies.selection had driven new alleles up to intermediate frequencies.

Page 6: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

Test statistics used:Test statistics used:The data consisted of pre ascertained SNPThe data consisted of pre ascertained SNP’’s for the genome wide s for the genome wide scale and a new test statistic the integrated haplotype score (iscale and a new test statistic the integrated haplotype score (iHS) HS) was determined. was determined. The test began with extended haplotype homozygosity (EHH). The test began with extended haplotype homozygosity (EHH). EHH measures the decay of identity, as a function of distance ofEHH measures the decay of identity, as a function of distance ofhaplotypes that carry a particular core allele at one end. The haplotypes that carry a particular core allele at one end. The haplotype homozygosity for each allele starts from 1 and decays haplotype homozygosity for each allele starts from 1 and decays to 0 as its distance increases from the core site. to 0 as its distance increases from the core site. As an allele increases in frequency due to strong selection, it As an allele increases in frequency due to strong selection, it tends to have high levels of haplotype homozygosity extending tends to have high levels of haplotype homozygosity extending further away than what is expected under the neutral model. The further away than what is expected under the neutral model. The integrated EHH (iHH) is denoted as IHHintegrated EHH (iHH) is denoted as IHHA A or IHHor IHHDD. .

Page 7: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

Figure 1. Decay of EHH in Simulated Data for an Allele at Frequency 0.5 (A) Decay of haplotypes in a single region in which a new selected allele (red, center column) is sweeping to fixation, replacing the ancestral allele (blue). Horizontal lines are haplotypes; SNP positions are marked below the haplotype plot using blue for SNPs with intermediate allele frequencies (minor allele .0.2), and red otherwise. For a given SNP, adjacent haplotypes with the same color carry identical genotypes everywhere between that SNP and the central (selected) site. The left- and right-hand sides are sorted separately. Haplotypes are no longer plotted beyond the points at which they become unique. (B) Decay of haplotype homozygosity for ten replicate simulations. When the core SNP is neutral (s¼0; left side) the haplotype homozygosity decays at similar rates for both ancestral and derived alleles. When the derived alleles are favored (s¼2Ns¼250; right side), the haplotype homozygosity decays much slower for the derived alleles than for the ancestral alleles. The discrepancy in the overall areas spanned by these two curves forms the basis of our text for selection (iHS).

Page 8: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

Figure 3. Plots of Chromosome 2 SNPs with Extreme iHS Values Indicate Discrete Clusters of SignalsSNPs with [iHS] >2.5 (top 1%) are plotted. The bottom plot combines signals for all three populations, plotting only SNPs with derived frequency >0.5 and iHS <-2.5. Such SNPs correspond to high-frequency-derived SNPs in the range for which our test is most powerful. The short vertical bars below each plot indicate 100-kb windows whose signals are in the top 1% of windows genome-wide.

Page 9: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

Figure 6. Signals of Selection for Three Candidate Selection Regions Discussed in the Text The columns show (left) scatter plots of negative iHS scores, (center) haplotype plots, and (right) decay of haplotype homozygosity. In each case the Core SNP for the center and right-hand plots was chosen as a SNP with high negative iHS score (starred in the scatter plots); the allele marked in red is derived. For each signal, values are listed for the derived allele frequency (pd) and the local deCode recombination rate estimate.

Page 10: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

Types of genes under selectionTypes of genes under selectionFor every gene, the number of SNPFor every gene, the number of SNP’’s with high iHS value in a 50 SNP s with high iHS value in a 50 SNP window centered on the gene was determined. Genes in the top 10%window centered on the gene was determined. Genes in the top 10% are are considered targets of selection. considered targets of selection. Chemosensory perception, olfaction as well as gametogenesis and Chemosensory perception, olfaction as well as gametogenesis and fertilization. Might be due to sexual competition and defense fertilization. Might be due to sexual competition and defense against against pathogens. pathogens. Genes related to the metabolism of carbohydrates, lipids, phosphGenes related to the metabolism of carbohydrates, lipids, phosphates and ates and vitamin C.vitamin C.Genes in skeletal development and hair formation and patterning Genes in skeletal development and hair formation and patterning in in Yoruba.Yoruba.Alcohol dehydrogenase cluster in east Asians, carbohydrate metabAlcohol dehydrogenase cluster in east Asians, carbohydrate metabolism olism genes like mannose in Yoruba and sucrose and lactose in the Eurogenes like mannose in Yoruba and sucrose and lactose in the Europeans. peans. 2 microcephaly genes namely, CDK5RAP2 in Yoruba and CENP in 2 microcephaly genes namely, CDK5RAP2 in Yoruba and CENP in Europeans and east Asians.Europeans and east Asians.Electron transport genes in Europeans (CYP genes). Electron transport genes in Europeans (CYP genes). Detailed report in Table 2 of the paper.Detailed report in Table 2 of the paper.

Page 11: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

Figure 7. Sharing of iHS Signals between PopulationsThe numbers listed inside circles represent the numbers of 100-kbwindows that are in the top 1% of the empirical distributions in at leastone population. The numbers in the intersection regions are in the top1% for one population, and the top 5% for one or both of the otherpopulations. The counts that would be expected if signals wereindependent across populations are shown in parentheses. The numberof windows not in any circle is reported in the upper-left corner.

Page 12: A MAP OF THE RECENT POSITIVE SELECTION IN THE HUMAN …sdifazio/molececol/Nov8b.pdf · Had 800,000 polyproject. Had 800,000 polymorphic SNPmorphic SNP’s in a total of 309 unrelated

How effective are the study How effective are the study statistics?statistics?According to the paper the candidate sweep According to the paper the candidate sweep regions tend to be narrower in Yoruba than in regions tend to be narrower in Yoruba than in the non African populations, indicating that the non African populations, indicating that sweep events are younger in the non African sweep events are younger in the non African populations. Why?populations. Why?

Discussions Points:


Recommended