Construction of reference chromosome-scale pseudomolecules for potato: Integrating the potato genome with genetic and physical maps Sanjeev Kumar Sharma*,1, Daniel Bolser§,1,2, Jan de Boer†, Mads Sønderkær‡, Walter Amoros**, Martin Federico Carboni§§, Juan Martín D'Ambrosio§§, German de la Cruz††, Alex Di Genova‡‡, David S. Douches***, Maria Eguiluz§§§, Xiao Guo†††, Frank Guzman§§§,3, Christine A. Hackett‡‡‡, John P. Hamilton****, Guangcun Li†††, Ying Li§§§§, Roberto Lozano§§§, Alejandro Maass‡‡, David Marshall††††, Diana Martinez§§§, Karen McLean*, Nilo Mejía‡‡‡‡, Linda Milne††††, Susan Munive**, Istvan Nagy*****,4, Olga Ponce§§§, Manuel Ramirez§§§, Reinhard Simon**, Susan J. Thomson§§§§§, Yerisf Torres§§§, Robbie Waugh*, Zhonghua Zhang§§§§, Sanwen Huang§§§§, Richard G. F. Visser†, Christian W. B. Bachem†, Boris Sagredo†††††, Sergio E. Feingold§§, Gisella Orjeda§§§, Richard E. Veilleux‡‡‡‡‡, Merideth Bonierbale**, Jeanne M. E. Jacobs§§§§§, Dan Milbourne*****, David Michael Alan Martin§, Glenn J. Bryan*,5 *Cell and Molecular Sciences, The James Hutton Institute, Dundee DD2 5DA, United Kingdom §Division of Biological Chemistry and Drug Discovery, College of Life Sciences, University of Dundee, Dundee DD1 5EH, United Kingdom †Laboratory of Plant Breeding, Department of Plant Sciences, Wageningen-UR, Wageningen, The Netherlands ‡Department of Biotechnology, Chemistry and Environmental Engineering, Aalborg University, Aalborg, Denmark **International Potato Center (CIP), Lima 12, Peru §§Laboratorio de Agrobiotecnología, Instituto Nacional de Tecnología Agropecuaria (INTA) cc276 (7620) Balcarce, Argentina ††Laboratorio de Genética y Biotecnología Vegetal, Universidad Nacional San Cristobal de Huamanga, Ayacucho, Perú ‡‡Mathomics, Centro de Regulación Genómica & Centro de Modelamiento Matemático, Universidad de Chile, Santiago, Chile ***Department of Crop and Soil Sciences, Michigan State University, Michigan, United States of America §§§Genomics Research Unit, Facultad de Ciencias, Universidad Peruana Cayetano Heredia, Lima 31, Peru †††Institute of Vegetables, Shandong Academy of Agricultural Sciences, Jinan 250100, China ‡‡‡Biomathematics and Statistics Scotland, Dundee DD2 5DA, United Kingdom ****Department of Plant Biology, Michigan State University, Michigan, United States of America §§§§Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China ††††Information and Computational Sciences, The James Hutton Institute, Dundee DD2 5DA, United Kingdom ‡‡‡‡INIA-La Platina, Santiago, Chile *****Crops Environment and Land Use Programme, Teagasc, Carlow, Ireland §§§§§The New Zealand Institute for Plant & Food Research Ltd., Christchurch 8120, New Zealand †††††INIA-Rayentué, Rengo, Chile ‡‡‡‡‡Department of Horticulture, Virginia Tech, Virginia, United States of America Footnotes: 1S. K. Sharma and D. Bolser contributed equally to this work 2Current Address: The EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, United Kingdom 3Current Address: Departamento de Genética, Universidade Federal do Rio Grande do Sul - UFRGS, Rio Grande do Sul, Brazil 4Current Address: Department of Molecular Biology and Genetics, Aarhus University, Slagelse DK-4200, Denmark 5Corresponding Author DOI: 10.1534/g3.113.007153
2 SI S. K. Sharma et al.
Figure S1 Genome-wide patterns of marker segregation distortion in DMDD population for 1830 STS markers from different
segregation categories plotted as a function of Chi-square value (y-axis) against marker physical position (x-axis) on each of the
12 potato chromosomes. Dotted, dashed and dotted-dashed lines represent Chi-square significance values at p = 0.01 for
marker segregation categories with two, three and four genotypic classes, respectively.
Segregation type <efxeg> <hkxhk> <lmxll> <nnxnp>
Chr I Chr II Chr III
Chr IV Chr V Chr VI
Chr VII Chr VIII Chr IX
Chr X Chr XI Chr XII
S. K. Sharma et al. 3 SI
4 SI S. K. Sharma et al.
DM RH
* *
PGSC0003DMB000000278PGSC0003DMB000000051PGSC0003DMB000000573PGSC0003DMB000001053PGSC0003DMB000001045PGSC0003DMB000000710PGSC0003DMB000001085PGSC0003DMB000001095PGSC0003DMB000000322PGSC0003DMB000001101PGSC0003DMB000000996PGSC0003DMB000000189PGSC0003DMB000000330PGSC0003DMB000000296PGSC0003DMB000000690PGSC0003DMB000000707PGSC0003DMB000000964PGSC0003DMB000000719PGSC0003DMB000001035PGSC0003DMB000000355PGSC0003DMB000001415PGSC0003DMB000000577PGSC0003DMB000000086PGSC0003DMB000000118PGSC0003DMB000001153PGSC0003DMB000000249PGSC0003DMB000001361PGSC0003DMB000001188PGSC0003DMB000000480PGSC0003DMB000000067PGSC0003DMB000001151PGSC0003DMB000000785PGSC0003DMB000001351PGSC0003DMB000000350PGSC0003DMB000001221PGSC0003DMB000002580PGSC0003DMB000000587PGSC0003DMB000004233PGSC0003DMB000001146PGSC0003DMB000000757PGSC0003DMB000000115PGSC0003DMB000000205PGSC0003DMB000000176PGSC0003DMB000000513PGSC0003DMB000000176PGSC0003DMB000000696PGSC0003DMB000000029PGSC0003DMB000001145PGSC0003DMB000000366PGSC0003DMB000000694PGSC0003DMB000000016PGSC0003DMB000000565PGSC0003DMB000000925PGSC0003DMB000000556PGSC0003DMB000000936PGSC0003DMB000001099PGSC0003DMB000000460PGSC0003DMB000000811PGSC0003DMB000000845PGSC0003DMB000000259PGSC0003DMB000001636PGSC0003DMB000000179PGSC0003DMB000000013
PGSC0003DMB000000019PGSC0003DMB000001152PGSC0003DMB000000019PGSC0003DMB000001013PGSC0003DMB000000261
PGSC0003DMB000000308PGSC0003DMB000000462PGSC0003DMB000000029
PGSC0003DMB000000668
PGSC0003DMB000000107
PGSC0003DMB000000182PGSC0003DMB000001341PGSC0003DMB000000182PGSC0003DMB000000685PGSC0003DMB000000539PGSC0003DMB000000632
PGSC0003DMB000000006
PGSC0003DMB000000018
PGSC0003DMB000000123
PGSC0003DMB000000909PGSC0003DMB000000185
PGSC0003DMB000000987PGSC0003DMB000000639PGSC0003DMB000000471
PGSC0003DMB000000181PGSC0003DMB000000419PGSC0003DMB000000704PGSC0003DMB000000928PGSC0003DMB000000830PGSC0003DMB000000997PGSC0003DMB000000777
PGSC0003DMB000000234PGSC0003DMB000000551PGSC0003DMB000000882PGSC0003DMB000000724PGSC0003DMB000000938PGSC0003DMB000000576PGSC0003DMB000000213
PGSC0003DMB000000420PGSC0003DMB000000294PGSC0003DMB000000892PGSC0003DMB000000434PGSC0003DMB000001017PGSC0003DMB000000285
PGSC0003DMB000000164
PGSC0003DMB000000517PGSC0003DMB000000913PGSC0003DMB000000404
PGSC0003DMB000000002PGSC0003DMB000000519
PGSC0003DMB000000226
PGSC0003DMB000000381PGSC0003DMB000000784
PGSC0003DMB000000184PGSC0003DMB000000633PGSC0003DMB000000754
PGSC0003DMB000000032
pPt-558903
pPt-654865
solcap_stsnp_c1_3731solcap_stsnp_c1_3745solcap_stsnp_c1_14449
pPt-472050pPt-652727solcap_stsnp_c1_8343
solcap_stsnp_c1_11135
solcap_stsnp_c1_12827
PM0146
pPt-457515solcap_stsnp_c1_5418
pPt-559222pPt-457943solcap_stsnp_c1_15760solcap_stsnp_c1_16295
solcap_stsnp_c1_15986solcap_stsnp_c1_15877
pPt-473312
solcap_stsnp_c1_10453solcap_stsnp_c1_10454pPt-657518pPt-559203solcap_stsnp_c1_11352
solcap_stsnp_c1_9742
solcap_stsnp_c1_11669
solcap_stsnp_c1_6748solcap_stsnp_c1_8332solcap_stsnp_c1_13165solcap_stsnp_c1_15844
pPt-471982solcap_stsnp_c1_3168
solcap_stsnp_c1_13085solcap_stsnp_c1_4179
solcap_stsnp_c1_10207pPt-457379pPt-651198solcap_stsnp_c1_10675toPt-438080solcap_stsnp_c1_3460
[1] 11.0
[2] 13.0[1] 14.0[5] 14.5[6] 15.5
[2] 19.0[1] 20.5
[1] 22.0
[1] 26.0[1] 26.5
[1] 28.0
[1] 30.5
[1] 33.5
[1] 36.5[1] 37.5[1] 38.5[1] 39.5[2] 40.5[1] 42.5[1] 43.0[2] 43.5[1] 45.0
[1] 49.0[1] 49.5[1] 50.5
[2] 53.0[1] 55.0[1] 55.5
[4] 58.5
[1] 62.0[1] 62.5
[1] 64.5
[1] 69.5
[1] 75.0[1] 75.5
[1] 78.5[2] 79.5
[1] 82.0[1] 83.0
[1] 83.5[1] 84.0
[2] 84.5
[1] 87.5[3] 88.5[1] 89.0
[2] 89.5[2] 90.5
[12] 91.0
[3] 92.0[1] 93.5
[1] 95.5[1] 96.0[2] 98.0[2] 98.5
[1] 99.0[1] 99.5
123456789
101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899
100101102103104105
0 1 2 4 7 10 15 20 30 50 100 500
Number of AFLP markers per bin (upper boundaries)
Chromosome 4 (72.2 Mb)
S. K. Sharma et al. 5 SI
DM RHChromosome 5 (52.1 Mb)123456789
101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778
0 1 2 4 7 10 15 20 30 50 100 500
Number of AFLP markers per bin (upper boundaries)
PGSC0003DMB000000103PGSC0003DMB000000410PGSC0003DMB000001176PGSC0003DMB000000943PGSC0003DMB000000251PGSC0003DMB000000051PGSC0003DMB000000192PGSC0003DMB000000609PGSC0003DMB000000834PGSC0003DMB000000424PGSC0003DMB000000243PGSC0003DMB000000210PGSC0003DMB000000540PGSC0003DMB000000216PGSC0003DMB000000707PGSC0003DMB000000248PGSC0003DMB000000027PGSC0003DMB000000360PGSC0003DMB000000145PGSC0003DMB000000317PGSC0003DMB000000282PGSC0003DMB000000426PGSC0003DMB000000042PGSC0003DMB000000003
PGSC0003DMB000000113
PGSC0003DMB000000206
PGSC0003DMB000000002
PGSC0003DMB000000033
PGSC0003DMB000000513PGSC0003DMB000000146
PGSC0003DMB000000175PGSC0003DMB000000260PGSC0003DMB000000178PGSC0003DMB000000275PGSC0003DMB000000416PGSC0003DMB000000507PGSC0003DMB000000236PGSC0003DMB000000328PGSC0003DMB000000822PGSC0003DMB000000069PGSC0003DMB000000323PGSC0003DMB000000619PGSC0003DMB000000204PGSC0003DMB000000649PGSC0003DMB000000543PGSC0003DMB000000202PGSC0003DMB000000106PGSC0003DMB000000512PGSC0003DMB000000827PGSC0003DMB000000452PGSC0003DMB000000580PGSC0003DMB000000072
pPt-654593pPt-655590pPt-457354PM0064
pPt-456666solcap_stsnp_c1_7354
solcap_stsnp_c1_15785toPt-440648solcap_stsnp_c1_11256pPt-473272
solcap_stsnp_c1_11075capPt-706997
pPt-651584
pPt-539560
st_19532pPt-650026
PM0467
PM0193pPt-536975solcap_stsnp_c1_6338pPt-655798
pPt-651016
toPt-439005toPt-763812
pPt-656906
solcap_stsnp_c1_3413PM0333pPt-458238solcap_stsnp_c1_12486solcap_stsnp_c1_1199toPt-438407
[2] 2.0[5] 2.5
[2] 7.0
[1] 8.0
[2] 13.5[1] 14.5
[1] 21.5
[1] 24.5[8] 25.5
[1] 29.5[2] 30.0[1] 30.5[1] 31.0
[2] 32.5
[2] 34.0[1] 34.5
[18] 35.5
[1] 38.0
[1] 40.0
[3] 41.0
[14] 43.5
[8] 44.5[2] 45.0[1] 46.0
[3] 48.0[1] 48.5[1] 49.0
[1] 55.5[2] 56.0[1] 56.5
[4] 58.0
[1] 61.5[5] 63.5[1] 64.0
S. K. Sharma et al. 6 SI
DM RHChromosome 6 (59.5 Mb)PGSC0003DMB000000208PGSC0003DMB000000411PGSC0003DMB000000665PGSC0003DMB000000584PGSC0003DMB000000504PGSC0003DMB000000967PGSC0003DMB000000930PGSC0003DMB000000862PGSC0003DMB000000052PGSC0003DMB000000266PGSC0003DMB000000596PGSC0003DMB000000337PGSC0003DMB000000890PGSC0003DMB000000681PGSC0003DMB000000816PGSC0003DMB000000625PGSC0003DMB000000422PGSC0003DMB000000614PGSC0003DMB000000654PGSC0003DMB000000044PGSC0003DMB000000247PGSC0003DMB000000301PGSC0003DMB000000598PGSC0003DMB000000223PGSC0003DMB000000881PGSC0003DMB000000698PGSC0003DMB000000655PGSC0003DMB000000075PGSC0003DMB000000885PGSC0003DMB000000088PGSC0003DMB000000763PGSC0003DMB000000395PGSC0003DMB000000370PGSC0003DMB000000121
PGSC0003DMB000000209
PGSC0003DMB000000050
PGSC0003DMB000000401
PGSC0003DMB000000005
PGSC0003DMB000000776
PGSC0003DMB000000156PGSC0003DMB000000660PGSC0003DMB000000349PGSC0003DMB000000600PGSC0003DMB000000563PGSC0003DMB000000221
PGSC0003DMB000000279PGSC0003DMB000000807PGSC0003DMB000000245PGSC0003DMB000000255
PGSC0003DMB000000019PGSC0003DMB000000432PGSC0003DMB000001055PGSC0003DMB000000315PGSC0003DMB000001158PGSC0003DMB000000758PGSC0003DMB000000125PGSC0003DMB000000227PGSC0003DMB000000873PGSC0003DMB000000984PGSC0003DMB000000579PGSC0003DMB000001107PGSC0003DMB000000875PGSC0003DMB000000982PGSC0003DMB000000874PGSC0003DMB000000558PGSC0003DMB000000945PGSC0003DMB000000461PGSC0003DMB000000676PGSC0003DMB000000772PGSC0003DMB000000726PGSC0003DMB000000346PGSC0003DMB000000783PGSC0003DMB000000593PGSC0003DMB000000158PGSC0003DMB000000222PGSC0003DMB000000246PGSC0003DMB000000215PGSC0003DMB000000578PGSC0003DMB000000550PGSC0003DMB000000180PGSC0003DMB000000087PGSC0003DMB000000345PGSC0003DMB000000091PGSC0003DMB000001083PGSC0003DMB000000575PGSC0003DMB000000801PGSC0003DMB000000128PGSC0003DMB000000686PGSC0003DMB000000150PGSC0003DMB000002047
toPt-441146 (2)solcap_stsnp_c1_9246
solcap_stsnp_c1_15546
solcap_stsnp_c1_16621
solcap_stsnp_c1_1574
solcap_stsnp_c1_3690
solcap_stsnp_c1_9854
solcap_stsnp_c1_12691pPt-473740 (2)solcap_stsnp_c1_9977
PM1285
solcap_stsnp_c1_12700
pPt-657258 (2)
toPt-438543 (2)pPt-655645 (2)PM0478
pPt-473454 (2)pPt-471807 (2)
pPt-536932 (2)solcap_stsnp_c1_14891
solcap_stsnp_c1_15368solcap_stsnp_c1_10650solcap_stsnp_c1_2059solcap_stsnp_c1_2104pPt-473107 (2)
pPt-656749 (2)pPt-472490 (2)
pPt-473709 (2)solcap_stsnp_c1_3007
solcap_stsnp_c1_2967pPt-538623 (2)
solcap_stsnp_c1_14920solcap_stsnp_c1_3074
solcap_stsnp_c1_3130pPt-654600 (2)
solcap_stsnp_c1_3091pPt-559151 (2)
[1] 1.5[1] 2.0
[1] 2.5[1] 3.5
[1] 5.5
[2] 10.0
[1] 11.5
[1] 13.0
[1] 14.0
[1] 18.5
[2] 21.5
[1] 25.5[1] 26.5
[3] 28.0
[1] 33.0[5] 34.0[8] 34.5[4] 35.0[1] 36.0
[1] 38.5
[1] 39.5
[1] 41.0[2] 41.5
[1] 45.0
[1] 47.0
[1] 49.0[5] 49.5
[1] 53.5
[4] 55.5[32] 56.5[3] 57.0
[3] 58.5[8] 59.0
[2] 62.0
[1] 64.0[5] 65.0[1] 65.5[4] 67.0[6] 67.5[2] 68.0
[1] 68.5[1] 69.0
[1] 69.5[1] 70.5
123456789
1011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768697071727374
0 1 2 4 7 10 15 20 30 50 100 500
Number of AFLP markers per bin (upper boundaries)
**
S. K. Sharma et al. 7 SI
8 SI S. K. Sharma et al.
S. K. Sharma et al. 9 SI
10 SI S. K. Sharma et al.
S. K. Sharma et al. 11 SI
12 SI S. K. Sharma et al.
S. K. Sharma et al. 13 SI
Figure S2 Illustration of the chromosome 2 - 12 pseudomolecules (PMs) integrated with the DM and RH genetic maps. STS and
AFLP markers anchor sequence locations in the chromosome PMs to the DMDD and RH genetic maps, respectively. The AFLP
marker positions in the PM were identified through sequence tag alignment of BAC clones from the RH WGP physical map.
Superscaffolds comprising the PM are shown as alternating grey and white rectangular blocks. The layout of the PM for each of
the genetic maps is shown separately but is identical with superscaffold IDs depicted in the middle. The pachytene idiogramis
adapted from the potato reference genome publication (Potato Genome Sequencing Consortium 2011). The putative
centromere region and pericentromeric/heterochromatic boundaries are demarcated by asterisks and dashed lines,
respectively. Each DMDD marker type is colour coded: blue = DArTs, yellow = SNPs, green = SSRs. Blue and magenta lines
emerging from the RH genetic map represent AFLP anchors and the intensity of green color corresponds to the AFLP marker
density per bin as reported by Van Os et al. (2006). Magenta lines represent AFLP markers with a relatively inaccurate mapping
position on the RH genetic map, covering an interval of 5 or more bins. Regions in the central heterochromatin where
superscaffold order and orientation are not completely resolved are indicated in yellow. Inversions with the tomato sequence
are indicated with red interval bars.
14 SI S. K. Sharma et al.
Tables S1-S9 Available for download at http://www.g3journal.org/lookup/suppl/doi:10.1534/g3.113.007153/-/DC1 Table S1 Details of (A) Simple sequence repeat (SSR), (B) Single nucleotide polymorphism (SNP) and (C) Amplified fragment length polymorphism (AFLP) markers employed in DMDD genotyping. Table S2 Location of sequence-tagged site (STS) markers employed in DMDD genotyping on the DM version 3 superscaffolds and DM version 4.03 pseudomolecules. STS markers include DArTs, SSRs and SNPs. Table S3 Revised annotation details for the Infinium 8.3k Potato Array SNPs (Felcher et al. 2012) on DM version 4.03 pseudomolecules. Table S4 Genetic and physical locations of STS markers (DArTs, SSRs and SNPs) mapped in DMDD and anchored in DM version 4.03 pseudomolecules. Table S5 Paracentric inversions between potato and tomato chromosomes detected by dot plot alignments between the chromosome pseudomolecules V4.03 of potato line DM and V2.40 of tomato cv. 'Heinz 1706'. Table S6 Summary of six BAC pools sequence assembly data comprising 82 DM BAC clones used for validating link peak-based orientation strategy for chromosome 4. Table S7 BAC pool assembly and validation details for chromosome 4 pseudomolecule version 4.03. Table S8 Centromere localisation in DM V3 sequence assembly. Table S9 Accessioned Golden Path (AGP) for the reference DM chromosome-scale pseudomolecules version 4.03. File also includes revised annotation details for potato genes and repeat regions (Potato Genome Sequencing Consortium 2011) and a list of chimeric superscaffolds.