+ All Categories
Home > Documents > Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal...

Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal...

Date post: 10-Sep-2020
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
22
Supporting Information Margos et al. 10.1073/pnas.0800323105 SI Materials and Methods DNA Extraction. DNA of tick-derived samples and cultures was either prepared by alkaline hydrolysis (1) or extracted by using the DNeasy tissue and blood kit (Qiagen) as described previously (2). Purified DNA of cultured isolates was diluted to a concen- tration of 10 pg/l. Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy genes, (ii) they display no more than 5% allelic divergence as assessed by microarray analyses of clinical isolates (3), and (iii) they are, if possible, not adjacent to genes encoding outer surface proteins or hypothetical proteins. Based on the sequenced genomes of B. burgdorferi (B31) (accession no. NC001318) (4), B. garinii (PBi) (accession number NC006156), and B. afzelii (Pko) (accession number 008277) (5), primers were designed to highly conserved regions of eight housekeeping genes that fulfilled these criteria (i.e., clpA, clpX, nifS, pepX, pyrG, recG, rplB, and uvrA) by using OligoExplorer 1.2 (Gene Link). The rationale for this strategy was the future application of this MLST scheme to other species of the Lyme borreliosis group. The primers designed for clpA, clpX, pepX, recG, rplB, and uvrA were nested, and those for nifS and rplB were seminested (Table S6). PCR. HotstarTaq Mastermix (Qiagen), 25 pmol of each primer, forward and reverse, and 2.5 l of template DNA (purified DNA of isolates or tick lysates) were used for the first set of amplifi- cation cycles (25-l final reaction volume). For PCR on tick- derived material, the MgCl 2 concentration was adjusted to 2.5 mM. Bioline Immomix Red, 50 pmol of each primer and 5 l of product derived from the primary set of cycles were used for the second set of amplification cycles (50-l final reaction volume). The PCR conditions for the housekeeping genes, except for recG, were as follows: for the first set of cycles, touchdown PCR was used with annealing temperatures starting from 55°C and decreasing 1°C each cycle. Specific conditions were 95°C for 15 min, 94°C for 30 s, annealing temperature from 55°C to 48°C for 30 s, and an extension step of 72°C for 30 s. An additional 20 cycles were run at 94°C for 30 s, annealing temperature of 48°C, and extension at 72°C for 30 s. After a final extension step for 5 min at 72°C, the samples were kept at 15°C until further analysis. The conditions for the second set of 35 cycles were 95°C for 7 min, 94°C for 30 s, 50°C for 30 s, 72°C for 30 s. After a final extension step for 5 min at 72°C, the samples were kept at 15°C. For recG, the PCR conditons for the first set of cycles were 95°C for 15 min, 94°C for 30 s, 55°C for 30 s, 72°C for 30 s, 30 cycles, and extension at 72°C for 5 min. The conditions for the second set of cycles were identical, except for an initial dena- turing step at 95°C for 7 min. To put the results obtained by this MLST scheme into context with previous studies, the IGS locus and ospC were amplified by using primers and conditions as described previously (6–8). Accession numbers of ospC and IGS sequences obtained in this study and of sequences downloaded from the GenBank database (9, 10) are shown in Table S7. Sequencing. PCR products of all loci were sequenced in forward and reverse directions by using the internal primers (Table S6). Forward and reverse sequences of individual housekeeping genes were compared using the Seqman module of Lasergene version 7 (DNASTAR). For ambiguous sequences, samples were either resequenced or reamplified and resequenced. Mixed infections detected by the presence of double peaks in sequence traces were discarded from further analysis. Sequence Analyses. Version 4.10.9 of DnaSP (11) was used to determine the GC content of the genes. Values for the overall nonsynonymous and synonymous substitutions (dN/dS) of genes were determined in MEGA version 3.1 (12) by using the modified Nei–Gojobori method and the Jukes–Cantor model. For ospC, dN/dS (omega) values for individual codons were determined in Phylemon (http://phylemon.bioinfo.cipf.es/cgi- bin/tools.cgi) using the sitewise likelihood-ratio (SLR) method described by Massingham and Goldman (13). For this, one strain per major ospC group was used for sequence alignment and to construct a neighbor joining tree (1,000 bootstrap repeats) in MEGA 3.1. Table S5 shows the omega values for the individual codons; minus signs in the Results column indicate purifying selection whereas plus signs indicate positive selection. / indicates 95% confidence; / indicates 99% confidence, and / indicates corrected 99% confidence. P values and adjusted P values are given in the corresponding columns. Strains used for SLR analysis included B31, 297, B156, B500, B509, BL522, BL538, MR616, MR623, MR661, MR662, MR640, ca92–1096, IPT2, IPT23, ITP39, IPT58, NE49, Z41293, and Z41493. MLST and Phylogeny. Sequences of individual housekeeping genes that differed by one or more nucleotides were assigned allele numbers by using the nonredundant database program available at www.mlst.net, followed by defining STs for the spirochete samples. In addition to analyzing the STs, the sequences of the house- keeping genes were concatenated. All sequences were aligned by using the ClustalW algorithm of MEGA 3.1 (12), and phyloge- netic trees were constructed. Trees of the concatenated house- keeping genes, the IGS locus, and ospC were generated by using MrBayes 3.1.2 software (14). To choose an appropriate evolu- tionary model for analyses using MrBayes, MrModeltest (Ny- lander 2004, www.abc.se/nylander/) was conducted in Paup 4.0 (15). The concatenated sequences of the housekeeping genes were partitioned into first, second, and third codon positions. Partitions were unlinked for the gamma distribution to allow sites for each codon position to evolve under rates from a gamma distribution with a different shape parameter for each position/ partition. For coding genes, the General Time Reversible model with gamma-distributed rate variation across sites and a propor- tion of invariable sites was used for tree construction, whereas for the IGS sequences, the General Time Reversible model with gamma-distributed rate variation across sites was used. Analyses were continued for 2 10 7 generations or until the average standard deviation of split frequencies was 0.01. Data were sampled every 100th generation. In addition, distance matrices and neighbor joining bootstrap (1,000) trees for individual loci and the concatenated housekeeping genes were obtained by using the Kimura two-parameter model in MEGA. All trees were rooted by using the sequences of the European B. garinii strain PBi obtained from the GenBank database (accession no. NC006156) as an outgroup. Statistics. To test whether the topologies of the MLST tree and the IGS tree differed significantly, a partition homogeneity test (16) based on the incongruence-length difference test (17), Margos et al. www.pnas.org/cgi/content/short/0800323105 1 of 22
Transcript
Page 1: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Supporting InformationMargos et al. 10.1073/pnas.0800323105SI Materials and MethodsDNA Extraction. DNA of tick-derived samples and cultures waseither prepared by alkaline hydrolysis (1) or extracted by usingthe DNeasy tissue and blood kit (Qiagen) as described previously(2). Purified DNA of cultured isolates was diluted to a concen-tration of 10 pg/�l.

Gene Selection and Primer Design. Chromosomal housekeepinggenes were selected based on the following criteria: that (i) theyare single-copy genes, (ii) they display no more than 5% allelicdivergence as assessed by microarray analyses of clinical isolates(3), and (iii) they are, if possible, not adjacent to genes encodingouter surface proteins or hypothetical proteins. Based on thesequenced genomes of B. burgdorferi (B31) (accession no.NC�001318) (4), B. garinii (PBi) (accession number NC�006156),and B. afzelii (Pko) (accession number 008277) (5), primers weredesigned to highly conserved regions of eight housekeepinggenes that fulfilled these criteria (i.e., clpA, clpX, nifS, pepX,pyrG, recG, rplB, and uvrA) by using OligoExplorer 1.2 (GeneLink). The rationale for this strategy was the future applicationof this MLST scheme to other species of the Lyme borreliosisgroup. The primers designed for clpA, clpX, pepX, recG, rplB, anduvrA were nested, and those for nifS and rplB were seminested(Table S6).

PCR. HotstarTaq Mastermix (Qiagen), 25 pmol of each primer,forward and reverse, and 2.5 �l of template DNA (purified DNAof isolates or tick lysates) were used for the first set of amplifi-cation cycles (25-�l final reaction volume). For PCR on tick-derived material, the MgCl2 concentration was adjusted to 2.5mM. Bioline Immomix Red, 50 pmol of each primer and 5 �l ofproduct derived from the primary set of cycles were used for thesecond set of amplification cycles (50-�l final reaction volume).

The PCR conditions for the housekeeping genes, except forrecG, were as follows: for the first set of cycles, touchdown PCRwas used with annealing temperatures starting from 55°C anddecreasing 1°C each cycle. Specific conditions were 95°C for 15min, 94°C for 30 s, annealing temperature from 55°C to 48°C for30 s, and an extension step of 72°C for 30 s. An additional 20cycles were run at 94°C for 30 s, annealing temperature of 48°C,and extension at 72°C for 30 s. After a final extension step for 5min at 72°C, the samples were kept at 15°C until further analysis.The conditions for the second set of 35 cycles were 95°C for 7min, 94°C for 30 s, 50°C for 30 s, 72°C for 30 s. After a finalextension step for 5 min at 72°C, the samples were kept at 15°C.

For recG, the PCR conditons for the first set of cycles were95°C for 15 min, 94°C for 30 s, 55°C for 30 s, 72°C for 30 s, 30cycles, and extension at 72°C for 5 min. The conditions for thesecond set of cycles were identical, except for an initial dena-turing step at 95°C for 7 min.

To put the results obtained by this MLST scheme into contextwith previous studies, the IGS locus and ospC were amplified byusing primers and conditions as described previously (6–8).Accession numbers of ospC and IGS sequences obtained in thisstudy and of sequences downloaded from the GenBank database(9, 10) are shown in Table S7.

Sequencing. PCR products of all loci were sequenced in forwardand reverse directions by using the internal primers (Table S6).Forward and reverse sequences of individual housekeepinggenes were compared using the Seqman module of Lasergeneversion 7 (DNASTAR). For ambiguous sequences, samples were

either resequenced or reamplified and resequenced. Mixedinfections detected by the presence of double peaks in sequencetraces were discarded from further analysis.

Sequence Analyses. Version 4.10.9 of DnaSP (11) was used todetermine the GC content of the genes. Values for the overallnonsynonymous and synonymous substitutions (dN/dS) of geneswere determined in MEGA version 3.1 (12) by using themodified Nei–Gojobori method and the Jukes–Cantor model.

For ospC, dN/dS (omega) values for individual codons weredetermined in Phylemon (http://phylemon.bioinfo.cipf.es/cgi-bin/tools.cgi) using the sitewise likelihood-ratio (SLR) methoddescribed by Massingham and Goldman (13). For this, one strainper major ospC group was used for sequence alignment and toconstruct a neighbor joining tree (1,000 bootstrap repeats) inMEGA 3.1. Table S5 shows the omega values for the individualcodons; minus signs in the Results column indicate purifyingselection whereas plus signs indicate positive selection. �/�indicates 95% confidence; ��/�� indicates 99% confidence,and ���/��� indicates corrected 99% confidence. P valuesand adjusted P values are given in the corresponding columns.Strains used for SLR analysis included B31, 297, B156, B500,B509, BL522, BL538, MR616, MR623, MR661, MR662, MR640,ca92–1096, IPT2, IPT23, ITP39, IPT58, NE49, Z41293, andZ41493.

MLST and Phylogeny. Sequences of individual housekeeping genesthat differed by one or more nucleotides were assigned allelenumbers by using the nonredundant database program availableat www.mlst.net, followed by defining STs for the spirochetesamples.

In addition to analyzing the STs, the sequences of the house-keeping genes were concatenated. All sequences were aligned byusing the ClustalW algorithm of MEGA 3.1 (12), and phyloge-netic trees were constructed. Trees of the concatenated house-keeping genes, the IGS locus, and ospC were generated by usingMrBayes 3.1.2 software (14). To choose an appropriate evolu-tionary model for analyses using MrBayes, MrModeltest (Ny-lander 2004, www.abc.se/�nylander/) was conducted in Paup 4.0(15). The concatenated sequences of the housekeeping geneswere partitioned into first, second, and third codon positions.Partitions were unlinked for the gamma distribution to allowsites for each codon position to evolve under rates from a gammadistribution with a different shape parameter for each position/partition. For coding genes, the General Time Reversible modelwith gamma-distributed rate variation across sites and a propor-tion of invariable sites was used for tree construction, whereasfor the IGS sequences, the General Time Reversible model withgamma-distributed rate variation across sites was used. Analyseswere continued for 2 � 107 generations or until the averagestandard deviation of split frequencies was �0.01. Data weresampled every 100th generation. In addition, distance matricesand neighbor joining bootstrap (1,000) trees for individual lociand the concatenated housekeeping genes were obtained byusing the Kimura two-parameter model in MEGA. All treeswere rooted by using the sequences of the European B. gariniistrain PBi obtained from the GenBank database (accession no.NC�006156) as an outgroup.

Statistics. To test whether the topologies of the MLST tree andthe IGS tree differed significantly, a partition homogeneity test(16) based on the incongruence-length difference test (17),

Margos et al. www.pnas.org/cgi/content/short/0800323105 1 of 22

Page 2: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

implemented in Paup 4.0, was applied. For this, IGS sequenceswere added to MLST sequences. The data were partitioned inMLST and IGS and used for parsimony analysis (100 bootstraprepeats, heuristic search) with random addition sequences (10repeats) and tree bisection–reconnection branch swapping.

To test whether there is a different distribution of the allelicprofiles of B. burgdorferi from Europe and North America, apermutation test was performed (Table S3 and Fig. S2). For this,a dissimilarity matrix was calculated from the allelic profiles ofB. burgdorferi by counting the number of genes at which each pairof strains differ (Table S3). Only one isolate/sample of B.burgdorferi per ST was taken to control for possible effects oflocal clonal expansion. The pairwise difference between strain iand strain j was denoted by dij, where dij equals the number ofgenes at which strains i and j differ. D refers to the symmetricdistance matrix whose ith row and jth column entry is dij. Fromthis matrix, a dissimilarity test statistic was calculated as theweighted sum (¥) of the mean dissimilarities within the tworegions compared in this test (i.e., the United States andEurope), where the weights are the relative numbers of strainsin each region (n1 are strains from the United States, and n2 arestrains from Europe), i.e.,

D1 � ��i�j, with i and j in region1�

dij

n1�n1 � 1 /2

and D2 �[i�j, with i and j in region2]

dij

n2�n2 � 1 /2.

The test statistic is a weighted average of the two means

T �n1

n1 � n2D1 �

n2

n1 � n2D2,

with n1 � n2 equaling the total number of STs. Under the nullhypothesis that there is no difference between the allelic profilesof B. burgdorferi strains from the two regions, and under thealternative hypothesis that there is a difference, a permutationtest was performed using one million random permutations.That is, the strains are repeatedly randomly divided into twogroups of the same size as before, and the dissimilarity teststatistic is recalculated.

1. Guy EC, Stanek G (1991) Detection of Borrelia burgdorferi in patients with Lyme diseaseby the polymerase chain reaction. J Clin Pathol 44:610–611.

2. Beati L, Keirans JE (2001) Analysis of the systematic relationships among ticks of thegenera Rhipicephalus and Boophilus (Acari: Ixodidae) based on mitochondrial 12Sribosomal DNA gene sequences and morphological characters. J Parasitol 87:32–48.

3. Terekhova D, Iyer R, Wormser GP, Schwartz I (2006) Comparative genome hybridiza-tion reveals substantial variation among clinical isolates of Borrelia burgdorferi sensustricto with different pathogenic properties. J Bacteriol 188:6124–6134.

4. Casjens S, et al. (2000) A bacterial genome in flux: The twelve linear and nine circularextrachromosomal DNAs in an infectious isolate of the Lyme disease spirochete Bor-relia burgdorferi. Mol Microbiol 35:490–516.

5. Glockner G, et al. (2006) Comparative genome analysis: Selection pressure on theBorrelia vls cassettes is essential for infectivity. BMC Genomics 7:211.

6. Brisson D, Dykhuizen DE (2004) ospC diversity in Borrelia burgdorferi: Different hostsare different niches. Genetics 168:713–722.

7. Bunikis J, et al. (2004) Sequence typing reveals extensive strain diversity of the Lymeborreliosis agents Borrelia burgdorferi in North America and Borrelia afzelii in Europe.Microbiology 150:1741–1755.

8. Liveris D, Gazumyan A, Schwartz I (1995) Molecular typing of Borrelia burgdorferisensu lato by PCR-restriction fragment length polymorphism analysis. J Clin Microbiol33:589–595.

9. Attie O, et al. (2007) Co-evolution of the outer surface protein C gene (ospC) andintraspecific lineages of Borrelia burgdorferi sensu stricto in the northeastern UnitedStates. Infect Genet Evol 7:1–12.

10. Wang G, et al. (2002) Disease severity in a murine model of lyme borreliosis is associatedwith the genotype of the infecting Borrelia burgdorferi sensu stricto strain. J Infect Dis186:782–791.

11. Rozas J, Sanchez-DelBarrio JC, Messeguer X, Rozas R (2003) DnaSP, DNA polymorphismanalyses by the coalescent and other methods. Bioinformatics 19:2496–2497.

12. Kumar S, Tamura K, Nei M (2004) Integrated software for Molecular EvolutionaryGenetics Analysis and sequence alignment. Brief Bioinform 5:150–163.

13. Massingham T, Goldman N (2005) Detecting amino acid sites under positive selectionand purifying selection. Genetics 169:1753–1762.

14. Huelsenbeck JP, Ronquist F (2005) in Statistical Methods in Molecular Evolution, edNielsen R (Springer, New York), pp 183–232.

15. Swofford DL (2002) PAUP*: Phylogenetic Analysis Using Parsimony (*and OtherMethods) (Sinauer, Sunderland, MA), Version 4.

16. Farris JS, Kallersjo M, Kluge AG, Bult C (1995) Constructing a significance test forincongruence. Syst Biol 44:570–572.

17. Mickevich MF, Farris JS (1981) The implications of congruence in Menidia. Syst Zool30:351–370.

Margos et al. www.pnas.org/cgi/content/short/0800323105 2 of 22

Page 3: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

US EU

US EU

US EUUS EUUS EU

US EU

US EU

US EU US EU

clpA clpX nifS pepX

pyrG recG rplB urvA

STs

15 50 7 1 5 9 0 5 8 2 1

7 1 2 9 1 7 5 1 4 13 1 4

23 0 10

Fig. S1. Venn diagram of shared alleles and sequence types (STs) between B. burgdorferi strains. Individual circles represent strains from the United States (US)and from Europe (EU). Values inside the circles give the number of alleles/STs found exclusively in these populations, and the number in the overlap indicatesthe number of shared alleles or STs. Only seven alleles of 113 total alleles were shared between strains from the two regions.

Margos et al. www.pnas.org/cgi/content/short/0800323105 3 of 22

Page 4: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Histogram of dissimilarity statistics

Value of dissimilarity statistic

Rel

ativ

e fr

eque

ncy

6.4 6.6 6.8 7.0

01

23

45

67

Observed statistic

Fig. S2. Permutation test. The distribution of the dissimilarity statistic under the null hypothesis of no difference between B. burgdorferi populations fromEurope and the United States. The bars represent the relative frequencies of the dissimilarity statistic T, which is calculated from the pairwise allelic differencematrix and is dimensionless. The value of the observed statistic was 6.68 (indicated as a dashed line), which is distant from the main body of the histogram (highervalues of the statistic indicating larger within-group differences). This provides significant evidence to reject the null hypothesis (P 0.00285).

Margos et al. www.pnas.org/cgi/content/short/0800323105 4 of 22

Page 5: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Fig. S3. Neighbor joining trees of housekeeping gene sequences. For the concatenated housekeeping genes, both the MrBayes algorithm (Fig. 3) and theneighbor joining tree building method (A) gave rise to the same terminal clades, and bootstrap support for the terminal nodes was high. The trees based onthe two methods were highly similar, rendering the neighbor joining algorithm a convenient approach in the analysis of very large datasets. Rooted neighborjoining trees for the individual housekeeping genes are shown in B–I. As expected, trees of the individual housekeeping genes showed less resolution than thetrees based on the concatenated genes. Branch support values were obtained by a bootstrap procedure (1,000 replications). Only bootstrap values of 50% ormore are shown. Symbols refer to the major IGS genotypes as defined by Bunikis et al. (7). Non-color-coded strains are from the Northeast and Midwest UnitedStates. Blue labels indicate European strains, and yellow labels indicate Californian strains. The branch length of the outgroup B. garinii is not according to scale(indicated by slashes). (Scale bar: 1% divergence.)

Margos et al. www.pnas.org/cgi/content/short/0800323105 5 of 22

Page 6: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Margos et al. www.pnas.org/cgi/content/short/0800323105 6 of 22

Page 7: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Margos et al. www.pnas.org/cgi/content/short/0800323105 7 of 22

Page 8: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Margos et al. www.pnas.org/cgi/content/short/0800323105 8 of 22

Page 9: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Margos et al. www.pnas.org/cgi/content/short/0800323105 9 of 22

Page 10: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Margos et al. www.pnas.org/cgi/content/short/0800323105 10 of 22

Page 11: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Margos et al. www.pnas.org/cgi/content/short/0800323105 11 of 22

Page 12: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Margos et al. www.pnas.org/cgi/content/short/0800323105 12 of 22

Page 13: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Margos et al. www.pnas.org/cgi/content/short/0800323105 13 of 22

Page 14: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Table S1. B. burgdorferi strains used in this study

Strain Biological source Geographic source Collector Provided by

B31 I. scapularis Shelter Island, NY A. Barbour I.S.297 Human patient Connecticut A. Steere I.S.JD1 I. scapularis Ipswich, MA J.P. I.S.N40 I. scapularis Westchester County, NY S. Barthold I.S.BL206 Human patient Westchester County LDP I.S.B515 Human patient Westchester County LDP I.S.B504 Human patient Westchester County LDP I.S.B509 Human patient Westchester County LDP I.S.MR623 Human patient Westchester County LDP I.S.B373 Human patient Westchester County LDP I.S.B156 Human patient Westchester County LDP I.S.MR661 Human patient Westchester County LDP I.S.MR654 Human patient Westchester County LDP I.S.BL538 Human patient Westchester County LDP I.S.BL515 Human patient Westchester County LDP I.S.BL522 Human patient Westchester County LDP I.S.B356 Human patient Westchester County LDP I.S.MR616 Human patient Westchester County LDP I.S.B331 Human patient Westchester County LDP I.S.B361 Human patient Westchester County LDP I.S.B500 Human patient Westchester County LDP I.S.B485 Human patient Westchester County LDP I.S.MR607 Human patient Westchester County LDP I.S.MR662 Human patient Westchester County LDP I.S.B418 Human patient Westchester County LDP I.S.MR640 Human patient Westchester County LDP I.S.B348 Human patient Westchester County LDP I.S.47703UT* I. scapularis Cass County, MN M. Johnson A.G.G.51405UT I. scapularis Cass County, MN M. Johnson A.G.G.16812UT I. scapularis Cumberland County, ME C. Lubelczyk A.G.G.15903UT I. scapularis Cumberland County C. Lubelczyk A.G.G.15506UT I. scapularis Cumberland County C. Lubelczyk A.G.G.15912UT I. scapularis Cumberland County C. Lubelczyk A.G.G.519014UT I. scapularis Van Buren County, MI E. Dobrowolski A.G.G.48102UT I. scapularis Hubbard County, MN M. Johnson A.G.G.498801UT I. scapularis Suffolk County, NY A.G.G. A.G.G.114311UT I. scapularis Cecil County, MD C. Abadam A.G.G.Ca4 I. pacificus California R.S. Lane J.P.Ca5 I. pacificus California R.S. Lane J.P.Ca6 I. pacificus California R.S. Lane J.P.Ca92–0953 Human patient California R.S. Lane J.P.Ca92–1096 Human patient California R.S. Lane J.P.Ca92–1337 Human patient California R.S. Lane J.P.Ca�WTB27 I. pacificus California J.P. J.P.Ca�WTB32 I. pacificus California J.P. J.P.IPT2 I. ricinus Alsace, France CNRB M.C.IPT19 I. ricinus Alsace CNRB M.C.IPT23 I. ricinus Alsace CNRB M.C.IPT39 I. ricinus Alsace CNRB M.C.IPT58 I. ricinus Alsace CNRB M.C.IPT69 I. ricinus Alsace CNRB M.C.IPT135 I. ricinus Auvergne, France CNRB M.C.IPT137 I. ricinus Alsace, France CNRB M.C.IPT190 I. ricinus Normandy, France CNRB M.C.IPT191 I. ricinus Normandy CNRB M.C.IPT193 I. ricinus Normandy CNRB M.C.IPT198 I. ricinus Normandy CNRB M.C.NE49 I. ricinus Switzerland L. Gern M.C.Z41293 I. ricinus Germany M. Wittenbrink M.C.Z41493 I. ricinus Germany M. Wittenbrink M.C.21509LT* I. ricinus Riga district, Babite, Latvia A.B. M.D.20111LT I. ricinus Babite A.B. M.D.22521LT I. ricinus Babite A.B. M.D.20604LT I. ricinus Jurmala, Kemeri, Latvia A.B. M.D.

LDP, human isolates from patients presenting at the Lyme Disease Practice of the Westchester Medical Center, Valhalla, NY; CNRB, Centre National deRéférence des Borrelia.*Spirochete samples carrying the suffix �LT� (Latvia tick) or �UT� (U.S. tick) were not isolated but were amplified by PCR from DNA extracted from ticks.

Margos et al. www.pnas.org/cgi/content/short/0800323105 14 of 22

Page 15: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Table S2. Gene products and characteristics of the loci used in this study

Geneno.* Locus

Gene product/description

Fragment lengthanalyzed, bp % G/C

% polymorphicsites

% parsimony-informative sites

dN/dSratio

No. of alleles/major groups/types

BB0369 clpA Clp protease subunit A 579 26.7 5.7 5.2 0.32 20BB0612 clpX Clp protease subunit X 624 32.5 4.0 2.6 0.04 13BB0084 nifS Aminotransferase 564 27.4 5.0 4.6 0.19 14BB0627 pepX Dipeptidyl aminopeptidase 570 29.9 3.3 3.1 0.2 11BB0575 pyrG CTP synthase 603 31.3 2.5 2.3 0.05 10BB0581 recG DNA recombinase 651 31.5 2.9 2.3 0.08 17BB0481 rplB 50S ribosomal protein L2 624 35.3 2.7 1.7 0.06 10BB0837 uvrA Exonuclease ABC, SU A 570 35.7 5.1 4.2 0.04 18BB�B19 ospC Outer surface protein C 481–493 34.0 56.8 54.8 0.7 17

rrs-rrlA Noncoding 16S–23Sintergenic spacer

768–791 33.0 23.9 15.5 NA 12

NA, not applicable.*According to the numbering of the B. burgdorferi B31 sequence (GenBank).

Margos et al. www.pnas.org/cgi/content/short/0800323105 15 of 22

Page 16: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Table S3. STs and allelic profiles of B. burgdorferi

Strains ST clpA clpX nifS pepX pyrG recG rpl uvr

B31, BL206, B515, 16812UT 1 1 1 1 1 1 1 1 1Ca4, Ca5, Ca6 2 1 1 7 1 1 1 1 11297, B504, 498801UT 3 4 1 1 1 1 6 1 7B509 4 8 1 1 1 4 6 1 7Ca�WTB27 5 11 1 1 9 7 6 4 14Ca92-0953 6 11 1 8 1 1 6 4 12MR623, B373 7 6 1 5 1 1 7 1 8B156 8 5 5 4 5 5 5 1 6MR661, MR654, 15506UT 9 10 5 4 6 1 6 1 6Ca�WTB32 10 5 5 4 5 8 5 6 6JD1, BL538, BL515, 114311UT 11 5 7 5 1 6 1 4 9BL522, B356 12 3 3 2 4 3 4 4 4Ca92-1337 13 3 3 2 4 3 6 4 4MR616, 15912UT 14 9 1 1 7 1 6 1 10B500 15 2 2 4 2 2 2 2 2B331, B361 16 2 2 1 2 2 2 2 2Ca92-1096 17 12 2 9 8 5 8 2 13B485, MR607, MR662 18 7 6 6 1 1 5 5 5B418, N40, MR640, B348, 15903UT 19 4 4 3 3 3 3 3 3IPT2, IPT23, IPT69, IPT191, IPT190, 22521LT,

20604LT20 14 1 11 1 1 1 1 10

IPT19, 21509LT 21 14 1 11 1 1 10 1 10IPT135 22 14 1 11 1 1 12 1 10IPT137 23 16 1 11 1 1 1 1 10IPT39, IPT58 24 15 9 12 8 1 11 8 16IPT193, IPT198 25 13 10 13 10 9 9 7 15NE49 26 13 8 10 10 9 9 7 15Z41293 27 17 10 14 10 10 13 9 17Z41493 28 17 11 10 10 9 14 10 1847703UT 29 18 12 1 11 2 15 1 251405UT 30 19 1 5 1 2 1 1 1048102UT 31 20 4 3 3 3 3 3 7519014UT 32 8 1 1 1 4 16 1 720111LT 33 15 13 12 8 1 17 8 16

Margos et al. www.pnas.org/cgi/content/short/0800323105 16 of 22

Page 17: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Table S4. New IGS genotypes among European isolates of B. burgdorferi as determined according to the IGS type definitionpublished by Bunikis et al. (7)

IGS type Strain

Alignment position*

62 (16) 77 (31) � 125 (79) 137 (98) 287 (248) 289 (250) 294 (255) 309 (270)

11 Z41293 A C y (6) C T G G G12 Z41493 A C y C T G G A

Strain Z41293: the deletion at position 79 (from the start of the analyzed sequence fragment) is 6 bp instead of 7 bp. It has additional deletions at positions106–117 (from the start of the fragment) (12 bp) and 742–751 (10 bp) as well as insertions of one base at position 308 and two bases at positions 711–712.

Strain Z41493: this strain has an additional insertion at position 308 (from the start of the fragment) and additional deletions at positions 670–671 and779–782.*Positions shown in parentheses are from the start of the sequence fragment analyzed in this study.

Margos et al. www.pnas.org/cgi/content/short/0800323105 17 of 22

Page 18: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Table S5. Determination of dN/dS (omega) values for individual codons of ospC using the sitewise likelihood-ratio method

Site/codon no. Omega, dN/dS LRT�Stat P value Adjusted P value Result Note

1 0.0000 9.7926 1.7521e-03 1.7872e-01 �� Synonymous2 0.0000 15.2659 9.3389e-05 1.1207e-02 ��� Synonymous3 0.0698 19.5063 1.0027e-05 1.4438e-03 ����

4 0.0000 12.5986 3.8604e-04 4.3623e-02 ��� Synonymous5 0.0000 9.7926 1.7521e-03 1.7872e-01 �� Synonymous6 0.0000 24.6933 6.7217e-07 1.0687e-04 ���� Synonymous7 0.0000 20.6421 5.5364e-06 8.1938e-04 ���� Synonymous8 0.0000 6.8800 8.7163e-03 6.7987e-01 �� Synonymous9 0.0000 17.7942 2.4613e-05 3.4414e-03 ���� Synonymous

10 0.0000 27.5735 1.5124e-07 2.4350e-05 ���� Synonymous11 0.0000 8.0707 4.4987e-03 3.7789e-01 �� Synonymous12 0.3937 5.6416 1.7539e-02 8.8193e-01 �

13 0.0000 31.9621 1.5721e-08 2.5782e-06 ���� Synonymous14 0.0000 22.3942 2.2205e-06 3.4640e-04 ���� Synonymous15 0.0000 16.7150 4.3437e-05 5.8206e-03 ���� Synonymous16 0.0000 20.6421 5.5364e-06 8.1938e-04 ���� Synonymous17 0.0000 20.6421 5.5364e-06 8.1938e-04 ���� Synonymous18 0.0000 16.3012 5.4029e-05 6.9157e-03 ���� Synonymous19 0.0000 24.4776 7.5178e-07 1.1878e-04 ���� Synonymous20 0.3047 8.4894 3.5722e-03 3.2507e-01 ��

21 0.0000 9.5325 2.0186e-03 1.9984e-01 �� Synonymous22 0.0000 18.0726 2.1264e-05 2.9983e-03 ���� Synonymous23 0.1276 8.7547 3.0880e-03 2.8718e-01 ��

24 0.0787 17.2743 3.2353e-05 4.4001e-03 ����

25 0.1590 12.9512 3.1972e-04 3.6448e-02 ���

26 0.7931 0.0562 8.1255e-01 8.8193e-0127 0.0000 13.7747 2.0609e-04 2.4319e-02 ��� Synonymous28 0.0000 15.3241 9.0553e-05 1.0991e-02 ��� Synonymous29 0.0000 20.6421 5.5364e-06 8.1938e-04 ���� Synonymous30 0.0000 29.4308 5.7950e-08 9.3879e-06 ���� Synonymous31 0.2451 9.2913 2.3024e-03 2.2103e-01 ��

32 0.2005 7.7544 5.3580e-03 4.3400e-01 ��

33 0.5188 2.0236 1.5487e-01 8.8193e-0134 0.0000 1.9858 1.5878e-01 8.8193e-01 Synonymous35 0.5408 0.8827 3.4746e-01 8.8193e-0136 0.8951 0.0672 7.9553e-01 8.8193e-0137 0.0000 9.7926 1.7521e-03 1.7872e-01 �� Synonymous38 0.0000 22.3942 2.2205e-06 3.4640e-04 ���� Synonymous39 0.0743 17.7830 2.4758e-05 3.4414e-03 ����

40 0.0626 19.7462 8.8437e-06 1.3000e-03 ����

41 0.7408 0.2825 5.9507e-01 8.8193e-0142 0.1160 9.8137 1.7322e-03 1.7872e-01 ��

43 1.3129 0.2204 6.3875e-01 8.8193e-0144 0.0000 10.0581 1.5168e-03 1.6078e-01 �� Synonymous45 0.0000 16.4444 5.0099e-05 6.5630e-03 ���� Synonymous46 0.0000 16.3012 5.4029e-05 6.9157e-03 ���� Synonymous47 0.0000 17.5094 2.8589e-05 3.9453e-03 ���� Synonymous48 0.5647 1.3603 2.4349e-01 8.8193e-0149 0.3035 4.7578 2.9166e-02 8.8193e-01 �

50 0.0859 15.4218 8.5993e-05 1.0749e-02 ���

51 4.0158 5.9135 1.5025e-02 8.8193e-01 �

52 3.5638 16.9541 3.8294e-05 5.1697e-03 ����

53 0.2966 8.3696 3.8154e-03 3.4339e-01 ��

54 0.4708 1.9818 1.5920e-01 8.8193e-0155 0.9231 0.0221 8.8193e-01 8.8193e-0156 0.0000 8.5713 3.4150e-03 3.1418e-01 �� Synonymous57 1.2840 0.5251 4.6866e-01 8.8193e-0158 8.4343 20.9558 4.6999e-06 7.1909e-04 ����

59 1.6386 1.4835 2.2323e-01 8.8193e-0160 5.4857 19.6334 9.3815e-06 1.3697e-03 ����

61 1.2592 0.3952 5.2960e-01 8.8193e-0162 6.0720 13.3107 2.6390e-04 3.0612e-02 ���

63 0.0000 15.3958 8.7183e-05 1.0753e-02 ��� Synonymous64 0.9069 0.0640 8.0024e-01 8.8193e-01

Margos et al. www.pnas.org/cgi/content/short/0800323105 18 of 22

Page 19: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Site/codon no. Omega, dN/dS LRT�Stat P value Adjusted P value Result Note

65 0.1349 8.3247 3.9109e-03 3.4807e-01 ��

66 0.0000 3.0214 8.2174e-02 8.8193e-01 Synonymous67 0.4057 2.0331 1.5391e-01 8.8193e-0168 0.4106 2.9421 8.6300e-02 8.8193e-0169 0.0000 18.4546 1.7400e-05 2.4708e-03 ���� Synonymous70 0.0000 9.3436 2.2376e-03 2.1705e-01 �� Synonymous71 0.3420 3.3806 6.5968e-02 8.8193e-0172 1.4954 1.4184 2.3366e-01 8.8193e-0173 0.0000 21.3805 3.7658e-06 5.7993e-04 ���� Synonymous74 0.1373 8.1268 4.3616e-03 3.7570e-01 ��

75 4.1697 10.4770 1.2087e-03 1.3054e-01 ��

76 0.5015 1.0042 3.1629e-01 8.8193e-0177 0.0863 15.3183 9.0832e-05 1.0991e-02 ���

78 0.5547 2.3876 1.2230e-01 8.8193e-0179 0.5721 1.4113 2.3484e-01 8.8193e-0180 0.0825 13.5706 2.2976e-04 2.6882e-02 ���

81 1.0000 0.0000 1.0000e�00 1.0000e�0082 1.0000 0.0000 1.0000e�00 1.0000e�0083 1.0000 0.0000 1.0000e�00 1.0000e�0084 0.2102 3.9939 4.5665e-02 8.8193e-01 �

85 1.6565 1.9817 1.5921e-01 8.8193e-0186 1.9868 4.9500 2.6091e-02 8.8193e-01 �

87 0.3967 1.0527 3.0489e-01 8.8193e-0188 0.2020 6.4953 1.0816e-02 8.2201e-01 �

89 1.1084 0.0841 7.7177e-01 8.8193e-0190 0.5397 1.2975 2.5468e-01 8.8193e-0191 0.1167 17.4755 2.9103e-05 3.9872e-03 ����

92 0.8730 0.0964 7.5624e-01 8.8193e-0193 0.0000 8.9425 2.7862e-03 2.6190e-01 �� Synonymous94 0.1285 7.8171 5.1755e-03 4.2439e-01 ��

95 0.1895 12.4295 4.2262e-04 4.7333e-02 ���

96 0.2192 6.6122 1.0128e-02 7.7987e-01 �

97 0.1183 10.0665 1.5099e-03 1.6078e-01 ��

98 1.5551 0.7600 3.8334e-01 8.8193e-0199 2.6614 5.6210 1.7746e-02 8.8193e-01 �

100 0.2487 5.6439 1.7516e-02 8.8193e-01 �

101 0.0000 10.8798 9.7221e-04 1.0792e-01 �� Synonymous102 0.5625 1.3795 2.4019e-01 8.8193e-01103 0.0000 7.1833 7.3586e-03 5.8133e-01 �� Synonymous104 0.4951 1.1805 2.7725e-01 8.8193e-01105 0.3264 7.2208 7.2065e-03 5.7652e-01 ��

106 1.2901 0.3891 5.3278e-01 8.8193e-01107 0.0000 22.1631 2.5045e-06 3.8819e-04 ���� Synonymous108 0.2672 8.0748 4.4885e-03 3.7789e-01 ��

109 1.9872 2.5334 1.1146e-01 8.8193e-01110 0.0000 19.5793 9.6508e-06 1.3994e-03 ���� Synonymous111 0.0000 6.3513 1.1730e-02 8.7973e-01 � Synonymous112 0.8250 0.1730 6.7747e-01 8.8193e-01113 1.7702 2.5709 1.0885e-01 8.8193e-01114 1.8866 3.7326 5.3360e-02 8.8193e-01115 0.5374 1.2662 2.6049e-01 8.8193e-01116 0.3730 3.7342 5.3308e-02 8.8193e-01117 6.4759 30.4373 3.4483e-08 5.6207e-06 ����

118 0.4860 1.2708 2.5962e-01 8.8193e-01119 0.0000 16.6891 4.4034e-05 5.8566e-03 ���� Synonymous120 2.9746 2.8197 9.3113e-02 8.8193e-01121 22.7900 9.6897 1.8530e-03 1.8530e-01 ��

122 2.4667 1.1856 2.7622e-01 8.8193e-01123 0.0000 3.7335 5.3330e-02 8.8193e-01 Synonymous124 1.4814 0.5605 4.5407e-01 8.8193e-01125 1.0761 0.0407 8.4012e-01 8.8193e-01126 0.4382 4.0582 4.3958e-02 8.8193e-01 �

127 0.0000 26.9846 2.0509e-07 3.2814e-05 ���� Synonymous128 0.7686 0.5446 4.6054e-01 8.8193e-01129 0.8235 0.3334 5.6364e-01 8.8193e-01

Margos et al. www.pnas.org/cgi/content/short/0800323105 19 of 22

Page 20: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Site/codon no. Omega, dN/dS LRT�Stat P value Adjusted P value Result Note

130 0.0000 16.5631 4.7058e-05 6.2117e-03 ���� Synonymous131 0.0835 13.2729 2.6927e-04 3.0966e-02 ���

132 1.3794 0.6831 4.0852e-01 8.8193e-01133 0.0000 15.3905 8.7425e-05 1.0753e-02 ��� Synonymous134 0.0000 16.3012 5.4029e-05 6.9157e-03 ���� Synonymous135 0.0000 9.4955 2.0598e-03 2.0186e-01 �� Synonymous136 0.1726 9.2353 2.3740e-03 2.2553e-01 ��

137 0.1455 14.7727 1.2128e-04 1.4432e-02 ���

138 0.4761 2.9707 8.4784e-02 8.8193e-01139 2.5698 5.7344 1.6636e-02 8.8193e-01 �

140 0.0000 0.1887 6.6401e-01 8.8193e-01 Synonymous141 0.0000 0.8284 3.6275e-01 8.8193e-01 Synonymous142 0.4563 2.2442 1.3411e-01 8.8193e-01143 1.1801 0.2511 6.1627e-01 8.8193e-01144 0.0000 10.5902 1.1369e-03 1.2505e-01 �� Synonymous145 0.0000 9.7391 1.8039e-03 1.8219e-01 �� Synonymous146 0.2364 6.2860 1.2169e-02 8.8193e-01 �

147 0.8566 0.1445 7.0389e-01 8.8193e-01148 0.0602 19.4614 1.0265e-05 1.4680e-03 ����

149 0.0000 7.9243 4.8776e-03 4.0484e-01 �� Synonymous150 0.4595 3.5467 5.9664e-02 8.8193e-01151 1.1551 0.0984 7.5373e-01 8.8193e-01152 0.0000 8.2899 3.9865e-03 3.5082e-01 �� Synonymous153 0.6184 1.3020 2.5384e-01 8.8193e-01154 0.8435 0.2023 6.5286e-01 8.8193e-01155 0.7757 0.3191 5.7212e-01 8.8193e-01156 0.2327 10.5735 1.1472e-03 1.2505e-01 ��

157 0.2783 8.1239 4.3686e-03 3.7570e-01 ��

158 2.5457 4.8507 2.7635e-02 8.8193e-01 �

159 0.0000 2.7834 9.5244e-02 8.8193e-01 Synonymous160 2.3048 4.9942 2.5433e-02 8.8193e-01 �

161 0.0000 20.6421 5.5364e-06 8.1938e-04 ���� Synonymous162 0.0000 15.8730 6.7737e-05 8.5348e-03 ���� Synonymous163 0.0000 16.2060 5.6813e-05 7.2153e-03 ���� Synonymous164 0.7158 0.5200 4.7083e-01 8.8193e-01165 0.7474 0.3842 5.3535e-01 8.8193e-01166 1.1044 0.0688 7.9304e-01 8.8193e-01167 1.0000 0.0000 1.0000e�00 1.0000e�00168 1.0000 0.0000 1.0000e�00 1.0000e�00169 2.2127 3.2887 6.9758e-02 8.8193e-01

Margos et al. www.pnas.org/cgi/content/short/0800323105 20 of 22

Page 21: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Table S6. Housekeeping genes and PCR primers

Gene (number*) Primer 5 to 3 Primer name Product length, bp

nifS (BB0084) SeminestedInner forward Same as outer forwardInner reverse TCACAGCCAATTTTTTTAAC nifR680 629Outer forward ATGGATTTCAAACAAATAAAAAG nifF1Outer reverse GTTGGAGCAAGCATTTTATG nifR719

clpA (BB0369)Inner forward GACAAAGCTTTTGATATTTTAG clpAF1255Inner reverse CAAAAAAAACATCAAATTTTCTATCTC clpAR2104 706Outer forward GATAGATTTCTTCCAGACAAAG clpAF1240Outer reverse TTCATCTATTAAAAGCTTTCCC clpAR2214

rplB (BB0481) SeminestedInner forward CGCTATAAGACGACTTTATC rplF40 720Inner reverse same as outer reverseOuter forward TGGGTATTAAGACTTATAAGC rplF2Outer reverse GCTGTCCCCAAGGAGACA rplR760

pyrG (BB0575)Inner forward GATATGGAAAATATTTTATTTATTG pyrF448Inner reverse AAACCAAGACAAATTCCAAG pyrR1154 687Outer forward GATTGCAAGTTCTGAGAATA pyrF391Outer reverse CAAACATTACGAGCAAATTC pyrR1190

recG (BB0581)Inner forward CTTTAATTGAAGCTGGATATC recF917Inner reverse CAAGTTGCATTTGGACAATC recR1658 741Outer forward CCCTTGTTGCCTTGCTTTC recF890Outer reverse GAAAGTCCAAAACGCTCAG recR1694

clpX (BB0612)Inner forward AATGTGCCATTTGCAATAGC clpXF403Inner reverse TTAAGAAGACCCTCTAAAATAG clpXR1124 721Outer forward GCTGCAGAGATGAATGTGCC clpXF391Outer reverse GATTGATTTCATATAACTCTTTTG clpXR1273

pepX (BB0627)Inner forward TTATTCCAAACCTTGCAATCC pepXF449Inner reverse TGTGCCTGAAGGAACATTTG pepXR1115 666Outer forward ACAGAGACTTAAGCTTAGCAG pepXF362Outer reverse GTTCCAATGTCAATAGTTTC pepXR1172

uvrA (BB0837)Inner forward GCTTAAATTTTTAATTGATGTTGG uvrF1434Inner reverse CCTATTGGTTTTTGATTTATTTG uvrR2111 677Outer forward GAAATTTTAAAGGAAATTAAAAGTAG uvrF1408Outer reverse CAAGGAACAAAAACATCTGG uvrR2318

*According to the numbering of the B. burgdorferi B31 sequence (GenBank).

Margos et al. www.pnas.org/cgi/content/short/0800323105 21 of 22

Page 22: Supporting Information · 23/06/2008  · Gene Selection and Primer Design. Chromosomal housekeeping genes were selected based on the following criteria: that (i) they are single-copy

Table S7. GenBank accession numbers for ospC and rrs-rrlA IGS

Strain ospC accession no. Reference rrs-rrlA accession no. Reference

B31 NC�001903 4 AY275189 7297 AY275214 7 AY275192 7JD1 DQ437462.1 9 DQ437438 9N40 AY275221 7 AY275211 7BL206 AF467867 10 AF467855 10B515 AF467872 10 AF467860 10B504 EU377780 This study EU377796 This studyB509 EU377781 This study EU377797 This studyMR623 EU377778 This study EU377794 This studyB373 EU377779 This study EU377795 This studyB156 EU377776 This study EU377792 This studyMR661 EU377775 This study EU377791 This studyMR654 EU377777 This study EU377793 This studyBL538 EU377773 This study EU377789 This studyBL515 EU377774 This study EU377790 This studyBL522 EU377772 This study EU377788 This studyB356 AF467873 10 AF467861 10MR616 EU377771 This study EU377787 This studyB331 AF467874 10 AF467862 10B361 EU377770 This study EU377786 This studyB500 AF467878 10 AF467866 10B485 EU377769 This study EU377785 This studyMR607 EU377768 This study EU377784 This studyMR662 EU377767 This study EU377783 This studyB418 AF467877 10 AF467877 10MR640 EU377766 This study EU377782 This studyB348 AF467875 10 AF467863 1047703UT EU375832 This study EU375821 This study51405UT EU375825 This study EU375815 This study16812UT EU375826 This study EU375820 This study15903UT EU375828 This study EU375819 This study15506UT EU375833 This study EU375818 This study15912UT EU375827 This study EU375816 This study519014UT EU375831 This study EU375823 This study48102UT EU375829 This study EU375822 This study498801UT EU375830 This study EU375814 This study114311UT EU375824 This study EU375817 This studyCa4 EU377746 This study EU377801 This studyCa5 EU377747 This study EU377802 This studyCa6 EU377748 This study EU377803 This studyCa92-0953 EU377751 This study EU377806 This studyCa92-1096 EU377752 This study EU377807 This studyCa92-1337 EU377753 This study EU377808 This studyCa�WTB27 EU377749 This study EU377804 This studyCa�WTB32 EU377750 This study EU377805 This studyIPT2 EU377754 This study EU377809 This studyIPT19 EU377755 This study EU377810 This studyIPT23 EU377756* This study EU377811 This studyIPT39 EU377757 This study EU377812 This studyIPT58 EU377758 This study EU377813 This studyIPT69 EU377759 This study EU377814 This studyIPT135 EU377760 This study EU377815 This studyIPT137 EU377761 This study EU377816 This studyIPT190 EU377762* This study EU377817 This studyIPT191 EU377763 This study EU377818 This studyIPT193 EU377764* This study EU377819 This studyIPT198 EU377765* This study EU377820 This studyNE49 EU377743 This study EU377798 This studyZ41293 EU377744* This study EU377799 This studyZ41493 EU377745* This study EU377800 This study21509LT ND ND20111LT ND EU377821 This study22521LT ND ND20604LT ND EU377822 This study

*Alleles identical to these novel European ospC groups were found as unpublished GenBank submissions in 2007 by B. Luft and colleagues, presumably derivedfrom North American B. burgdorferi samples (GenBank accession nos.: EF537408/EF537407, EF537427, and EF537414 corresponding to EU377756/EU377762,EU377764/EU377765/EU377745, and EU377744, respectively).

Margos et al. www.pnas.org/cgi/content/short/0800323105 22 of 22


Recommended