+ All Categories
Home > Documents > Are adaptive loci transferable across genomes of related species? Outlier and environmental...

Are adaptive loci transferable across genomes of related species? Outlier and environmental...

Date post: 05-Dec-2016
Category:
Upload: felix
View: 213 times
Download: 1 times
Share this document with a friend
14
Are adaptive loci transferable across genomes of related species? Outlier and environmental association analyses in Alpine Brassicaceae species DEBORAH ZULLIGER, ELVIRA SCHNYDER and FELIX GUGERLI WSL Swiss Federal Research Institute, Zurcherstrasse 111, CH-8903 Birmensdorf, Switzerland Abstract Local adaptation is one possible response of organisms to survive in a changing environ- ment. However, the genetic basis of adaptation is not well understood, especially in non- model species. To infer recurrent patterns of local adaptation, we investigated whether the same putative adaptive loci reoccur in related species. We performed genome scans using amplified fragment length polymorphism (AFLP) markers on populations of five Alpine Brassicaceae species sampled across a wide range of environmental conditions. To identify markers potentially under directional selection, we performed outlier and environmental association analyses using a set of topo-climatic variables available as GIS layers. Several AFLP loci showed signatures of adaptation, of which one, found in Cardamine resedifolia (Cre_P1_212.5), was associated with precipitation. We sequence- characterized this candidate locus and genotyped single nucleotide polymorphisms (SNPs) found within this locus for all species. Testing for environmental associations of SNPs revealed the same association of this locus in Arabis alpina but not in other study species. Cumulative statistical evidence indicates that locus Cre_P1_212.5 is environmen- tally relevant or is linked to a gene under selection in our study range. Furthermore, the locus shows an association to the same potentially selective factor in at least one other related species. These findings help to identify trends in plant adaptation in Alpine ecosystems in response to particular environmental parameters. Keywords: amplified fragment length polymorphisms, Brassicaceae, environmental association genetics, local adaptation, single nucleotide polymorphisms Received 23 April 2012; revision received 22 November 2012; accepted 22 November 2012 Introduction Comprehending the genetic basis of local adaptation and identifying the underlying genomic regions is a fundamental challenge facing molecular ecologists (Reusch & Wood 2007). As the availability of molecular markers and methods to infer selectively important genetic polymorphism is advancing, the field of evolutionary biology is increasingly detecting genetic responses to environmental changes also in nonmodel species (reviewed by Feder & Mitchell-Olds 2003; Vasemagi & Primmer 2005; Rako et al. 2007; Reusch & Wood 2007; Hoffmann & Willi 2008; Kawecki 2008; Mitchell-Olds et al. 2008; Lavergne et al. 2010). Nonethe- less, a comprehensive theory on the mechanisms influ- encing the evolution of species ranges and ecological niches is still far from established (Kawecki 2008), par- ticularly in natural populations of nonmodel species. Plant species in alpine ecosystems are considered particularly vulnerable to climate change, as small shifts in key factors, such as duration of snow cover or tem- perature, can influence the flowering phenology, repro- duction and growth of these species within their environmentally heterogeneous habitat (Arft et al. 1999; Kudo & Hirao 2006). To model vegetation dynamics under future climate conditions, it is therefore necessary to identify which environmental factors are driving local adaptation. Several studies have been carried out with the objective to identify environmental factors or habitat types and their association with loci under directional selection or particular genotypes in alpine Correspondence: Felix Gugerli, Fax: +41 44 739 2215; E-mail: [email protected] © 2013 Blackwell Publishing Ltd Molecular Ecology (2013) 22, 1626–1639 doi: 10.1111/mec.12199
Transcript

Are adaptive loci transferable across genomes of relatedspecies? Outlier and environmental association analysesin Alpine Brassicaceae species

DEBORAH ZULLIGER, ELVIRA SCHNYDER and FELIX GUGERLI

WSL Swiss Federal Research Institute, Z€urcherstrasse 111, CH-8903 Birmensdorf, Switzerland

Abstract

Local adaptation is one possible response of organisms to survive in a changing environ-

ment. However, the genetic basis of adaptation is not well understood, especially in non-

model species. To infer recurrent patterns of local adaptation, we investigated whether

the same putative adaptive loci reoccur in related species. We performed genome scans

using amplified fragment length polymorphism (AFLP) markers on populations of five

Alpine Brassicaceae species sampled across a wide range of environmental conditions.

To identify markers potentially under directional selection, we performed outlier and

environmental association analyses using a set of topo-climatic variables available as

GIS layers. Several AFLP loci showed signatures of adaptation, of which one, found in

Cardamine resedifolia (Cre_P1_212.5), was associated with precipitation. We sequence-

characterized this candidate locus and genotyped single nucleotide polymorphisms

(SNPs) found within this locus for all species. Testing for environmental associations of

SNPs revealed the same association of this locus in Arabis alpina but not in other study

species. Cumulative statistical evidence indicates that locus Cre_P1_212.5 is environmen-

tally relevant or is linked to a gene under selection in our study range. Furthermore, the

locus shows an association to the same potentially selective factor in at least one other

related species. These findings help to identify trends in plant adaptation in Alpine

ecosystems in response to particular environmental parameters.

Keywords: amplified fragment length polymorphisms, Brassicaceae, environmental association

genetics, local adaptation, single nucleotide polymorphisms

Received 23 April 2012; revision received 22 November 2012; accepted 22 November 2012

Introduction

Comprehending the genetic basis of local adaptation

and identifying the underlying genomic regions is a

fundamental challenge facing molecular ecologists

(Reusch & Wood 2007). As the availability of molecular

markers and methods to infer selectively important

genetic polymorphism is advancing, the field of

evolutionary biology is increasingly detecting genetic

responses to environmental changes also in nonmodel

species (reviewed by Feder & Mitchell-Olds 2003;

Vasem€agi & Primmer 2005; Rako et al. 2007; Reusch &

Wood 2007; Hoffmann & Willi 2008; Kawecki 2008;

Mitchell-Olds et al. 2008; Lavergne et al. 2010). Nonethe-

less, a comprehensive theory on the mechanisms influ-

encing the evolution of species ranges and ecological

niches is still far from established (Kawecki 2008), par-

ticularly in natural populations of nonmodel species.

Plant species in alpine ecosystems are considered

particularly vulnerable to climate change, as small shifts

in key factors, such as duration of snow cover or tem-

perature, can influence the flowering phenology, repro-

duction and growth of these species within their

environmentally heterogeneous habitat (Arft et al. 1999;

Kudo & Hirao 2006). To model vegetation dynamics

under future climate conditions, it is therefore necessary

to identify which environmental factors are driving

local adaptation. Several studies have been carried out

with the objective to identify environmental factors or

habitat types and their association with loci under

directional selection or particular genotypes in alpineCorrespondence: Felix Gugerli, Fax: +41 44 739 2215;

E-mail: [email protected]

© 2013 Blackwell Publishing Ltd

Molecular Ecology (2013) 22, 1626–1639 doi: 10.1111/mec.12199

plant species (Byars et al. 2007; Parisod & Christin 2008;

Manel et al. 2010, 2012; Poncet et al. 2010). However,

few studies have gone as far as to characterize these

loci (Buehler et al. 2013) and even more so to test them

on related species.

For nonmodel species, it is therefore not known to

what extent adaptive loci are transferable in the sense

that signals of selection induced by particular environ-

mental factors are the same among related species.

Here, we therefore addressed the following question:

Are the same genomic regions responsible for the same

adaptations in different species? For this, we tested the

transferability of selective loci among five Alpine plant

species of the family Brassicaceae, which are related to

the fully sequenced model species Arabidopsis thaliana

(The Arabidopsis Genome Initiative 2000). Choosing

related species should facilitate the comparison of iden-

tified regions showing signatures of directional selection

among study species as well as with the whole-genome

sequence information of A. thaliana and should allow

the characterization of their genomic environment.

To avoid biases towards known genomic regions, as is

the case when testing candidate genes already identified

in model species, we applied the genome scan approach

(Storz 2005; Vasem€agi & Primmer 2005) using amplified

fragment length polymorphisms (AFLPs; Vos et al.

1995), a technique that genotypes dozens of markers for

each individual and is suitable for nonmodel species.

Two methods were then used to identify markers of

adaptive relevance: First, we used a population genomic

approach, which follows the principle that genetic

regions under selection, the so-called outlier loci, show

a higher genetic differentiation (FST) among populations

than neutral loci do (Beaumont & Nichols 1996; Vitalis

et al. 2001; Storz 2005). Second, we applied a landscape

genetic approach, which is based on finding associa-

tions between the occurrence or frequency of AFLP

markers and environmental gradients (Manel et al.

2012). On the basis of our results, we then characterized

one outlier locus associated with environmental vari-

ables and genotyped single nucleotide polymorphisms

(SNPs) in this locus in all five study species. Finally, we

performed an association analysis to test the relevance

of this locus in response to environmental variation

across related species.

Materials and methods

Study species and sampling

We sampled five diploid Alpine Brassicaceae species

(Arabis alpina, Arabis jacquinii, Cardamine resedifolia,

Draba aizoides and Thlaspi rotundifolium). A. alpina is a

common perennial arctic-alpine rosette herb (Bovet et al.

2006; Ehrich et al. 2007). It is widely distributed in the

European Alps and also has a large altitudinal distribu-

tion ranging from the montane region up to 3200 m

a.s.l. Arabis alpina grows on calcareous substrates in

open, moist and rocky habitats but also on nutrient-rich,

alkaline soils. Reproduction occurs sexually through

seeds or asexually via stoloniferous growth (Ansell et al.

2008). Arabis jacquinii is a less common Middle and

South European Alpine herb. It grows on creek banks

and spring meadows from 1500 to 2600 m a.s.l. and

prefers wet, calcareous soils (Landolt 1992). It repro-

duces mainly through seed, while daughter rosettes

may disintegrate and reroot. Cardamine resedifolia is a

perennial herb and is common in the Middle and South

European Alps. Its altitudinal distribution ranges from

1500 to 3200 m a.s.l. Cardamine resedifolia occurs on

rocky rubble, in rock crevices and open meadows and

grows on lime-deficient soils and silicicolous bedrock,

where the species disperses by seed, including long-dis-

tance dispersal, but also by adventitious roots (Markg-

raf 1958). The species might be predominantly selfing

but outcrossing has not been ruled out (Lihova et al.

2009). Draba aizoides is a common Middle and South

European Alpine herb and occurs on scree, ridge and

open meadows from 1500 to 3000 m a.s.l. where it

grows on rocky, calcareous substrates (Landolt 1992).

Seed dispersal is presumably the main mode of repro-

duction (Markgraf 1958), but daughter rosettes poten-

tially separate and root after being translocated.

Thlaspi rotundifolium is a less common Alpine herb

growing between 1500 and 3000 m a.s.l. on piles of rub-

ble and scree. It exclusively occurs on calcareous sub-

strates (Landolt 1992) and shows high potential for

vegetative spread.

The five species were chosen in view of their wide

ecological amplitude and range/abundance. For molec-

ular genetic analysis, a total of 10–13 populations of

each species were sampled from July to September of

2009 in the Swiss Alps and the Jura Mountains

(Table S1 and Fig. S1, Supporting information). Twenty

individuals per population were collected allowing for

frequency-based outlier analyses. Sampling sites were

chosen to represent a wide range of variety in

environmental variables, such as temperature, evapo-

transpiration and precipitation.

DNA extraction and AFLPs

DNA was extracted from 10-mg leaf tissue, dried on

silica gel immediately after sampling, using a Qiagen

DNeasy Plant Mini Kit following the manufacturer’s

instructions (Qiagen). Within each species, samples

were arranged randomly on PCR plates including three

replicates within and three replicates among plates.

© 2013 Blackwell Publishing Ltd

ADAPTIVE LOCI IN ALPINE BRASSICACEAE 1627

These two types of duplicates served for rigorously

testing the reliability of the genotyping procedure

during AFLP marker selection. Moreover, we added

10–20% blind samples to calculate mismatch error rates

in the set of AFLP markers retained for analysis.

We produced AFLP data (Vos et al. 1995) using the

standard protocol described in Gugerli et al. (2008) in

all five species. First, 5 lL of approximately 18 ng/lLof genomic DNA was digested at 37 °C for 2 h using 2

U MseI (New England Biolabs) and 5 U PstI restriction

enzyme (New England Biolabs) in a 20-lL reaction mix.

MseI and PstI adaptors were then ligated to the restric-

tion fragments, and preselective and selective PCRs

were performed following the standard protocol in

Gugerli et al. (2008). Preselective PCRs were run on a

PCT-100 thermo cycler (MJ Research) using MseI-C and

PstI-A primers. Selective PCRs were run on a Veriti

thermo cycler (Applied Biosystems) using three primer

pair combinations: P1—MseI-CA and PstI-AAG; P3—

MseI-CT and PstI-AAC; and P9—MseI-CA and

PstI-AAC as in Herrmann et al. (2010). Using the same

primer/enzyme combinations in all study species, we

aimed at increasing the chance to obtain homologous

markers to facilitate among-species comparisons. Prod-

ucts of selective PCRs were analysed on an ABI 3730

automated sequencer (Applied Biosystems). Markers

were scored using the software GENEMAPPER 3.7 (Applied

Biosystems), and automatically set bins were adjusted

manually. We ran the script scanAFLP v. 1.0 (Herrmann

et al. 2010) in R (R Development Core Team 2010) to

evaluate markers for their reproducibility. To keep a

marker in the data set, the number of errors among

duplicated samples was set to 1, among replicated sam-

ples to 0 and among all negative controls to 1. Finally,

we only considered markers with at least three poly-

morphisms per species. Matrices of the three primer/

enzyme combinations were joined to generate one

matrix per species. Mismatch error rates were calcu-

lated as suggested by Bonin et al. (2004) using replicate

samples after discarding markers with high error rates

from the matrix.

Outlier analysis

We tested for the presence of significant association

between pairs of loci based on an exact test for linkage

disequilibrium (LD) using the program ARLEQUIN v. 3.5.

(Excoffier et al. 2005). For each species, we calculated

the percentage of loci with possible LD.

We used the program BAYESCAN v. 2.1. (Foll &

Gaggiotti 2008) to find markers subject to directional

selection in each species. The method implemented in

this program has been adapted to dominant loci, such

as AFLP markers, and identifies outlier loci as loci that

show significantly higher (or lower) FST coefficients

than can be expected under the neutral theory. Such

outlier loci are assumed to be under directional (or bal-

ancing) selection. We defined the significance threshold

as ‘decisive’ according to Jeffrey’s (1961) interpretation

(P(selected) = 99%, log10(Bayesfactor) � 2).

Environmental association analysis

To identify loci correlated with environmental variables,

we used an approach similar to the one described by

Manel et al. (2012). We extracted environmental vari-

ables by intersecting the sampling locations with GIS

data of 19 topo-climatic variables collected in 1961–1990

(25-m resolution, Zimmermann & Kienast 1999;

Table S2, Supporting information). We performed a

principal component analysis (PCA) in R (R Develop-

ment Core Team 2010) to identify those variables that

explain the most variation and to remove redundancies

among correlated variables. Groups of variables were

formed based on a correlation of |r| � 0.8 correspond-

ing to the method of Manel et al. (2012), and within the

first few principal component axes, which explained

most of the variation, we kept at the most two variables

for each axis, choosing only the main contributor of no

more than two groups of variables.

For outlier loci identified by BAYESCAN, we calculated

population-wise frequencies of AFLP band presences

within each species. Multiple linear regressions were

then performed, using the statistical software package

SPSS v. 17.0 (SPSS, Inc., Chicago IL, USA), between the

band frequencies as dependent variables and the

selected environmental variables as independent vari-

ables (see Table S1 for values of environmental vari-

ables, Supporting information). Multiple linear

regressions were also performed using second-degree

polynomials of all variables, except for topography

(TOPO) and topographic wetness index (TWI), which

are ‘index’ values, to account for nonlinear relationships

between AFLP band presence and environmental vari-

ables (Legendre & Legendre 1998). Loci with an

R2adj > 0.5 were considered associated with at least one

environmental variable (Manel et al. 2012), with the pre-

mise that the adjusted coefficient of determination gives

unbiased estimates of how much of the total variation

is accounted for by a response variable in a linear

model (Ohtani 2000). Simple linear regressions were

then performed for each variable separately to assess

their significance.

We refrained from correcting for multiple testing under

the premise that we wanted to identify loci that primarily

show high explanatory power by respective environmen-

tal factors. While taking the risk of identifying false posi-

tives, our analysis in turn included several steps aimed

© 2013 Blackwell Publishing Ltd

1628 D. ZULLIGER, E . SCHNYDER and F . GUGERLI

at reducing the rate of false positives, such as only

considering outliers that show an R2adj > 0.5 for the

multiple linear regression with all environmental

variables (both linear and polynomial) plus have at

least one environmental variable (both linear and poly-

nomial) with a significant association. In order not to be

too conservative and risk type II errors, we chose this

approach over correcting for false positives using, for

example, Bonferroni or false discovery rates.

Spatial variation

Ecological data are often to some degree spatially auto-

correlated, as the value at one site may be influenced

by the values at neighbouring sites. Autocorrelation can

therefore violate the assumption of independence and

introduce a bias in standard statistical inference meth-

ods (Legendre 1993; Dray et al. 2006). On the other

hand, spatial autocorrelation can also be used as a

proxy for important environmental clines that were not

included in the set of environmental variables used in a

study (Manel et al. 2010, 2012). To assess the degree of

spatial autocorrelation within our data, we chose an

approach based on Moran’s eigenvector maps (MEMs;

Dray et al. 2006). This method uses Moran’s I statistic to

identify the principal coordinate axes that reflect

significant spatial autocorrelation and allows to model

complex spatial patterns at different spatial scales (Dray

et al. 2006). MEMs were generated for each species

using the R package spacemakeR. Multiple linear regres-

sions were then performed in SPSS between AFLP band

frequencies of outlier loci as dependent variables and

MEMs as independent variables.

To assess the effects of population structure, we

applied the program INSTRUCT (Gao et al. 2007), which

infers population structure while considering inbreed-

ing by estimating selfing rates within populations, with

K ranging from 1 to n + 1, with n = number of popula-

tions for a given species, for 200 000 generations

(burn-in = 50 000).

Sequencing of outlier locus Cre_P1_212.5

Of the outlier loci consistently detected across all analyses

(see Results section), we selected one locus, Cre_P1_212.5,

for sequence-characterizing. To do this, we produced

AFLP genotypes using the procedures described above

for eight individuals: four samples showing the candidate

AFLP band and four lacking the band. Selective PCR was

performed using the primer combination P1—MseI-CA

and PstI-AAG. Locus Cre_P1_212.5 was then isolated

using the procedures of Roden et al. (2009). To electropho-

rese the AFLP bands, 20 lL of the selective PCR products

was loaded on Spreadex� EL 400 mini gels (Elchrom Sci-

entific) in 30 mM Tris-acetate EDTA (TAE) buffer, using

M3Marker (Elchrom Scientific) as size standard. The elec-

trophoresis chamber and gel were preheated at 55 °C, andsamples were run at 120 V for 180 min. We stained the gel

for 30 min using SYBR GOLD (Clare Chemical Research)

and viewed the bands in an Epi Chemi II Darkroom (UVP

Laboratory Products). Each amplified Cre_P1_212.5 band

was picked twice using a cylinder of 1 mm diameter. As a

control, one larger and one shorter band were also picked.

Gel cylinders were then eluted in 25 lL deionized water

at 4 °C for 24 h. We amplified the excised DNA bands in a

PCR reaction using primers designed to anneal to the

adaptor sequences (MseI + 0 and PstI + 0) as described in

Roden et al. (2009). As the PCR product still included

other bands besides Cre_P1_212.5, we purified the band

by repeating electrophoresis, band picking and cylinder

elution. The re-excised bands were then re-amplified

using the preselective PstI-A primer and the selective

MseI-CA primer. To purify the re-amplified bands, we

added 2 lL of ExoSAP IT (GE Healthcare) to 5 lL of

each PCR product and incubated the samples at 37 °Cfor 15 min and at 80 °C for 15 min. Bands were

sequenced both forward and reverse using BigDye�

Terminator Cycle Sequencing Kit v3.1 (Applied Biosys-

tems) on an ABI 3130 Genetic Analyzer (Applied

Biosystems). We aligned the sequences using the

software DNADynamo (BlueTraktorSoftware) and

BLAST-searched (Altschul et al. 1990) the obtained

consensus sequence against the nucleotide database of

GenBank for Arabidopsis thaliana.

Sequencing of flanking regions of Cre_P1_212.5 anddetecting SNPs for SNaPshots

To characterize the distinct mutation(s) underlying the

AFLP band polymorphism of locus Cre_P1_212.5, that

is, to find the mutation(s) in the restriction sites and/

or in the selective bases, we aimed at sequencing the

flanking regions of the excised band. As there was

only one putative homologue sequence detected in

the BLAST search, we searched for conserved regions

upstream and downstream of locus Cre_P1_212.5 and

designed three forward primers and three reverse

primers (Table 1) using PRIMER3 (Rozen & Skaletsky

2000). We applied standard PCR conditions for nine

primer pair combinations (Table 2) to test for the

combination(s) that amplified the region best in seven

individuals of C. resedifolia and four individuals each

of A. alpina, A. jacquinii, D. aizoides and T. rotundifolium.

Using primer combination P9 for A. alpina, A. jacquinii

and C. resedifolia and combination P7 for D. aizoides

and T. rotundifolium (Table 2), we sequenced the amp-

lified fragments and aligned the sequences as descri-

bed above. Within each species, the sequences were

© 2013 Blackwell Publishing Ltd

ADAPTIVE LOCI IN ALPINE BRASSICACEAE 1629

compared to find SNPs for genotyping. The coding

regions of the obtained consensus sequences of each

species were translated into protein sequences and

BLASTed against the protein database of GenBank. For

each species, we checked which SNPs were synony-

mous mutations and which ones induced a change in

the protein sequence, and we compared the variation

in protein sequences within and among species.

SNaPshots

A SNP was detected in the MseI restriction site in

C. resedifolia (see Results section) presumably leading

to the observed AFLP band presence/absence pattern.

To confirm whether this polymorphism corresponded

to the observed AFLP band presence/absence pattern

in C. resedifolia, we genotyped this SNP plus two

additional SNPs identified in the seven sequenced

C. resedifolia individuals. We designed primers follow-

ing the recommendations in the SNaPshot protocol

(Applied Biosystems; Table 3) and used the software

OLIGO EXPLORER v.1.5 (Gene Linke 2010) to check for

primer dimers and secondary primer structures.

SNaPshot reactions were performed following the pro-

tocol for the ABI Prism SNaPshot� Multiplex Kit

(Applied Biosystems) but using a reduced reaction

volume (6 lL) as described in Buehler et al. (2013).

Electrophoresis of SNaPshot fragments was conducted

on an ABI 3130 Genetic Analyzer (Applied Biosys-

tems), and the results were viewed on GENEMAPPER 3.7

(Applied Biosystems).

Further, we wanted to test whether locus Cre_P1_212.5

is of adaptive relevance or is linked to a genomic region

under selection also in other species related to C. resedifo-

lia. Thus, we genotyped nucleotide variation in the

remaining study species, that is, three SNPs in D. aizoides

and four in T. rotundifolium. Only one SNP was found

and genotyped in A. alpina in spite of sequencing an

additional 34 individuals, and no SNPs were detected in

A. jacquinii after sequencing another ten individuals.

SNaPshot genotyping was performed as described above

(see Table 3 for SNaPshot primers).

Table 1 Primers tested for sequencing of the fragment Cre_P1_212.5

andflanking regions in fiveAlpine Brassicaceae species

Primer Sequence 5′-3′

Cre212_1a_R (F) GCA TGT CGA AAT GAC ATC CA

Cre212_1b_R (F) GCC AAA AGG CAA GAA TGG T

Cre212_2_R (F) GTC GAA ATG ACA TCC ACA GG

Cre212_1c_R (R) TGA CTT TGC CCA ATG TTT CA

Cre212_2b_R (R) AGC CAA CTG CCT TAA ACG AA

Cre212_3_R (R) TCC CAG TAA CCG GAT CCA TA

Table 2 Primer combinations tested for sequencing of the

fragment Cre_P1_212.5 with flanking regions in five Alpine

Brassicaceae species

Primer

combination

Forward

primer

Reverse

primer

Anticipated

fragment

length (bp)

P1 Cre212_1a_R Cre212_1c_R 587

P2 Cre212_1a_R Cre212_2b_R 560

P3 Cre212_1a_R Cre212_3_R 463

P4 Cre212_1b_R Cre212_1c_R 620

P5 Cre212_1b_R Cre212_2b_R 593

P6 Cre212_1b_R Cre212_3_R 495

P7 Cre212_2_R Cre212_1c_R 584

P8 Cre212_2_R Cre212_2b_R 557

P9 Cre212_2_R Cre212_3_R 459

Primer combinations in bold were used for further fragment

amplifications.

Table 3 SNaPshot primers used for single nucleotide polymorphism (SNP) genotyping of sequence-characterized markers identified

from a genome scan using amplified fragment length polymorphisms (AFLPs) in five Alpine Brassicaceae species

Species SNaPshot primer Sequence 5′-3′

Cardamine resedifolia Cre_SNP40 (GACT)2* TGC TCC TTT CAT GGC AGT GTA CAA

Cre_SNP239 (GACT)4 GAT ATC AAC AAA TCT GAG ATG TTG

Cre_SNP253_R (GACT)6 TTG AAG ATT ATC TGA TTC CAA TAT

Draba aizoides Dai_SNP353 (GACT)2 TCG TTA GCT ATT TGA ATC TCA TGC

Dai_SNP433 (GACT)4 TAA AAC TGA TGG ATC CGATTA CCG

Dai_SNP434_R (GACT)6 GTT GAT GCA GAT ATC ACA AGA AAT

Thlaspi rotundifolium Tro_SNP106 (GACT)2 ACT AAT GGC GCA GCT AGA GAA TCT

Tro_SNP222_R (GACT)4 AGC TTT AAT AAC ATG TCA GCT TTG

Tro_SNP307 (GACT)6 GCA GGT ACA ATA AAT GCT TTC TTT

Tro_SNP342 (GACT)8 ATG TTG ATT AGG TAC TTA CTA CTT

Arabis alpina Aal_SNP208 (GACT)2 CAT TGA TGT TTG CCT TGC AAT TGA

*(GACT)n was used to separate fragment lengths in multiplex PCRs.

© 2013 Blackwell Publishing Ltd

1630 D. ZULLIGER, E . SCHNYDER and F . GUGERLI

Population-wise SNP allele frequencies were calcu-

lated for each species, and a multiple linear regression

analysis was performed in SPSS between the allele

frequencies (Table S4, Supporting information) and the

same seven selected environmental variables as previ-

ously used with the AFLP markers. Allele frequencies

were further utilized to perform multiple linear regres-

sions with MEMs as well as second-degree polynomials

of the environmental variables.

Results

AFLP data and outlier analyses

We obtained AFLP markers for 197 (C. resedifolia) to 260

(A. alpina) individuals per species (Table 4). AFLP geno-

typing, scoring and screening for reproducible markers

with at least three polymorphisms resulted in a total of

405 reliable AFLP loci (53–124 loci per species; Table 4).

Mismatch error rates ranged from 2.4% to 4.0% per spe-

cies.

Of the pairwise tests for LD between loci, the average

percentage of loci showing significant linkage over all

populations within species ranged from 7.3% in T. rot-

undifolium to 14.8% in C. resedifolia. In the other species,

percentages of loci in LD were 11.9% in A. alpina,

14.2% in A. jacquinii and 10.1% in D. aizoides. Locus

Cre_P1_212.5, which was chosen for sequence-charac-

terizing, showed 23.8% LD in the pairwise comparisons

between loci in C. resedifolia.

Using BAYESCAN, 43 loci (10.6%) were identified as

outliers over all species and were distributed among all

species with the most in A. alpina (14) and the least in

D. aizoides (3).

PCA and association analyses

The PCA of 19 environmental variables exhibited 73%

of the variation on the first two axes. Including the

third axis, 96% of the variation was covered, and with a

fourth axis, this value increased only by 2%. The vari-

ables TWI (abbreviations given only for those variables

selected for further analysis), site water balance, slope

(SLP25) and annual number of frost days during grow-

ing season (SFROYY) showed below-threshold correla-

tions (|r| < 0.8), whereas the remaining variables were

correlated within the following groups: (i) aspect value

and annual radiation (SRADYY); (ii) degree days above

0 °C, degree days above 3 °C, degree days above 5.56 °C,mean average annual temperature, mean average sum-

mer temperature (TAVESU), and mean average winter

temperature; (iii) annual precipitation (PRECYY), pre-

cipitation in summer, precipitation days in summer,

average yearly moisture index and summer average

moisture index; (iv) topographical variables (TOPO)

and smoothed topographic variables (for abbreviations

see Table S2, Supporting information). We chose one to

two variables of the first four PC axes, and in the case

of correlated variables, only the representative of each

group with the highest variability. This left the follow-

ing seven variables for further calculations: PRECYY,

SFROYY, SLP25, SRADYY, TAVESU, TOPO and TWI.

Multiple linear regressions between outlier loci as

dependent and the seven environmental variables as

independent variables resulted in six loci with an

R2adj > 0.5, whereas using second-degree polynomials of

the environmental variables resulted in nine loci with

R2adj > 0.5 (Table 4). When performing linear regressions

using each environmental variable separately, 11 loci

were significantly (P < 0.05) associated with an environ-

mental variable and 10 when using second-degree poly-

nomials (Table 4). Loci were predominantly associated

with PRECYY, TOPO and TWI. In summary, four loci

(Aja_P1_433.7, Cre_P1_212.5, Dai_P1_121.3 and

Dai_P1_191.1) were detected as outliers by BAYESCAN,

showed an R2adj > 0.5 in the multiple linear regressions

with environmental variables (linear and second-degree

polynomials) and were significantly associated with at

least one environmental variable (linear and second-

degree polynomials) in the simple linear regressions.

Spatial variation

Multiple linear regressions between outlier loci and

MEMs resulted in an R2adj > 0.5 for 11 loci (Table 4),

indicating that these loci show some signs of spatial

autocorrelation or that there are other environmental

variables that are relevant for these loci but were not

accounted for in the association analysis.

Deviance information criteria (DICs) obtained from

the program INSTRUCT for each K were lowest for high

Ks close or equal to the number of sampled populations

(Fig. 1). Posterior distributions of selfing rates for

inferred clusters were high in all species ranging from

0.65 to 0.96 and therefore justified the use of INSTRUCT.

The results of these analyses suggest that population

structure can be considered as well represented by the

sampled locations.

Sequencing and characterization of outlier locus

Of the four loci most probably being of ecological rel-

evance, we chose locus Cre_P1_212.5 for exemplary

sequence-characterizing. Due to its high band peaks

in the electropherograms and the lack of other peaks

with similar fragment sizes, this marker appeared

suitable to be excised from the gel from a technical

point of view.

© 2013 Blackwell Publishing Ltd

ADAPTIVE LOCI IN ALPINE BRASSICACEAE 1631

After amplifying and sequencing the excised

Cre_P1_212.5 band, we obtained a 214-bp fragment

including the two primers, that is, fragment length was

in the expected size range. The BLAST search showed

that this sequence was 96% congruent with a DNA

sequence assigned to the nucleotidyltransferase family

protein in A. thaliana located on chromosome 2 (Gen-

Bank accession no. NC_003071.7). After sequencing the

Table 4 Results of outlier and environmental association analyses of amplified fragment length polymorphism (AFLP) loci in five

Alpine Brassicaceae species

Species n AFLP loci

n of

outliers

(BAYESCAN) Outlier ID MEMs (R2adj)

Environmental

variables (R2adj)

Significant

environmental

variables*

Arabis alpina 260 88 14 Aal_P1_91 �0.067 0.466 TOPO

Aal_P1_248 �0.696 0.460 —

Aal_P1_303.4 �0.464 0.552 TAVESU, SFRO

Aal_P1_326.5 �0.458 0.250 —

Aal_P1_331.6 0.362 0.272† —Aal_P1_383 �0.038 0.156 —

Aal_P1_433.7 �0.267 0.727† TWI

Aal_P3_322 �0.027 0.316 —

Aal_P3_325.5 0.070 �0.354 —

Aal_P9_179.4 0.241 �0.640 —Aal_P9_181.4 �0.922 �0.310 —

Aal_P9_210.7 0.739 0.009 —Aal_P9_368.4 �0.170 �0.017 —

Aal_P9_467.8 �0.670 0.253 —Arabis jacquinii 235 59 10 Aja_P1_113.4 �1.064 0.433 –

Aja_P1_129.3 0.117 �0.454 TOPO

Aja_P1_130.4 0.197 �0.474 TOPO

Aja_P3_148.3 0.082 �0.217 PRECYY

Aja_P3_151 �0.758 �0.109 —

Aja_P3_239.2 �0.548 0.244 —Aja_P9_175 0.409 0.376† PRECYY, SRADYY

Aja_P9_176.7 �0.042 0.237† PRECYY

Aja_P9_259.3 0.152 0.242† PRECYY

Aja_P9_350.9 0.271 �0.111 —Cardamine resedifolia 197 53 12 Cre_P1_179.5 �0.106 0.606 —

Cre_P1_212.5 0.827 0.624† PRECYY

Cre_P1_214.2 0.822 �1.260 —

Cre_P1_216.6 0.847 �0.991 —Cre_P3_217.3 0.821 �1.057 —

Cre_P9_93.3 0.949 �1.118 —Cre_P9_148.5 0.435 0.414 —

Cre_P9_191.3 0.970 �0.854 —Cre_P9_401.2 0.917 �1.268 —

Cre_P9_413.4 �0.570 �0.258 —Cre_P9_419.6 �0.574 �0.183 —

Cre_P9_426.8 �0.118 0.620 —Draba aizoides 200 81 3 Dai_P1_121.3 0.088 0.992† PRECYY, TOPO, TWI

Dai_P1_185.9 0.459 �1.123

Dai_P1_191.1 �0.325 0.791† PRECYY, TWI

Thlaspi rotundifolium 198 124 4 Tro_P1_206 0.218 0.143 —Tro_P9_111.1 0.006 0.686† —

Tro_P9_268.3 �0.249 �0.166 —

Tro_P9_312.5 0.789 �0.951 —

P1, P3 and P9 refer to the primer/enzyme combinations detailed in the Materials and Methods section.

Numbers in bold: R2adj > 0.5 for linear environmental variables.

*Simple linear regression. Variables in italics are significant even though R2adj � 0.5 in multiple linear regression.

†R2adj > 0.5 for second-degree polynomials of environmental variables.

© 2013 Blackwell Publishing Ltd

1632 D. ZULLIGER, E . SCHNYDER and F . GUGERLI

flanking regions in C. resedifolia using primer combina-

tion P9 (Table 2), we obtained a 420-bp fragment

including the Cre_P1_212.5 region (GenBank accession

no. JN982363). The first 310 bp of this region were

highly conserved compared with the sequence of A. tha-

liana and were thus considered a putative protein-cod-

ing region with mostly synonymous mutations.

SNP genotyping

In the seven sequences of C. resedifolia, four SNPs were

found, of which one (Cre239) was located in the MseI

restriction site (Fig. 2) and one was in the noncoding

region of the sequence. Using the SNaPshot protocol,

we genotyped the three SNPs located in the putative

coding region of this sequence. For 100% of all C. resedi-

folia samples, the presence of the mutation in the MseI

restriction site in both alleles corresponded to the

absence of the Cre_P1_212.5 band in the AFLP band

profile. In the other species, DNA sequences of locus

Cre_P1_212.5 showed exactly the same length of 214 bp

as in C. resedifolia. It can therefore be assumed that

homologous bands in the AFLP electropherograms are

equivalent to locus Cre_P1_212.5. In A. jacquinii and

D. aizoides, this band was always present, as there were

no mutations in the restriction sites or selective bases.

In A. alpina and T. rotundifolium, the band was always

absent (Fig. 2). In A. alpina, the absence was due to the

same mutation as found in C. resedifolia in the MseI

restriction site. In T. rotundifolium, the band was absent

because of a mutation in the selective bases of the MseI

(pre-)selective primers (Fig. 2).

In A. alpina, we genotyped one SNP located within

the coding region, whereas three SNPs, all located in

the noncoding region, were genotyped in D. aizoides. In

T. rotundifolium, we genotyped four SNPs, of which two

were located in the coding and two in the noncoding

region (GenBank accession nos. JN982363–JN982367).

No ambiguities in SNP genotyping were found in any

of the species, as replicated samples were to 100%

compliant. Population-wise allele frequencies for each

genotyped SNP are listed in Table S4 (Supporting

information).

SNP association analyses

Multiple linear regressions using allele frequencies of

SNPs and the seven environmental variables showed an

R2adj > 0.5 in the A. alpina SNP, in two out of three SNPs

in C. resedifolia, in one out of three SNPs in D. aizoides

and in two out of four SNPs in T. rotundifolium

(Table 5). Significant associations to environmental vari-

ables were found in most of these SNPs, whereas the

same association with PRECYY as originally found in

C. resedifolia was only confirmed in A. alpina (Table 5;

Fig. 3a,b). This association was even stronger in A. alp-

ina than in C. resedifolia, and, opposed to C. resedifolia,

no significant spatial autocorrelation was found in

A. alpina in the multiple linear regressions with MEMs.

Using second-degree polynomials of environmental

variables in multiple linear regressions produced very

similar results for all species (data not shown).

Protein sequences

The comparison of protein sequences within species

revealed only few minor changes (Fig. 4). In A. alpina

and C. resedifolia, there was one change each from

aspartic acid (D) to glutamic acid (E), which are both

negatively charged amino acids. In T. rotundifolium,

there was a change from asparagine (N) to serine (S).

These amino acids are both polar and uncharged.

Among species, changes were less conservative as

several mutations involved changes between different

0

2000

4000

6000

8000

10 000

12 000

14 000

16 000

DIC

0 5 10 15

K

Arabis alpina

A. jacquinii

Cardamine resedifolia

Draba aizoides

Thlaspi rotundifolium

Fig. 1 Deviance information criteria (DIC) vs. K as inferred by

the program INSTRUCT (Gao et al. 2007) for five Alpine species

of Brassicaceae. Full symbols represent the K values with the

lowest DIC for each species.

Band presence 5‘ PstI-RS SB MseI RS 3‘

Cardamine resedifolia 1 CTGCAG AG ... ... TG TTAA 0 CTGCAG AG ... ... TG CTAA

Arabis alpina 0 CTGCAG AG ... ... TG CTAAArabis jacquinii 1 CTGCAG AG ... ... TG TTAADraba aizoides 1 CTGCAG AG ... ... TG TTAAThlaspi rotundifolium 0 CTGCAG AG ... ... TA TTAA

SB

Fig. 2 Restriction sites (RS), selective bases (SB) and single

nucleotide polymorphisms (SNPs) for locus Cre_P1_212.5 in

five Alpine Brassicaceae species. Rectangles highlight mutation

sites and small triangles indicate where the restriction enzymes

cut.

© 2013 Blackwell Publishing Ltd

ADAPTIVE LOCI IN ALPINE BRASSICACEAE 1633

‘categories’ of amino acids, for example, between posi-

tively charged and hydrophobic ones (Fig. 4).

Discussion

In the present study, we identified loci showing signals

of directional selection in Alpine Brassicaceae species at

the molecular level and characterized a new candidate

locus (Cre_P1_212.5). By genotyping SNPs within the

coding region of this locus, we could confirm an associ-

ation with mean annual precipitation (PRECYY) in the

two species C. resedifolia and A. alpina. Although the

marker was originally identified in C. resedifolia, as this

was the only species in which it was polymorphic, this

association was even stronger in the only SNP identi-

fied in A. alpina, reinforcing the initially presumed

adaptive relevance of this locus. In two other species,

D. aizoides and T. rotundifolium, this locus showed asso-

ciations with other environmental variables, that is,

average annual number of frost days during growing

season (SFROYY) and to TWI, respectively. In a fifth

species, A. jacquinii, we could not test for associations

as we did not find any SNPs for genotyping in the

sequences obtained.

Multiple linear regressions between outlier loci/SNPs

and environmental variables were generally not charac-

terized by a strong statistical power because there were

limitations to the number of sampled populations. As

we carried out sampling for five species, and we were

interested in frequency-based association analysis,

certain confinements had to be accepted, which also

concerned the number of independent variables that

could be included in the regression analysis.

Nevertheless, cumulative evidence from different

methods used implied that this marker is an outlier

locus showing association with annual precipitation in

C. resedifolia. Moreover, and functioning as a type of

replicate, we have confirmation of the same association

in A. alpina, even though this species was sampled in

different locations than C. resedifolia. We are therefore

confident that this genetic region is either linked to a

locus or is itself under directional selection in at least

these two plant species. We acknowledge, however, that

an alternative factor may as well underlie this environ-

mental association, as we removed those factors from

the analysis which showed the highest correlations with

annual precipitation (PRECYY). Nevertheless, the

respective factors that showed the highest correlations,

namely precipitation in summer, precipitation days in

summer, average yearly moisture index and summer

average moisture index, were all related to precipita-

tion. This finding supports our interpretation that the

adaptive signature detected most likely relates to water

availability.

Previous studies have already identified precipitation

as a major environmental factor driving plant adapta-

tion (Richardson et al. 2009; Manel et al. 2010, 2012;

Poncet et al. 2010). Additional predominant factors are

temperature (Hamilton et al. 2002; St Clair et al. 2005;

Richardson et al. 2009; Manel et al. 2010, 2012; Poncet

et al. 2010) and unknown variables recognized by

accounting for spatial autocorrelation among sampling

sites using the approach of MEMs (Dray et al. 2006).

MEMs were the best predictors of allele distribution in

the previous studies on A. alpina (Manel et al. 2010,

2012), suggesting either that the outlier character of the

Table 5 Results of environmental association analyses for single nucleotide polymorphism (SNP) in five Alpine Brassicaceae species

Species SNP locus Allele

R2adj > 0.5

All

environmental

variables

Single environmental variables

MEMsPRECYY SFROYY SLP25 SRADYY TAVESU TOPO TWI

Arabis alpina Aal208 A/T 0.80* 0.69***

Cardamine

resedifolia

Cre40 G/A

Cre239 C/T 0.55 0.40* 0.84*

Cre253 C/A 0.51 0.40* 0.87*

Draba aizoides Dai353† C/T

Dai433† G/A 0.87 0.50* 0.55

Dai434† C/T

Thlaspi

rotundifolium

Tro106 A/C

Tro222 T/C

Tro307† C/G 0.83

Tro342† C/T 0.91 0.44* 0.75

*P < 0.05; ***P < 0.001.†SNPs not in putative coding regions.

© 2013 Blackwell Publishing Ltd

1634 D. ZULLIGER, E . SCHNYDER and F . GUGERLI

markers used in this study is mostly influenced by

spatial autocorrelation or that there are other environ-

mental factors driving local adaptation that were not

accounted for in these studies.

On the other hand, eigenvector methods, such as the

MEM approach, suffer the problem of inflating the vari-

ation explained by a given causal process (Gilbert &

Bennett 2010). This can result in R2adj values that are up

to 0.5 higher than actual R2adj values and is to a certain

extent due to eigenvector axes accounting for random

noise. For this reason, we did not put too much weight

on R2adj values >0.5 as exclusion criterion for sequence-

characterizing. Nevertheless, we included this informa-

tion, as there are a number of desirable aspects of

eigenvector methods (Borcard et al. 2004), such as the

decomposition of spatial scales for analyzing scale-

dependent community structuring. Moreover, it is

possible that outlier detection resulted in false positives,

for example, because of population structure (Excoffier

et al. 2009). To compensate for such type II error,

post hoc environmental association analyses substantiated

our candidate loci showing signatures of local adaptation.

Our marker Cre_P1_212.5 also showed a significant

R2adj for MEMs in C. resedifolia, implying that there

might be autocorrelation between the sampling sites for

this locus, or another relevant, co-varying environmen-

tal variable that we did not consider is driving local

adaptation besides precipitation. For the genotyped

SNP in A. alpina, however, values of R2adj for MEMs

were not significant or even above 0.5. This outcome

substantiates that precipitation is indeed the major pre-

dictor of genetic variation at this locus or a genomic

region in LD in A. alpina. It has to be noted though that

in several studies (Manel et al. 2010, 2012; Poncet et al.

2010), and also including the present one, the same set

of topo-climatic variables were used. Thus, a certain

bias towards the same variables as predictors cannot be

ruled out. Moreover, the climate data used are interpo-

lated from meteorological recordings (Zimmermann &

Kienast 1999). In future studies, it would therefore be

interesting to include additional variables, such as, for

example, microclimatic characteristics, soil composition

0

1

2

3

4

5

0

5000

10 000

15 000

20 000

25 000

0 0.2 0.4 0.6 0.8 1

PR

EC

YY

pol

yn. (

x 10

8 )P

RE

CY

Y p

olyn

. (x

108 )

PR

EC

YY

Cre_SNP239_C allele frequency

PRECYY PRECYY polyn.

(a)

(b)

PR

EC

YY

Aal_SNP208_A allele frequency

0

2

3

5

7

8

0

5000

10 000

15 000

20 000

25 000

30 000

0 0.2 0.4 0.6 0.8 1

4

1

6

Fig. 3 Single nucleotide polymorphism (SNP) allele frequencies

in locus Cre_P1_212.5 in two Alpine Brassicaceae species vs.

annual precipitation (PRECYY; sum of 1/10 mm/month) and

second-degree polynomials of PRECYY (PRECYY polyn.). (a)

SNP locus Cre_SNP239_C in Cardamine resedifolia, (b) SNP

locus Aal_SNP208_A in Arabis alpina.

Cre IHRHDAPFMAVYKSLIPAEEELEKQKQLMAQLENLVAKEWPHAKLYLYGSCANSFGFPKSDIDVCLAIEDDDINKSEMLLKLADILESDNLQNVQVKI H M Y K A H D SE E ES K

Aal I Y M Y R A H E D SE D QS L I Y M Y R A H D D SE D QS L

Aja ? Y M Y R G Q D SE D QS D

Dai H Y I F R A H D SE D QS Q

Tro I Y M Y K A H E N AD E EA Q I Y M Y K A H E S AD E EA Q

Fig. 4 Protein sequences of locus Cre_P1_212.5 in the five Brassicaceae species studied, indicating the changes of amino acids within

and among species. Changes within species are highlighted in grey. Cre—Cardamine resedifolia; Aal—Arabis alpina; Aja—A. jaquinii;

Dai—Draba aizoides; Tro—Thlaspi rotundifolium.

© 2013 Blackwell Publishing Ltd

ADAPTIVE LOCI IN ALPINE BRASSICACEAE 1635

and nutrient availability, as well as data collected at the

sampling sites, although these probably comprise

shorter time spans than the 30-year records used here.

The linearity of the relationship between allele fre-

quencies vs. environmental gradients is occasionally

debated. In fact, many (if not most) environmental

association studies published in the literature assume

and test for a linear relationship between environmen-

tal variables and allele frequencies at potentially selec-

tive loci, particularly so if the underlying response is

considered quantitative (Coop et al. 2010; Manel et al.

2010, 2012; Poncet et al. 2010). There is theoretical as

well as empirical support for a linear relationship

between environmental factors along ecological gradi-

ents and allele frequencies (Schmidt et al. 2008). In a

recent article, Kooyers & Olsen (2012) used linear

regressions to correlate allele frequencies with environ-

mental factors for one of the classical textbook exam-

ples of plant adaptation to the environment, namely

cyanogenesis in Trifolium repens. The association of

allele frequencies at cyanogenesis genes and minimum

winter temperature in T. repens was found to be linear.

Even though it remains speculative what type of

response curve other than linear is ecologically most

appropriate (e.g. exponential, logarithmic), we are con-

vinced that the assumption of a linear association

between allele frequencies at loci of adaptive relevance

and environmental factors is well justified and reason-

ably assumed in whole-genome searches for environ-

mental associations of unknown type.

Currently, we cannot conclusively explain why the

association of Cre_P1_212.5 to annual precipitation is

transferable from C. resedifolia to A. alpina but not to

the other species. In our study, the same sequencing

primers were suitable for A. alpina, A. jacquinii and

C. resedifolia, but alternative primers had to be devel-

oped for D. aizoides and T. rotundifolium, indicating

molecular differences between the species in this par-

ticular genomic region. This circumstance, however,

contrasts with the most recently accepted phylogeny

of Brassicaceae (Franzke et al. 2011), which suggests

that the tribes Thlaspideae (T. rotundifolium) and Ara-

bideae (A. alpina, A. jacquinii) are more closely related

to each other than they are to Cardamineae (C. resedi-

folia). Another contradiction is that species of the

genus Draba are also considered to belong to the tribe

Arabideae. One possibility is that we did not detect

and genotype all relevant SNPs or that the association

with some environmental variables is not reasonably

comparable because sampling among species was not

identical. Finally, these five Brassicaceae species all

have different ecological preferences and niches. For

example, of the two species for which we found a

common locus of ecological relevance, C. resedifolia is

restricted to silicicolous substrate, whereas A. alpina is

largely confined to calcareous soils. It is therefore

likely that even in related species the same genes

react differently to the various environmental factors.

Moreover, we lack functional proof for the SNPs asso-

ciated with environmental variation, so that the

genetic variation actually relevant for the signature of

selection may stem for a genomic region linked to the

SNPs studied here. Linkage must be given due dili-

gence considering that the average percentage of LD

estimated between pairs of loci was quite high in all

five Brassicaceae species, possibly a consequence of

substantial inbreeding, and particularly so for locus

Cre_P1_212.5. Accordingly, we cannot exclude that a

genomic region in LD with Cre_P1_212.5 is in fact

responsible for the association found in C. resedifolia

and A. alpina.

Within species, differences in the DNA sequences at

locus Cre_P1_212.5 lead to only one mutation each in

the protein sequences of A. alpina, C. resedifolia and

T. rotundifolium (Fig. 4). These are only minor changes

among amino acids with similar characteristics, that

is, with both negatively charged side chains (aspartic

and glutamic acid) or with both polar and uncharged

side chains (serine and asparagine). Such small differ-

ences most probably do not affect the folding domain

of the protein (Chakraborty et al. 2000); however, they

can nonetheless alter its functional specificity (Smoo-

ker et al. 2000; He et al. 2011). Locus Cre_P1_212.5

might therefore not only be linked to a locus under

directional selection but could itself have some adap-

tive relevance. Whether this gene is at all expressed

and under which environmental conditions could be

examined in a future study by sampling over an

environmental gradient and performing expression

studies, for example, by real-time PCRs on cDNA or

microarray technology. This way we could verify the

correlation between the environmental variables and

the amount of the expressed gene. Finally, selection

experiments, such as common garden trials and reci-

procal transplantations, are an indispensable tool for

testing and predicting evolutionary responses to

projected future conditions (Reusch & Wood 2007;

Holderegger et al. 2010).

The approach presented in this study is simple and,

even though AFLP-based, allows for subsequent

sequence-characterizing and SNP genotyping; it is

therefore suitable for any species and can be performed

by small laboratories. Although more up-to-date meth-

ods, such as next generation sequencing, exist, they cur-

rently remain very costly and therefore restrict the

number of populations and genomes sampled as well

as the target species, that is, those for which ample

genomic information is at hand (cf. Turner et al. 2010).

© 2013 Blackwell Publishing Ltd

1636 D. ZULLIGER, E . SCHNYDER and F . GUGERLI

Conclusions

Identifying and characterizing key markers under direc-

tional selection is a first step towards exploring the

underlying mechanisms of local adaptation. Here, we

applied different methods to reliably detect genomic

regions with adaptive relevance in natural populations

of Alpine plant species. In a second step, we character-

ized and genotyped one of these candidate loci to test

whether the same association can be found in different

species. To the best of our knowledge, this is the first

study investigating the transferability of a new candi-

date locus derived from a nonmodel Alpine plant

species to another, closely related species. Detecting

genomic patterns of environmental adaptation among

species is considered useful for modelling the evolu-

tionary potential of Alpine plants under predicted

future climate conditions and sheds light on ecological

niche evolution. While it has been shown that precipita-

tion is a key environmental factor associated with signa-

tures of selection in a broad spectrum of Alpine plants

(Manel et al. 2012), we here go one step further and

show that respective outlier loci can be transferred

among closely related taxa, that is, that environmental

associations may be the same. Even though we lack

experimental proof of the functionality of the adaptive

signature detected here, which is common place for

studies using genome scans, the results of this study

deepen our understanding and represent a valuable

contribution to model scenarios of the fate of Alpine

species in a changing environment.

Acknowledgments

We thank Sabine Brodbeck, Linda Feichtinger, Christoph

Schw€orer and Jolanda Zimmermann for assistance in the labora-

tory and for sampling, St�ephanie Manel and Thomas Sattler for

useful advice on statistical analysis, Niklaus Zimmermann for

providing the GIS data and Gy€orgy Sipos for his help with inter-

preting the protein sequences. For valuable comments improving

the manuscript, we thank Daniela Csencsics, Silke Werth and the

anonymous referees. AFLP data analysed in this study were

generated at the Genetic Diversity Center (GDC) of the ETH Zur-

ich. For his involvement in the conception of this study, including

the valuable hint regarding the issue of linear vs. nonlinear

responses of allele frequencies to environmental factors, we

thank Rolf Holderegger. Funding was provided by the EC-sup-

ported Integrated Project EcoChange (FP6-036866) and by the

Swiss National Science Foundation (SNF) Project Sinergia-AVE

(CRSI33_127155), which were both spoken to Rolf Holderegger.

References

Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990)

Basic local alignment search tool. Journal of Molecular Biology,

215, 403–410.

Ansell SW, Grundmann M, Russell SJ, Schneider H, Vogel

JC (2008) Genetic discontinuity, breeding-system change

and population history of Arabis alpina in the Italian

Peninsula and adjacent Alps. Molecular Ecology, 17, 2245–2257.

Arft AM, Walker MD, Gurevitch J et al. (1999) Responses of

tundra plants to experimental warming: meta-analysis of the

international tundra experiment. Ecological Monographs, 69,

491–511.

Beaumont MA, Nichols RA (1996) Evaluating loci for use in

the genetic analysis of population structure. Proceedings of the

Royal Society of London, Series B-Biological Sciences, 263, 1619–1626.

Bonin A, Bellemain E, Eidesen PB, Pompanon F, Brochmann C,

Taberlet P (2004) How to track and assess genotyping errors

in population genetics studies. Molecular Ecology, 13, 3261–3273.

Borcard D, Legendre P, Avois-Jacquet C, Tuomisto H (2004)

Dissecting the spatial structure of ecological data at multiple

scales. Ecology, 85, 1826–1832.Bovet L, Kammer PM, Meyla-Bettex M, Guadagnuolo R,

Matera V (2006) Cadmium accumulation capacities of Arabis

alpina under environmental conditions. Environmental and

Experimental Botany, 57, 80–88.Buehler D, Poncet BN, Manel S, Taberlet P, Holderegger R,

Gugerli F (2013) An outlier locus relevant in habitat-medi-

ated selection across independent regional replicates. Evolu-

tionary Ecology, in press, doi 10.1007/s10682-012-9597-8.

Byars SG, Papst W, Hoffmann AA (2007) Local adaptation

and cogradient selection in the alpine plant, Poa hiemata,

along a narrow altitudinal gradient. Evolution, 61, 2925–2941.

Chakraborty S, Bhattacharya S, Ghosh S et al. (2000) Structural

and interactional homology of clinically potential typsin

inhibitors: molecular modelling of Cucurbitaceae family

peptides using the x-ray structure of MCTI-II. Protein Engi-

neering, 13, 551–555.Coop G, Witonsky D, Di Rienzo A, Pritchard JK (2010) Using

environmental correlations to identify loci underlying local

adaptation. Genetics, 185, 1411–1423.

Dray S, Legendre P, Peres-Neto PR (2006) Spatial modelling: a

comprehensive framework for principal coordinate analysis of

neighbour matrices (PCNM). Ecological Modelling, 196, 483–493.

Ehrich D, Gaudeul M, Assefa A et al. (2007) Genetic conse-

quences of Pleistocene range shifts: contrast between the

Arctic, the Alps and the East African mountains. Molecular

Ecology, 16, 2542–2559.

Excoffier L, Laval G, Schneider S (2005) Arlequin ver. 3.0: An

integrated software package for population genetics data

analysis. Evolutionary Bioinformatics Online, 1, 47–50.Excoffier L, Hofer T, Foll M (2009) Detecting loci under selec-

tion in a hierarchically structured population. Heredity, 103,

285–298.

Feder ME, Mitchell-Olds T (2003) Evolutionary and ecologi-

cal functional genomics. Nature Reviews Genetics, 4, 651–

657.

Foll M, Gaggiotti O (2008) A genome-scan method to iden-

tify selected loci appropriate for both dominant and

codominant markers: a Bayesian perspective. Genetics, 180,

977–993.

© 2013 Blackwell Publishing Ltd

ADAPTIVE LOCI IN ALPINE BRASSICACEAE 1637

Franzke A, Lysak MA, Al-Shehbaz IA, Koch MA, Mummen-

hoff K (2011) Cabbage family affairs: the evolutionary history

of Brassicaceae. Trends in Plant Science, 16, 108–116.

Gao H, Williamson S, Bustamante CD (2007) An MCMC

approach for joint inference of population structure and

inbreeding rates from multi-locus genotype data. Genetics,

176, 1635–1651.

Gilbert B, Bennett JR (2010) Partitioning variation in ecological

communities: do the numbers add up? Journal of Applied

Ecology, 47, 1071–1082.Gugerli F, Englisch T, Niklfeld H et al. (2008) Relationships

among levels of biodiversity and the relevance of intraspe-

cific diversity in conservation—a project synopsis.

Perspectives in Plant Ecology, Evolution and Systematics, 10,

259–281.

Hamilton NRS, Skot L, Chorlton KH, Thomas ID, Mizen S

(2002) Molecular genecology of temperature response in

Lolium perenne: 1. Preliminary analysis to reduce false posi-

tives. Molecular Ecology, 11, 1855–1863.

He Y, Chen LQ, Zhou YA et al. (2011) Functional characteriza-

tion of Arabidopsis thaliana isopropylmalate dehydrogenases

reveals their important roles in gametophyte development.

New Phytologist, 189, 160–175.

Herrmann D, Poncet BN, Manel S et al. (2010) Selection criteria

for scoring amplified fragment length polymorphisms (AF-

LPs) positively affect the reliability of population genetic

parameter estimates. Genome, 53, 302–310.

Hoffmann AA, Willi Y (2008) Detecting genetic responses to

environmental change. Nature Reviews Genetics, 9, 421–432.Holderegger R, Buehler D, Gugerli F, Manel S (2010)

Landscape genetics of plants. Trends in Plant Science, 15,

675–683.

Jeffreys H (1961) Theory of Probability. Clarendon Press, Oxford.

Kawecki TJ (2008) Adaptation to marginal habitats. Annual

Review of Ecology, Evolution and Systematics, 39, 321–342.Kooyers NJ, Olsen KM (2012) Rapid evolution of an adaptive

cyanogenesis cline in introduced North American white

clover (Trifolium repens L.). Molecular Ecology, 21, 2455–

2468.

Kudo G, Hirao AS (2006) Habitat-specific responses in the

flowering phenology and seed set of alpine plants to climate

variation: implications for global-change impacts. Population

Ecology, 48, 49–58.Landolt E (1992) Unsere Alpenflora, 318 pp. Verlag Schweizer

Alpen-Club, Brugg, Switzerland.

Lavergne S, Mouquet N, Thuiller W, Ronce O (2010) Biodiver-

sity and climate change: integrating evolutionary and ecolog-

ical responses of species and communities. Annual Review of

Ecology, Evolution, and Systematics, 41, 321–350.Legendre P (1993) Spatial autocorrelation—trouble or new

paradigm. Ecology, 74, 1659–1673.Legendre P, Legendre L (1998) Numerical Ecology. Elsevier Sci-

ence BV, Amsterdam.

Lihova J, Carlsen T, Brochmann C, Marhold K (2009) Contrast-

ing phylogeographies inferred for the two alpine sister

species Cardamine resedifolia and C. alpina (Brassicaceae). Jour-

nal of Biogeography, 36, 104–120.Manel S, Poncet BN, Legendre P, Gugerli F, Holderegger R

(2010) Common factors drive adaptive genetic variation at

different spatial scales in Arabis alpina. Molecular Ecology, 19,

3824–3835.

Manel S, Gugerli F, Thuiller W et al. (2012) Broad-scale

adaptive genetic variation in alpine plants is driven by

temperature and precipitation. Molecular Ecology, 21, 3729–

3738.

Markgraf F (1958) Cruciferae. In: Illustrierte Flora von Mitteleuro-

pa (ed. Hegi G), pp. 73–529. Hanser, M€unchen.

Mitchell-Olds T, Feder M, Wray G (2008) Evolutionary and

ecological functional genomics. Heredity, 100, 101–102.Ohtani K (2000) Bootstrapping R2 and adjusted R2 in regression

analysis. Economic Modelling, 17, 473–483.Parisod C, Christin PA (2008) Genome-wide association to fine-

scale ecological heterogeneity within a continuous popula-

tion of Biscutella laevigata (Brassicaceae). New Phytologist, 178,

436–447.Poncet BN, Herrmann D, Gugerli F et al. (2010) Tracking genes

of ecological relevance using a genome scan in two indepen-

dent regional population samples of Arabis alpina. Molecular

Ecology, 19, 2896–2907.R Development Core Team (2010) R: A Language and Environ-

ment for Statistical Computing. R Foundation for Statistical

Computing, Vienna. http://www.r-project.org.

Rako L, Blacket MJ, McKechnie SW, Hoffmann AA (2007)

Candidate genes and thermal phenotypes: identifying eco-

logically important genetic variation for thermotolerance in

the Australian Drosophila melanogaster cline. Molecular Ecol-

ogy, 16, 2948–2957.Reusch TBH, Wood TE (2007) Molecular ecology of global

change. Molecular Ecology, 16, 3973–3992.

Richardson BA, Rehfeldt GE, Kim MS (2009) Congruent

climate-related genecological responses from molecular

markers and quantitative traits for western white pine (Pinus

monticola). International Journal of Plant Sciences, 170,

1120–1131.Roden SE, Dutton PH, Morin PA (2009) AFLP fragment

isolation technique as a method to produce random

sequences for single nucleotide polymorphism discovery in

the green turtle, Chelonia mydas. Journal of Heredity, 100,

390–393.

Rozen S, Skaletsky HJ (2000) Primer3 on the www for gen-

eral users and for biologist programmers. In: Bioinformatics

Methods and Protocols: Methods in Molecular Biology (eds

Krawetz S & Misener S), pp. 365–386. Humana Press,

Totowa.

Schmidt PS, Serrao EA, Pearson GA et al. (2008) Ecological

genetics in the North Atlantic: environmental gradients and

adaptation at specific loci. Ecology, 89, S91–S107.

Smooker PM, Whisstock JC, Irving JA, Siyaguna S, Spithill TW,

Pike RN (2000) A single amino acid substitution affects sub-

strate specificity in cysteine proteinases from Fasciola hepatica.

Protein Science, 9, 2567–2572.

St Clair JB, Mandel NL, Vance-Boland KW (2005) Genecology

of Douglas fir in western Oregon and Washington. Annals of

Botany, 96, 1199–1214.Storz JF (2005) Using genome scans of DNA polymorphism to

infer adaptive population divergence. Molecular Ecology, 14,

671–688.

The Arabidopsis Genome Initiative (2000) Analysis of the gen-

ome sequence of the flowering plant Arabidopsis thaliana.

Nature, 408, 796–815.Turner TL, Bourne EC, Von Wettberg EJ, Hu TT, Nuzhdin SV

(2010) Population resequencing reveals local adaptation of

© 2013 Blackwell Publishing Ltd

1638 D. ZULLIGER, E . SCHNYDER and F . GUGERLI

Arabidopsis lyrata to serpentine soils. Nature Genetics, 42,

260–263.Vasem€agi A, Primmer CR (2005) Challenges for identifying

functionally important genetic variation: the promise of com-

bining complementary research strategies. Molecular Ecology,

14, 3623–3642.Vitalis R, Dawson K, Boursot P (2001) Interpretation of varia-

tion across marker loci as evidence of selection. Genetics, 158,

1811–1823.

Vos P, Hogers R, Bleeker M et al. (1995) AFLP—a new tech-

nique for DNA-fingerprinting. Nucleic Acids Research, 23,

4407–4414.Zimmermann NE, Kienast F (1999) Predictive mapping of

alpine grasslands in Switzerland: species versus community

approach. Journal of Vegetation Science, 10, 469–482.

During her postdoc, D.Z. genotyped and scored the the

AFLP data for all five study species, performed the out-

lier and association analyses and the spatial variation

analysis. D.Z. also sequenced and characterized the out-

lier and developed SNPs for SNaPshots and produced

SNP data for three out of four study species. D.Z. ana-

lyzed the data and compiled the manuscript. E. S. In

his research group, F.G. studies population processes

like adaptation in the context of environmental varia-

tion or landscape configuration.

Data accessibility

DNA sequences: GenBank accession nos JN982363–

JN982367.

Sampling locations and environmental data: Table S1

(Supporting information).

AFLP and SNP data sets: DRYAD entry doi:10.5061/

dryad.mm155.

Supporting information

Additional supporting information may be found in the online

version of this article.

Table S1 Sampling locations and respective values of seven

uncorrelated environmental variables for five Alpine Brassica-

ceae species.

Table S2 Nineteen topo-climatic variables available as GIS

layers (Zimmermann & Kienast 1999) used for the principal

component analysis (PCA).

Table S3 Mismatch error rates and number of loci for ampli-

fied fragment length polymorphisms in five Alpine Brassica-

ceae species.

Table S4 Population-wise allele frequencies of single nucleo-

tide polymorphisms (SNPs) in five Alpine Brassicaceae species.

Fig. S1 Sampling locations of five Brassicaceae species sampled

in Switzerland.

© 2013 Blackwell Publishing Ltd

ADAPTIVE LOCI IN ALPINE BRASSICACEAE 1639


Recommended