+ All Categories
Home > Documents > Interrelationships of chromalveolates within a broadly sampled tree of photosynthetic protists

Interrelationships of chromalveolates within a broadly sampled tree of photosynthetic protists

Date post: 10-Dec-2023
Category:
Upload: independent
View: 0 times
Download: 0 times
Share this document with a friend
10
Interrelationships of chromalveolates within a broadly sampled tree of photosynthetic protists Valérie C. Reeb a , Michael T. Peglar a , Hwan Su Yoon b , Jennifer Ruoyu Bai a , Min Wu a , Philip Shiu a , Jessie L. Grafenberg a , Adrian Reyes-Prieto a , Susanne E. Rümmele a , Jeferson Gross a , Debashish Bhattacharya a, * a Department of Biology and Roy J. Carver Center for Comparative Genomics, University of Iowa, Iowa City, IA 52242, USA b Bigelow Laboratory for Ocean Sciences, West Boothbay Harbor, Maine, USA article info Article history: Received 25 January 2009 Revised 14 April 2009 Accepted 20 April 2009 Available online 3 May 2009 Keywords: Chromalveolata Multigene phylogenetics Plastidophila Rhizaria Supergroup Tree of life abstract The Chromalveolata ‘‘supergroup” is a massive assemblage of single-celled and multicellular protists such as ciliates and kelps that remains to be substantiated in molecular trees. Recent multigene analyses place chromalveolates into two major clades, the SAR (Stramenopiles, Alveolata, and Rhizaria) and the Crypto- phyta + Haptophyta. Here we determined 69 new sequences from different chromalveolates to study the interrelationships of its constituent phyla. We included in our trees, the novel groups Telonemia and Kat- ablepharidophyta that have previously been described as chromalvoleate allies. The best phylogenetic resolution resulted from a 6-protein (actin, a-tubulin, b-tubulin, cytosolic HSP70, BIP HSP70, HSP90) and a 5-protein (lacking HSP90) alignment that validated the SAR and cryptophyte + haptophyte clades with the inclusion of telonemids in the former and katablepharids in the latter. We assessed the Plasti- dophila hypothesis that is based on EF2 data and suggest this grouping may be explained by horizontal gene transfers involving the EF2 gene rather than indicating host relationships. Ó 2009 Elsevier Inc. All rights reserved. 1. Introduction In recent years, significant advances have been made in molec- ular phylogenetics driven by next-generation sequencing technol- ogy, more powerful analytical methods, and the availability of a greater diversity of organisms through culture collections and environmental sampling. These inroads have significantly im- proved our understanding of deep-level evolutionary relationships among the major eukaryotic groups and led to the discovery of new phyla (e.g., Not et al., 2007a; Shalchian-Tabrizi et al., 2006). Currently, eukaryote diversity is classified into six putative ‘‘super- groups”: Opisthokonta, Amoebozoa, Rhizaria, Excavata, Chromal- veolata, and Plantae (Adl et al., 2005; Cavalier-Smith, 2004). However phylogenetic support for these supergroups is highly var- iable (see Parfrey et al., 2006), depending greatly on taxon sam- pling, genetic markers considered, and phylogenetic reconstruction methods. What has however become abundantly clear is that a single-gene approach is insufficient to robustly re- solve basal splits in the eukaryote tree of life (ToL). Single gene ap- proaches are also prone to gross misinterpretation if the chosen sequence has undergone highly variable rates of change in differ- ent lineages or more seriously, horizontal gene transfer (HGT) or endosymbiotic gene transfer (EGT). The latter two processes can result in well-supported, conflicting relationships among taxa when different genes are used to infer trees. Two types of approaches currently exist with regard to infer- ence of the ToL. The first is to use short sequences (e.g., DNA bar- codes) to ‘place’ extant diversity within a tree and study population or species-level phenomena. This approach can also in- clude the rDNA genes that generally are excellent phylogenetic markers and available from a broad diversity of taxa due to their long use in systematics and application to metagenomic and other exploratory studies (e.g., Edgcomb et al., 2002; Huse et al., 2008). Barcodes and rDNA trees cannot however resolve the eukaryotic tree in its entirety due to their limited phylogenetic resolution (e.g., Parfrey et al., 2006). The second approach has been to rely on multi-gene frameworks that incorporate from several to over a 100 genes to study supergroup relationships (e.g., Burki et al., 2008; Hackett et al., 2007; Hampl et al., 2009; Rodriguez-Ezpeleta et al., 2007). This latter approach has played a central role in cur- rent views of eukaryote evolution but until now has the major dis- advantage of limited taxon sampling. This is because complete genome data, that drives phylogenomic methods, are only now 1055-7903/$ - see front matter Ó 2009 Elsevier Inc. All rights reserved. doi:10.1016/j.ympev.2009.04.012 * Corresponding author. Fax: +1 319 335 1069. E-mail addresses: [email protected] (V.C. Reeb), [email protected] (M.T. Peglar), [email protected] (H.S. Yoon), [email protected] (J.R. Bai), [email protected] (M. Wu), [email protected] (P. Shiu), jessie-grafenber- [email protected] (J.L. Grafenberg), [email protected] (A. Reyes-Prieto), susanne- [email protected] (S.E. Rümmele), [email protected] (J. Gross), debas- [email protected] (D. Bhattacharya). Molecular Phylogenetics and Evolution 53 (2009) 202–211 Contents lists available at ScienceDirect Molecular Phylogenetics and Evolution journal homepage: www.elsevier.com/locate/ympev
Transcript

Molecular Phylogenetics and Evolution 53 (2009) 202–211

Contents lists available at ScienceDirect

Molecular Phylogenetics and Evolution

journal homepage: www.elsevier .com/locate /ympev

Interrelationships of chromalveolates within a broadly sampled treeof photosynthetic protists

Valérie C. Reeb a, Michael T. Peglar a, Hwan Su Yoon b, Jennifer Ruoyu Bai a, Min Wu a, Philip Shiu a,Jessie L. Grafenberg a, Adrian Reyes-Prieto a, Susanne E. Rümmele a, Jeferson Gross a,Debashish Bhattacharya a,*

a Department of Biology and Roy J. Carver Center for Comparative Genomics, University of Iowa, Iowa City, IA 52242, USAb Bigelow Laboratory for Ocean Sciences, West Boothbay Harbor, Maine, USA

a r t i c l e i n f o

Article history:Received 25 January 2009Revised 14 April 2009Accepted 20 April 2009Available online 3 May 2009

Keywords:ChromalveolataMultigene phylogeneticsPlastidophilaRhizariaSupergroupTree of life

1055-7903/$ - see front matter � 2009 Elsevier Inc. Adoi:10.1016/j.ympev.2009.04.012

* Corresponding author. Fax: +1 319 335 1069.E-mail addresses: [email protected] (V.C.

(M.T. Peglar), [email protected] (H.S. Yoon), [email protected] (M. Wu), [email protected]@uiowa.edu (J.L. Grafenberg), [email protected]@uiowa.edu (S.E. Rümmele), [email protected] (D. Bhattacharya).

a b s t r a c t

The Chromalveolata ‘‘supergroup” is a massive assemblage of single-celled and multicellular protists suchas ciliates and kelps that remains to be substantiated in molecular trees. Recent multigene analyses placechromalveolates into two major clades, the SAR (Stramenopiles, Alveolata, and Rhizaria) and the Crypto-phyta + Haptophyta. Here we determined 69 new sequences from different chromalveolates to study theinterrelationships of its constituent phyla. We included in our trees, the novel groups Telonemia and Kat-ablepharidophyta that have previously been described as chromalvoleate allies. The best phylogeneticresolution resulted from a 6-protein (actin, a-tubulin, b-tubulin, cytosolic HSP70, BIP HSP70, HSP90)and a 5-protein (lacking HSP90) alignment that validated the SAR and cryptophyte + haptophyte cladeswith the inclusion of telonemids in the former and katablepharids in the latter. We assessed the Plasti-dophila hypothesis that is based on EF2 data and suggest this grouping may be explained by horizontalgene transfers involving the EF2 gene rather than indicating host relationships.

� 2009 Elsevier Inc. All rights reserved.

1. Introduction

In recent years, significant advances have been made in molec-ular phylogenetics driven by next-generation sequencing technol-ogy, more powerful analytical methods, and the availability of agreater diversity of organisms through culture collections andenvironmental sampling. These inroads have significantly im-proved our understanding of deep-level evolutionary relationshipsamong the major eukaryotic groups and led to the discovery ofnew phyla (e.g., Not et al., 2007a; Shalchian-Tabrizi et al., 2006).Currently, eukaryote diversity is classified into six putative ‘‘super-groups”: Opisthokonta, Amoebozoa, Rhizaria, Excavata, Chromal-veolata, and Plantae (Adl et al., 2005; Cavalier-Smith, 2004).However phylogenetic support for these supergroups is highly var-iable (see Parfrey et al., 2006), depending greatly on taxon sam-pling, genetic markers considered, and phylogeneticreconstruction methods. What has however become abundantlyclear is that a single-gene approach is insufficient to robustly re-

ll rights reserved.

Reeb), [email protected]@gmail.com (J.R. Bai),

(P. Shiu), jessie-grafenber-u (A. Reyes-Prieto), susanne-

@uiowa.edu (J. Gross), debas-

solve basal splits in the eukaryote tree of life (ToL). Single gene ap-proaches are also prone to gross misinterpretation if the chosensequence has undergone highly variable rates of change in differ-ent lineages or more seriously, horizontal gene transfer (HGT) orendosymbiotic gene transfer (EGT). The latter two processes canresult in well-supported, conflicting relationships among taxawhen different genes are used to infer trees.

Two types of approaches currently exist with regard to infer-ence of the ToL. The first is to use short sequences (e.g., DNA bar-codes) to ‘place’ extant diversity within a tree and studypopulation or species-level phenomena. This approach can also in-clude the rDNA genes that generally are excellent phylogeneticmarkers and available from a broad diversity of taxa due to theirlong use in systematics and application to metagenomic and otherexploratory studies (e.g., Edgcomb et al., 2002; Huse et al., 2008).Barcodes and rDNA trees cannot however resolve the eukaryotictree in its entirety due to their limited phylogenetic resolution(e.g., Parfrey et al., 2006). The second approach has been to relyon multi-gene frameworks that incorporate from several to overa 100 genes to study supergroup relationships (e.g., Burki et al.,2008; Hackett et al., 2007; Hampl et al., 2009; Rodriguez-Ezpeletaet al., 2007). This latter approach has played a central role in cur-rent views of eukaryote evolution but until now has the major dis-advantage of limited taxon sampling. This is because completegenome data, that drives phylogenomic methods, are only now

V.C. Reeb et al. / Molecular Phylogenetics and Evolution 53 (2009) 202–211 203

slowly appearing for under-studied eukaryote groups that occupykey nodes in the ToL. Once many more of these genomes have beensequenced, then the gulf between taxon diversity and depth ofgene sampling will be adequately bridged and a robust well-sam-pled ToL will likely emerge. However genome data will offer itsown set of concerns such as identifying the ‘right’ markers to inferthe ToL that have not undergone HGT or EGT during their historyand to have available the computational power to analyze rapidlyhundreds of genes from potentially hundreds of species.

Here we take another step towards resolving the ToL, with a fo-cus on photosynthetic eukaryotes, by combining the multigene (3-gene, 5-gene, 6-gene) approach with a broad diversity of taxa. Ourfocus is on understanding the relationship between the Chromal-veolata and other eukaryotes. Plantae, the other major photosyn-thetic supergroup, and chromalveolates are defined by primaryand secondary endosymbiosis, respectively, that gave rise to theirplastids (e.g., Lane and Archibald, 2008; Reyes-Prieto et al.,2007). Understanding the phylogeny of photosynthetic groupswith respect to their non-photosynthetic and plastid-lacking coun-terparts is essential to elucidate the frequency of plastid origins.These data can be used to critically assess whether plastid endo-symbiosis comprises a useful basis for defining supergroups orother taxonomic entities.

Until recently, the chromalveolates included six phyla, dividedinto 2 subgroups: Alveolata and Chromista. The alveolates includesdinoflagellates, apicomplexans, and ciliates and most single- andmulti-gene phylogenetic analyses support their monophyly andsister group relationship to Stramenopiles (e.g., Baldauf et al.,2000; Gajadhar et al., 1991; Yoon et al., 2008). The kingdom Chro-mista (haptophytes, cryptophytes, and Stramenopiles) was pro-posed by Cavalier-Smith based on the presence of chlorophyll cand the position of the plastid in the rough endoplasmic reticulumin these taxa (Cavalier-Smith, 1981). The Chromista hypothesiswas initially supported by trees inferred from plastid and nuclearencoded plastid targeted proteins (Fast et al., 2001; Yoon et al.,2002a, 2004). However, nuclear non-plastid targeted proteinsshow it to be paraphyletic, whereby Stramenopiles groups withAlveolata and the cryptophytes and haptophytes form a monophy-letic group that is evolutionarily distantly related to the strameno-pile + alveolate clade (e.g., Hackett et al., 2007; Patron et al., 2007).Furthermore, monophyly of the Chromalveolata in toto, that is usu-ally well supported by plastid data (Bachvaroff et al., 2005; Harperand Keeling, 2003; Inagaki et al., 2004; Patron et al., 2004; Yoonet al., 2002b, 2005), is not found using alignments of non-plastidlocalized proteins (Cavalier-Smith, 2002, 2004; Harper and Keel-ing, 2003; Harper et al., 2005; Nishi et al., 2005; Stechmann andCavalier-Smith, 2003). Recently, multigene phylogenies haveshown surprisingly that Rhizaria taxa are nested within theChromalveolata (Burki et al., 2007; Hackett et al. 2007). This mas-sive assemblage of protists that is based solely on molecular datawith no unifying morphological synapomorphies has been referredto as the SAR clade (i.e., Stramenopiles + Aveolata + Rhizaria; Burkiet al., 2007). Until now, only Hackett et al., (2007), using a 16-pro-tein alignment, have shown moderate bootstrap support for theunion of the SAR clade with cryptophytes + haptophytes.

Several new phyla have been provisionally placed as sister toexisting chromalveolate members. The Katablepharidophyta(Kathablepharida) are freshwater and marine heterotrophic flagel-lates with ca. 6 described species (Clay and Kugrens, 1999). Theirmorphological characters ally them with either cryptophytes oralveolates, and small subunit (SSU) rDNA data support a sistergroup relationship to Cryptophyta (Okamoto and Inouye, 2005).The genus Telonema (Griessmann, 1913) represents a group of mar-ine heterotrophic protists that are likely related to the paraphyletic‘‘chromist” lineages (Shalchian-Tabrizi et al., 2006). Phylogeneticdata demonstrate the uniqueness of the group which led to its

transfer to the new phylum Telonemia with an apparent relation-ship to the cryptophytes + haptophytes (Shalchian-Tabrizi et al.,2006) based on HSP90 and SSU rDNA data. The phylum Chromeri-da (Moore et al., 2008), including the monotypic genus and speciesChromera velia, was recently uncovered in Australia in associationwith the coral Plesiastrea versipora (Moore et al., 2008). Both mor-phological and molecular characters suggest a sister relationshipwith apicomplexans. If confirmed, the position of the Chromeridais of high interest because it may represent a photosynthetic sisterto the Apicomplexa, thereby supporting an algal origin of theseimportant pathogens. And finally, a novel photosynthetic group,the picobiliphytes (Not et al., 2007b), apparently possesses a nucle-omorph-like structure (as in cryptophytes; Not et al., 2007b; butsee Cuvelier et al., 2008) and is related to cryptophytes and kat-ablepharids in 18S rDNA trees (Cuvelier et al., 2008). Here, we in-clude these new lineages (except picobiliphytes) in our trees to testtheir positions relative to chromalveolates.

2. Materials and methods

2.1. Taxon sampling

A total of 129 taxa representing all six eukaryotic supergroupshave been used in this study in order to better assess the mono-phyly of the Chromalveolata, Plantae, and Rhizaria and to deter-mine the phylogenetic positions of the newly described phylaChromerida, Telonemia, and Katablepharidophyta. Our sequencingeffort however focused on members of the Chromaveolata, Plantae,and Rhizaria for which 31 strains were gathered from different cul-ture collections (Supplementary Table S1): American Type CultureCollection (ATCC), The Provasoli-Guillard National Center for Cul-ture of Marine Phytoplankton (CCMP), Scottish Association forMarine Science Culture Collection of Algae and Protozoa (CCAP),Roscoff Culture Collection (RCC), Microbial Culture Collection atthe National Institute for Environmental Studies, Japan (NIES), Cul-ture Collection of Algae at the University of Göttingen (SAG), Cul-ture Collection of Algae at the University of Texas at Austin(UTEX). The Paulinella chromatophora FK01 and MO88/a isolateswere obtained from private collections.

2.2. DNA preparation and sequencing

Cells were frozen in liquid nitrogen and thawed prior to totalgenomic DNA extraction using the DNeasy kit (Qiagen, Santa Clari-ta, CA, USA). Some of the DNA samples were obtained directly fromthe ATCC. Seven nuclear genes were targeted for this study (Sup-plementary Table S1). Both the small and large ribosomal subunitgenes (SSU and LSU rDNA) and five protein coding genes were tar-geted for analysis: actin, a-tubulin (aTub), b-tubulin (bTub), andtwo Hsp70 paralogs, the luminal binding protein (Hsp70BIP) andcytosolic Hsp70 (Hsp70CYT). The sequences of heat shock protein90 (Hsp90) and translation elongation factor 2 (EF2) were obtainedfrom NCBI. Primers used for polymerase chain reaction (PCR)amplification of SSU rDNA, actin, aTub and bTub are as in Yoonet al. (2008). Primers for PCR amplification of the LSU are fromMoreira et al. (2007). Degenerate PCR primers were designed toamplify conserved regions of Hsp70BIP and Hsp70CYT from cryp-tophytes and haptophytes. The sequences of these oligos are asfollows: hsp70F2 50-ggcatcgatctnggnacnac-30 and hsp70R2 50-cgtcatggcackytcnccyt-30; CrypBIPF 50-acctactcItgcgtiggtgtgtacaa-30

and CrypBIPR 50-gtcatggctgiytciccytcraaiacctg-30; CrypCYT201F50-gagcgtctcatcggigatgc-30 and CryptCYT1581R 50-grttgtcggcgt-argtggagaag-30. Herculase� II Fusion DNA polymerase from Strata-gene (Cat. #600677) was used to amplify the different genes usingPCR. PCR fragments for the protein coding genes were cloned using

204 V.C. Reeb et al. / Molecular Phylogenetics and Evolution 53 (2009) 202–211

the pCR-blunt II Topo vector (Invitrogen catalog No. 45-0245). Upto 8 clones per gene and per taxon were sequenced in order to de-tect potential paralogs. Sequencing of cloned plasmid DNA wasdone using vector or gene-specific primers. PCR products of rRNAcoding regions were sequenced directly using gene-specific prim-ers. Sequencing reactions were done using the BigDyeTM terminatorkit (PE-Applied Biosystems, Norwalk, CT, USA) and run on an ABI3730 automated DNA sequencer at the Roy J. Carver Center forComparative Genomics at the University of Iowa. Because katable-pharids and telonemids are predators that feed on haptophyte al-gae, we also sequenced the respective algal prey provided by theculture collections to insure that we obtained sequences of the tar-get taxa.

2.3. Phylogenetic analyses

Phylogenetic analyses were carried out on several datasetsincluding sequences generated from this study (Supplementary Ta-ble S1) and sequences obtained from different public databases(e.g., NCBI, Joint Genome Institute). The datasets are as follows:(1) 108 taxa for 3 protein coding genes, actin, aTub, and bTub(108taxa_3protComb); (2) 112 taxa for 5 genes, SSU, LSU, actin,aTub, and bTub (112taxa_5genesComb); (3) 75 taxa for 6 proteincoding genes, actin, aTub, bTub, Hsp70BIP, Hsp70CYT, and Hsp90(75taxa_6protComb); (4) 64 taxa for EF2 (64taxa_EF2); (5) 75 taxafor 7 proteins coding genes actin, aTub, bTub, Hsp70BIP, Hsp70CYT,Hsp90, and EF2 (75taxa_7protComb); (6) 66 taxa for Hsp90 (66tax-a_Hsp90); and (7) 75 taxa for 5 protein coding genes, actin, aTub,bTub, Hsp70BIP and Hsp70CYT. Before combining each dataset,individual genes were analyzed and checked for incongruencyusing a RAxML bootstrap proportion >70% for conflicting groupingsas the criterion.

Ribosomal RNA sequences were aligned as nucleotides and pro-tein coding genes were aligned as amino acids in MacClade v4.07(Maddison and Maddison, 2005). Ambiguously aligned regions thatviolated positional homology were assessed manually and ex-cluded from further analysis. Genealogies were inferred using theparallel version of MrBayes 3.1.1 (Ronquist and Huelsenbeck,2003) and RAxML v 7.0.4 (Stamatakis et al., 2008) on the Cyberin-frastructure for Phylogenetic Research (CIPRES) portal v 1.13 athttp://www.phylo.org/. Models of sequence evolution for the dif-ferent datasets were determined using Modeltest 3.7 (Posada andCrandall, 1998) for nucleotides sequences and ProtTest 1.4 (Aba-scal et al., 2005) for amino acid sequences. For Bayesian analyses,the WAG + I + C model was applied to all partitions that were com-bined in the 75taxa_6protComb, 64taxa_EF2, 75taxa_7protComb,66taxa_Hsp90, and 75taxa_5protComb datasets. One model wasapplied per partition for the 108taxa_3protComb (WAG + I + Cfor actin and bTub, RtREV + C for aTub) and 112taxa_5genesComb(GTR + I + C for SSU and LSU, WAG + I + C for actin and bTub,RtREV + C for aTub) datasets, using the unlinked option. Two timesfour MCMCMC chains were run simultaneously for 2 million gen-erations sampling trees every 100 generations for each of the data-sets under investigation. Stationarity in likelihood scores wasdetermined by plotting the tree �lnL values against generationsunder Tracer v1.3 (Rambaut and Drummond, 2005). All trees belowthe stationarity level were discarded (i.e., as ‘burnin’). A majorityrule consensus tree was generated from the post-burnin trees.RAxML bootstrap analysis was done with each dataset (exceptthe 112taxa_5genesComb) for 100 iterations using the WAG sub-stitution matrix, empirical base frequencies, and estimated propor-tion of invariable sites. A majority rule consensus tree wascalculated to determine the support values for each node.

Because of their high rate of evolution, and resulting long-branch attraction, foraminifers were excluded from all analyses,thus only Cercozoa remain in our dataset to represent Rhizaria.

Nevertheless, foraminifers showed a sister relationship to the coreRhizaria clade in the actin gene analysis (e.g., Nikolaev et al., 2004),whereas in other single-gene analyses they grouped with the rho-dophytes which also have long branches (see also Yoon et al.,2008). For the same reasons, SSU and LSU rRNA from excavateswere excluded from the 112taxa_5genesComb analyses.

2.4. Testing tree topologies

We implemented the approximately unbiased (AU-) test (Shi-modaira, 2002) to assess the likelihoods of tree topologies thatsupport alternative positions of Telonemia or those that enforcethe monophyly of the supergroup Plastidophila. For testing theplacement of Telonemia, we generated different backbone phylog-enies that were identical to the best RAxML topologies using the108taxa_3protComb, 75taxa_5protComb (excluding Hsp90),75taxa_6protComb, and 66taxa_Hsp90 datasets, but we re-posi-tioned Telonemia at the base and at the first split within every phy-lum, as well as at every branch defining higher taxonomic clades.To test Plastidophila monophyly on phylogenies that did not re-cover this group, we forced monophyly of Viridiplantae, Rhodo-phyta, Cryptophyta, Haptophyta and Katablepharidophyta on theRAxML best trees for the 75taxa_6protComb, 70taxa_actin, 72tax-a_aTub, 72taxa_bTub, 45taxa_Hsp70CYT, and 66taxa_Hsp90 data-sets. RAxML analysis of the Hsp70 (BIP) dataset recoveredmonophyly of Plastidophila (without bootstrap support) thus wasnot tested further. We repeated the same set of tests with theinclusion of Telonemia in the Plastidophila. In addition, we as-sessed alternative positions for each of the Plastidophila phyla(i.e., Viridiplantae, Rhodophyta + cryptophytes nucleomorphs,Cryptophyta, and Katablepharidophyta + Haptophyta) on the EF2best RAxML topology. For this purpose, we placed each of thesephyla at the base and at the first split within other phyla, as wellas at every branch defining higher taxonomic clades. The site-by-site likelihoods for trees in the different analyses were calculatedusing the respective dataset and TREEPUZZLE v5.2 (Schmidtet al., 2002) with the WAG + C + F evolutionary model (the alphavalue for the gamma distribution was obtained from RAxML).The AU-test was implemented using CONSEL v 0.1i (Shimodairaand Hasegawa, 2001) to identify the pool of probable trees in eachtest and to assign their probabilities.

3. Results and discussion

3.1. Results of multi-gene phylogenetic analyses

The major goal of our multi-gene analysis was to determine theinterrelationships of known and putative chromalveolate phylawithin a broadly sampled eukaryotic ToL. To this end, we gener-ated 69 sequences from 31 taxa. Our novel data comprised 16 ac-tin, 16 aTub, 10 bTub, 3 Hsp70BIP, 2 Hsp70CYT, 5 SSU rDNA, and 17LSU rDNA coding regions (Supplementary Table S1). The 3-protein(108taxa_3protComb; 1088 aa) RAxML tree inferred from thesedata is shown in Fig. 1. This phylogeny was rooted on the branchleading to the Opisthokonta. The only supergroups that receive sig-nificant bootstrap and Bayesian support in this tree are the unikontlineages (i.e., Opisthokonta and Amoebozoa; Stechmann and Cava-lier-Smith, 2003; see also Yoon et al., 2008) with strong support forthe separation of unikont and bikont eukaryotes (RAxML bootstrapproportions [RBP] = 94%; Bayesian posterior probability [BPP] =1.0). In general, among bikonts the ‘deepest’ nodes that are consis-tently supported are at the level of phylum (e.g., Alveolata[RBP = 77%; BPP = 1.0]; Viridiplantae [RBP = 97%; BPP = 1.0]; Cryp-tophyta [RBP = 100%; BPP = 1.0]). Two interesting results of notewith respective to chromalveolates are the association in the

Alexandrium tamarense Lingulodinium polyedrum Heterocapsa triquetra

Amphidinium carterae Karlodinium veneficum

Karenia brevis Oxyrrhis marina

Perkinsus marinus Plasmodium berghei Plasmodium yoelii

Plasmodium falciparum Toxoplasma gondii

Theileria parva Cryptosporidium hominis Cryptosporidium parvum

Tetrahymena pyriformis Tetrahymena thermophila

Paramecium tetraurelia Thalassiosira pseudonana

Phaeodactylum tricornutum Mallomonas rasilis

Heterosigma akashiwo Ochromonas danica

Ankylochrysis lutea Aureococcus anophagefferens

Pinguiococcus pyrenoidosus Bolidomonas pacifica

Cafeteria roenbergensis Plectospira myriandra

Thraustotheca clavata Phytophthora palmivora

Apodachlya brachynema Phytophthora ramorum Phytophthora sojae

Telonema subtile Telonema antarcticum

Trypanosoma brucei Neobodo designis

Euglena gracilis Jakoba libera

Naegleria gruberi Acrasis rosea

Streblomastix strix Trichomonas vaginalis

Bodomorpha minima Heteromita globosa

Cercomonas sp. Paulinella chromatophora FK01

Paulinella chromatophora MO88 Thaumatomonas sp.

Cholamonas cyrtodiopsidis Bigelowiella natans

Bigelowiella sp. Gymnophrys sp.

Dunaliella sp. Chlamydomonas reinhardtii Volvox carteri Scenedesmus obliquus

Ostreococcus tauri Ostreococcus ‘lucimarinus’

Micromonas pusilla Micromonas pusilla

Arabidopsis thaliana Populus trichocarpa

Glycine max Zea mays

Oryza sativa Physcomitrella patens

Mesostigma viride Chlorella sp.

Helicosporidium sp. Glaucocystis nostochinearum

Cyanophora paradoxa Chrysochromulina polylepis Prymnesium parvum Imantonia rotunda

Pleurochrysis carterae Chrysochromulina sp.

Isochrysis galbana Emiliania huxleyi

Diacronema sp. Pavlova gyrans

Pavlova lutheri Katablepharis japonica Leucocryptos marina

Chroomonas mesostigmatica Storeatula sp. Guillardia theta Cryptomonas paramaecium Rhodomonas sp.

Rhodomonas salina Goniomonas truncata

Flintiella sanguinaria Porphyra yezoensis

Chondrus crispus Cyanidioschyzon merolae

Rhodochaete parvula Galdieria sulphuraria

Dictyostelium purpureum Dictyostelium discoideum

Physarum polycephalum Danio rerio

Mus musculus Drosophila melanogaster

Strongylocentrotus purpuratus Sclerotinia sclerotiorum

Neurospora crassa Ustilago maydis

86

100

100

80

77

97 100 75

90 80

100

100 100

100

100

100

100

100

100

100 100

100

100

99

99

74 100

91 98

100

100 92

97

100

100 100

100

74

88

100 100

100

100

100

100 90

100

87 83 76

100 100

99 100

100 100

100

100

Dinoflagellata

Apicomplexa

Ciliophora

Stramenopiles

Rhizaria

Telonemia

Excavata

Viridiplantae

Glaucophyta

Rhodophyta

Cryptophyta

Haptophyta

Katablepharidophyta

Amoebozoa

Ophistokonta

75

86

94

0.1 substitutions/site

Fig. 1. Phylogeny of the major eukaryotic groups inferred from maximum likelihood (RAxML) analysis of the combined amino acid sequences of actin, a-tubulin, and b-tubulin from 108 taxa (108taxa_3protComb data set). The phylogram shows the RAxML tree with the highest likelihood. Numbers above the branches are RAxML bootstrapproportions when P50%. Thick branches indicate Bayesian posterior probabilities P95%. Branch lengths are proportional to the number of substitutions per site (see scalebar).

V.C. Reeb et al. / Molecular Phylogenetics and Evolution 53 (2009) 202–211 205

Bayesian analysis of katablepharids with haptophytes (BPP = 1.0)and the positioning of telonemids as sister to Stramenopiles(BPP = 1.0) that are monophyletic with alveolates (BPP = 1.0).

These results support expansion of the Chromalveolata to includethese taxa although this awaits definitive proof of overall chromal-veolate monophyly (see below) that is not supported here because

206 V.C. Reeb et al. / Molecular Phylogenetics and Evolution 53 (2009) 202–211

neither the SAR nor the cryptophyte + haptophyte clades are recov-ered in the tree. Given these results, it is clear that even at this rel-atively broad sampling of taxon diversity, these 3 conservedeukaryotic proteins do not encode adequate phylogenetic signalto resolve basal chromalveolate relationships because most ofthese nodes receive little or no bootstrap or Bayesian support. Thissuggests that future analyses must either use other, more powerful(e.g., longer) phylogenetic markers for this purpose or includeadditional gene data to infer the ToL in its entirety. Given ouruse of a relatively broad sample of taxa, it is unclear whether addedtaxon diversity will alleviate this paucity in deep phylogeneticsignal.

3.2. Does the addition of rDNA data help the situation?

To test the effect of adding two conserved gene sequences toour 3-protein data, we generated a 5-gene, 112 taxon alignmentthat included SSU and LSU rDNA (112taxa_5genesComb; 3109nt + 1088 aa). These data were analyzed using a mixed-model(DNA, amino acids; see Methods) approach under Bayesian infer-ence. The topology of this tree (Supplementary Fig. S1) is similarto the 3-protein phylogeny with some important differences. Thekatablepharids are associated with cryptophytes (BPP = 1.0) inthe 5-gene tree rather than with Haptophyta as in the 3-proteinphylogeny. Association of Katablepharidophyta with the crypto-phytes is generally recovered from rDNA data (Cuvelier et al.,2008; Not et al., 2007a; Okamoto and Inouye, 2005; Slapetaet al., 2006), but not with protein data (this study, Kim and Gra-ham, 2008; Kim et al., 2006). In addition, it should be noted thatunlike cryptophytes that have flat mitochondrial cristae, hapto-phytes and katablepharids both contain tubular cristae.

We included Chromera velia rDNA sequences in this tree andshow it to be affiliated with dinoflagellates (BPP = 0.99) rather thanwith apicomplexans as previously reported (Moore et al., 2008),and there is Bayesian support (BPP = 0.98) for the union of two ofthe three Plantae phyla (Viridiplantae and Rhodophyta). RegardingC. velia, according to (Moore et al., 2008) this species is clearly analveolate because it contains three of the diagnostic features:micropore, subsurface alveoli supported by microtubules, andmitochondria with ampulliform and tubular cristae. The Chromeri-da possess a single large cone-shaped plastid bound by four mem-branes. This group also has unique and novel features: e.g., itcontains a photosynthetic (likely red algal-derived as in most otherchromalveolates) plastid containing chlorophyll a, but not chloro-phyll c (Moore et al., 2008); the taxon uses UGA codons to encodetryptophan in the psbA plastid gene (also found in apicoplasts ofcoccidians and in various mitochondria), whereas other algae useUGG. Phylogenetic analyses show a close relationship of C. veliato apicomplexans (LSU rDNA), or to colpodellids (SSU rDNA) withrejection using various topology tests of the alternative placement(found here) as sister to dinoflagellates (Moore et al., 2008). Inter-estingly, the psbA phylogeny (i.e., in absence of apicomplexans thatlack photosynthesis) places C. velia close to the peridinin dinoflag-ellates. A close relationship of the C. velia plastid to that of dino-flagellates is also supported by the arrangement of thylakoidlamellae in stacks of three, but this may be the ancestral conditionfor the putative photosynthetic ancestor of apicomplexans anddinoflagellates. Given these considerations, it is likely that the affil-iation of C. velia with dinoflagellates found here reflects the ab-sence of protein data in our alignment from Chromerida.

With regard to the Plantae, this putative supergroup has untilnow been recovered only in some multigene analyses (Burkiet al., 2008; Hackett et al., 2007; Rodriguez-Ezpeleta et al., 2005).However in the 5-gene tree, the third Plantae phylum, Glauco-phyta, is associated with cryptophytes and katablepharids(BPP = 1.0). The monophyly of glaucophytes and cryptophytes has

long been noted in rDNA trees (e.g., Bhattacharya et al., 1995; Shal-chian-Tabrizi et al., 2007; Van de Peer and De Wachter, 1997) andis generally not supported (but see Rensing et al., 1997) when pro-tein data are used to infer phylogeny. The rDNA sequences are pre-sumably responsible for the 5-gene result in this case, although thebasis for the divergent phylogenetic signal is currently unknown.Taken together, the rDNA data, although, adding significant lengthto the alignment fail to improve appreciably our ability to resolvebasal splits in the eukaryotic tree.

3.3. A 6-protein data set provides a reasonably well-resolved ToL

Given the results described above, we added 3 conserved pro-teins (Hsp70CYT, Hsp70BIP, and Hsp90]) to the existing 3-proteinalignment (75taxa_6protComb; 2302 aa) with the goal of gainingadded phylogenetic resolution. The extent of missing data for thisalignment is shown in Supplementary Table S2. This tree (Fig. 2),despite extensive missing data for some protists (e.g., ca. 50% inthe dinoflagellate Alexandrium tamarense), provides robust supportfor two important existing hypotheses based on larger, relativelytaxon-depauperate data sets (e.g., Burki et al., 2008; Hackettet al., 2007). First, the SAR clade with Rhizaria as sister to Stra-menopiles + Alveolata (RBP = 84%; BPP = 1.0 [as in Hackett et al.(2007)] is well supported (RBP = 88%; BPP = 1.0) and second, thecryptophyte + haptophyte clade, now with the addition of the kat-ablapharids (RBP = 65%; BPP = 1.0) is also resolved in our tree. As inthe 3-protein tree, katablepharids are monophyletic with hapto-phytes, but without strong support (RBP = 64%). In contrast how-ever to the 3-protein and 5-gene trees, the telonemids (ca. 45%missing data) form an independent split at the base of the tree withno obvious affiliation to Stramenopiles or alveolates. The Glauco-phyta and Rhodophyta are sisters in the RAxML tree but with nobootstrap or Bayesian support and the Viridiplantae do not receivesupport for their phylogenetic position within the eukaryoticradiation.

The telonemid result is worthy of further consideration. The ini-tial description of this protist phylum used Hsp90 alone and incombination with SSU rDNA to suggest monophyly with crypto-phytes with this clade being sister to haptophytes (Shalchian-Tab-rizi et al., 2006). The sequences of aTub and bTub were generatedby this study but the addition of these genes to the Hsp90 datadid not provide added resolution of the placement of Telonemiaamong eukaryotes. Subsequent analyses using rDNA either pro-vided no resolution (Shalchian-Tabrizi et al., 2007) or suggestedan affiliation with the SAR clade (Lefevre et al., 2008; BPP = 1.0).In our analyses, although the individual actin, aTub, and bTub RAx-ML trees do not place Telonemia in the tree with support (resultsnot shown; see also Shalchian-Tabrizi et al., 2006), the concate-nated data tree supports the affiliation of telonemids with Stra-menopiles (Fig. 1), and the 5-gene tree that includes rDNA placesthem within the Stramenopiles + Alveolata clade (Fig. S1). It istherefore very surprising that the 6-protein alignment providesno resolution at all with respect to this clade (Fig. 2) given the in-creased phylogenetic support that is found for other groups in thetree. The 6-protein phylogeny shows telonemids to comprise anindependent split in the ToL. The reason for this odd result be-comes apparent when one inspects the Hsp90 tree that (as in Shal-chian-Tabrizi et al., 2006) places telonemids sister to cryptophytes(RBP = 50%, Supplementary Fig. S2). The effect of the removal ofHsp90 from the 6-protein alignment is predictable, with telone-mids again being monophyletic with Stramenopiles (RBP = 51%;BPP = 1.0) and the remainder of the tree remaining relatively un-changed (Supplementary Fig. S3). What this suggests to us is thatthe telonemid genes that have been studied thus far provide (atleast) two types of phylogenetic signal with Hsp90 as the outlier,suggesting a specific relationship with cryptophytes that is not

Dinoflagellata

Apicomplexa

Ciliophora

Stramenopiles

Rhizaria

Telonemia

Excavata

Viridiplantae

Glaucophyta

Rhodophyta

Cryptophyta

Haptophyta

Katablepharidophyta

Amoebozoa

Ophistokonta

100

100

88

84

100

100

100

100 91

100 78

86 72 Heterocapsa triquetra

Crypthecodinium cohnii Alexandrium tamarense Amphidinium carterae 100

Karlodinium veneficum Karenia brevis

Oxyrrhis marina Perkinsus marinus

100 73

100 100 Plasmodium berghei Plasmodium falciparum

100 Babesia sp. Theileria parva

Toxoplasma gondii Tetrahymena thermophila Paramecium tetraurelia

100 51

97 100 Thalassiosira pseudonana

Phaeodactylum tricornutum 52 Aureococcus anophagefferens

Heterosigma akashiwo 100 Phytophthora ramorum

Phytophthora sojae Spumella uniguttata

100 98 Thaumatomonas sp.

Cercomonas sp. 100 Bigelowiella sp.

Bigelowiella natans

95 87

100 100

100 Trypanosoma cruzi Trypanosoma brucei

Leishmania major Euglena gracilis

Jakoba libera Naegleria gruberi

100

100 76

93 70

50 Glycine max Arabidopsis thaliana

Nicotiana tabacum Zea mays

Physcomitrella patens Selaginella moellendorfii

65 100

100 Ostreococcus ‘lucimarinus’ Ostreococcus tauris

Micromonas pusilla 100 Chlamydomonas reinhardtii

Volvox carteri

99 94

82 Porphyra yezoensis Chondrus crispus

Flintiella sanguinaria 78 Galdieria sulphuraria

Cyanidioschyzon merolae 100 Glaucocystis nostochinearum

Cyanophora paradoxa

65

64 100

100 100 Emiliania huxleyi Isochrysis galbana

Prymnesium parvum Pavlova lutheri

100 Katablepharis japonica Leucocryptos marina

100 100

75 65 Chroomonas mesostigmatica

Storeatula sp. Guillardia theta

Cryptomonas paramaecium Rhodomonas salina

Goniomonas truncata 100 Telonema subtile

Telonema antarcticum 100 Dictyostelium purpureum

Physarum polycephalum

100

100 100

100 Xenopus laevis Danio rerio

100 Drosophila melanogaster Apis mellifera

Monosiga brevicollis

100 100 Neurospora crassa

Magnaporthe grisea Debaryomyces hansenii

Plasmodium yoelii

0.1 substitutions/site

Fig. 2. Phylogeny of the major eukaryotic groups inferred from maximum likelihood (RAxML) analysis of the combined amino acid sequences of 6 nuclear proteins (actin,aTub, bTub, Hsp70BIP, Hsp70CYT, and Hsp90) from 75 taxa (75taxa_6protComb data set). The phylogram shows the RAxML tree with the highest likelihood. Numbers abovethe branches correspond to RAxML bootstrap proportions when P50%. Thick branches indicate Bayesian posterior probabilities P95%. Branch lengths are proportional to thenumber of substitutions per site (see scale bar).

V.C. Reeb et al. / Molecular Phylogenetics and Evolution 53 (2009) 202–211 207

readily apparent with the other 5 genes. Genome data from telone-mids is required to determine whether this result is explained byHGT events in these taxa (e.g., hsp90 and other genes come from

a cryptophyte source) as an explanation for the topological conflict.With respect to cell ultrastructure, some telonemid characters (i.e.,cortical alveoli-like peripheral vacuoles and tubular mitochondrial

208 V.C. Reeb et al. / Molecular Phylogenetics and Evolution 53 (2009) 202–211

cristae) indicate a relationship to alveolates and some chromists(Shalchian-Tabrizi et al., 2006), whereas the presence in Telonemaantarcticum of tripartite tubular flagellar hairs on the immature fla-gellum (Klaveness et al., 2005) provides a character that is shared

76

60

79

98

58

100

64 62

98 94

90

68 GlycA

NicoZea m

Physco

Mes100 Volvo

Chlam100

100 OstreococOstreococ

Micromon

99

57

97

100 86 Ch

BrotryocladiHildenbrandia

100 PorphyPorphyr

84 Galdieria sulphCyanidio

100 GHemise

58 100

100 Emi

PavlovaLeucocryptos m

Guillardia tGon

80 100

100 100

56 TrypanosLeish

TrypanosomKaren

Euglena gracilis 62 Nae

Jakoba libera

74

100

82

96 69

100 Theileria parva Babesia bovis

Toxoplas100 Plasmodium b

Plasmodium f

99 99

73

AlexandrPerkinsus

55

99 86

100 PhaeoThal

Aureococcus ano100 Phytophthora sojae

Phytophthora ramorum71

100 Dictyostelium purpureuPhysarum polycephalum

Cyanophora paradoxa

91

100 87

100 Xenopus laevis Danio rerio

100 Drosophila melanogaster Apis mellifera Monosiga brevicollis

100 100 Neurospora crassa

Magnaporthe grisea Debaryomyces hansenii

91

0.1 substitutions/site

Fig. 3. Phylogeny of the major eukaryotic groups inferred from maximum likelihood (RARAxML tree with the highest likelihood. Numbers above the branches correspond to RAxprobabilities P95%. Branch lengths are proportional to the number of substitutions per

with Stramenopiles (Andersen, 2004). Both of these observationsare in accord with our results and may indicate that Telonemia isan independent lineage within the SAR clade. Given these data, ifone were to discount the Hsp90 data for telonemids, then all of

Ophistokonta

Glaucophyta Amoebozoa

Stramenopiles

Rhizaria

Ciliophora

ine max rabidopsis thaliana

tiana tabacum ays mitrella patens

Selaginella moellendorfii ostigma viride x carteri ydomonas reinhardtii cus ‘lucimarinus’ cus tauris

as pusilla ondrus crispus

Gelidium canariensis a uvarioides rubra ra yezoensis a miniata uraria schyzon merolae uillardia theta

lmis andersenii Isochrysis galbana

liania huxleyi lutheri arina heta iomonas truncata oma cruzi mania major a brucei ia brevis ( Dinoflagellata)

gleria gruberi

ma gondii erghei alciparum

Karlodinium micrum Amphidinium carterae ium tamarense marinus

Paramecium tetraurelia Tetrahymena thermophila

Thaumatomonas sp. Bigelowiella natans

dactylum tricornutum assiosira pseudonana phagefferens

m

Viridiplantae

Rhodophyta

Haptophyta

Katablepharidophyta Cryptophyta

Excavata

Apicomplexa

Dinoflagellata

Cryptophyta nucleomorph

xML) analysis of EF2 from 64 taxa (64taxa_EF2 data set). The phylogram shows theML bootstrap proportions when P50%. Thick branches indicate Bayesian posteriorsite (see scale bar).

V.C. Reeb et al. / Molecular Phylogenetics and Evolution 53 (2009) 202–211 209

the chromalveolates in our trees can be grouped into two majorclades, the SAR clade that putatively includes telonemids and thecryptophyte + haptophyte clade that includes katablepharids. Onlythe picobiliphytes remain to be placed in such a multigene frame-work. Therefore, our results that are based on relatively sparse se-quence data but reasonable taxon sampling significantly simplifiesviews of chromalveolate evolution. The potential union of the twomajor clades, that comprises a vastly different view than formu-lated in the original chromalveolate hypothesis (Cavalier-Smith,1999), remains as a possible future development that may arisefrom the use of genome-level data to address this challengingissue.

To test the phylogenetic position of Telonemia we used the AU-test to assess the likelihoods of tree topologies that placed Telon-emia in alternate positions in the 108taxa_3protComb,75taxa_5protComb, 75taxa_6protComb, and 66taxa_Hsp90 trees.This analysis shows that many alternate positions for telonemidsare not rejected at a significance value of P < 0.05: 13 for108taxa_3protComb, 6 for 75taxa_5protComb, 3 for 75taxa_6prot-Comb, and 21 for 66taxa_Hsp90 trees. It is therefore clear thatthese datasets are unable to robustly resolve the phylogenetic po-sition of Telonemia in the ToL. Nevertheless, the AU-test using thetwo datasets that excluded Hsp90 and showed a Telonemia–Stra-menopiles relationship (i.e., 108taxa_3protComb and75taxa_5protComb) do reject sisterhood of telonemids with cryp-tophytes (P = 0.026 and P = 0.021), haptophytes (P = 0.037 and

Table 1Bayesian posterior probabilities and RAxML bootstrap proportions for major clades in the

Groups 108taxa_3protComb 112taxa_5genesComb 75taxa_6pr

BPP RBP BPP RBP BPP

Dinoflagellata 100 100 100 NA 100Dinoflagellata +

ChromeridaNA NA 99 NA NA

Apicomplexa 100 90 100 NA 100Dinoflagellata +

Apicomplexa100 80 100a NA 100

Ciliophora 100 100 100 NA 100Alveolata 100 77 100a NA 100Stramenopiles / / 100 NA 100Telonemia / 100 100 NA 100Stramenopiles +

Telonemia100 / / NA /

Stramenopiles +Alveolata

100b /b 97b NA 100

Rhizaria 100 74 100 NA 100SAR (Stramenopiles +

Alveolata + Rhizaria)/ / / NA 100

Viridiplantae 100 97 100 NA 100Rhodophyta 100 88 100 NA 100Viridiplantae +

Rhodophyta/ / 95 NA /

Glaucophyta 100 75 100 NA 100Plantae / / / NA /Cryptophyta 100 100 100 NA 100Katablepharidophyta 100 100 100 NA 100Cryptophyta +

Katablepharidophyta/ / 100 NA /

Haptophyta 100 100 100 NA 100Haptophyta +

Katablepharidophyta97 / / NA /

Cryptophyta +Katablepharidophyta +Haptophyta

/ / / NA 100

Chromalveolata / / / / /Plastidophila / / / / /

/ = RPB < 60%; BPP < 95%.NA, not applicable

a Including Chromerida.b Including Telonemia.c Including Cryptophyta nucleomorph.

P = 0.005), and marginally fails to reject their alliance with Kat-ablepharidophyta in the 108taxa_3protComb tree (P = 0.051). Theplacement of Telonemia at the base of the Cryptophyta was not re-jected (P = 0.099) using the 75taxa_6protComb (including Hsp90)but its placement with Stramenopiles was rejected (P = 0.01). A sis-ter group relationship between Telonemia and Stramenopiles isalso rejected in the 66taxa_Hsp90 tree (P = 0.011). Taken together,the results of the AU-tests do not support a specific Telonemia–Stramenopiles sister group relationship because several alternateplacements of telonemids are not significantly rejected. Howeverdatasets containing Hsp90 always reject Telonemia + Strameno-piles suggesting this gene may provide conflicting phylogeneticsignal when compared to the other genes used in our study.

3.4. EF2 and birth of the Plastidophila

An intriguing result that was recently published postulates thepossible existence of a novel supergroup, the ‘‘Plastidophila” (Kimand Graham, 2008) that includes cryptophytes, katablepharids,haptophytes, rhodophytes, and Viridiplantae. This supergroup isbased primarily on analysis of a single gene, the conserved trans-lation elongation factor EF2. The affiliation of these Plantae andchromalveolate lineages is also supported by a unique two aminoacid signature sequence shared by these EF2 proteins. Thereforethe gene tree result appears to be robust. We had previously sug-gested (Hackett et al., 2007; Nosenko and Bhattacharya, 2007)

different phylogenetic analyses.

otComb 64taxa_EF2 75taxa_7protComb 66taxa_Hsp90 75taxa_5protComb

RBP BPP RBP BPP RBP BPP RBP BPP RBP

100 100 99 100 100 100 99 100 100NA NA NA NA NA NA NA NA NA

100 100 96 100 100 99 65 100 100100 100 82 100 100 100 88 100 97

100 100 100 100 100 100 100 100 100100 100 74 100 100 100 100 100 97100 100 99 100 100 100 100 100 99100 NA NA 100 100 100 100 100 100/ NA NA / / / / 100 /

84 / / 100 97 100 82 100b /

100 100 / 100 100 100 97 100 9788 100 / 100 99 99 / 100 /

100 100 100 100 100 100 95 100 9799 100c 99c 100 100 100 99 100 99/ 100 / / / / / / /

100 NA NA 100 100 100 99 100 75/ / / / / / / / /100 100 / 100 100 100 93 100 100100 NA / 100 100 NA NA 100 100/ / / / / / / / /

100 100 100 100 100 100 100 100 10064 100 / 100 87 / / 96 /

65 / / 100 86 / / 100 /

/ / / / / / / / // 100 98 / / / / / /

210 V.C. Reeb et al. / Molecular Phylogenetics and Evolution 53 (2009) 202–211

that the EF2 topology is best explained by HGT from a cell that isancestral to red and green algae into the haptophyte ancestor(cryptophyte and katablepharid data were not available at thattime), rather than HGT or alternatively EGT from the red algalendosymbiont to the nucleus of chromalveolates. This latter pointis consistent with the positioning of red algal nucleomorph EF2sequences with red algae rather than at the base of these chrom-alveolates. In addition, the putative HGT origin of EF2 is not un-ique in our analyses because the dinoflagellate Karenia brevisalso appears to have gained its EF2 gene from an excavate source(Nosenko and Bhattacharya, 2007). Therefore how do we inter-pret the Plastidophila hypothesis? In our opinion, this idea ismost likely be explained by HGT because no other single genetree in our analyses (results not shown) reproduces robustly thisresult. This is a common feature of HGT (i.e., strong support for aconflicting topology).

To study this issue further, we generated a well-sampled EF2tree to validate the results of Kim and Graham (2008) and to assessthe effects of EF2 inclusion on our 6-protein alignment. Our predic-tion for the latter analysis was that it would disrupt existing cladesin the tree (i.e., Fig. 2) and skew the tree towards Plastidophilamonophyly, but much more weakly than when EF2 was analyzedalone. This is another potential indication of HGT origin; i.e., a sin-gle gene more strongly supports a result than when it is combinedwith other conserved phylogenetic markers. The EF2 tree is shownin Fig. 3. Here, we also recover robustly the Plastidophila(RBP = 98%; BPP = 1.0; see Table 1) using our taxonomically morebroadly sampled data set than that used in Kim and Graham(2008). Many other groupings agree with accepted ideas abouteukaryote phylogeny (e.g., opisthokont monophyly [RBP = 91%;BPP = 1.0], alveolate monophyly [RBP = 74%; BPP = 1.0]) and areconsistent with our 3-protein, 5-gene, and 6-protein analyses.Essentially, only the Plastidophila conflicts strongly with these re-sults. When Kim and Graham (2008) combined EF2 with 5 otherproteins (actin, Hsp70CYT, Hsp90, aTub, bTub), they again recov-ered the Plastidophila but with lower bootstrap support(RBP = 88% versus 98%) for nodes and no longer found monophylyof Viridiplantae and Rhodophyta. When we add EF2 to our 6-pro-tein alignment, maximum likelihood support for the SAR cladeand Excavata rise significantly (RBP = 99%, 100%, respectively),but support for Plastidophila falls to 57% and now includes Telon-emia (Supplementary Fig. S4). These results are consistent with ourprediction described above for an HGT origin of EF2 in Crypto-phyta/Haptophyta/Katablepharidophyta and similar to Hsp90,seems to suggest that caution needs to be used when interpretingsingle gene trees that argue strongly for or against a particularview of ancient (e.g., supergroup) eukaryotic relationships.

To test these results, we again used the AU-test to assess Plast-idophila monophyly in two ways. First, we forced the monophylyof the 5 Plastidophila lineages and also the monophyly of the 5phyla + Telonemia on the 75taxa_6protComb, 70taxa_actin, 72tax-a_aTub, 72taxa_bTub, and 66taxa_Hsp90 trees and assessed thelikelihoods of these competing trees. All of these topologies wererejected at P < 0.05, except for Plastidophila monophyly includingTelonemia using the 66taxa_Hsp90 data set. For the second ap-proach that tested alternative positions of each of the Plastidophilaphyla on the 64taxa_EF2 topology, only two positions were not re-jected in each of the 4 analyses, but these represent rearrange-ments within the Plastidophila. In summary, our resultsdemonstrate that Plastidophila monophyly arises from the EF2dataset, and that alternative topologies that force Plastidophilapolyphyly with this data set are significantly rejected. Other genes,whether single or combined significantly reject the Plastidophilahypothesis, thereby supporting the idea that monophyly of this no-vel supergroup when inferred using EF2 is most likely explained bya gene-specific phenomenon such as HGT.

Acknowledgments

This research was funded by a Grant from the National ScienceFoundation Assembling the Tree of Life program (EF 04-31117). Weare grateful for the constructive comments of three anonymousreviewers.

Appendix A. Supplementary data

Supplementary data associated with this article can be found, inthe online version, at doi:10.1016/j.ympev.2009.04.012.

References

Abascal, F., Zardoya, R., Posada, D., 2005. ProtTest: selection of best-fit models ofprotein evolution. Bioinformatics 21, 2104–2105.

Adl, S.M., Simpson, A.G., Farmer, M.A., Andersen, R.A., Anderson, O.R., Barta, J.R.,Bowser, S.S., Brugerolle, G., Fensome, R.A., Fredericq, S., James, T.Y., Karpov, S.,Kugrens, P., Krug, J., Lane, C.E., Lewis, L.A., Lodge, J., Lynn, D.H., Mann, D.G.,McCourt, R.M., Mendoza, L., Moestrup, O., Mozley-Standridge, S.E., Nerad, T.A.,Shearer, C.A., Smirnov, A.V., Spiegel, F.W., Taylor, M.F., 2005. The new higherlevel classification of eukaryotes with emphasis on the taxonomy of protists. J.Eukaryot. Microbiol. 52, 399–451.

Andersen, R.A., 2004. Biology and systematics of heterokont and haptophyte algae.Am. J. Bot. 91, 1508–1522.

Bachvaroff, T.R., Sanchez Puerta, M.V., Delwiche, C.F., 2005. Chlorophyll c-containing plastid relationships based on analyses of a multigene data setwith all four chromalveolate lineages. Mol. Biol. Evol. 22, 1772–1782.

Baldauf, S.L., Roger, A.J., Wenk-Siefert, I., Doolittle, W.F., 2000. A kingdom-levelphylogeny of eukaryotes based on combined protein data. Science 290, 972–977.

Bhattacharya, D., Helmchen, T., Bibeau, C., Melkonian, M., 1995. Comparisons ofnuclear-encoded small-subunit ribosomal RNAs reveal the evolutionaryposition of the Glaucocystophyta. Mol. Biol. Evol. 12, 415–420.

Burki, F., Shalchian-Tabrizi, K., Minge, M., Skjaeveland, A., Nikolaev, S.I., Jakobsen,K.S., Pawlowski, J., 2007. Phylogenomics reshuffles the eukaryotic supergroups.PLoS ONE 2, e790.

Burki, F., Shalchian-Tabrizi, K., Pawlowski, J., 2008. Phylogenomics reveals a new‘megagroup’ including most photosynthetic eukaryotes. Biol. Lett. 4, 366–369.

Cavalier-Smith, T., 1981. Eukaryote kingdoms: seven or nine? Biosystems 14, 461–481.

Cavalier-Smith, T., 1999. Principles of protein and lipid targeting in secondarysymbiogenesis: euglenoid, dinoflagellate, and sporozoan plastid origins and theeukaryote family tree. J. Eukaryot. Microbiol. 46, 347–366.

Cavalier-Smith, T., 2002. The phagotrophic origin of eukaryotes and phylogeneticclassification of Protozoa. Int. J. Syst. Evol. Microbiol. 52, 297–354.

Cavalier-Smith, T., 2004. Only six kingdoms of life. Proc. Biol. Sci. 271, 1251–1262.Clay, B., Kugrens, P., 1999. Systematics of the enigmatic kathablepharids, including

EM characterization of the type species, Kathablepharis phoenikoston, and newobservations on K. remigera comb.nov.. Protist 150, 43–59.

Cuvelier, M.L., Ortiz, A., Kim, E., Moehlig, H., Richardson, D.E., Heidelberg, J.F.,Archibald, J.M., Worden, A.Z., 2008. Widespread distribution of a unique marineprotistan lineage. Environ. Microbiol. 10, 1621–1634.

Edgcomb, V.P., Kysela, D.T., Teske, A., de Vera Gomez, A., Sogin, M.L., 2002. Benthiceukaryotic diversity in the Guaymas Basin hydrothermal vent environment.Proc. Natl. Acad. Sci. USA 99, 7658–7662.

Fast, N.M., Kissinger, J.C., Roos, D.S., Keeling, P.J., 2001. Nuclear-encoded, plastid-targeted genes suggest a single common origin for apicomplexan anddinoflagellate plastids. Mol. Biol. Evol. 18, 418–426.

Gajadhar, A.A., Marquardt, W.C., Hall, R., Gunderson, J., Ariztia-Carmona, E.V., Sogin,M.L., 1991. Ribosomal RNA sequences of Sarcocystis muris, Theileria annulata andCrypthecodinium cohnii reveal evolutionary relationships among apicomplexans,dinoflagellates, and ciliates. Mol. Biochem. Parasitol. 45, 147–154.

Griessmann, K., 1913. Über marine Flagellaten. Arch. Protistenk. 32, 1–78.Hackett, J.D., Yoon, H.S., Li, S., Reyes-Prieto, A., Rummele, S.E., Bhattacharya, D.,

2007. Phylogenomic analysis supports the monophyly of cryptophytes andhaptophytes and the association of rhizaria with chromalveolates. Mol. Biol.Evol. 24, 1702–1713.

Hampl, V., Hug, L., Leigh, J.W., Dacks, J.B., Lang, B.F., Simpson, A.G., Roger, A.J., 2009.Phylogenomic analyses support the monophyly of Excavata and resolverelationships among eukaryotic ‘‘supergroups”. Proc. Natl. Acad. Sci. USA 106,3859–3864.

Harper, J.T., Keeling, P.J., 2003. Nucleus-encoded, plastid-targeted glyceraldehyde-3-phosphate dehydrogenase (GAPDH) indicates a single origin forchromalveolate plastids. Mol. Biol. Evol. 20, 1730–1735.

Harper, J.T., Waanders, E., Keeling, P.J., 2005. On the monophyly of chromalveolatesusing a 6-protein phylogeny of eukaryotes. Int. J. Syst. Evol. Microbiol. 55, 487–496.

Huse, S.M., Dethlefsen, L., Huber, J.A., Welch, D.M., Relman, D.A., Sogin, M.L., 2008.Exploring microbial diversity and taxonomy using SSU rRNA hypervariable tagsequencing. PLoS Genet. 4, e1000255.

V.C. Reeb et al. / Molecular Phylogenetics and Evolution 53 (2009) 202–211 211

Inagaki, Y., Simpson, A., Dacks, J., Roger, A., 2004. Phylogenetic artifacts can becaused by leucine, serine, and arginine codon usage heterogeneity:dinoflagellate plastid origins as a case study. Syst. Biol. 53, 582–593.

Kim, E., Graham, L.E., 2008. EEF2 analysis challenges the monophyly ofArchaeplastida and Chromalveolata. PLoS ONE 3, e2621.

Kim, E., Simpson, A.G., Graham, L.E., 2006. Evolutionary relationships ofapusomonads inferred from taxon-rich analyses of 6 nuclear encoded genes.Mol. Biol. Evol. 23, 2455–2466.

Klaveness, D., Shalchian-Tabrizi, K., Thomsen, H.A., Eikrem, W., Jakobsen, K.S., 2005.Telonema antarcticum sp. nov., a common marine phagotrophic flagellate. Int. J.Syst. Evol. Microbiol. 55, 2595–2604.

Lane, C.E., Archibald, J.M., 2008. The eukaryotic tree of life: endosymbiosis takes itsTOL. Trends Ecol. Evol. 23, 268–275.

Lefevre, E., Roussel, B., Amblard, C., Sime-Ngando, T., 2008. The molecular diversityof freshwater picoeukaryotes reveals high occurrence of putative parasitoids inthe plankton. PLoS ONE 3, e2324.

Maddison, D.R., Maddison, W.P., 2005. MacClade 4.05. Sinauer, Sunderland.Moore, R.B., Obornik, M., Janouskovec, J., Chrudimsky, T., Vancova, M., Green, D.H.,

Wright, S.W., Davies, N.W., Bolch, C.J., Heimann, K., Slapeta, J., Hoegh-Guldberg,O., Logsdon, J.M., Carter, D.A., 2008. A photosynthetic alveolate closely related toapicomplexan parasites. Nature 451, 959–963.

Moreira, D., von der Heyden, S., Bass, D., Lopez-Garcia, P., Chao, E., Cavalier-Smith,T., 2007. Global eukaryote phylogeny: Combined small- and large-subunitribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata. Mol.Phylogenet. Evol. 44, 255–266.

Nikolaev, S.I., Berney, C., Fahrni, J.F., Bolivar, I., Polet, S., Mylnikov, A.P., Aleshin, V.V.,Petrov, N.B., Pawlowski, J., 2004. The twilight of Heliozoa and rise of Rhizaria, anemerging supergroup of amoeboid eukaryotes. Proc. Natl. Acad. Sci. USA 101,8066–8071.

Nishi, A., Ishida, K., Endoh, H., 2005. Reevaluation of the evolutionary position ofopalinids based on 18S rDNA, and a- and b-tubulin gene phylogenies. J. Mol.Evol. 60, 695–705.

Nosenko, T., Bhattacharya, D., 2007. Horizontal gene transfer in chromalveolates.BMC Evol. Biol. 7, 173.

Not, F., Gausling, R., Azam, F., Heidelberg, J.F., Worden, A.Z., 2007a. Verticaldistribution of picoeukaryotic diversity in the Sargasso Sea. Environ. Microbiol.9, 1233–1252.

Not, F., Valentin, K., Romari, K., Lovejoy, C., Massana, R., Tobe, K., Vaulot, D., Medlin,L.K., 2007b. Picobiliphytes: a marine picoplanktonic algal group with unknownaffinities to other eukaryotes. Science 315, 253–255.

Okamoto, N., Inouye, I., 2005. The katablepharids are a distant sister group of theCryptophyta: A proposal for Katablepharidophyta divisio nova/Kathablepharida phylum novum based on SSU rDNA and b-tubulinphylogeny. Protist 156, 163–179.

Parfrey, L.W., Barbero, E., Lasser, E., Dunthorn, M., Bhattacharya, D., Patterson, D.J.,Katz, L.A., 2006. Evaluating support for the current classification of eukaryoticdiversity. PLoS Genet. 2, e220.

Patron, N.J., Inagaki, Y., Keeling, P.J., 2007. Multiple gene phylogenies support themonophyly of cryptomonad and haptophyte host lineages. Curr. Biol. 17, 887–891.

Patron, N.J., Rogers, M.B., Keeling, P.J., 2004. Gene replacement of fructose-1, 6-bisphosphate aldolase supports the hypothesis of a single photosyntheticancestor of chromalveolates. Eukaryot. Cell 3, 1169–1175.

Posada, D., Crandall, K.A., 1998. MODELTEST: testing the model of DNA substitution.Bioinformatics 14, 817–818.

Rambaut, A., Drummond, A., 2005. Tracer, MCMC Trace Analysis Tool Version 1.3.Rensing, S., Obrdlik, P., Rober-Kleber, N., Müller, S., Hofmann, C., Van de Peer, Y.,

Maier, U.-G., 1997. Molecular phylogeny of the stress-70 protein family withreference to algal relationships. Eur. J. Phycol. 32, 279–285.

Reyes-Prieto, A., Weber, A.P., Bhattacharya, D., 2007. The origin and establishmentof the plastid in algae and plants. Annu. Rev. Genet. 41, 147–168.

Rodriguez-Ezpeleta, N., Brinkmann, H., Burey, S.C., Roure, B., Burger, G., Loffelhardt,W., Bohnert, H.J., Philippe, H., Lang, B.F., 2005. Monophyly of primaryphotosynthetic eukaryotes: green plants, red algae, and glaucophytes. Curr.Biol. 15, 1325–1330.

Rodriguez-Ezpeleta, N., Brinkmann, H., Burger, G., Roger, A.J., Gray, M.W., Philippe,H., Lang, B.F., 2007. Toward resolving the eukaryotic tree: the phylogeneticpositions of jakobids and cercozoans. Curr. Biol. 17, 1420–1425.

Ronquist, F., Huelsenbeck, J.P., 2003. MrBayes 3: Bayesian phylogenetic inferenceunder mixed models. Bioinformatics 19, 1572–1574.

Schmidt, H.A., Strimmer, K., Vingron, M., von Haeseler, A., 2002. TREE-PUZZLE:maximum likelihood phylogenetic analysis using quartets and parallelcomputing. Bioinformatics 18, 502–504.

Shalchian-Tabrizi, K., Eikrem, W., Klaveness, D., Vaulot, D., Minge, M.A., Le Gall, F.,Romari, K., Throndsen, J., Botnen, A., Massana, R., Thomsen, H.A., Jakobsen, K.S.,2006. Telonemia, a new protist phylum with affinity to chromist lineages. Proc.Biol. Sci. 273, 1833–1842.

Shalchian-Tabrizi, K., Kauserud, H., Massana, R., Klaveness, D., Jakobsen, K.S., 2007.Analysis of environmental 18S ribosomal RNA sequences reveals unknowndiversity of the cosmopolitan phylum Telonemia. Protist 158, 173–180.

Shimodaira, H., 2002. An approximately unbiased test of phylogenetic treeselection. Syst. Biol. 51, 492–508.

Shimodaira, H., Hasegawa, M., 2001. CONSEL: for assessing the confidence ofphylogenetic tree selection. Bioinformatics 17, 1246–1247.

Slapeta, J., Lopez-Garcia, P., Moreira, D., 2006. Present status of the molecularecology of kathablepharids. Protist 157, 7–11.

Stamatakis, A., Hoover, P., Rougemont, J., 2008. A rapid bootstrap algorithm for theRAxML Web servers. Syst. Biol. 57, 758–771.

Stechmann, A., Cavalier-Smith, T., 2003. Phylogenetic analysis of eukaryotes usingheat shock protein Hsp90. J. Mol. Evol. 57, 408–419.

Van de Peer, Y., De Wachter, R., 1997. Evolutionary relationships among theeukaryotic crown taxa taking into account site-to-site rate variation in 18SrRNA. J. Mol. Evol. 45, 619–630.

Yoon, H.S., Grant, J., Tekle, Y.I., Wu, M., Chaon, B.C., Cole, J.C., Logsdon Jr., J.M.,Patterson, D.J., Bhattacharya, D., Katz, L.A., 2008. Broadly sampled multigenetrees of eukaryotes. BMC Evol. Biol. 8, 14.

Yoon, H.S., Hackett, J.D., Bhattacharya, D., 2002a. A single origin of the peridinin-and fucoxanthin-containing plastids in dinoflagellates through tertiaryendosymbiosis. Proc. Natl. Acad. Sci. USA 99, 11724–11729.

Yoon, H.S., Hackett, J.D., Ciniglia, C., Pinto, G., Bhattacharya, D., 2004. A moleculartimeline for the origin of photosynthetic eukaryotes. Mol. Biol. Evol. 21, 809–818.

Yoon, H.S., Hackett, J.D., Pinto, G., Bhattacharya, D., 2002b. The single, ancient originof chromist plastids. Proc. Natl. Acad. Sci. USA 99, 15507–15512.

Yoon, H.S., Hackett, J.D., Van Dolah, F.M., Nosenko, T., Lidie, K.L., Bhattacharya, D.,2005. Tertiary endosymbiosis driven genome evolution in dinoflagellate algae.Mol. Biol. Evol. 22, 1299–1308.


Recommended