+ All Categories
Home > Documents > 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among...

2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among...

Date post: 11-Sep-2021
Category:
Upload: others
View: 4 times
Download: 0 times
Share this document with a friend
12
RESEARCH Open Access Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper respiratory tract infections in Kuala Lumpur, Malaysia Maryam Nabiel Al-Khannaq 1 , Kim Tien Ng 1 , Xiang Yong Oong 1 , Yong Kek Pang 1 , Yutaka Takebe 1,2,3 , Jack Bee Chook 1 , Nik Sherina Hanafi 4 , Adeeba Kamarulzaman 1 and Kok Keng Tee 5* Abstract Background: Despite the worldwide circulation of human coronavirus OC43 (HCoV-OC43) and HKU1 (HCoV-HKU1), data on their molecular epidemiology and evolutionary dynamics in the tropical Southeast Asia region is lacking. Methods: The study aimed to investigate the genetic diversity, temporal distribution, population history and clinical symptoms of betacoronavirus infections in Kuala Lumpur, Malaysia between 2012 and 2013. A total of 2,060 adults presented with acute respiratory symptoms were screened for the presence of betacoronaviruses using multiplex PCR. The spike glycoprotein, nucleocapsid and 1a genes were sequenced for phylogenetic reconstruction and Bayesian coalescent inference. Results: A total of 48/2060 (2.4 %) specimens were tested positive for HCoV-OC43 (1.3 %) and HCoV-HKU1 (1.1 %). Both HCoV-OC43 and HCoV-HKU1 were co-circulating throughout the year, with the lowest detection rates reported in the October-January period. Phylogenetic analysis of the spike gene showed that the majority of HCoV-OC43 isolates were grouped into two previously undefined genotypes, provisionally assigned as novel lineage 1 and novel lineage 2. Sign of natural recombination was observed in these potentially novel lineages. Location mapping showed that the novel lineage 1 is currently circulating in Malaysia, Thailand, Japan and China, while novel lineage 2 can be found in Malaysia and China. Molecular dating showed the origin of HCoV-OC43 around late 1950s, before it diverged into genotypes A (1960s), B (1990s), and other genotypes (2000s). Phylogenetic analysis revealed that 27.3 % of the HCoV-HKU1 strains belong to genotype A while 72.7 % belongs to genotype B. The tree root of HCoV-HKU1 was similar to that of HCoV-OC43, with the tMRCA of genotypes A and B estimated around the 1990s and 2000s, respectively. Correlation of HCoV-OC43 and HCoV-HKU1 with the severity of respiratory symptoms was not observed. Conclusions: The present study reported the molecular complexity and evolutionary dynamics of human betacoronaviruses among adults with acute respiratory symptoms in a tropical country. Two novel HCoV-OC43 genetic lineages were identified, warranting further investigation on their genotypic and phenotypic characteristics. Keywords: Coronaviruses, Molecular epidemiology, Phylogenetics, Upper respiratory infection, Virus evolution * Correspondence: [email protected] 5 Department of Medical Microbiology, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia Full list of author information is available at the end of the article © 2016 Al-Khannaq et al. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Al-Khannaq et al. Virology Journal (2016) 13:33 DOI 10.1186/s12985-016-0488-4
Transcript
Page 1: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

RESEARCH Open Access

Molecular epidemiology and evolutionaryhistories of human coronavirus OC43 andHKU1 among patients with upperrespiratory tract infections in Kuala Lumpur,MalaysiaMaryam Nabiel Al-Khannaq1, Kim Tien Ng1, Xiang Yong Oong1, Yong Kek Pang1, Yutaka Takebe1,2,3,Jack Bee Chook1, Nik Sherina Hanafi4, Adeeba Kamarulzaman1 and Kok Keng Tee5*

Abstract

Background: Despite the worldwide circulation of human coronavirus OC43 (HCoV-OC43) and HKU1 (HCoV-HKU1),data on their molecular epidemiology and evolutionary dynamics in the tropical Southeast Asia region is lacking.

Methods: The study aimed to investigate the genetic diversity, temporal distribution, population history and clinicalsymptoms of betacoronavirus infections in Kuala Lumpur, Malaysia between 2012 and 2013. A total of 2,060 adultspresented with acute respiratory symptoms were screened for the presence of betacoronaviruses using multiplexPCR. The spike glycoprotein, nucleocapsid and 1a genes were sequenced for phylogenetic reconstruction andBayesian coalescent inference.

Results: A total of 48/2060 (2.4 %) specimens were tested positive for HCoV-OC43 (1.3 %) and HCoV-HKU1 (1.1 %).Both HCoV-OC43 and HCoV-HKU1 were co-circulating throughout the year, with the lowest detection rates reported inthe October-January period. Phylogenetic analysis of the spike gene showed that the majority of HCoV-OC43 isolateswere grouped into two previously undefined genotypes, provisionally assigned as novel lineage 1 and novel lineage 2.Sign of natural recombination was observed in these potentially novel lineages. Location mapping showed that thenovel lineage 1 is currently circulating in Malaysia, Thailand, Japan and China, while novel lineage 2 can be foundin Malaysia and China. Molecular dating showed the origin of HCoV-OC43 around late 1950s, before it divergedinto genotypes A (1960s), B (1990s), and other genotypes (2000s). Phylogenetic analysis revealed that 27.3 % ofthe HCoV-HKU1 strains belong to genotype A while 72.7 % belongs to genotype B. The tree root of HCoV-HKU1was similar to that of HCoV-OC43, with the tMRCA of genotypes A and B estimated around the 1990s and 2000s,respectively. Correlation of HCoV-OC43 and HCoV-HKU1 with the severity of respiratory symptoms was not observed.

Conclusions: The present study reported the molecular complexity and evolutionary dynamics of humanbetacoronaviruses among adults with acute respiratory symptoms in a tropical country. Two novel HCoV-OC43genetic lineages were identified, warranting further investigation on their genotypic and phenotypic characteristics.

Keywords: Coronaviruses, Molecular epidemiology, Phylogenetics, Upper respiratory infection, Virus evolution

* Correspondence: [email protected] of Medical Microbiology, Faculty of Medicine, University ofMalaya, Kuala Lumpur, MalaysiaFull list of author information is available at the end of the article

© 2016 Al-Khannaq et al. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, andreproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link tothe Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Al-Khannaq et al. Virology Journal (2016) 13:33 DOI 10.1186/s12985-016-0488-4

Page 2: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

BackgroundHuman coronaviruses are common cold viruses that arefrequently found to be associated with acute upper respira-tory tract infections (URTIs) [1]. According to the Inter-national Committee for Taxonomy of Viruses (ICTV),human coronavirus OC43 (HCoV-OC43) and HKU1(HCoV-HKU1) belong to the betacoronavirus genus, amember of the Coronaviridae family. Coronaviruses con-tain the largest RNA genomes and have been establishedas one of the rapidly evolving viruses [2]. In addition to thehigh nucleotide substitution rates across the genome [3],the coronavirus genome is subjected to homologousrecombination during viral replication, which is caused byRNA template switching mediated by the copy-choicemechanism [4, 5]. The genetic recombination of coro-naviruses had possibly led to the emergence of lethalpathogens such as severe acute respiratory syndromecoronavirus (SARS-CoV) and Middle East respiratorysyndrome coronavirus (MERS-CoV), which caused upto 50 % mortality in infected individuals [6–9]. Recom-bination events in the spike (S), nucleocapsid (N) andthe RNA dependent RNA polymerase (RdRp) withinthe 1a gene of HCoV-OC43 and HCoV-HKU1 leadingto the emergence of unique recombinant genotypeshave been reported [10, 11].Studies have shown that HCoV-OC43 is often associ-

ated with approximately 5 % of acute respiratory infec-tions while the more recent HCoV-HKU1 is less prevalent[12, 13]. In humans, acute upper respiratory symptomssuch as nasal congestion and rhinorrhea are relativelycommon in HCoV infections while sore throat andhoarseness of voice are less common, with cough usuallyassociated with HCoV-OC43 infection [14]. In tropicalcountries, annual shift in the predominant genotype hasbeen documented, with more cases of HCoV-OC43 andHCoV-HKU1 infections reported during the early monthsof the year [15]. Despite the clinical importance and socio-economic impact of HCoV infections [16, 17], the preva-lence, seasonality, clinical and phylogenetic characteristicsof HCoVs remain largely unreported in the tropical regionof Southeast Asia. Based on the S, N and 1a genes ofHCoV-OC43 and HCoV-HKU1 isolated from Malaysiaand also globally, we attempted to delineate the genetichistory and the phylodynamic profiles of human betacoro-naviruses HCoV-OC43 and HCoV-HKU1 using a suite ofBayesian phylogenetic tools. We also reported the emer-gence of two novel HCoV-OC43 lineages, in a cross-sectional study of patients presented with acute URTI inMalaysia.

MethodsClinical specimensA total of 2,060 consenting outpatient adults presentedwith symptoms of acute URTI were recruited at the

Primary Care Clinics of University Malaya MedicalCentre in Kuala Lumpur, Malaysia between March 2012and February 2013. Prior to collection of nasopharyngealswabs, demographic data such as age, gender and ethnicitywere obtained. In addition, the severities of symptoms(sneezing, nasal discharge, nasal congestion, headache,sore throat, voice hoarseness, muscle ache and cough)were graded based on previously reported criteria [18–21].The scoring scheme used had been validated earlier on theadult populations with common cold [19]. The nasopha-ryngeal swabs were transferred to the laboratory in univer-sal transport media and stored in −80 °C.

Molecular detection of HCoV-OC43 and HCoV-HKU1Total nucleic acids were extracted from nasopharyngealswabs using the magnetic beads-based protocols imple-mented in the NucliSENS easyMAG automated nucleicacid extraction system (BioMérieux, USA) [22, 23]. Speci-mens were screened for the presence of respiratory virusesusing the xTAG Respiratory Virus Panel FAST multiplexRT-PCR assay (Luminex Molecular Diagnostics, USA)which can detect HCoV-OC43, HCoV-HKU1 and otherrespiratory viruses and subtypes [24].

Genetic analysis of HCoV-OC43 and HCoV-HKU1RNA from nasopharyngeal swabs positive for HCoV-OC43 and HCoV-HKU1 was reverse transcribed intocDNA using SuperScript III kit (Invitrogen, USA) withrandom hexamers (Applied Biosystems, USA). The par-tial S gene (S1 domain) [HCoV-OC43; 848 bp (24,030-24,865) and HCoV-HKU1; 897 bp (23,300-24,196)],complete N gene [HCoV-OC43; 1,482 bp (28,997-30,478) and HCoV-HKU1; 1,458 bp (28,241-29,688)] andpartial 1a (nsp3) gene [HCoV-OC43; 1,161 bp (6,168-7,328) and HCoV-HKU1; 1,115 bp (6,472-7,586)] wereamplified either by single or nested PCR, using 10 μM ofthe newly designed or previously described primerslisted in Table 1. The PCR mixture (25 μl) containedcDNA, PCR buffer (10 mM Tris–HCl, 50 mM KCl,3 mM MgCl, 0.01 % gelatin), 100 μM (each) deoxynu-cleoside triphosphates, Hi-Spec Additive and 4u/μl BIO-X-ACT Short DNA polymerase (BioLine, USA). Thecycling conditions were as follows: initial denaturation at95 °C for 5 min followed by 40 cycles of 94 °C for 1 min,54.5 °C for 1 min, 72 °C for 1 min and a final extensionat 72 °C for 10 min, performed in a C1000 Touch auto-mated thermal cycler (Bio-Rad, USA). Nested/semi-nested PCR was conducted for each genetic region ifnecessary, under the same cycling conditions at 30 cy-cles. Purified PCR products were sequenced using theABI PRISM 3730XL DNA Analyzer (Applied Biosys-tems, USA). The nucleotide sequences were codon-aligned with previously described complete and partial

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 2 of 12

Page 3: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

HCoV-OC43 and HCoV-HKU1 reference sequences re-trieved from GenBank [11, 25–32].Maximum clade credibility (MCC) trees for the partial

S (S1 domain), complete N and partial 1a (nsp3) geneswere reconstructed in BEAST (version 1.7) [27, 33, 34].MCC trees were generated using a relaxed molecularclock, assuming uncorrelated lognormal distributionunder the general time-reversible nucleotide substitutionmodel with a proportion of invariant sites (GTR + I) anda constant coalescent tree model. The Markov chainMonte Carlo (MCMC) run was set at 3 × 106 steps longsampled every 10,000 state. The trees were annotatedusing Tree Annotator program included in the BEASTpackage, after a 10 % burn-in, and visualized in FigureTree(http://tree.bio.ed.ac.uk/software/Figuretree/). Neighborjoining (NJ) trees for the partial S (S1 domain), completeN and partial 1a (nsp3) genes were also reconstructed,using Kimura 2-parameter model in MEGA 5.1 [35]. Thereliability of the branching order was evaluated by boot-strap analysis of 1000 replicates. In addition, to explorethe genetic relatedness between HCoV-OC43 and HCoV-HKU1 genotypes, the pairwise genetic distances amongsequences of the S gene were estimated. Inter- and intra-genotype nucleotide distances were estimated by the boot-strap analysis with 1000 replicates using MEGA 5.1. Suchanalysis has not been done for the N and the 1a genesbecause those regions were highly conserved acrossgenotypes [10, 11, 32]. To test for the presence of recom-bination in HCoV-OC43, the S gene was subjected to pair-wise distance-based bootscanning analysis using SimPlot

version 3.5 [10, 36]. Established reference genomes forHCoV-OC43 genotype A (ATCC VR-759), B (87309Belgium 2003), and C (HK04-01) were used as putativeparental lineages, with a sliding window and step size of160 bp and 20 bp, respectively. In addition, MaxChirecombination test [37] was performed in the Recombin-ation Detection Program (RDP) version 4.0 [38]. In RDPthe highest acceptable p value (the probability thatsequences could share high identities in potentiallyrecombinant regions by chance alone) was set at 0.05,with the standard multiple comparisons corrected usingthe sequential Bonferroni method with 1,000 permuta-tions [39].

Estimation of divergence timeThe origin and divergence time (in calendar year) ofHCoV-OC43 and HCoV-HKU1 genotypes were estimatedusing the MCMC approach as implemented in BEAST.Analyses were performed under the relaxed molecularclock with GTR + I nucleotide substitution models andconstant-size and exponential demographic models. TheMCMC analysis was computed at 3 × 106 states sampledevery 10,000 steps. The mean divergence time and the95 % highest posterior density (HPD) regions were esti-mated, with the best-fitting models were selected by Bayesfactor inference using marginal likelihood analysis imple-mented in Tracer (version 1.5) [33]. The evolutionary ratefor S gene of betacoronaviruses (6.1 × 10−4 substitutions/site/year) reported previously was used for analysis [36].

Table 1 PCR primers of HCoV-OC43 and HCoV-HKU1

Target gene HCoV Primer Locationa Sequence (5'-3') Reference

Spike (S) OC43 LPW 1261 24010-24029 Forward: CTRCTATARYTATAGGTAGT [11]

LPW 2094 24866-24887 Reverse: GCCCAAATTACCCAATTGTAGG [11]

HKU1 LPW 1832 23275-23299 Forward: TATGTTAATAAWACTTTGTATAGTG [40]

LPW 1866 24197-24218 Reverse: TACAATTGACAAGAACTAGAAG [40]

Nucleocapsid (N) OC43 & HKU1 βN-F OC43: 28974-28996 Forward: GCTGTTTWTGTTAAGTCYAAAGT this study

HKU1: 28218-28240

βN-R OC43: 30479-30501 Reverse: CATTCTGATAGAGAGTGCYTATY this study

HKU1: 29699-29721

βN-Fn OC43: 29046-29069 Forward (nested): GCMTTGTTRAGARMTWAWATCTAA this study

HKU1: 28287-28310

βN-Rn OC43: 30447-30466 Reverse (nested): GCGAGGGGTTACCACCWRRT this study

HKU1: 29671-29690

1a OC43 OC43-1aF 6145-6167 Forward: CTTTTGGTAAACCTGTTATATGG this study

OC43-1aR 7329-7351 Reverse: AGCTTAATAAAAGAGGCAATAAT this study

OC43-1aFn 6183-6199 Forward (semi-nested): GCTTCYCTCAATTCTTTAACAT this study

HKU1 HKU1-1aF 6448-6471 Forward: TTCTCTTACTTATTTTAATAAACC this study

HKU1-1aR 7587-7610 Reverse: CTTTATACATAGCAGTAACAACTA this studyaNucleotide location was determined based on the HCoV-OC43 ATCC VR-759 (AY585228) and HCoV-HKU1 (NC_06577) reference sequences

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 3 of 12

Page 4: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

Statistical analysisThe association of HCoV-OC43 and HCoV-HKU1 infec-tions with specific acute URTI symptoms and its severity(none, moderate and severe) as well as demographic datawere evaluated using the Fisher’s exact test/Chi-squaretest carried out in the statistical package for the socialsciences (SPSS, version 16; IBM Corp).

ResultsDetection of HCoV-OC43 and HCoV-HKU1 in nasopharyngealswabsDuring the 12-month study period (March 2012 toFebruary 2013), all nasopharyngeal swab specimensfrom 2,060 patients collected from Kuala Lumpur,Malaysia were screened for the presence of HCoV-OC43 and HCoV-HKU1 using multiplex RT-PCRmethod, in which a total of 48 (2.4 %) subjects werefound positive for betacoronavirus. HCoV-OC43 andHCoV-HKU1 was detected in 26/2060 (1.3 %) and 22/2060 (1.1 %) patients, respectively, while no HCoV-OC43/HCoV-HKU1 co-infection was observed. Age,gender and ethnicity of the patients were summarizedin Table 2. The median age of subjects infected withHCoV-OC43 and HCoV-HKU1 was 53.0 and 48.5,

respectively. Both HCoV-OC43 and HCoV-HKU1 wereco-circulating throughout the year, although lowernumbers of HCoV-OC43 were detected between Octo-ber 2012 and January 2013 while no HCoV-HKU1 wasdetected during these months (Fig. 1).

Phylogenetic analysis of the S, N and 1a genesThe partial S (S1 domain), complete N and partial 1a(nsp3) genes of 23 HCoV-OC43 isolates were success-fully sequenced, while another three xTAG-positiveHCoV-OC43 isolates could not be amplified, probablydue to low viral copy number in these specimens. Basedon the phylogenetic analysis of the S gene, one subject(1/23, 4.3 %) was grouped with HCoV-OC43 genotype Breference sequences while another subject (1/23, 4.3 %)was grouped with HCoV-OC43 genotype D sequences.The remaining 21 isolates formed two phylogeneticallydiscrete clades that were distinct from other previouslyestablished genotypes A, B, C, D (genotype D is a re-combinant lineage that is not readily distinguished fromgenotype C in the S and N phylogenetic trees) and E[11, 32] (Fig. 2 and Additional file 1: Figure S1). Of the21 isolates, ten isolates have formed a cluster with otherrecently reported isolates from Japan, Thailand andChina [31, 32] supported by the posterior probabilityvalue of 1.0 and bootstrap value of 36 % at the internaltree node of the MCC and NJ trees, respectively withintra-group pairwise genetic distance of 0.003 ± 0.001.These isolates were provisionally designated as novellineage 1. Spatial structure was observed within novellineage 1, with an isolate from China sampled in year 2008located at the base of the phylogeny. Moreover, anothereleven HCoV-OC43 isolates have formed a second distinctcluster supported by significant posterior probability andbootstrap values at the internal tree node (1.0 and 98 %, re-spectively) and intra-group pairwise genetic distance of0.004 ± 0.001. The cluster contained Malaysian and Chineseisolates [32] only, and was denoted as novel lineage 2.Based on the phylogenetic inference of the conserved Ngene, only one subject was grouped with the genotype Breference in concordance with the S gene (Additional file 2:Figure S2). Unlike the phylogenetic inference of the S gene,the remaining 22 isolates were seen intermingled with eachother forming a single cluster together with isolates indi-cated as novel lineages 1 and 2 in the S gene, in addition toone genotype D strain. It is however important to note thatthe tree resolution was poor, due primarily to the lack ofthe N gene reference sequences in the public database.On the other hand, phylogenetic analysis of the 1a(nsp3) gene (Additional file 3: Figure S3) revealed thatall except genotype A could not be differentiated clearlywithin this region, due mainly to the low genetic diver-sity between genotypes. The limited number of 1a ref-erence sequences available in the public database could

Table 2 Demographic data on 48 outpatients infected withhuman betacoronavirus in Kuala-Lumpur, Malaysia, 2012-2013

HCoV-OC43(n = 26)

HCoV-HKU1(n = 22)

P-Value

Gender

Male 11(42.3 %) 8(36.4 %) 0.77

Female 15(57.7 %) 14(63.6 %)

Age

<40 9(34.6 %) 10(45.4 %) 0.33

40–60 10(38.5 %) 4(18.2 %)

>60 7(26.9 %) 8(36.4 %)

Symptoms

Sneezing 21(80.8 %) 14(63.6 %) 0.99

Nasal discharge 20(76.9 %) 19(86.4 %)

Nasal congestion 19(73.1 %) 14(63.6 %)

Headache 18(69.2 %) 16(72.7 %)

Sore throat 16(61.5 %) 14(63.6 %)

Hoarseness of voice 20(76.9 %) 18(81.8 %)

Muscle ache 17(65.4 %) 14(63.6 %)

Cough 23(88.5 %) 19(86.4 %)

Ethnicity

Malay 10(38.5 %) 10(45.4 %) 0.19

Chinese 3(11.5 %) 6(27.3 %)

Indian 13(50.0 %) 6(27.3 %)

Others 0(0.0 %) 0(0.0 %)

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 4 of 12

Page 5: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

have also resulted in a poor 1a tree topology. Inaddition, phylogenetic trees of previously describedcomplete and partial S gene sequences as well as partial1a (nsp3) and complete RdRp gene sequences were re-constructed to further confirm the reliability of the par-tial S1 and nsp3 for identification of HCoV-OC43genotypes (Additional file 4: Figure S4 and Additionalfile 5: Figure S5).To assess the diversity between HCoV-OC43 genotypes,

inter-genotype pairwise genetic distance was estimated forthe S gene, listed in Table 3. Using the oldest genotype asreference i.e. genotype A, genetic variation between geno-type A and genotypes B to E was 2.2–2.7 %. Geneticdistance between novel lineages 1 and 2 compared to geno-type A was 3.2 % and 3.1 %, respectively, higher than thatof other established genotypes. Taken together, the distinctinter-genotype genetic variations of the two novel lineages1 and 2 against other previously established genotypes cor-roborated with the MCC inference (Fig. 2) in which bothlineages formed distinct phylogenetic topologies.On the other hand, phylogenetic analysis of 22

HCoV-HKU1 S and N genes indicated the predomin-ance of HCoV-HKU1 genotype B (72.7 %, 16/22),followed by HCoV-HKU1 genotype A (27.3 %, 6/22)(Fig. 3, Additional file 6: Figure S6 and Additional file7: Figure S7). Interestingly, the S and N genes of HCoV-HKU1 were equally informative for genotype assignment,while genotypes A, B and C were less distinctive based onthe 1a gene phylogenetic analysis due to the high geneticconservation within this region (Additional file 8: FigureS8). Inter-genotype genetic diversity among HCoV-HKU1genotypes showed that genotype A was more geneticallydiverse than genotypes B and C based on the genetic dataof the S gene (Table 3). The difference in genetic distance

between genotype A and genotypes B and C was 15.2–15.7 %, while the difference in genetic distance betweengenotypes B and C was 1.3 %.Evidence of possible recombination was observed in the

S gene of novel lineage 1, involving genotypes B and C(Fig. 4). All isolates within novel lineage 1 showed simi-lar recombination structures (representative isolatesfrom Malaysia (12MYKL0208), Japan (Niigata.JPN/11-764), Thailand (CU-H967_2009) and China (892A/08)were shown). Similarly, sign of possible recombinationwas noticed within novel lineage 2 (Fig. 4). All Malay-sian and Chinese isolates showed similar recombinationstructures in the S gene involving genotypes A and B(12MYKL0002, 12MYKL0760 and 12689/12 representa-tive sequences were shown). Moreover, using the afore-mentioned putative parental and representative strains,MaxChi analysis of the novel lineages 1 and 2 isolatessupported the hypothesis of recombination in the Sgene (p < 0.05) (Additional file 9: Figure S9). Taken to-gether, the emergence of novel lineage 1 and novellineage 2 in these Asian countries was likely to bedriven by natural recombination events.

Estimation of divergence timesThe divergence times of HCoV-OC43 and HCoV-HKU1were estimated using the coalescent-based Bayesian re-laxed molecular clock under the constant and exponentialtree models (Fig. 2 and Fig. 3; Table 4). The newly esti-mated mean evolutionary rate for the S gene of HCoV-OC43 was 7.2 (5.0 – 9.3) × 10−4 substitutions/site/year.On the other hand, the evolutionary rate for the S gene ofHCoV-HKU1 was newly estimated at 6.2 (4.2–7.8) × 10−4

substitutions/site/year. These estimates were comparable

Fig. 1 Annual distribution of HCoV-OC43 and HCoV-HKU1 among adults with acute in Malaysia. The monthly detection of HCoV-OC43 and HCoV-HKU1(right axis, in bars) and the total number of nasopharyngeal swabs screened (left axis, in solid line) between March 2012 and February2013 were shown

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 5 of 12

Page 6: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

13MYKL1758 (KR055643)

LY270 (KF512652)

LY341 (KF512660)

12MYKL1140 (KR055639)

LY160 (KF512631)

LY245 (KF512642)

LY227 (KF512634)

XH1470 (KF512623)

12MYKL0781 (KR055633)

12MYKL0732 (KR055630)

XH1466 (KF512622)

LY254 (KF512648)

LY319 (KF512659)

LY261 (KF512651)LY244 (KF512641)

12MYKL0523 (KR055628)

12MYKL0710 (KR055629)

HK04-01 (JN129834)

12MYKL1612 (KR055642)

LY360 (KF512663)

LY235 (KF512635)

12MYKL0413 (KR055626)

12MYKL1024 (KR055636)

12MYKL0945 (KR055635)

LY243 (KF512640)

LY306 (KF512655)

12MYKL1057 (KR055637)

12MYKL0208 (KR055624)

13MYKL1975 (KR055644)

XH1377 (KF512617)

12MYKL0774 (KR055632)

LY357 (KF512662)

12MYKL0464 (KR055627)

LY318 (KF512658)

LY241 (KF512638)

LY257 (KF512650)

12MYKL1130 (KR055638)

12MYKL0043 (KR055623)

12MYKL0002 (KR055622)

12MYKL0868 (KR055634)

12MYKL0236 (KR055625)

LY252 (KF512646)

XH1340 (KF512616)

12MYKL0760 (KR055631)12MYKL1381 (KR055640)

LY342 (KF512661)LY307 (KF512656)

12MYKL1484 (KR055641)

OC43 (AY391777)LY82 (KF512628)OC43 (L14643)ATCC VR759 (DQ355401)LY82 (KF512628)LY84 (KF512629)XH1444 (KF512621)XH1400 (KF512618)LY1 (KF512624)XH1316 (KF512614)XH1327 (KF512615)LY253 (KF512647)LY155 (KF512630)LY162 (KF512632)LY250 (KF512645)LY53 (KF512626)LY48 (KF512625)LY246 (KF512643)LY308 (KF512657)LY256 (KF512649)XH1440 (KF512620)LY65 (KF512627)LY277 (KF512654)LY236 (KF512636)LY242 (KF512639)XH1415 (KF512619)LY274 (KF512653)LY165 (KF512633)LY238 (KF512637)

1997A/10 (KF572808)

2145A/10 (KF572810)3184A/12 (KF572813)2058A/10 (KF572809)2941A/11 (2941A/11)3194A/12 (KF572814)3074A/12 (KF572812)1783A/10 (KF572804)

Niigata.JPN/11-400 (AB695081)Niigata.JPN/11-335 (AB695079)

Niigata.JPN/11-286 (AB695078)Niigata.JPN/11-343 (AB695080)

1908A/10 (KF572805)1919A/10 (KF572806)1926/06 (KF572807)CaenTHS (DQ355402)CaenVAC (DQ355408)89996 Belgium 2003 (AY903454) 37767 Belgium 2003 (AY903457)84020 Belgium 2003 (AY903456)CaenVUO (DQ355406)CaenEYM (DQ355405)87309 Belgium 2003 (AY903459)

3647/06 (KF572818)3582/06 (KF572817)34364 Belgium 2004 (AY903455)36638 Belgium 2004 (AY903458)19572 Belgium 2004 (AY903460)HK04-02 (JN129835)229/05 (KF572816)CaenBEL (DQ355403)039A/07 (KF572819)4954/07 (KF572843)5331/07 (KF572845)5508/07 (KF572856)5352/07 (KF572847)69A/07 (KF572864)5485/07 (KF572855)5517/07 (KF572857)5484/07 (KF572854)5472/07 (KF572852)5345/07 (KF572846)5566/07 (KF572859)5445/07 (KF572851)5479/07 (KF572853)5519/07 (KF572858)5625/07 (KF572862)5656/07 (KF572863)4795/07 (KF572842)5240/07 (KF572844)079A/07 (KF572820)1357A/09 (KF572834)1591A/09 (KF572836)9001/09 (KF572870)1157A/08 (KF572828)978A/08 (KF572872)1081A/08 (KF572826)1216A/08 (KF572830)10290/10 (KF572823)10285/10 (KF572822)8099/09 (KF572865)2151A/10 (KF572839)1382A/09 (KF572835)1034A/08 (KF572824)1135A/08 (KF572827)8164/09 (KF572866)1593A/09 (KF572837)8375/09 (KF572867)9138/09 (KF572871)Niigata.JPN/11-564 (AB695082)

10108/10 (KF572821)CU-H1187/2010 (JX513260)CU-H1772/2010 (JX513268)2134A/10 (KF572838)10574/10 (KF572825)5442/07 (KF572850)5617/07 (KF572857)5370/07 (KF572848)5595/07 (KF572860)5414/07 (KF572849)

3269A/12 (KF572841)12691/12 (KF572832)

Niigata.JPN/11-769 (AB695085)

Niigata.JPN/11-833 (AB695086)

CU-H967/2009 (JX513258)

CU-H1367/2010 (JX513261)CU-H1444/2010 (JX513262)

Niigata.JPN/11-768 (AB695084)

Niigata.JPN/11-764 (AB695083)

892A/08 (KF572868)

Niigata.JPN/11-981 (AB695087)

12689/12 (KF572831)

12694/12 (KF572833)

A

B

C/D

D

D-like

c

c

D

E

2013200319931983197319631953

1951.0 (1929.6-1965.0)

1961.5 (1956.0-1965.8)

1991.2 (1981.5-1999.0)

2000.8 (1998.5-2002.6)

2001.7 (1999.3-2010.0)

2001.7 (2000.1-2002.9)

2004.6 (2003.2-2005.8)

2007.5 (2006.6-2008.0)

2010.6 (2009.6-2011.4)

2009.4 (2008.4-2010)

1.0

0.6

1.0

1.0

1.0

1.0

1.0

1.0

1.0

Spike gene (S1 domain) (848bp)

HCoV-OC43

Novel L

ineag

e 1N

ovel Lin

eage 2

Fig. 2 (See legend on next page.)

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 6 of 12

Page 7: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

to previous findings of 6.1–6.7 × 10−4 substitutions/site/year for the S gene reported elsewhere [11].Based on these evolutionary estimates of the S gene,

the common ancestor of HCoV-OC43 was dated back tothe 1950s. Divergence time of genotype A was datedback to early 1960s, followed by genotype B around1990s. Interestingly, genotypes C, D, E, and novel line-ages 1 and 2 were all traced back to the 2000s (Fig. 2).Moreover, the common ancestor of HCoV-HKU1 wastraced back to early 1950s, as estimated from the S gene.Subsequently, HCoV-HKU1 continued to diverge furtherinto distinctive genotypes (A-C). Genotype A was datedto the late 1990 and genotypes B and C were both tracedback to early 2000s (Fig. 3). Bayes factor analysis showedinsignificant differences (Bayes factor <3.0) between theconstant and exponential coalescent models of demo-graphic analysis. Divergence times generated using theexponential tree model were slightly (but not signifi-cantly) different from those estimated using the constantcoalescent model (Table 4). Of note, HCoV-OC43 andHCoV-HKU1 genotype assignments were less distinctivewithin the N and 1a genes (as compared to the S gene);these regions were therefore deemed unsuitable fordivergence time estimations in this study.

Clinical symptoms assessmentThe type of URTI symptoms (sneezing, nasal discharge,nasal congestion, headache, sore throat, hoarseness ofvoice, muscle ache and cough) and their severities during

HCoV-OC43 and HCoV-HKU1 infections were analyzed.Fisher’s exact test analysis suggested that the severity ofsymptoms was not significantly associated with HCoV-OC43 and HCoV-HKU1 infections (p values > 0.05), thisis due to the fact that the majority (61 % and 55 %) of thepatients infected with HCoV-OC43 and HCoV-HKU1 re-spectively were presented with at least one respiratorysymptom at moderate level of symptom severity. Inaddition, no significant association between HCoV-OC43and HCoV-HKU1 genotypes with disease severity wasobserved.

DiscussionIn the present cohort, over 2000 patients with URTIsymptoms were recruited and screened, of whom 1.3 %(26/2060) and 1.1 % (22/2060) of the subjects wereinfected with HCoV-OC43 and HCoV-HKU1, respect-ively. These estimates corroborate with the previouslyreported average incidence of HCoV-OC43 and HCoV-HKU1 at 0.2–4.3 % and 0.3–4.4 %, respectively [12, 15,40–45]. Although HCoV-OC43 and HCoV-HKU1 arenot as common as other respiratory viruses, severalstudies have reported an elevated incidence of HCoV-OC43 (up to 67 %) due to sporadic outbreaks with fatal-ity rate up to 8 % [46, 47]. This 12-month study showedthat HCoV-OC43 and HCoV-HKU1 infections were fre-quently detected during March 2012 to September 2012and decreased thereafter, in line with findings reportedfrom other tropical Southeast Asian country [15]. However,

(See figure on previous page.)Fig. 2 Maximum clade credibility (MCC) tree of HCoV-OC43 genotypes. Estimation of the time of the most recent common ancestors (tMRCA)with 95 % highest posterior density (95 % HPD) of HCoV-OC43 genotypes based on the spike gene (S1 domain) (848 bp). Data were analyzedunder relaxed molecular clock with GTR + I substitution model and a constant size coalescent model implemented in BEAST. The Malaysian HCoV-OC43isolates obtained in this study were color-coded and the HCoV-OC43 genotypes (a) to (e) as well as novel lineages 1 and 2 were indicated. TheMCC posterior probability values were indicated on the nodes of each genotype

Table 3 Genetic distance among HCoV-OC43 and HCoV-HKU1 genotypes in the spike gene

HCoV Genetic distance

OC43 genotype A genotype B genotype C genotype D genotype E Novel lineage 1 Novel lineage 2

genotype A -

genotype B 2.7 -

genotype C 2.2 1.5 -

genotype D 2.7 1.8 0.8 -

genotype E 2.5 0.9 1.2 1.6 -

Novel lineage 1 3.2 2.0 1.3 0.7 1.9 -

Novel lineage 2 3.1 2.9 1.8 1.4 2.6 1.7 -

HKU1 genotype A genotype B genotype C

genotype A -

genotype B 15.7 -

genotype C 15.2 1.3 -

Pairwise genetic distances are expressed in percentage (%) difference

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 7 of 12

Page 8: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

such patterns differ from that in temperate areas where theprevalence peaks during winter seasons, but few or nodetections in the summer [43]. It is also important to notethat the study was performed in a relatively short duration,therefore limiting the epidemiological and disease trendcomparison with reports from other countries.Phylogenetic inference based on the S gene of HCoV-

OC43 suggested the emergence of two potentially novelgenotypes (designated as novel lineage 1 and novel lineage2), supported by phylogenetic evidence and shared recom-bination structures. The relatively low mean intra-cluster

genetic variation reflects the high intra-genotype genetichomogeneity of each novel lineage. Inter-genotype geneticdistances between HCoV-OC43 genotypes further sup-ported that the novel lineages 1 and 2 are distinct fromthe previously described genotypes [11, 17, 32] in whichthe genetic distances between each of these two genotypesand the others were notably high (up to 3.2 %) (Table 3).Phylogenetic analysis also revealed that novel lineage 1includes isolates from Malaysia, Thailand, China andJapan while novel lineage 2 isolates are all fromMalaysia and China. Spatiotemporal characteristic

HKU1-N3-A (DQ415903)

HKU1-N13-A (DQ415909)

HKU1-N9-A (DQ415906)

HKU1-N14-A (DQ415910)

HKU1-N6-A (DQ415904)HKU1-N7-A (DQ415905)

HKU1-A (AY597011)

HKU1-BRA-21 (KF430198)HKU1-BRA-23 (KF430203)

HKU1-N24-A (DQ437619)

HKU1-N23-A (DQ415900)

HKU1- Caen1 (HM034837)

HKU1-N11-A (DQ415908)

HKU1-Reference (NC006577)

HKU1-N19-A (DQ415896)HKU1-N18-A (DQ415914)

HKU1-USA-15/2009 (KF686344)

HKU1-N10-A (DQ415907)

HKU1-USA-10/2010 (KF686341)

HKU1-USA-16/2010 (KF430200)

HKU1-USA-5/2009 (KF686340)

HKU1-USA-11/2009 (KF686341)

HKU1-USA-14/2009 (KF430199)

HKU1-USA-12/2010 (KF686346)

HKU1-USA-13/2010 (KF686343)

HKU1-USA-7/2010 (KF430202)HKU1-USA-18/2010 (KF430201)

HKU1-B (AY884001)

HKU1-N15-B (DQ415911)HKU1-N25-B (DQ415902)

HKU1-N21-C (DQ415898)HKU1-N5p8-A/B (DQ339101)

HKU1-N22-C (DQ415899)HKU1-N17-C (DQ415913)

HKU1-N20-C (DQ415897)HKU1-N16-C (DQ415912)

1951.2 (1906.3-1985.4)

1999.5 (1994.9-2002.5)

2001.1 (1997.6-2003.8)

2002.3 (1999.9-2003.8)

A

B

C

HKU1-USA-1/2005 (KF686338)

1.00

1.00

1.00

1.00

2013200319931943 1953

12MYKL1214 (KR055571)

12MYKL0447 (KR055558)12MYKL0790 (KR055564)12MYKL0163 (KR055553)12MYKL0737 (KR055561)12MYKL1061 (KR055567)12MYKL0759 (KR055562)12MYKL1217 (KR055572)12MYKL0407 (KR055557)12MYKL0841 (KR055565)12MYKL0777 (KR055563)12MYKL0529 (KR055559)12MYKL1058 (KR055566)12MYKL0181 (KR055555)13MYKL1997 (KR055575)12MYKL0323 (KR055556)12MYKL0624 (KR055560)

12MYKL1132 (KR055569)12MYKL1075 (KR055568)

13MYKL1898 (KR055574)12MYKL 1781 (KR055573)

12MYKL1153 (KR055570)

HCoV-HKU1Spike gene (S1 domain)(897bp)

Fig. 3 Maximum clade credibility (MCC) tree of HCoV-HKU1 genotypes. Estimation of the time of the most recent common ancestors (tMRCA)with 95 % highest posterior density (95 % HPD) of HCoV-HKU1 genotypes based on the spike gene (S1 domain) (897 bp). Data were analyzedunder relaxed molecular clock with GTR + I substitution model and a constant size coalescent model implemented in BEAST. The MalaysianHCoV-HKU1isolates obtained in this study were color-coded and the HCoV-HKU1 genotypes (a) to (c) were indicated. The MCC posterior probabilityvalues were indicated on the nodes of each genotype

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 8 of 12

Page 9: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

observed within the novel lineage 1 phylogeny (Fig. 2)may suggest the origin of this lineage in China, before itspread to other regions in the East and Southeast Asia.In order to clearly define the genetic characteristic of

the putative novel lineages 1 and 2 (and also anyother isolates with discordant phylogenetic patterns),complete genome sequencing and phylogenetic analysisneed to be carried out.

Fig. 4 Recombination analyses of HCoV-OC43 novel lineages 1 and 2. Reference strains of HCoV-OC43 genotype A (ATCC VR-759), B (87309 Belgium2003), and C (HK04-01) were used as the putative parental strains. The bootstrap values were plotted for a window of 160 bp moving in increments of20 bp along the alignment. Samples 12MYKL0208, Niigata.JPN/11-764, CU-H967_2009, 892A/08 were used as representative sequences for novel lineage1 in addition to 12MYKL0002, 12MYKL0760 and 12689/12 isolates as representatives for novel lineage 2

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 9 of 12

Page 10: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

Based on the newly estimated substitution rates, thedivergence times for HCoV-OC43 and HCoV-HKU1were phylogenetically inferred. Interestingly, althoughHCoV-OC43 was the first human coronavirus discov-ered in 1965 [48, 49], and the HCoV-HKU1 was firstdescribed much later in 2005 [50], the S gene analysisof HCoV-OC43 and HCoV-HKU1 revealed that therespective common ancestors of both viruses haveemerged since 1950s. Furthermore, the divergencetimes of HCoV-OC43 genotypes predicted in thisstudy are comparable to those described in previousstudies [11, 27]. Phylogenetic, recombination and mo-lecular clock analysis suggest the emergence of novellineages 1 and 2 around the mid-2000s and late 2000s,respectively, probably by natural recombination eventsinvolving genotypes B and C (for lineage 1) and geno-types A and B (for lineage 2).Human coronaviruses are progressively recognized as

respiratory pathogens associated with an increasingrange of clinical outcomes. Our results indicated thatmost patients infected with HCoV-OC43 and HCoV-

HKU1 were presented with moderate respiratory symp-toms (data not shown) in accordance with previouslyreported clinical results [16, 51–53] where they wererecognized as common cold viruses associated withURTI symptoms.

ConclusionsIn conclusion, epidemiological and evolutionary dynamicsinvestigation revealed the genetic complexity of humanbetacoronaviruses HCoV-OC43 and HCoV-HKU1 infec-tions in Malaysia, identifying two potentially novel HCoV-OC43 lineages among adults with acute respiratory tractinfections. The reported findings warrant continuousmolecular surveillance in the region, and detailed geno-typic and phenotypic characterization of the novel beta-coronavirus lineages.

DeclarationsEthics statementThe study was approved by the University of Malaya Med-ical Ethics Committee (MEC890.1). Standard, multilingualconsent forms allowed by the Medical Ethics Committeewere used. Written consents were obtained from all studyparticipants.

Consent for publicationNot applicable.

Availability of data and materialsHCoV-OC43 and HCoV-HKU1 nucleotide sequencesgenerated in the study are available in GenBank underthe accession numbers KR055512-KR055644.

Additional files

Additional file 1: Figure S1. Phylogenetic analysis of the HCoV-OC43spike gene (S1 domain). Trees were reconstructed using neighbor-joiningmethod. Bootstrap values were calculated from 1,000 trees. The scale barof individual tree was indicated in substitutions per site, using Kimura 2-parameter model in MEGA (version 5.1) to estimate pair-wise evolutionarydistance. The Malaysian isolates obtained in this study were color-codedand the HCoV-OC43 genotypes A to E as well as novel lineages 1 and 2were indicated. (PDF 242 kb)

Additional file 2: Figure S2. Phylogenetic analysis of the HCoV-OC43nucleocapsid gene. Trees were reconstructed using neighbor-joiningmethod. Bootstrap values were calculated from 1,000 trees. Bootstrapvalues of greater than 70 % were indicated on the branch nodes. Thescale bar of individual tree was indicated in substitutions per site, usingKimura 2-parameter model in MEGA (version 5.1) to estimate pair-wiseevolutionary distance. The Malaysian isolates obtained in this study werecolor-coded and the HCoV-OC43 genotypes A to E as well as novel lineages1 and 2 were indicated. Each HCoV-OC43 sequence was assigned to itsproper genotype based on the S1 phylogenetic analysis. NL1= novel lineage1. (PDF 253 kb)

Additional file 3: Figure S3. Phylogenetic analysis of the HCoV-OC431a gene (nsp3). Tree was reconstructed using neighbor-joining method.Bootstrap values were calculated from 1,000 trees. Bootstrap values ofgreater than 70% were indicated on the branch nodes. The scale bar of

Table 4 Evolutionary characteristics of HCoV-OC43 and HCoV-HKU1 genotypes

Subtype-gene evolutionaryratea

Genotype tMRCAb

OC43-Spike 7.2 (5.2–9.4)

all genotypes 1952.2 (1931.0–1965.2)

genotype A 1961.8 (1955.1–1966.0)

genotype B 1991.0 (1981.4–1999.0)

genotype C/D 2001.7 (2000.1–2002.9)

genotype D 2004.5 (2003.3–2005.8)

genotype E 2009.3 (2008.3–2010.0)

novel lineage 1 2007.5 (2006.6–2008.0)

novel lineage 2 2010.5 (2009.5–2011.4)

HKU1-Spike 6.2 (4.5–8.0)

all genotypes 1957.2 (1920.3–1987.5)

genotype A 1999.4 (1994.8–2002.5)

genotype B 2001.2 (1997.6–2003.6)

genotype C 2002.3 (1999.8–2003.8)

HKU1-Nucleocapsid 4.3 (2.8–5.8)

all genotypes 1962.0 (1915.1–1994.8)

genotype A 1986.8 (1970.8–1999.0)

genotype B 2002.2 (1999.4–2002.2)

genotype C 2002.3 (2000.1–2003.8)aEstimated mean rates of evolution expressed as 10−4 nucleotide substitutions/site/year under a relaxed molecular clock with GTR + I substitution model and anExponential tree model. The 95 % highest posterior density (HPD) confidenceintervals are included in parenthesesbMean time of the most common ancestor (tMRCA, in calendar year). The95 % highest posterior density (HPD) confidence intervals are indicated

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 10 of 12

Page 11: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

individual tree was indicated in substitutions per site, using Kimura 2-parameter model in MEGA (version 5.1) to estimate pair-wise evolutionarydistance. The Malaysian isolates obtained in this study were color-coded.Each HCoV-OC43 sequence was assigned to its proper genotype basedon the S1 phylogenetic analysis. (PDF 284 kb)

Additional file 4: Figure S4. Phylogenetic analysis of the HCoV-OC43complete and partial S gene. Trees were reconstructed using neighbor-joiningmethod. Bootstrap values were calculated from 1,000 trees. Bootstrap valuesof greater than 70 % were indicated on the branch nodes. The scale bar ofindividual tree was indicated in substitutions per site, using Kimura 2-parameter model in MEGA (version 5.1) to estimate pair-wise evolutionarydistance. NL1= novel lineage 1, NL2= novel lineage 2. (PDF 208 kb)

Additional file 5: Figure S5. Phylogenetic analysis of the HCoV-OC431a (nsp3) and RdRp gene. Trees were reconstructed using neighbor-joiningmethod. Bootstrap values were calculated from 1,000 trees. Bootstrap valuesof greater than 70 % were indicated on the branch nodes. The scale bar ofindividual tree was indicated in substitutions per site, using Kimura 2-parameter model in MEGA (version 5.1) to estimate pair-wise evolutionarydistance. (PDF 119 kb)

Additional file 6: Figure S6. Phylogenetic analysis of the HCoV-HKU1spike gene (S1 domain). Tree was reconstructed using neighbor-joiningmethod. Bootstrap values were calculated from 1,000 trees. Bootstrapvalues of greater than 70% were indicated on the branch nodes. Thescale bar of individual tree was indicated in substitutions per site, usingKimura 2-parameter model in MEGA (version 5.1) to estimate pair-wiseevolutionary distance. The Malaysian isolates obtained in this study werecolor-coded. (PDF 360 kb)

Additional file 7: Figure S7. Phylogenetic analysis of the HCoV-HKU1nucleocapsid gene. Tree was reconstructed using neighbor-joiningmethod. Bootstrap values were calculated from 1,000 trees. Bootstrapvalues of greater than 70 % were indicated on the branch nodes. Thescale bar of individual tree was indicated in substitutions per site, usingKimura 2-parameter model in MEGA (version 5.1) to estimate pair-wiseevolutionary distance. The Malaysian isolates obtained in this study werecolor-coded. (PDF 347 kb)

Additional file 8: Figure S8. Phylogenetic analysis of the HCoV-HKU11a gene (nsp3). Trees was reconstructed using neighbor-joining method.Bootstrap values were calculated from 1,000 trees. Bootstrap values ofgreater than 70% were indicated on the branch nodes. The scale bar ofindividual tree was indicated in substitutions per site, using Kimura 2-parameter model in MEGA (version 5.1) to estimate pair-wise evolutionarydistance. The Malaysian isolates obtained in this study were color-coded.(PDF 133 kb)

Additional file 9: Figure S9. Recombination analysis in HCoV-OC43novel lineages 1 and 2. Analysis of the partial S gene was carried out usingthe MaxChi method in RDP. The x-axis gives the nucleotide positions of thealignment, whereas the y-axis presents the particular test statistics. Peaks in thelog P of χ2 values in the MaxChi test marks potential points of recombination.Dashed lines represent p value cut-offs: uncorrected (lower line) and correctedfor multiple comparisons (upper line) at the 0.05 level. (PDF 330 kb)

AbbreviationsGTR + I: general time-reversible nucleotide substitution model with invariantsites; HCoV-HKU1: human coronavirus HKU1; HCoV-OC43: human coronavirusOC43; HPD: highest posterior density; ICTV: International Committee forTaxonomy of Viruses; MCC: maximum clade credibility; MCMC: Markov chainMonte Carlo; MERS-CoV: Middle East respiratory syndrome coronavirus;NJ: neighbor joining; RdRp: RNA dependent RNA polymerase; SARS-CoV: severe acute respiratory syndrome coronavirus; tMRCA: time of themost recent common ancestors; URTI: upper respiratory tract infection.

Competing interestsCo-author Kok Keng Tee is an Associate Editor for Virology Journal. This doesnot alter the authors’ adherence to all the Virology Journal policies on sharingdata and materials.Other authors do not have any competing interests in the manuscript.

Authors' contributionsConceived and designed the experiments: MNA and KKT. Performed theexperiments: MNA, KTN, XYO, and KKT. Analyzed the data: MNA, KTN, XYO,and KKT. Contributed reagents/materials/analysis tools: MNA, KTN, XYO, YKP,YT, JBC, NSH, AK, and KKT. Wrote the paper: MNA, KTN, and KKT. All authorsread and approved the final version of the manuscript.

AcknowledgementsWe would like to thank Nyoke Pin Wong, Nur Ezreen Syafina, Farhat A. Avin,Chor Yau Ooi, Sujarita Ramanujah, Nirmala K. Sambandam, NagammaiThiagarajan and See Wie Teoh for assistance and support.

FundingThis work was supported by grants from the Ministry of Education, Malaysia:High Impact Research High Impact Research UM.C/625/1/HIR/MOE/CHAN/02/02 to KKT. The funders were not involved in study design or datacollection and analysis.

Author details1Department of Medicine, Faculty of Medicine, University of Malaya, KualaLumpur, Malaysia. 2AIDS Research Center, National Institute of InfectiousDiseases, Toyama, Shinjuku-ku, Tokyo, Japan. 3School of Medicine, YokohamaCity University, Yokohama, Kanagawa, Japan. 4Department of Primary CareMedicine, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia.5Department of Medical Microbiology, Faculty of Medicine, University ofMalaya, Kuala Lumpur, Malaysia.

Received: 26 November 2015 Accepted: 11 February 2016

References1. Arden KE, McErlean P, Nissen MD, Sloots TP, Mackay IM. Frequent detection

of human rhinoviruses, paramyxoviruses, coronaviruses, and bocavirusduring acute respiratory tract infections. J Med Virol. 2006;78(9):1232–40.

2. Lai MM, Cavanagh D. The molecular biology of coronaviruses. Adv Virus Res.1997;48:01–0.

3. Vijgen L, Keyaerts E, Moes E, Maes P, Duson G, Van Ranst M. Developmentof one-step, real-time, quantitative reverse transcriptase PCR assays forabsolute quantitation of human coronaviruses OC43 and 229E. J ClinMicrobiol. 2005;43(11):5452–6.

4. Makino S, Keck JG, Stohlman SA, Lai M. High-frequency RNA recombinationof murine coronaviruses. J Virol. 1986;57(3):729–37.

5. Brian DA, Spaan WJ. Recombination and coronavirus defective interferingRNAs. Seminars in Virology. Academic Press. 1997;8(2):101–11.

6. Zhong N. Management and prevention of SARS in China. Philos Trans PhysSci Eng. 2004;359(1447):1115–6.

7. Peiris J, Guan Y, Yuen K. Severe acute respiratory syndrome. Nat Med. 2004;10:88–97.

8. Graham RL, Donaldson EF, Baric RS. A decade after SARS: strategies forcontrolling emerging coronaviruses. Nat Rev Microbiol. 2013;11(12):836–48.

9. Stavrinides J, Guttman DS. Mosaic evolution of the severe acute respiratorysyndrome coronavirus. J Virol. 2004;78(1):76–82.

10. Woo PC, Lau SK, Yip CC, Huang Y, Tsoi HW, Chan KH, et al. Comparative analysisof 22 coronavirus HKU1 genomes reveals a novel genotype and evidence ofnatural recombination in coronavirus HKU1. J Virol. 2006;80(14):7136–45.

11. Lau SK, Lee P, Tsang AK, Yip CC, Tse H, Lee RA, et al. Molecular epidemiologyof human coronavirus OC43 reveals evolution of different genotypes over timeand recent emergence of a novel genotype due to natural recombination.J Virol. 2011;85(21):11325–37.

12. Esper F, Weibel C, Ferguson D, Landry ML, Kahn JS. Coronavirus HKU1infection in the United States. Emerg Infect Dis. 2006;12(5):775–9.

13. Woo PC, Lau SK, Huang Y, Tsoi HW, Chan KH, Yuen KY. Phylogenetic andrecombination analysis of coronavirus HKU1, a novel coronavirus frompatients with pneumonia. Arch Virol. 2005;150(11):2299–11.

14. Walsh EE, Shin JH, Falsey AR. Clinical impact of human coronaviruses 229Eand OC43 infection in diverse adult populations. J Infect Dis. 2013;208(10):1634–42.

15. Dare RK, Fry AM, Chittaganpitch M, Sawanpanyalert P, Olsen SJ, Erdman DD.Human coronavirus infections in rural Thailand: a comprehensive studyusing real-time reverse-transcription polymerase chain reaction assays. JInfect Dis. 2007;196(9):1321–8.

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 11 of 12

Page 12: 2016 Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper resp

16. Pyrc K, Berkhout B, van der Hoek L. The novel human coronaviruses NL63and HKU1. J Virol. 2007;81(7):3051–7.

17. Hu Q, Lu R, Peng K, Duan X, Wang Y, Zhao Y, et al. Prevalence andgenetic diversity analysis of human coronavirus OC43 among adultpatients with acute respiratory infections in Beijing, 2012. PLoS ONE.2014;9(7):e100781.

18. Jackson GG, Dowling HF, Spiesman IG, Boand AV. Transmission of thecommon cold to volunteers under controlled conditions. I. The commoncold as a clinical entity. AMA Arch Intern Med. 1958;101(2):267–78.

19. Turner RB, Wecker MT, Pohl G, Witek TJ, McNally E, St George R, et al.Efficacy of tremacamra, a soluble intercellular adhesion molecule 1, forexperimental rhinovirus infection: a randomized clinical trial. JAMA. 1999;281(19):1797–04.

20. Yale SH, Liu K. Echinacea purpurea therapy for the treatment of thecommon cold: a randomized, double-blind, placebo-controlled clinical trial.Arch Intern Med. 2004;164(11):1237–41.

21. Zitter JN, Mazonson PD, Miller DP, Hulley SB, Balmes JR. Aircraft cabinair recirculation and symptoms of the common cold. JAMA. 2002;288(4):483–6.

22. Boom R, Sol C, Salimans M, Jansen C, Wertheim-van Dillen P, Van derNoordaa J. Rapid and simple method for purification of nucleic acids. J ClinMicrobiol. 1990;28(3):495–03.

23. Chan KH, Yam WC, Pang CM, Chan KM, Lam SY, Lo KF, et al. Comparison ofthe NucliSens easyMAG and Qiagen BioRobot 9604 nucleic acid extractionsystems for detection of RNA and DNA respiratory viruses innasopharyngeal aspirate samples. J Clin Microbiol. 2008;46(7):2195–9.

24. Jokela P, Piiparinen H, Mannonen L, Auvinen E, Lappalainen M. Performanceof the Luminex xTAG Respiratory Viral Panel Fast in a clinical laboratorysetting. J Virol Methods. 2012;182(2):82–6.

25. St-Jean JR, Jacomy H, Desforges M, Vabret A, Freymuth F, Talbot PJ. Humanrespiratory coronavirus OC43: genetic stability and neuroinvasion. J Virol.2004;78(16):8824–34.

26. Vabret A, Dina J, Mourez T, Gouarin S, Petitjean J, van der Werf S, et al. Inter-and intra-variant genetic heterogeneity of human coronavirus OC43 strainsin France. J Gen Virol. 2006;87(11):3349–53.

27. Vijgen L, Keyaerts E, Moes E, Thoelen I, Wollants E, Lemey P, et al. Completegenomic sequence of human coronavirus OC43: molecular clock analysissuggests a relatively recent zoonotic coronavirus transmission event. J Virol.2005;79(3):1595–04.

28. Mounir S, Talbot PJ. Molecular characterization of the S protein gene ofhuman coronavirus OC43. J Gen Virol. 1993;74:1981–81.

29. Vijgen L, Keyaerts E, Lemey P, Moes E, Li S, Vandamme AM, et al. Circulationof genetically distinct contemporary human coronavirus OC43 strains. VirolJ. 2005;337(1):85–92.

30. Kon M, Watanabe K, Tazawa T, Watanabe K, Tamura T, Tsukagoshi H, et al.Detection of human coronavirus NL63 and OC43 in children with acuterespiratory infections in Niigata, Japan, between 2010 and 2011. Jpn J InfectDis. 2012;65(3):270–2.

31. Suwannakarn K, Chieochansin T, Vichiwattana P, Korkong S, TheamboonlersA, Poovorawan Y. Prevalence and genetic characterization of humancoronaviruses in southern thailand from july 2009 to january 2011.Southeast Asian J Trop Med Public Health. 2014;45(2):326–36.

32. Zhang Y, Li J, Xiao Y, Zhang J, Wang Y, Chen L, et al. Genotype shift inhuman coronavirus OC43 and emergence of a novel genotype by naturalrecombination. J Infect. 2014;70:641–50.

33. Drummond AJ, Rambaut A. BEAST: Bayesian evolutionary analysis by samplingtrees. BMC Evol Biol. 2007;7:214–9.

34. Lau SK, Li KS, Huang Y, Shek C-T, Tse H, Wang M, et al. Ecoepidemiologyand complete genome comparison of different strains of severe acuterespiratory syndrome-related Rhinolophus bat coronavirus in China revealbats as a reservoir for acute, self-limiting infection that allows recombinationevents. J Virol. 2010;84(6):2808–19.

35. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5:molecular evolutionary genetics analysis using maximum likelihood,evolutionary distance, and maximum parsimony methods. Mol Biol Evol.2011;28(10):2731–9.

36. Vijgen L, Keyaerts E, Lemey P, Maes P, Van Reeth K, Nauwynck H, et al.Evolutionary history of the closely related group 2 coronaviruses: Porcinehemagglutinating encephalomyelitis virus, bovine coronavirus, and humancoronavirus OC43. J Virol. 2006;80(14):7270–4.

37. Smith JM. Analyzing the mosaic structure of genes. J Mol Evol. 1992;34(2):126–9.

38. Martin DP, Murrell B, Golden M, Khoosal A, Muhire B. RDP4: Detection andanalysis of recombination patterns in virus genomes. Virus Evol. 2015;1(1):vev003.

39. Martin D, Rybicki E. RDP: detection of recombination amongst alignedsequences. J Bioinform. 2000;16(6):562–3.

40. Lau SK, Woo PC, Yip CC, Tse H, Tsoi HW, Cheng VC, et al. Coronavirus HKU1and other coronavirus infections in Hong Kong. J Clin Microbiol. 2006;44(6):2063–71.

41. S-d T, Salez N, Benkouiten S, Badiaga S, Charrel R, Brouqui P. Respiratoryviruses within homeless shelters in Marseille, France. BMC Res Notes.2014;7(1):81–9.

42. Garbino J, Crespo S, Aubert J-D, Rochat T, Ninet B, Deffernez C, et al. Aprospective hospital-based study of the clinical impact of non–severe acuterespiratory syndrome (non-SARS)–related human coronavirus infection. ClinInfect Dis. 2006;43(8):1009–15.

43. Furuse Y, Suzuki A, Kishi M, Galang HO, Lupisan SP, Olveda RM, et al.Detection of novel respiratory viruses from influenza-like illness in thePhilippines. J Med Virol. 2010;82(6):1071–4.

44. Lee WJ, Chung YS, Yoon HS, Kang C, Kim K. Prevalence and molecularepidemiology of human coronavirus HKU1 in patients with acuterespiratory illness. J Med Virol. 2013;85(2):309–14.

45. Dominguez SR, Shrivastava S, Berglund A, Qian Z, Goes LG, Halpin RA, et al.Isolation, propagation, genome analysis and epidemiology of HKU1betacoronaviruses. J Gen Virol. 2014;95(4):836–48.

46. Vabret A, Mourez T, Gouarin S, Petitjean J, Freymuth F. An outbreak ofcoronavirus OC43 respiratory infection in Normandy, France. Clin Infect Dis.2003;36(8):985–9.

47. Patrick DM, Petric M, Skowronski DM, Guasparini R, Booth TF, Krajden M, etal. An outbreak of human coronavirus OC43 infection and serological cross-reactivity with SARS Coronavirus. Can J Infect Dis Med Microbiol. 2006;17(6):330–6.

48. Tyrrell D, Bynoe M. Cultivation of viruses from a high proportion of patientswith colds. Lancet. 1966;287(7428):76–7.

49. Kahn JS, McIntosh K. History and recent advances in coronavirus discovery.J Pediatric Infect Dis Soc. 2005;24(11):223–7.

50. Woo PC, Lau SK, Chu CM, Chan KH, Tsoi HW, Huang Y, et al.Characterization and complete genome sequence of a novel coronavirus,coronavirus HKU1, from patients with pneumonia. J Virol. 2005;79(2):884–95.

51. Lu R, Yu X, Wang W, Duan X, Zhang L, Zhou W, et al. Characterization ofhuman coronavirus etiology in Chinese adults with acute upper respiratorytract infection by real-time RT-PCR assays. PLoS ONE. 2012;7(6):e38638.

52. van Elden LJ, van Alphen F, Hendriksen KA, Hoepelman AI, van Kraaij MG,Oosterheert J-J, et al. Frequent detection of human coronaviruses in clinicalspecimens from patients with respiratory tract infection by use of a novelreal-time reverse-transcriptase polymerase chain reaction. J Infect Dis.2004;189(4):652–7.

53. Gerna G, Percivalle E, Sarasini A, Campanini G, Piralla A, Rovida F, et al.Human respiratory coronavirus HKU1 versus other coronavirus infections inItalian hospitalised patients. J Clin Virol. 2007;38(3):244–50.

• We accept pre-submission inquiries

• Our selector tool helps you to find the most relevant journal

• We provide round the clock customer support

• Convenient online submission

• Thorough peer review

• Inclusion in PubMed and all major indexing services

• Maximum visibility for your research

Submit your manuscript atwww.biomedcentral.com/submit

Submit your next manuscript to BioMed Central and we will help you at every step:

Al-Khannaq et al. Virology Journal (2016) 13:33 Page 12 of 12


Recommended