+ All Categories
Home > Documents > Nucleotide sequence analysis and genomic organization of the NY-RPV isolate of barley yellow dwarf...

Nucleotide sequence analysis and genomic organization of the NY-RPV isolate of barley yellow dwarf...

Date post: 17-Nov-2023
Category:
Upload: un-lincoln
View: 0 times
Download: 0 times
Share this document with a friend
9
Downloaded from www.microbiologyresearch.org by IP: 93.91.26.109 On: Fri, 25 Dec 2015 18:43:06 Journal of General Virology (1991), 72, 2347-2355. Printedin Great Britain 2347 Nucleotide sequence analysis and genomic organization of the NY-RPV isolate of barley yellow dwarf virus Jeffrey R. Vincent,* Richard M. Lister and Brian A. Larkinst Department of Botany and Plant Pathology, Purdue University, West Lafayette, Indiana 47907-1155, U.S.A. cDNA clones representing the ssRNA genome of the NY-RPV isolate of barley yellow dwarf luteovirus (BYDV) were sequenced and 5600 nucleotides of the genome were determined. The deduced genome organ- ization has limited similarity to that of another BYDV isolate, Vic-PAV, but is identical to that of beet western yellows (BWYV) and potato leafroll (PLRV) luteo- viruses. NY-RPV has six major positive-sense open reading frames (ORFs) and, by comparison with RNA-dependent RNA polymerase and nucleic acid helicase consensus sequence motifs, it is postulated that NY-RPV ORF2 and ORF3 encode the viral replicase, which is expressed by a translational frame- shift mechanism. The region of the NY-RPV genome containing the 22K coat protein ORF, the apparently associated internal apparent VPg ORF and the ORF immediately 3'-proximal (ORF6) to the coat protein ORF are organized as reported for other luteoviruses. Evidence is presented showing that ORF6 is expressed by readthrough of the coat protein gene termination codon, and that this protein is associated with the intact virus as a 65K protein. Although NY-RPV infects graminaceous rather than dicotyledonous plants, the taxonomic relationships between BYDV isolates and other luteoviruses deduced from the genome organiz- ation and sequence data strongly suggest that NY-RPV is distinct from the PAV-like isolates of BYDV and is more closely related to BWYV and PLRV. Introduction Luteoviruses, such as barley yellow dwarf virus (BYDV), beet western yellows virus (BWYV) and potato leafroll virus (PLRV), cause yellowing diseases in a wide range of host plants (Matthews, 1982), are not mechanically transmitted and are limited to the phloem tissue of the host plant. The physical properties of luteoviruses are similar in that they are 24 to 30 nm diameter isometric particles containing a positive-sense ssRNA genome of 5.6 to 6.0 kb (Mayo et al., 1989; Miller et al., 1988 a; Veidt et al., 1988; Waterhouse et al., 1988). The genomic RNA has a small protein (VPg) covalently attached to its 5' end (Murphy et al., 1989) but does not have a poly(A) tail (Mayo et al., 1982, 1989; Miller et al., 1988 a; Veidt et al., 1988). Unlike BWYV and PLRV, which typically infect dicotyledonous hosts, BYDV consists of a group of serologically related viruses that infect barley, oats, wheat, rice, maize and other graminaceous hosts (Rochow, 1970). Like other luteoviruses, they are t Present address: Department of Plant Science, University of Arizona, Tucson, Arizona 85721, U.S.A. The nucleotidesequencedata reported in this paper will appear in the DDJB, EMBLand GenBanknucleotidesequence databasesunder the accession number D01013. transmitted only by aphid vectors in a persistent, circulative manner. Isolates of BYDV, originally dis- tinguished and named for their predominant aphid vectors, have been separated into two groups based on their serological relationships (Aapola & Rochow, 1971 ; Rochow, 1970; Rochow & Carmichael, 1979; Rochow & Duffus, 1981), the cytopathological ultrastructure of infected cells (Gill & Chung, 1970) and the dsRNA profiles identified in infected tissues (Gildow et al., 1983). Based on these criteria, BYDV group 1 includes serotypes MAV, transmitted by Macrosiphon (=Sitobion) avenae Fabr., PAV, transmitted by M. avenae and Rhopalosiphum padi L., and SGV, trans- mitted by Schizaphis graminum Rond. (Rochow, 1970). Group 2 includes serotypes RMV, transmitted by R. maidis Fitch., and RPV, transmitted by R. padi (Rochow, 1970). To help understand the molecular biology of luteo- viruses, the nucleotide sequence and genome organiz- ation of a PAV serotype of BYDV (Miller et al., 1988a), and BWYV (Veidt et al., 1988) and PLRV (Mayo et al., 1989; Keese et al., 1990) have been determined. The coat protein genes of these luteoviruses (Miller et al., 1988b; Kawchuk et al., 1989; Prill et al., 1989; Veidt et al., 1988), and of isolates representing the two BYDV groups described have also been identified and characterized 0001-0233 © 1991 SGM
Transcript

Downloaded from www.microbiologyresearch.org by

IP: 93.91.26.109

On: Fri, 25 Dec 2015 18:43:06

Journal of General Virology (1991), 72, 2347-2355. Printed in Great Britain 2347

Nucleotide sequence analysis and genomic organization of the NY-RPV isolate of barley yellow dwarf virus

Jeffrey R. Vincent,* Richard M. Lister and Brian A. Larkinst

Department o f Botany and Plant Pathology, Purdue University, West Lafayette, Indiana 47907-1155, U.S.A.

cDNA clones representing the ssRNA genome of the NY-RPV isolate of barley yellow dwarf luteovirus (BYDV) were sequenced and 5600 nucleotides of the genome were determined. The deduced genome organ- ization has limited similarity to that of another BYDV isolate, Vic-PAV, but is identical to that of beet western yellows (BWYV) and potato leafroll (PLRV) luteo- viruses. NY-RPV has six major positive-sense open reading frames (ORFs) and, by comparison with RNA-dependent RNA polymerase and nucleic acid helicase consensus sequence motifs, it is postulated that NY-RPV ORF2 and ORF3 encode the viral replicase, which is expressed by a translational frame- shift mechanism. The region of the NY-RPV genome

containing the 22K coat protein ORF, the apparently associated internal apparent VPg ORF and the ORF immediately 3'-proximal (ORF6) to the coat protein ORF are organized as reported for other luteoviruses. Evidence is presented showing that ORF6 is expressed by readthrough of the coat protein gene termination codon, and that this protein is associated with the intact virus as a 65K protein. Although NY-RPV infects graminaceous rather than dicotyledonous plants, the taxonomic relationships between BYDV isolates and other luteoviruses deduced from the genome organiz- ation and sequence data strongly suggest that NY-RPV is distinct from the PAV-like isolates of BYDV and is more closely related to BWYV and PLRV.

Introduction

Luteoviruses, such as barley yellow dwarf virus (BYDV), beet western yellows virus (BWYV) and potato leafroll virus (PLRV), cause yellowing diseases in a wide range of host plants (Matthews, 1982), are not mechanically transmitted and are limited to the phloem tissue of the host plant. The physical properties of luteoviruses are similar in that they are 24 to 30 nm diameter isometric particles containing a positive-sense ssRNA genome of 5.6 to 6.0 kb (Mayo et al., 1989; Miller et al., 1988 a; Veidt et al., 1988; Waterhouse et al., 1988). The genomic RNA has a small protein (VPg) covalently attached to its 5' end (Murphy et al., 1989) but does not have a poly(A) tail (Mayo et al., 1982, 1989; Miller et al., 1988 a; Veidt et al., 1988).

Unlike BWYV and PLRV, which typically infect dicotyledonous hosts, BYDV consists of a group of serologically related viruses that infect barley, oats, wheat, rice, maize and other graminaceous hosts (Rochow, 1970). Like other luteoviruses, they are

t Present address: Department of Plant Science, University of Arizona, Tucson, Arizona 85721, U.S.A.

The nucleotide sequence data reported in this paper will appear in the DDJB, EMBL and GenBank nucleotide sequence databases under the accession number D01013.

transmitted only by aphid vectors in a persistent, circulative manner. Isolates of BYDV, originally dis- tinguished and named for their predominant aphid vectors, have been separated into two groups based on their serological relationships (Aapola & Rochow, 1971 ; Rochow, 1970; Rochow & Carmichael, 1979; Rochow & Duffus, 1981), the cytopathological ultrastructure of infected cells (Gill & Chung, 1970) and the dsRNA profiles identified in infected tissues (Gildow et al., 1983). Based on these criteria, BYDV group 1 includes serotypes MAV, transmitted by Macrosiphon (=Sitobion) avenae Fabr., PAV, transmitted by M. avenae and Rhopalosiphum padi L., and SGV, trans- mitted by Schizaphis graminum Rond. (Rochow, 1970). Group 2 includes serotypes RMV, transmitted by R. maidis Fitch., and RPV, transmitted by R. padi (Rochow, 1970).

To help understand the molecular biology of luteo- viruses, the nucleotide sequence and genome organiz- ation of a PAV serotype of BYDV (Miller et al., 1988a), and BWYV (Veidt et al., 1988) and PLRV (Mayo et al., 1989; Keese et al., 1990) have been determined. The coat protein genes of these luteoviruses (Miller et al., 1988b; Kawchuk et al., 1989; Prill et al., 1989; Veidt et al., 1988), and of isolates representing the two BYDV groups described have also been identified and characterized

0001-0233 © 1991 SGM

Downloaded from www.microbiologyresearch.org by

IP: 93.91.26.109

On: Fri, 25 Dec 2015 18:43:06

2348 J. R. Vincent, R. M. Lister and B. A. Lark ins

(Vincent et al., 1990). Here , we presen t the nuc leo t ide sequence and genome o rgan iza t ion of the R P V sero type o f BYDV. This in fo rma t ion is c o m p a r e d to tha t for a P A V sero type o f BYDV, and also to tha t pub l i shed for B W Y V and P L R V . Ana lyses of sequence s imi la r i ty and genome re la t ionsh ips are p resen ted which ind ica te tha t the R P V serotype, a l though it infects and causes yel lowing disease in g r aminaceous hosts, is more closely re la ted to B W Y V and P L R V , which typ ica l ly infect d ico ty ledonous hosts, t han to P A V serotypes o f BYDV.

Methods

Virus and RNA isolation. The BYDV isolate used in this work was the New York RPV isolate of Rochow (NY-RPV) (Rochow, 1970), named according to the suggestion of Rochow (1984) that isolates originating in New York State be prefixed by 'NY'. This isolate was maintained at Purdue University in oat plants (Arena sativa L. cv. Clintland 64) by mass transfer of viruliferous R. padi; the absence of cross-contami- nation was confirmed by ELISA as described previously (Barbara et al., 1987). Virus was purified and the RNA extracted as described for the MAV-PS1 isolate of BYDV (Barbara et aL, 1987).

cDNA. A cDNA library representing the NY-RPV genome was constructed in 2gtll (Young & Davis, 1983), pUCI8 (Vieira & Messing, 1982) and pGEM-3Z (Promega) (Barbara et al., 1987; Vincent et al., 1990). A restriction map representing the viral genome was generated by single and double restriction enzyme digests of the cloned cDNA, and by Southern blot hybridization between different restriction fragments (Maniatis et al., 1982).

c-DNA sequencing. Plasmid DNA isolated by an alkaline lysis technique (Birnboim & Doly, 1979) was sequenced by the dideoxy- nucleotide chain termination method (Sanger et al., 1977) with a modified T7 DNA potymerase (Sequenase; U.S. Biochemical). The cDNA clones representing the viral genome were sequenced by one or more of the following strategies. (i) Restriction fragments representing overlapping regions of the genome were subcloned; (ii) exonuclease III/ mung bean nuclease-generated nested deletions were identified in existing cDNA clones (Putney et al., 1981); (iii) specific synthetic sequencing primers were prepared (Laboratory for Macromolecular Structure, Department of Biochemistry, Purdue University). Sequence analyses were performed using Microgenie version 4.0 (Beckman Instruments) and the Genetics Computer Group sequence analysis software, version 6.1 (Devereux et al., 1984).

Western blot analysis. For Western blot analysis of NY-RPV, purified virus particles were dissociated by the addition of an equal volume of 2 × SDS-PAGE sample buffer (Laemmli, 1970), incubated overnight at 4 °C and electrophoresed on a 10~ polyacrylamide gel. Viral proteins were detected immunologicaUy with 0-15% Tween 20 used to prevent non-specific binding (Barbara et al., 1987).

Results and Discussion

Sequencing

O v e r l a p p i n g c D N A clones represen t ing N Y - R P V genomic R N A were ident i f ied by res t r ic t ion enzyme analys is and hybr id i za t ion (Fig. 1). F r o m them, 5600 nucleot ides o f the N Y - R P V genome were d e t e r m i n e d

H X Hc I I

V E V H S Ss B g S s A Bg E V B E V S s X

] I I I i i L 11 I I I I I I i I I[ t l l l L ~

H E A V H B B

29 24 3-2 - -

23 130

I I A

95 1 0 4 - -

10 I - 8 - -

6-4

I kb T

Fig. 1. Map of inserts from cDNA clones used for sequencing the BYDV NY-RPV isolate. The location of representative restriction sites within the genome is indicated, pRP cDNA clones used to generate the BYDV NY-RPV sequence are shown below the restriction map. The arrows indicate the length and direction of sequencing of these cDNA clones. A, Accl; B, BamHI; Bg, BglII; E, EcoRI; H, HindlII; Hc, HinclI; S, Smal; Ss, SstI; V, EcoRV; X, Xbal.

(Fig. 2), of which 100% was d e t e r m i n e d f rom sequencing o f bo th s t rands and 74% was d e t e r m i n e d f rom more than one i ndependen t ly de r ived c D N A clone. The c D N A inserts in clones p R P l l A , pRP23 , pRP29 , pRP95 , pRP104 , pRP3-2 and pRP1-8 were sequenced in the i r ent i re ty , whereas only a pa r t i a l sequence for the r ema in ing clones shown in Fig. 1 was de te rmined . Clones p R P 2 9 and pRP1-8 represen t the ex t reme 5' and 3' ends of the N Y - R P V sequence ob ta ined , respec t ive ly ; no o ther c lones con ta in ing sequences co te rmina l wi th those ob t a ined f rom e i ther p R P 2 9 or pRP1-8 were identif ied. Therefore , only p R P 2 9 was used to de t e rmine the Y-prox imal 366 nuc leo t ides and only pRP1-8 was used to de t e rmine the Y-prox imal 107 nucleot ides .

Sequencing of more than one i ndependen t ly de r ived c D N A clone represen t ing severa l regions of the genome ident i f ied six va ry ing nuc leo t ides (Fig. 2). O f these, ha l f were C to U t rans i t ions and ha l f were t ransvers ions , and three result in a m i n o ac id changes (Fig. 2); the o the r three changes , occurr ing in the ' w o b b l e ' pos i t ion , would not result in any a m i n o ac id change. W e do not have any ind ica t ion whe the r the obse rved nuc leo t ide differences are c loning a r te fac ts or whe the r they represen t t rue va r i ab i l i t y assoc ia ted wi th the vi ra l R N A popu la t ion f rom which the c D N A l ib ra ry was const ructed .

Open reading f r a m e s ( O R F s )

A c o m p u t e r analys is o f the N Y - R P V nucleo t ide sequence ident i f ied six O R F s on the pos i t ive-sense s t r and c a pa b l e o f encod ing po lypep t ides grea te r t han

Downloaded from www.microbiologyresearch.org by

IP: 93.91.26.109

On: Fri, 25 Dec 2015 18:43:06

The N Y - R P V isolate o f B Y D V 2349

10K, as well as two on the negative-sense strand (Fig. 3). The amino acid sequences encoded by the six major positive-sense ORFs are indicated within the nucleotide sequence in Fig. 2 and are referred to as ORF1 to ORF6 in order of their proximity to the 5' end of the sequence.

ORF1 begins at nucleotide 115, which also begins the first AUG in the genome, and potentially encodes 256 amino acids of Mr 29 263. ORF2 begins with the second AUG in the genome at nucleotide 266 and potentially encodes a polypeptide of 654 amino acids (M r 70575). Thus, ORF2 overlaps the 3' end of ORF1 by 620 nucleotides. The first AUG of ORF3 begins at nucleo- tide 1813, which is 213 nucleotides downstream of the previous in-frame termination codon, and could encode a polypeptide of 633 amino acids (Mr 71785). The GDD consensus sequence for RNA-dependent RNA polymer- ase (Kamer & Argos, 1984) has been identified within ORF3 (see below) and thus ORF3 encodes part of the NY-RPV viral replicase.

The coat protein of NY-RPV is encoded by ORF4 (Vincent et al., 1990). Beginning at nucleotide 3687, this gene is 612 bases in length, terminates with an amber termination codon and encodes a protein of 22190. Located completely within ORF4, but in a different reading frame, is 9459 base ORF which is capable of encoding a protein of Mr 17211 (ORF5). Recently the Mr of the VPg of a different RPV isolate has been determined to be 17K (Murphy et al., 1989). As only ORF5 is capable of encoding a protein of Mr 17K, it is probable that it encodes the NY-RPV VPg. ORF6 is a 1197 nucleotide ORF beginning at nucleotide 4302, immediately after the coat protein gene termination codon and in the same reading frame. The first AUG of ORF6 does not occur until nucleotide 4572.

Western blot analysis of purified NY-RPV virions identified three proteins; Fig. 4 shows a typical analysis. Two smaller proteins were identified with estimated Mrs of 19K and 23K; the largest protein had an Mr of approximately 58K to 63K, and often was not detectable (Fig. 4). Both the ability to detect the largest protein and its size were dependent on how often the purified virus preparation had been thawed and refrozen before analysis. When an NY-RPV preparation had been thawed and frozen five times, the 63K protein could not be detected and a faint protein band with an apparent Mr of 58K was identified instead (Fig, 4, lane 1). When this preparation was frozen and thawed once more, the 58K band could no longer be detected.

Of the smaller proteins identified by Western blot analysis, the 23K protein was predominant and migrated in SDS-PAGE as the authentic NY-RPV coat protein (Vincent et al., 1990); the band migrating as a 19K protein was presumably the VPg. No separate ORF is apparent that would account for the largest protein

observed by Western blot analysis of NY-RPV virions. However, readthrough of the coat protein gene termin- ation codon could produce a protein of Mr 65K. The implication is that ORF4 and ORF6 together can encode a 65K protein by readthrough of the ORF4 stop codon, and that this protein is associated with the viral capsid. Similarly, readthrough of the PLRV coat protein gene termination codon recently has been shown (Bahner et al., 1990).

Non-coding regions

The NY-RPV genome contains 401 nucleotides that are not contained within the six major positive-sense coding regions. These non-coding regions are located at the extreme 5' and 3' ends of the genome, and 5' to the coat protein gene (ORF4). We have no evidence that these 5'- and T-terminal sequences represent the actual termini of NY-RPV. We identified 114 nucleotides in the 5'-proximal non-coding region and 102 nucleotides in the T-proximal non-coding region. Computer analysis of the T-terminal non-coding region did not identify any potential tRNA-like structures. For PLRV and BWYV, luteoviruses with genomic organization similar to that of NY-RPV, there are 141, and between 146 and 197 T-terminal nucleotides, respectively (Keese et al., 1990; Mayo et al., 1989; Veidt et al., 1988). A comparison between the 5' non-coding regions of these luteoviruses showed that although 114 nucleotides were identified at the 5' end of NY-RPV, the length of this region is variable; isolates of PLRV have between 32 and 174 (Keese et al., 1990; Mayo et al., 1989), and BWYV has 31 nucleotides at the 5' terminus (Veidt et al., 1988). However, for these luteoviruses, and also for Vic-PAV (Miller et al., 1988a), a consensus hexanucleotide sequence, AC/UAAAA/C, has been identified in these 5' non-coding regions. This sequence is also found in southern bean mosaic virus (SMBV) (Keese et al., 1990; Wu et al., 1987).

Nearly half of all the non-coding nucleotides in NY-RPV are located between ORF3 and ORF4, i.e. 5' to the coat protein gene. This region is A-U-rich, with 58 of the 185 nucleotides being A or U residues. Uridine forms 42~o of the 113 nucleotides immediately upstream of the coat protein gene initiation codon, and A and U residues together make up 62~ of the nucleotides in this region. A comparison of similar regions from other luteoviruses indicates that there is a core region consisting of 113 nucleotides upstream of the coat protein gene initiation codon (Fig. 5). This core region contains the identifying features of subgenomic promoters defined for brome mosaic virus (BMV) (Marsh et al., 1988). There is extensive nucleotide identity between the core regions of NY-RPV, BWYV

Downloaded from www.microbiologyresearch.org by

IP: 93.91.26.109

On: Fri, 25 Dec 2015 18:43:06

2350 J. R. Vincent, R. M. Lister and B. A. Larkins

1 CGGCUCCAAAGCL.JAAGCUCACUGWOCCCGCGCUGHUUCGACGCUUCCAGCUCACAGCC CAUAAAAC CCCCUCCUC CCG CUCUAGC GAUCAACUC GCGUGUUC GUUAUUAACGCA.UGUUC

ORFI

121 AUCGCCCAACCUUGCGGGCGAGUUCUU GUGUUCGACGUCGC CUCCCACAC GCCAUC GUUCUUCACUC GUUAUAGUGUUGAACUCUC GC UCCGUGC UC UAGAUC CACUCUUCACGCGAGCA ORFI I A Q P C G R V L V F D V A S H T P S F F T R Y S V E L S L R A L D P L F T R A

ORF2

241 GUAACAGAUUUCCGAUACACCCAAAAUGAAAUCGAUUHAUUUHGU GU GUCUCUUGGCUUUCUGUUGCCAAUUCU CCUCACAGGAGAAUCUUACUCUUGGC GCGGUCACCUUAACCUCC C C

ORFI V T D F R Y T Q N E I D L F C V S L G F L L P I L L T G E S Y S W R G H L N L P ORF2 * K S I Y F V C L L A F C C Q F S S Q E N L T L G A V T L T S P

361 CUUUCUUACAC CGAACUACUUGUUCGAUGGGGGCUCGCAGUGGGGUACUUC CCUGCCUU CUC CGCUGAUGGUGACAUUC GACAGAACC CAGAACUCC GCAUCGACCUGUCCAC CAUGUCA ORiel L S Y T E L L V R W G L A V G Y F P A F S A D G D I R Q N P E L R I D L S T M S

ORF2 F L T P N Y L F D G G S Q W G T S L P S P L M V T F D R T Q N S A S T C P P C Q

481 ACCCGCUCUUUCUACGAGCAGUUCCUACUCAGAUAUAAUACAAGUGGGUUGGCAAAAGCUAUC•UUGGACAGCAAGAGUGCUUUCAAAGCGGCAUGGAGUCUUUUAAAAGAUUCCUGCAC

ORFI T R S F Y E Q F L L R Y N T S G L A K A I V G Q Q £ C F Q S G M E S F K R F L H

ORF2 P A L S T S S S Y S D I I Q V G W Q K L S L D S K S A F K A A W S L L K D S C T

601 UAC CGCCUCACGUGCUUUGAAAGCUGCCUUCCACGACCUCGUUGGGAAAGCCCUUU GGC UCCUGGUC CUUAUC U GGACAG GGC UUUUGAGGCAACUCUUCUCG GCCGUAUGGUCGGCCAU

ORiel Y R L T C F E S C L P R P R W E S P L A P G P Y L D R A F E A T L L G R M V G H

ORF2 T A S R A L K A A F H D L V G K A L W L L V L I W T G L L R Q L F S A V W $ A I

721 AAC CAACUACUCUUUACC GGUUUGUCUUCUGAUAUCACUAGGUAUUAUAAC GAGUU GGUUGUGGAAG GCGUGCC GG UGGCUUUUUGGGACGCUGCCGGCAUUAC UUU GCAUCACGC UGGU

ORFI N Q L L F T G L S S D I T R Y Y N E L V V E G V P V A F W D A A G I T L H H A G ORF2 T N Y S L P V C L L I S L G I I T S W L W K A C R W L F G T L P A L L C I T L V

841 GAAGAAUAUUUU CC GAAUUCUUACAUU CAAAAGAUU CUU CAAUGAGA~CUGU CU CAGGUUAUGACAGUUAUU CGAUU C CCAGCACC CCUC CGAAGCGGAGC GUCAUCAUGAUGC GACG

ORFI E E Y F P N S Y I Q K I L Q + ORF2 K N I F R I L T F K R F F N K K T V S G Y D S Y S I P S T P P K R S V I M M R R

(C) 961 CCAAAAUAAGGAC, CACAUUGGAUAUGC GAACUGUAUUCGUCUUUUCGACG GAC GGAACG CAAUUGUCACAGUU GCUCACAAUAUAGAGGAGGGUU GCUCUUUCUAUUCCUCAC GGACUAG

ORF2 Q N K E H I G Y A N C I R L F D G R N A I V T V A H N I E E G C S F Y S S R T S

(P)

1081 UGGUUCCAUCCCAAUCACUGAAUUUCGCGUUAUCUUUGAAAGCAAAACCAUGGAUAUCGCCAUUCUGGUGGGACCCAUCAAU UGGGAAUCGAUUCUCGGAUGCAAAGGAGUCCACUUUAC

ORF2 G S I P I T E F R V I F E S K T M D I A I L V G P I N W E S I L G C K G V H F T

1201 CAC CGCUGACCGCUUAGCAGAAUGUC CAGC GGCUCUUUAUCUUCUUGAUAGCGAUGGUCAGUGGCGC U CAAACUCGG CGAAGAUUUGUG GUCAUUUC GACAAC UUC GCCCAAGUCUUGUC

ORE2 T A D R L A E C P A A L Y L L D S D G Q W R S N S A K I C G }4 F D N F A Q V L S

(U) 1321 CAACACUAAAGUCGGUCACUCCGGUGC CGGUUACUUCUAUG GCAAGACUCUCG UUG GCCUCCAUAAAGGC CAUCCUGGCAAAGAUUUCAAUUUUAAUCUGAUGGCUCCUUUGCCAGGAAU

ORF2 N T K V G H S G A G Y F Y G K T L V G L H K G H P G K D F N F N L M A P L P G I

1441 ACC CGGCUUGAC CUCCCC GCAGUACGUCGU GGAAUCCGACC CCC C CCAGGGAUU GGUUUUCCCCGAAGAAGUCACUGAAUCAAUC GAAGCCGCGAUUAAGGAGGCAACAAUGUACAAGAA

ORF2 P G L T S P Q Y V V E S D P P Q G L V F P E E V T E S I E A ;% • K E A T M Y K N

1561 UGUCUUCGCCAAUAGAGGACGUGGCGCCUUUAAAUCUAAAAGCGGCAUCAACUGGGAAGACAUCGAAGACGAGUCGGGAAACGGGAAGGCGGCGGCGUCCGCCGUAACAAACGCAGCAGC

ORF2 V F A N R G R G A F K S K S G I N W E D I E D E S G N G K A A A S A V T N A A A ORF3 K R }4 Q L G R H R R R V G K R E G G G V R R N K R S S

1681 GGCAAACAAGGUUAUCGCCAC CC C CGGCGU CG C CAAGUCCCAAAAGAAGAC UGC UGUUC C UUCUU CG C CGAAG GCAC CGCAG CCC CCCGCUGCAUCC CAAC CCACCAGCACAACUG GGAG

ORF2 A N K V I A T P G V A K S Q K K T A V P S S P K A P Q P P A A S Q P T S T T G S

ORF3 G K Q G Y R H P R R R Q V F K E D C C S F F A E G T A A P R C I P T H Q H N W E

1801 CAGAACUCC UAUAUGUC CAAU UGCUAC UGCAACAUC CC C GGACACGGC UGUGC C UACUG GAC CAU CG CAGCAAGAAAUCAUGAACAACAUAAUGAAUC UG C UGGUC C AGAGGAUC G AUAU

ORF2 R T 9 I C P I A T A T S P D T A V P T G P S Q Q E I M N N I M N L L V Q R I D M

ORE3 Q N S Y M S N C Y C N I P G H G C A Y W T I A A R N H E Q H N E S A G P E D R Y

1921 GUC GAAGAU CGAGAAAUC GAUAGUGGACCAAGUCGC GAAUCAAGCUCUGAAGAAACCAC GCGGCAAGCGUGGCUCAAAGAAAC GGC CCG CAGCU GGCAAAAGU U CUUCGC CGACAUCUAC

ORF2 S K I E K S I V D Q V A N Q A L K K P R G K R G S K K R P A A G K S S S P T S T

ORF3 V E D R E I D S G P S R E S S S E E T T R Q A W L K E T A R S W Q K F F A D I Y

2041 ACCUG GGACGUAUCAACAUC CAAACAAGAAGUC CCAGGCUUCGAACAGGUU GGGAAAUUCUC CC C CCAGUACUACC C GC GCCC CC GCAG CGAAU CAGAAU GGGGGAGAAAACUCU GUGC C

ORF2 P G T Y Q H P N K K S Q A S N R L G N S P P S T T R A P A A N Q N G G E N S V P

ORF3 T W D V S T S K Q E V P G F E Q V G K F S P Q Y Y P R P R S E S E W G R K L C A

2161 GAACACCCUCUCUUGGGUC~AGAAAACUGCCGGUUUCGGGUGGCCCCAAGUCGGGGCAUCCG~UGAACUGACAUCUCUUCGGCUGCAA~CAGCUCGCUGGCUCGAAC~UUCCGAGUCAGCU

ORF2 N T L S W V R K L P V S G G P K S G H P L N + ORF3 E H P L L G E K T A G F G W P Q V G A S A E L T S L R L Q A A R W L E R S E S A

(U)

2281 AAAAUUCCAUCAGAUGCUGCUAGAGAGAAC GUGAU CAAUCGCAC C GUACAG GCGUAC UC GAAUUGCAAAAC CAAUACGCC GAGGUGCACAC GAG GUGAGUUAAGUU GGGAAAC GUU CAAA

ORE3 K I P S D A A R E N V I N R T V Q A Y S N C K T N T P R C T R G E L S W E T F K

2401 AUUGACUU C CUUGAAGC CAUCAAAUC GCUC CAACU UGAC GCUGGC GU C GGAUUACC GAUGAU CAC UGCGGG GC UCCCCACCCAUCGUGGGUGGGUUGAAGACCCAGAUCGUUUGCCAGUG

ORF3 I D F L E A I K S L Q L D A G V G L P M I T A G L P T }4 R G W V E D P D R L P V

2521 UUAGCUCAG CUUAC CUUUGACCGCCUACUUAC GAUGUCAAAGGCAAG CUU GGAAGC U CGCUC CC CGGAGCAACU CG U GAAGGAGAAUC UCU GUGAUC CUAUUC GUCUCUUUGUAAAACAA

ORF3 L A Q L T F D R L L T M S K A S L E A R S P E Q L V K E N L C D P I R L F V K Q

(U) 2641 GAGCCACAUAAACAGAGUAAGCUUGAUGAGGGACGUUAC CGC CUCAUCAU GUC AGUC UC CUUGAUUGAUCAAUU GG UAGC CC GGG UUU U GU UUCAAAGACAAAACAAAUCCGAAAUCG C G

ORF3 E P H K Q S K L D E G R Y R L I M S V S L I D Q L V A R V L F Q R Q N K S E I A

2761 UUGUGGAGC GCGAUUCCGUCAAAACC C GGUUUUGGAUUAUC CAC CGA GGAC CAAGUUUCAAAAUUCAUGGAUGU CCU CG CCGGGAACGUUGGUGC GUCUC CCGAAGAGGUCUGCGACAAU

ORF3 L W S A I P S K P G F G L S T E D Q V S K F M D V L A G N V G A S P E E V C D N

Downloaded from www.microbiologyresearch.org by

IP: 93.91.26.109

On: Fri, 25 Dec 2015 18:43:06

The NY-RPV isolate of BYDV 2351

(c) 2881 UGGCGC GACCUGUUG GUUCCUAC GGACUGUUC CGGUUUUGACUGGUCAGUCUCC GAUUGGAUGCUCGCGGAUGACAUGGAGGUGAGAAAUC GCCUAACAAUCGACUGCAACGAGCUCAC C

ORF3 W R D L L V P T D C S G F D W S V S D W M L A D D M E V R N R L T I D C N E L T

(R)

3001 AGACACCUUAGGGCUGUUUGGCUUCAGGGCAUAUCAAACUCGGUUUUAUGC CU GUCG GAUGGAAC CAU GUUAGCUCAAC GAGUACC CGGCGUGCAGAAAAGUGGUUC GUACAACACUUC G

ORF3 R H L R A V W L Q G I S N S V L C L S D G T M L A Q R V P G V Q K S G S Y N T S

3121 UC CACAAACUCACGC GUUCGGGUCAUGGCUGC CUAU CACUGUGGC GCCUCGUGG GCUAUC GCUAU GGGUGAUGAUG C CCUGGAAGCUCCAGACACAGACUUGUCAAAAUAUAAAGAUCUG

ORF3 S T N S R V R V M A A Y H C G A S W A I A M G D D A L E A P D T D L S K Y K D L

3241 GGCUUCAAAGUC GAGGUUAGCGGAGAGUUGGAAUUCUGCUCGCGCAUUUUCAAGACC CC UAACCUCGC CAU UC C GG UUAAUGAAAACAAAAUGUU GUACCGCUU GAUCCACGGGUACAAU

ORF3 G F K V E V S G E L E F C S R I F K T P N L A I P V N E N K M L Y R L I H G Y N

3361 CC GGAAUGUGGCAAUUUUGAGGUGGUUCAAAACUAUCUCAAUGCAGCUGUCUCAGUGCUG CAUGAGC UCC GACAUGAUC GGGAGCUUUGCCUAAAACUUCAGGAGUGGUUGAUUUCUGAC

ORF3 P E C G N F E V V Q N Y L N A A V S V L H E L R H D R E L C L K L Q E W L I S D

3481 GU CACCACAAAACUAAACUGAGCACAAAACUAG CC GGACAAACGUAAGUUGCAAGUG CCGGAAG UCAAGUCUUACACACAAGC CCAAC AUUGAUUUUCAUUUGUUAG CGG GAUUUGCC C U

ORF3 V T T K L N +

3601 UGGAUUUAUAUC CU CAAUCCCCAUUUCAGUUGUUGGUGUUUAUCUAGUCUACCUUAAGGU trOCGACCCACA UUCGUGAAAUU GUUAAUGAGUACGGU CGUCCUUAGAUCCAAUGGCAAUG

ORF4 * S T V V L R S N G N

O~LF5 * A M

3721 GUUCGC GCAGACGCAGACAGAGAGUCGCUCGGCGAAGGCCUGCUGCAAGAACGCAGCCAGUGGUUGUGGUCGCUUCCAACGGCCCAGCCAGGCGCGGAAGACGCCGACGACCAGU•GGUC

ORF4 G S R R R R Q R V A R R R P A A R T Q P V V V V A S N G P A R R G R R R R P V G

ORF5 V R A D A D R E S L G E G L L Q E R S Q W L W S L P T A Q P G A E D A D D Q L V

3841 CUCGGCGAGGAAGAACUCCAAGAUCUGGAGGAGGAAGCCGUGGCGAGACAUUCGUUUUCUCAAAGGAUUCACUCGCGGGCAACUCCUCUGGAAGUAUCACCUUCGGGCCGUCUCUAUCAG

ORF4 P R R G R T P R S G G G S R G E T F V F S K D S L A G N S S G S I T F G P S L S

ORF5 L G E E E L Q D L E E E A V A R H S F S Q R I H S R A T P L E V S P S G R L Y Q

3961 AGUAUC CG GCAUUC CAGAAU GGAGUACUC AAGG CCUAC CAU GAAUAUAAGAUCACAAAUU GU GU CUUACAGUUCGUCAGCGAG~CCUCUUCCACAGCAGCCGGCUCCAUCUCUUACGAGU

ORF4 E Y P A F Q N G V L K A Y H E Y K I T N C V L Q F V S E A S S T A A G S I S Y E

ORF5 S I R H S R M E Y S R P T M N I R S Q I V S Y $ S S A R P L P Q Q P A P S L T S

4081 UGGAC CCCCAUUGCAAAGCAU CUUCACUC GCAUCAACAAUCAAUAAGUUCACAAUCACCAAAACU GG UGC GCGGAG CUU C CCAGC GAAGAUGAU CAAC GG GUUAGAGUG GCAC CCCUCAG

ORF4 L D P H C K A S S L A S T I N K F T I T K T G A R S F P A K M I N G L E W H P S

ORF5 W T P I A K H L H S H Q Q S I S S Q S P K L V R G A S Q R R +

4201 AUGAGGAUCAGUUC C GAAUUC UUUACAAAGGGAAC GGUGCUU CUU CUGUG GCUGGGU CUUUUAAGAU CACUCUUAGAGUCCAG CUACAA~C CCAAAAUAGGUAGAC GC GGAACC C GGUC

ORF4/6 D E D Q F R I L Y K G N G A S S V A G S F K I T L R V Q L Q N P K + V D A E P G

4321 CCAGUCCAGGACCAUCACCCGACCCCCCCCCCCCACCUUCACCGUCCCCAGAACCUGCUCCUGCUAAGGAGGAACGAUUUAUCGUCUAUUCAGGGGUGGCUCACACAAUCAUCAGUGC C C

ORF6 P S P G P S P D P P P P P S P S P E P A P A K E E R F I V Y S G V A H T I I S A

4441 AAAGCACU GAUGAUUCUAUCAUAGUUAGAGAUAUCC CAGAC CAAC GGUUUAGAUAC GUUGAGAAC GAAAAC UU C UAC UGGUUU CAGAUAGCUGCUCAAUGGUAC UCAAAUAC GAACACAA

ORF6 Q S T D D S I I V R D I P D Q R F R Y V E N E N F Y W F Q I A A Q W Y S N T N T

4561 AAGCAG UUCCGAUGUUUGUCUUUC CAGUCC CCAUC GGGGAAU GGUCAGUC GAAAUAU CAACUGAG GGAUAUCAAGCUAC GUCUAGUAC CAC GGAUCCUAACAAG GG C CGCAUU GAC GG GC

ORF6 K A V P M F V F P V P I G E W S V E I S T E G Y Q A T S S T T D P N K G R I D G

4681 UCAUAGCUUAUGAUAACUCAAGU GAAG GGUGGAAUAUC GGGG CU G GAAGUAAC GUUACCAUCAC GAACAACAAAGCU GACAAUAGUUGGAAAUAC GG U CAUCC U GAUCU GGAAAUCAACU

ORF6 L I A Y D N S S E G W N I G A G S N V T I T N N K A D N S W K Y G H P D L E I N

(A) 4801 CUU GU CACUUCAAC CAGAAUCAAGUUUUGGAGAAAGAUGGUAUUAUUUCUUUCCAU GUCAAGGCUACAGAGAAG GAAGC UAAUUU CUU C UUAGUC GC C CC U CCAGUC CA GAAAAC C UC GA

OR/'6 S C H F N Q N Q V L E K D G I I S F H V K A T E K E A N F F L V A P P V Q K T S

(K)

4921 AGUAUAACUAC G CU GUCUCAUAUGGAGCGUGGACGGACAGAGACAUG GAAUUCGGAUUGAUUACAGUCACG UUAGAU GAGAAACGU GG CUC G GGUUC CCC CACUCGCAA~GCUUACG C G

ORF6 K Y N Y A V S Y G A W T D R D M E F G L I T V T L D E K R G S G S P T R K S L R

5041 CUGGACAC GCGG GAGUAACGACCACCACUGAUUUGGUGGCGUUACCGGAGAUGGAAAACUCC GGUAUAGAAACAUC UGAAACC CC GUCAGC U CCG GU CAC CAG C UCAAAA GCA CC GUUG C

ORF6 A G H A G V T T T T D L V A L P E M E N S G I E T S E T P S A P V T S S K A P L

5161 CAACUGUCUCGGAUUCGGAGUCAGAAGAC GAUCCGUUA UCAG CC GCAC CAGAUG UAGGC UUUGG C GGAACU CGU CUUUU GAUU GAUAC UGAUAU CAAAACAAUACC C GAU CC G GAC GU UG

ORF6 P T V S D S E S E D D P L S A A P D V G F G G T R L L I D T D I K T I P D P D V

5281 CUGAUGCCUUCGUGAACAGCGCUCACGUGGGUGUGGAUCCAUGGGCAGAAGUUCGAGCGUUUAAGAGAGCUCAGCGUCCUCCCCGUGGACCGUCAUCGGUUGCUAGCAGCAGCCUCUCUG

ORF6 A D A F V N S A H V G V D P W A E V R A F K R A Q R P P R G P S S V A S S S L S

5401 GAGGUU CG C UGC GU GGAAGCCUCAGAC CGAAAACUGAGGAUC CGAAAGAUU CAU CCAAAU CU AAA UC UAG AAAAUG GUC UCUC GC~AAGUCUG CGAUGACUAUAAAC UAAG GC GAAG GAAA

OR~6 G G S L R G S L R P K T E D P K D S S K S K S R K W S L G S L B +

5521 AUGCCUUAAAUCACCCGUCAAGCGACGUUAAAAGGAGGAUUAAGACCCUCCCCCCUUGCUGGUUCAACAACCUUGUUCC•

Fig. 2. Nucleotide sequence (RNA form) of the BYDV NY-RPV isolate. The initiation codons for the six major positive-sense ORFs are indicated by an asterisk and the termination codons by + . The deduced amino acid sequence is indicated below the nucleotide sequence. Nucleotides which varied when identified from the sequencing of different clones are indicated in parentheses above the nucleotide sequence; associated changes in the deduced amino acid sequence are indicated in parentheses below the amino acid sequence.

Downloaded from www.microbiologyresearch.org by

IP: 93.91.26.109

On: Fri, 25 Dec 2015 18:43:06

2352 J. R. Vincent, R . M . L is ter and B. A . Lark ins

. . . . . I , , , . . . . i . . . . i . . . . i . . . . i

1 3 5

2 5' i t Ii q h l h q ~ ~ 3'

4 6

y ~ r~ rdl ~ ,~ 00~ ~ 5'

bp 1000 2000 3000 4000 5000

Fig, 3, ORFs determined from the positive- and negative-sense orientation of the BYDV NY-RPV nucleotide sequence. Vertical ticks above a line indicate the location of initiation codons; those below a line indicate the location of termination codons. The open boxes represent ORFs and the six major positive-sense ORFs are numbered according to their proximity to the 5' end of the genome. All ORFs are represented as beginning with an AUG initiation codon.

1 2

- - 1 0 7 K

- - 69K

63K 5 8 K .-------~

23K ~ t~

- - 46K

- - 29K

: : ' - - 18K

- - - 1 5 K

Fig. 4. Immunological detection of the virion-associated proteins from the BYDV NY-RPV isolate. Purified NY-RPV virions were dis- sociated and electrophoresed on a 10% polyacrylamide gel. After electroblotting onto nitrocellulose, virus-associated proteins were detected immunologically using a polyclonal anti-NY-RPV primary antiserum and an alkaline phosphatase-conjugated secondary anti- serum. Sizes of proteins identified from purified NY-RPV prep- arations are shown to the left, those of protein size standards to the right. Lane 1, purified NY-RPV thawed and frozen five times; lane 2, purified NY-RPV thawed only once.

-200 <90 ~,8o • ivo -~6o -~o ~4~

R PV GCAC~CUAGCCGGAC~CGU~GUUGCAAGUGCCGG~GUC~GUCU B ~ y v AACAAGAAG~AC, UCAGC~JIIACAUUGAAAUDUU AAAGAG~DUU CUGC~CAGU AAGAGACDD AAGCAAA P L R V AGGAGCUCACU~CUAGCC~GCAUACACGAGUUGCAAGCAUUGGAAGUUC~GCCUCGUU

-laO IZ0 110 lO0 -~0 t~o -;r~

B Y D V M A V U'SACCACAUUU UAGCUGGGI?tlUG G~.UA GGGUUUAuAcCcuCuAUA B Y D V P A V UUACCAG~UCU 6OUUa ~aU~GCCUt,UAUAC~;UaCUAUA

' { {{{{ {{ [ : : {i , !

R P V UACACACaaGCCC~CaUUGaUVUU CAWJUGU UAGCeC~,UUUa ] CCCUUCCAUUU,~UaUCCUC~UC B W Y V CCC~GAUAC~CGGAUUAC~UUCC UAGCAGGCUUCG CCGCAGGCUUCGUUUCAUCGAUA PLRV ACAUC~CCGGACU~UAGAUUAU~UUCU UAGCGGGAUUUG CUUUAGGAUUCUCAUCCGCAAUC

- 6 0 -50 - 4 0 - 3 0 20 ~0 i

BYDV MAV CCCUGUACAL~AGCUCUCACG[UaUOWJAO~JAC~U~a~UUUC AGACACUACUACAaaGCUAGUGA BYDV PAV CCCUGUACAUUAGCUCUCGCG [UACUUUAUUUACAAUAAAGUUUC AGACACCACUAGAGAGGUGGUGA

R P V / / I I i / / I I I I I I ' ~1 i ] ; / i cccauuucA~uu~uuc~tm UAUCU~e,UCU~CCUUAAG~UUUC G~CCC~C~UUC~O6~UU~UUA

BWYV CC~UAUCCGUGAUCAGUAUC UA~CAUCUACCU~G~UCUC] C~CACGUAQGCG~UCGIIU} PLRV CCAUUUUCAGUAGCCGGUUUA UAUUUUGUUUACCUAAAGAUUUC CUCCCACGUGCGAUCAAUUGUUA

Fig. 5. Comparison of luteovirus non-coding regions located 5' to the coat protein coding region. Asterisks represent nucleotides identical in the sequences of BYDV NY-RPV isolate, BWYV and PLRV. A vertical bar represents nucleotides identical in all the luteoviruses examined. Numbering is in a direction 5' to the putative coat protein gene initiation codon. Boxed regions represent regions conserved in luteoviruses. RPV, BYDV NY-RPV isolate; BWYV, BWYV FL1 (Veidt et al., 1988); PLRV (Mayo et al. 1989); BYDV MAV (P. P. Ueng & R. M. Lister, unpublished results); BYDV PAV (Miller et al., 1988a; P. P. Ueng & R. M. Lister, unpublished results).

and PLRV, and two highly conserved regions found in all luteoviruses sequenced. The high degree of nucleot ide conservat ion, and the presence of A - U - r i c h tracts and A U C A and A A G A sequence blocks similar to those of

BMV subgenomic promoter sequences, suggest that this region may be involved in the format ion of subgenomic

d s R N A s and/or the regulat ion of specific prote in messages. In fact, the t ranscr ip t ional start site of a 2.6 kb subgenomic R N A from P L R V has recently been mapped to 34 bp 5' to the coat prote in gene t rans la t ion

in i t ia t ion codon (Tacke et al., 1990), which corresponds to the larger of the two conserved regions. The size of the d s R N A s identified in oat plants infected with N Y - R P V

(Gildow et al., 1983) is consis tent with this region having

a role in subgenomic R N A formation.

Comparison with other luteoviruses

The N Y - R P V sequence is the second BYDV genome sequence to be reported. The genome of a BYDV PAV

serotype from Victoria, Austral ia , Vic-PAV, has been de te rmined to consist of 5677 nucleotides (Miller et al., 1988a) and encodes six posit ive-sense ORFs. The genome organizat ions of N Y - R P V and Vic-PAV show significant differences (Fig. 6). Al though the R N A s of both conta in approximate ly 5600 nucleotides, there is no

similari ty in genome organiza t ion and little sequence similari ty in regions of the genomes 5' to the coat prote in genes. The coat prote in gene itself is located T-proximal in N Y - R P V , whereas it is near the centre of the genome

of Vic-PAV.

Downloaded from www.microbiologyresearch.org by

IP: 93.91.26.109

On: Fri, 25 Dec 2015 18:43:06

The N Y - R P V isolate o f B Y D V 2353

29.3K

ORF2

BYDV (NY-RPV) 71.8K 17.2K

OkF3 ] [ ~ 1

I 22-2K 43.0K

BYDV (Vic-PAV) -~ ORW I 22,0K 49.8K

38-7K [oR~3 I o,F5 I

I ORF2 ] OR~ ~j ORF6

60,4K 17.1K 6.7K

Fig. 6. Genome organization of BYDV NY-RPV and Vic-PAV isolates. The location of major ORFs of NY-RPV and Vic-PAV (Miller et al., 1988a) and the reading frames in which they occur are represented by boxes. The sizes of the polypeptides potentially encoded by each ORF are indicated.

However, similarities between Vic-PAV and NY-RPV are found in the 22K ORFs, in the internal 17K ORFs and in the immediately 3'-proximal ORFs (i.e. NY-RPV ORF6 and Vic-PAV ORF5). The 22K ORFs are the coat protein genes and contain the presumptive 17K VPg gene as an internal ORF. This organization has been found in all luteoviruses examined to date (Vincent et al., 1990). Nucleotide and deduced amino acid sequence identities between NY-RPV and Vic-PAV ORFs determined using Microgenie sequence alignment software were 55-2~ and 48.8~ for the coat protein coding region, 51-7~ and 34-7~ for the VPg coding region, and 44.6~ and 32.2~ for the full ORFs located immediately 3' to the coat protein coding region. The ORF 3'-proximal to the NY-RPV coat protein ORF is proposed to be expressed by readthrough of the coat protein gene termination codon, as with a similar ORF of Vic-PAV and, indeed, with luteoviruses in general. A conserved 14 nucleotide sequence surrounding the luteovirus 22K coat protein gene termination codon has been identified previously, and a role for this sequence in the expression of the readthrough ORF postulated (Smith & Harris, 1990; Veidt et al., 1988). An examin- ation of this region from NY-RPV indicates that this conserved region is actually the 12 nucleotide sequence A A A U A G G U A G A C , with the coat protein gene termination codon indicated by the underlined nucleotides.

The genome organization of NY-RPV, although it differs from that of Vic-PAV, is very similar to that reported for both BWYV (Veidt et al., 1988) and PLRV (Keese et al., 1990; Mayo et al., 1989). Furthermore, the sizes of the putative protein products encoded by each ORF of the latter viruses are similar to those encoded by the ORFs of NY-RPV.

There is considerable interest in evolutionary relation- ships between viruses based on amino acid sequence motifs associated with the RNA-dependent RNA polymerases (viral replicases) (Habili & Symons, 1989; Kamer & Argos, 1984; Poch et al., 1989) and nucleic acid helicases (Gorbalenya et al., 1988; Gorbalenya & Koonin, 1989; Habili & Symons, 1989; Hodgman, 1988a, b). These relationships are based both on sequence similarity within motifs and on the distance separating motifs. Using such a classification scheme, positive-sense RNA plant viruses have been placed into three supergroups, with the luteoviruses in supergroup B (Habili & Symons, 1989).

The Vic-PAV replicase shows substantial similarity to the replicase of carnation mottle virus (Miller et al., 1988a), and the replicases of BWYV and PLRV show homology to that of SBMV (Mayo et al., i989; Veidt et al., 1988), all members of supergroup B. A BESTFIT comparative analysis (Devereux et al., 1984) of the amino acid sequence deduced from NY-RPV ORF3 with that of the putative replicase-encoding region of SBMV (Wu et al., 1987) has identified a region containing the GDD amino acid motif for RNA-depen- dent RNA polymerases (Kamer & Argos, 1984). This region is also similar to regions of both PLRV and BWYV (Fig. 7). The four amino acid sequence motifs of supergroup B positive-sense RNA viral replicases (Habili & Symons, 1989) (VPTDCSGFDWS-53- GVQKSGSYNTSSTNSRV-11-WAIAMGDDALEA- 22-FCSRIF) were also identified within this region (Fig. 7), although the last motif was not represented in SBMV. The NY-RPV replicase motifs showed greater similarity in both sequence and gap distances to comparable regions in BWYV and PLRV, than to those in Vic-PAV.

Seven amino acid sequence motifs have been identi- fied for proteins that have helicase functions (Habili & Symons, 1989; Hodgman, 1988 a, b), and have been used to evaluate the taxonomic relationships between luteo- viruses and other positive-sense RNA plant viruses (Habili & Symons, 1989). The NY-RPV sequence contains three of the seven helicase motifs. These regions resemble the 'A' and 'B' sites of the catalytic site of nucleoside triphosphate-utilizing enzymes (Gorbalenya et al., 1988; Gorbalenya & Koonin, 1989) and super- group B motif VI (Habili & Symons, 1989). The potential 'A' and 'B' sites are located in ORF2 (356-AGYFY- GKT-41-LVFPEE-240), and motif VI has been identi- fied in ORF3 (351-QSKLDEGRYRLIMSVS-267). The NY-RPV helicase motifs were compared with those from other luteoviruses (Habili & Symons, 1989). Motif VI has been found in all luteoviruses, but although all other luteoviruses contain motif IV, no such region was identified in NY-RPV. Regions comparable to the 'A' and 'B' motifs were not identified in PLRV or Vic-PAV,

Downloaded from www.microbiologyresearch.org by

IP: 93.91.26.109

On: Fri, 25 Dec 2015 18:43:06

2354 J. R. Vincent, R. M. Lister and B. A. Larkins

RPV 326 II I 1 : I l l l k F l l l l l I I I I

ARSPEQLVKENLCDPIRLFVKQEPHK QSKLDEGRYRLIMSVS ] LIDQLV l I 41 I llIillllll llll ]illlllllllllili|l:Iltl

PtJ{V 3118 DMSAEELVQEGLC[gPIRLFVKREPHK QSKLDEGRYRLIMSVS | LVDQLV I ~III lllllklllll ~III I:IIIIIIIIIIIIII|IILIIJ

BWYV 276 DMGPEELVRNGLCDPIR FVKGEPHK QAKLDEGRYRLIMSVS | LVDQLV I:II[IIIIIIIIIII| l:llll

RPV 326 ARSPEQLVKENLCDPIRLFVKQEPHK QSKLDEGRYRLIMSVS | LIDQLV I II :1 I f l l : I I I I I l l l II I I I I l l I I I | : : 1 1 1 1

SBMV 600 ALSPTEMVEMGLCDPVELFVKQEPHP SRKLKEFRYRLISSVS | IVDQLV I i I : :

Vic-PAV 121 SAGKRAMYHKAIASLKTVPYHQKDAN VQAFLKKEKHWMTKDI APRLIC

RPV 374

PLRV 356

BWYV 324

RPV 374

SBMV 648

Vic-PAV 169

ARVLFQRQNKSEIALWSAIPSKPGFGLSTEVQVSKFMDVLAGNVGASPEE I ] 1 1 1 1 I I I I 1 : 1 1 : : 1 1 1 1 1 1 1 1 1 1 I : I: ] II I l l ARVLFQNQNKREISLWRSVPSKPGFGLSTDTQTAEFLECLQKVSGA.PEE I I I I I I I I I I ] l i : l l l : : [ l l [ l l l { l l [ I I:1 I : [ : I ARVLFQNQNLREIALWRAIPSKPGFGLSTDEQVLDFVESLARQVGTTTTE I l l l l l 11 I I l l l I I I I I I l I I l l l II I: II } 1 : : I ARVLFQRQNKSEIALWSAIPSKPGFGLSTEVQVSKFMDVLAGNVGASPEE I [I II III I :IIIIII lit I I I

ERMLFGAQNELEIAEWQSIPSKPGMGLSVIHQADAIFRDLRVKHTVCPAA l : :

PRSKRYNIILGTRLKFNEKKIMHAIDSVFGSPTVLSGYDNFKQGRIIAKK

RPV 424

PLRV 406

BWYV 374

RPV 424

SBMV 698

Vic-PAV 219

VCDNWRDLL VPTDCSGFDWS VSDWMLADDMEVRNRLTIDCNELTRHLR I I : I I I I I I I I 1 [ I I I I I I I ] l l l l l l I 1 : : 1 1

LCANHKEYT RPTDCSGFDWS VRyWMLEDDMEVRNRLTFNNTQLTKRLR : I] II lliIllllll I III ]II 111111 [ III VVAI~YL TPTDCSGFDWS VADWMLHDDMIVRNRLTIDLNPATERLR I 11: I I I I I I l l l [ I I : l l l l 111 I l l l l l l [ I I :11 VCDNWRDLL VPTDCSGFDWS VSDWMLADDMEVRNRLTIDCNELTRHLR

:1 I I I I I I I II I ] : 1 : 1 I :1 I . . . . . . . . . . EADISGFDWS VQDWELWADVEMRIVLGSFPPb95ARAAR

i I II I I I WQKFACPVA IGVDASRFDQH VSEQALKWEHGIYNGIFGDSEMALALEH

RPV 472

PLRV 454

BWYV 422

RPV 472

S B M V 736

Vic-PAV 267

AVWLQGISNSVLCLSDGTMLAQRVP GVQKSGSYNTSSTNSRV RVMAAY I li I llilllllll:lll It ilii!IIIII11:111:]II1111 AAWLKCIGNSVLCLSDGTLLAQTVP GVQKSGSYNTSSSNSRI RVMAAY : I 1 : 1 1 I [ l l l l l l [ l l l l l I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : SCWLRCISNSVLCLSDGTLLAQIHP GVQKSGSYNTSSSNSRI RVIV'jLAF : il lilllillllil:Ill I iilllllllIll:lil:Illlll: AVWLQGISNSVLCLSDGTMLAQRV~ GVQKSGSYNTSSTNSRV ~ RkrMAAy

• I I I I II I :~ I :1 1: l l l l l I I I i t 1 1 : 1 1 : I N. RFSCFMNSVLQLSNGQLLQQELP I GIMKSGSYCTSSTNSRI ] RCLMAE

I : : : I : " ' QITNNIKMFVEDKMLRFKVRGHRMS GDINTSMGNKLIMCGMM HAYLKK

RPV 520 HCGAS I WAIAMGDDALEA

tlil i rr:Irlllll]i PLRV 502 HCGAD WAI~AMGDDALEA i dP iiIlillftil:

BWYV 470 HTGAI WAM2H"IGDDALES ! Ii ll:rillirlr:

Pd:'v 520 HCGAS WAIAMGDDALEA i : } l l i i 1 1 : : i

SB~P,/ 783 LIGSP WCIAMGDDSVEG I IlJ

V~-PAV 315

.PDTDLSKYKDLGFKVEVSGELE FCS~FCSRIF F :PI Ir IrlEllr lll llllr]r

.PNSDLEEYKTLGFKVEVGRELE ~FCSHIFFCSHIF : l l I] l i l i l l l J l l / ' / l / l

.NPADLAAYKKLGFKVEVSGQLE] FCSHIF :if: if FlllIrtll iIIlit II

.PDTDLSKYKDLGFKVEVSGELEiFCSRIF Jl II

FVEGAREKYAGLGHLCKDYKPCA TTPTC.~ I: J I

LGVEA ELCNqqGDDCTII TDRPd,/EKLFDGMYDHFI~2YGFIk'~M VTEKPV

Fig. 7. A comparative analysis of luteovirus putative replicase amino acid sequences. An amino acid sequence from the putative BYDV NY-RPV replicase was identified by BESTFIT analysis (Devereux et al., 1984) with the SBMV putative replicase amino acid sequence. The NY-RPV region identified was then used in BESTFIT analyses with the amino acid sequence of the putative replicases from BWYV and PLRV. The NY-RPV sequence is represented twice to allow direct comparison with other viruses in the analysis. The comparable region from BYDV Vic-PAV (Miller et al., 1988a) has been included for comparison. Vertical dashes represent identical amino acids and vertical dots indicate chemically related amino acids• Asterisks represent the consensus sequence of RNA-dependent RNA polymer- ases (Kamer & Argos, 1984). Boxed regions represent the helicase and replicase motifs identified in NY-RPV ORF3. The Vic-PAV helicase motif VI is located between amino acids 173 and 188. The numbering represents the amino acid position from the beginning of the ORF in which the GDD sequence is located. RPV, NY-RPV isolate; PLRV (Mayo et al., 1989); BWYV (Veidt et al., 1988); SBMV (Wu et al., 1987); Vic-PAV (Miller et al., 1988a).

but similar sequence and gap lengths were found in BWYV (356-SPYFNGKT-38-VFAQED-194) (Veidt et aL, 1988).

It has been proposed that the luteovirus viral replicase is expressed as a result of a translational frameshift allowing two overlapping ORFs to produce one protein (Keese et al., 1990; Mayo et al., 1989; Miller et at., 1988 a; Veidt et al., 1988). As both ORF2 and ORF3 within the NY-RPV genome are associated with helicase function, it seems probable that both the helicase and the viral replicase are expressed by a translational frameshift mechanism. For the luteovirus Vic-PAV, a region for site-specific ribosomal slippage has been identified within the 16 nucleotide overlap between these ORFs. For other luteoviruses, including NY-RPV, the region of overlap is considerably greater, including up to several hundred nucleotides, but contains no obvious site for ribosome slippage.

Taxonomic considerations

The genome of the NY-RPV isolate of BYDV shares significant properties with other luteoviruses. These include (i) a replicase-encoding region presumably expressed by translational frameshift, (ii) an ORF, presumably encoding the VPg, completely contained within the 22K coat protein gene and (iii) a highly conserved nucleotide sequence surrounding the 22K coat protein gene termination codon, which potentially allows the formation of a larger fusion protein. There is also considerable similarity between all luteovirus 22K coat proteins, although the NY-RPV coat protein is more like those from BWYV and PLRV than those from the MAV-PS1 and P-PAV isolates of BYDV (Vincent et al., 1990). The greater similarity between the NY-RPV coat protein gene sequence and those of BWYV and PLRV, the similarity between the genome organization of NY-RPV, BWYV and PLRV, and the relative dis- similarity between the genome organization of NY-RPV and Vic-PAV are all consistent with the hypothesis that NY-RPV is a strain of BWYV (Casper, 1988). Hence, NY-RPV should be classified as distinct from BYDV strains similar to Vic-PAV. However, differences in aphid vectors (i.e.R. padi and Myzus persicae) and host range (i.e. monocotyledonous mad predominantly dicoty- ledonous hosts), suggest that NY-RPV should also be classified as distinct from BWYV and PLRV.

We thank C. Logan for her photographic services and M. Karanjkar for his assistance in the Western blot analysis. This work was funded by Grant 88-37263-3855 from the Competitive Research Grants Office of the U.S. Department of Agriculture and Grant 1484670 from The Quaker Oats Company. The Genetics Computer Group Sequence Analysis Software Package for the VAX was supported by NIH grant no. AI27713. This is Journal Series Article 12706 of the Purdue University Agricultural Experiment Station.

Downloaded from www.microbiologyresearch.org by

IP: 93.91.26.109

On: Fri, 25 Dec 2015 18:43:06

The N Y - R P V isolate o f B Y D V 2355

References

AAPOLA, A. I. & ROCHOW, W. F. (1971). Relationships among three isolates of barley yellow dwarf virus. Virology 46, 127-141.

BAHNER, I., LAMB, J., MAYO, M. A. & HAY, R. T. (1990). Expression of the genome of potato leafroll virus: readthrough of the coat protein termination codon in vivo. Journal of General Virology 71,2251 2256.

BARBARA, D. J., KAWATA, E. E., UENG, P. P., LISTER, R. M. & LARKINS, B. A. (1987). Production of cDNA clones from the MAV isolate of barley yellow dwarf virus. Journal of General Virology 68, 2419-2427.

BIRNBOIM, H. C. & DOLY, J. (1979). A rapid alkaline extraction method for screening recombinant plasmid DNA. Nucleic Acids Research 7, 1513-1523.

CASPER~ R. (1988). Luteoviruses. In The Plant Viruses; Polyhedral Virions with Monopartite RNA Genomes, vol. 3, pp. 235-258. Edited by R. Koenig. New York: Plenum Press.

DEVEREUX, J., HAEBERLI, P. & SMITHIES, O. (1984). A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Research 12, 387-395.

GILDOW, F. E., BALLINGER, M. E. & ROCHOW, W. F. (1983). Identification of double-stranded RNAs associated with barley yellow dwarf virus infection of oats. Phytopathology 73, 1570~1572.

GILL, C. C. & CHUNG, J. (1979). Cytopathological evidence for the division of barley yellow dwarf virus isolates into two subgroups. Virology 95, 59-69.

GORBALENYA, A. E. & KOONIN, E. V. (1989). Viral proteins containing the purine NTP-binding sequence pattern. Nucleic Acids Research 17, 8413-8439.

GORBALENYA, A. E., KOONIN, E. V., DONCHENKO, m. P. & BLINOV, V. M. (1988). A novel superfamily of nucleoside triphosphate- binding motif containing proteins which are probably involved in duplex unwinding in DNA and RNA replication and recombi- nation. FEBS Letters 235, 16-24.

HABILI, N. & SYMONS, R. (1989). Evolutionary relationship between luteoviruses and other RNA plant viruses based on sequence motifs in their putative RNA polymerases and nucleic acid helicases. Nucleic Acids Research 17, 9543-9555.

HODGMAN, T. C. (1988a). A new superfamily of replicative proteins. Nature, London 333, 22-23.

HODGMAN, T. C. (1988b). Erratum. A new superfamily of replicative proteins. Nature, London 333, 578.

KAMER, G. & ARGOS, P. (1984). Primary structural comparison of RNA-dependent polymerases from plant, animal and bacterial viruses. Nucleic Acids Research 12, 7269-7282.

KAWCHUK, L. M., MARTIN, R. R., ROCHON, D. M. & MCPHERSON, J. (1989). Identification and characterization of the putative potato leafroll virus coat protein gene. Journal of General Virology 70, 783-788.

KEESE, P., MARTIN, R. R., KAWCHUK, L. M., WATERHOUSE, P. M. & GERLACH, W. L. (1990). Nucleotide sequences of an Australian and a Canadian isolate of potato leafroll luteovirus and their relationships with two European isolates. Journal of General Virology 71,719 724.

LAEMMLI, U. K. (1970). Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature, London 227, 680-685.

MANIATIS, T., FRITSCH, E. F. & SAMaROOK, J. (1982). Molecular Cloning: A Laboratory Manual. New York: Cold Spring Harbor Laboratory.

MARSH, L. E., DREHER, T. W. & HALL, T. C. (1988). Mutational analysis of the core and modulator sequences of the BMV RNA3 subgenomic promoter. Nucleic Acids Research 16, 981-995.

MATTHEWS, R. E. F. (1982). Classification and nomenclature of viruses. Intervirology 17, 1 199.

MAYO, M. A., BARKER, H., ROBINSON, D. J., TAMADA, T. & HARRISON, B. D. (1982). Evidence that potato leafroll virus RNA is positive- stranded, is linked to a small protein and does not contain polyadenylate. Journal of General Virology 59, 163-167.

MAYO, M. A., ROBINSON, D. J., JOLLY, C. A. & HYMAN, L. (1989). Nucleotide sequence of potato leafroll luteovirus RNA. Journal of General Virology 70, 1037-1051.

MILLER, W. A., WATERHOUSE, P. M. & GERLACH, W. L. (1988a). Sequence and organization of barley yellow dwarf virus genomic RNA. Nucleic Acids Research 16, 6096 6111.

MILLER, W. A., WATERHOUSE, P. M., KORTT, A. A. & GERLACH, W. L. (1988 b). Sequence and identification of the barley yellow dwarf virus coat protein gene. Virology 165, 306-309.

MURPHY, J. F., D'ARCY, C. J. & CLARK, J. M., JR (1989). Barley yellow dwarf virus RNA has a Y-terminal genome-linked protein. Journalof General Virology 70, 2253 2256.

PocH, O., SAUVAGET, I., DELARUE, M. & TORDO, N. (1989). Identification of four conserved motifs among the RNA-dependent polymerase encoding elements. EMBO Journal 8, 3867-3874.

PRILL, B., MAISS, E., TIMPE, U. & CASPER, R. (1989). Nucleotide sequence of the potato leafroll virus coat protein Aerie. Nucleic Acids Research 17, 1768.

PUTNEY, S. D., BENKOVIC, S. J. & SCHIMMEL, P. (1981). A DNA fragment with an ct-phosphothiolate nucleotide at one end is asymmetrically blocked from digestion by exonuclease III and can be replicated in vivo. Proceedings of the National Academy of Sciences, U.S.A. 78, 7350-7354.

ROCHOW, W. F. (1970). Barley yellow dwarf virus. CMI/AAB Descriptions of Plant Viruses, no. 32.

ROCHOW, W. F. (1984). In Barley Yellow Dwarf: Proceedings of the Workshop hem at CIMMYT, Mexico, pp. 204 205. Edited by P. A. Burnett. CIMMYT Press.

ROCHOW, W. F. & CARMICHAEL, L. E. (1979). Specificity among barley yellow dwarf viruses in enzyme immunosorbent assays. Virology 95, 415-420.

ROCHOW, W. F. & DUFFUS, J. E. (1981). Luteoviruses and yellows diseases. In Handbook of Plant Virus Infections and Comparative Diagnosis, pp. 147-170. Edited by E. Kurstak. Amsterdam: Elsevier/ North-Holland.

SANGER, F., NICKLEN, S. • COULSON, A. R. (1977). DNA sequencing with chain-terminating inhibitors. Proceedings of the National Academy of Sciences, U.S.A. 74, 5463-5467.

SMITH, O. P. & HARRIS, K. F. (1990). Potato leaf roll virus 3' genome organization: sequence of the coat protein gene and identification of a viral subgenomic RNA. Phytopathology 80, 609 614.

TACKE, E., PROFER, D., SALAMINI, F. & ROHDE, W. (1990). Characterization of a potato leaf roll luteovirus subgenomic RNA: differential expression by internal translation initiation and UAG suppression. Journal of General Virology 71, 2265 2272.

VEIDT, I., LOT, H., LEISER, M., SCHEIDECKER, D., GUILLEY, H., RICHARDS, K. & JONARD, G. (1988). Nucleotide sequence of beet western yellows virus RNA. Nucleic Acids Research 16, 9917 9932.

VIEIRA, J. & MESSING, J. (1982). The pUC plasmids, an ml3mp7- derived system for insertion mutagenesis and sequencing with universal primers. Gene 19, 259-268.

VINCENT, J. R., UENG, P. P., LISTER, R. M. & LARKINS, B. A. (1990). Nucleotide sequences of coat protein genes for three isolates of barley yellow dwarf virus and their relationships to other luteovirus coat protein sequences. Journal of General Virology 71, 2791- 2799.

WATERHOUSE, P. M., GILDOW, F. E. & JOHNSTONE, G. R. (1988). Luteovirus group. AAB Descriptions of Plant Viruses, no. 339.

Wu, S., RINEHART, C. A. & KAESBERG, P. (1987). Sequence and organization of southern bean mosaic virus genomic RNA. Virology 161, 73-80.

YOUNG, R. A. & DAVIS, R. W. (1983). Efficient isolation of genes using antibody probes. Proceedings of the National Academy of Sciences, U.S.A. 80, 119~1198.

(Received 27 February 1991 ; Accepted 10 June 1991)


Recommended