+ All Categories
Home > Documents > Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular...

Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular...

Date post: 26-Feb-2021
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
13
Copyright 0 1987 by the Genetics Society of America Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI (atpA) and the Proteolipid Subunit of CFo (atpH) Steven R. Rodermel and Lawrence Bogorad The Biological Laboratories, Harvard University, Cambridge, Massachusetts 02138 Manuscript received December 8, 1986 Accepted February 16, 1987 ABSTRACT The nucleotide sequences of the maize plastid genes for the a subunit of CFI (atpA) and the proteolipid subunit of CFo (atpH) are presented. The evolution of these genes among higher plants is characterized by a transition mutation bias of about 2:l and by rates of synonymous and nonsynony- mous substitution which are much lower than similar rates for genes from other sources. This is consistent with the notion that the plastid genome is evolving conservatively in primary sequence. Yet, the mode and tempo of sequence evolution of these and other plastidencoded coupling factor genes are not the same. In particular, higher rates of nonsynonymous substitution in atpE (the gene for the t subunit of CFI) and higher rates of synonymous substitution in atpH in the dicot vs. monocot lineages of higher plants indicate that these sequences are likely subject to different evolutionary constraints in these two lineages. The 5‘- and 3‘- transcribed flanking regions of atpA and atpH from maize, wheat and tobacco are conserved in size, but contain few putative regulatory elements which are conserved either in their spatial arrangement or sequence complexity. However, these regions likely contain variable numbers of “species-specific” regulatory elements. The present studies thus suggest that the plastid genome is not a passive participant in an evolutionary process governed by a more rapidly changing, readily adaptive, nuclear compartment, but that novel strategies for the coordinate expression of genes in the plastid genome may arise through rapid evolution of the flanking sequences of these genes. HE genes for the constituent proteins of the T plastid coupling factor for photophosphoryla- tion (CFt-CFo complex) are dispersed between the genomes of the plastid and nucleus. In all species examined to date, atpA, atpB, atpE, atpF, atpH and atpZ (for the a, /3 and E subunits of CFI and subunits I, I11 and IV of CFo, respectively) are encoded in the plastid DNA, while atPC, atpD and atpG (for the y and 6 subunits of CF, and subunit I1 of CFo, respec- tively) are encoded in the nuclear genome and trans- ported into the plastid post-translationally (e.g., MEN- DIOLA-MORGENTHALER, MORGENTHALER and PRICE 1976; BOUTHYETTE and JAGENDORF 1978; NELSON, NELSON and SCHATZ 1980; NECHUSHTAI et al. 198 1 ; WE~THOFF et al. 198 1, 1985; WATANABE and PRICE 1982). The plastid-encoded genes appear to be highly conserved in their sequence arrangement among most higher plant plastid DNAs. at@ and atpE are clus- tered in one transcription unit, either partially over- lapping or immediately adjacent to one another, while atpZ, atpH, atpF and atpA are clustered (in this polar- ity) in another transcription unit some distance away from the atpBE locus on the plastid chromosome (e.g., WESTHOFF et al. 1981, 1985; KREBBERS et al. 1982; HOWE et al. 1982, 1983; DEHEJJ et al. 1983; FLUHR, The sequence data presented in this article have been submitted to the EMBL/GenBank Data Libraries under the accession number Y00310. Genetics 116: 127-139 (May, 1987) FROMM and EDELMAN 1983; DENO,SHINOZAKI and SUGIURA1983, 1984; BOVENBERG et al. 1984; HUTTLY and GRAY 1984; KO, STRAUS and WILLIAMS 1984; ZURAWSKI and CLEGG 1984; OLIVER 1984; PHILLIPS 1985; BIRD et al. 1985; RODERMEL and Bo- GORAD 1985, 1986). Among higher plants, the nu- cleotide sequences of all the plastid-encoded coupling factor genes have been reported in tobacco (SHINO- ZAKI et al. 1983a; DENO, SHINOZAKI and SUGIURA 1983,1984) and wheat (HOWE et al. 1982,1985; BIRD et al. 1985; J. C. GRAY, unpublished data). In addition, among angiosperms the sequence of atpBE has been reported in maize (KREBBERS et al. 1982, spinach (ZURAWSKI, BOTTOMLEY and WHITFELD 1982), barley (ZURAWSKI and CLEGG 1984), and pea (WHITFELD, ZURAWSKI and BOTTOMLEY 1983), and the sequences of atpH and atpZ have been reported for spinach (ALT et al. 1983) and pea (COZENS et aE. 1986), respectively. The nucleotide sequences of atpA and atpH from maize are presented in the current studies. Although the striking conservation of sequence con- tent and arrangement of the plastid-encoded coupling factor genes is in accord with the notion that the molecular evolution of angiosperm plastid DNAs is distinctly conservative in nature (reviewed in PALMER 1985a,b), little is known about the mode and tempo of sequence change in various regions of the plastid
Transcript
Page 1: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

Copyright 0 1987 by the Genetics Society of America

Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI (atpA) and the Proteolipid Subunit of CFo (atpH)

Steven R. Rodermel and Lawrence Bogorad

The Biological Laboratories, Harvard University, Cambridge, Massachusetts 021 38 Manuscript received December 8, 1986

Accepted February 16, 1987

ABSTRACT The nucleotide sequences of the maize plastid genes for the a subunit of CFI (atpA) and the

proteolipid subunit of CFo (atpH) are presented. The evolution of these genes among higher plants is characterized by a transition mutation bias of about 2:l and by rates of synonymous and nonsynony- mous substitution which are much lower than similar rates for genes from other sources. This is consistent with the notion that the plastid genome is evolving conservatively in primary sequence. Yet, the mode and tempo of sequence evolution of these and other plastidencoded coupling factor genes are not the same. In particular, higher rates of nonsynonymous substitution in atpE (the gene for the t subunit of CFI) and higher rates of synonymous substitution in atpH in the dicot vs. monocot lineages of higher plants indicate that these sequences are likely subject to different evolutionary constraints in these two lineages. The 5 ‘ - and 3‘- transcribed flanking regions of atpA and atpH from maize, wheat and tobacco are conserved in size, but contain few putative regulatory elements which are conserved either in their spatial arrangement or sequence complexity. However, these regions likely contain variable numbers of “species-specific” regulatory elements. The present studies thus suggest that the plastid genome is not a passive participant in an evolutionary process governed by a more rapidly changing, readily adaptive, nuclear compartment, but that novel strategies for the coordinate expression of genes in the plastid genome may arise through rapid evolution of the flanking sequences of these genes.

HE genes for the constituent proteins of the T plastid coupling factor for photophosphoryla- tion (CFt-CFo complex) are dispersed between the genomes of the plastid and nucleus. In all species examined to date, atpA, atpB, atpE, atpF, atpH and atpZ (for the a, /3 and E subunits of CFI and subunits I, I11 and IV of CFo, respectively) are encoded in the plastid DNA, while atPC, atpD and atpG (for the y and 6 subunits of CF, and subunit I1 of CFo, respec- tively) are encoded in the nuclear genome and trans- ported into the plastid post-translationally (e.g., MEN- DIOLA-MORGENTHALER, MORGENTHALER and PRICE 1976; BOUTHYETTE and JAGENDORF 1978; NELSON, NELSON and SCHATZ 1980; NECHUSHTAI et al. 198 1 ; WE~THOFF et al. 198 1 , 1985; WATANABE and PRICE 1982). The plastid-encoded genes appear to be highly conserved in their sequence arrangement among most higher plant plastid DNAs. at@ and atpE are clus- tered in one transcription unit, either partially over- lapping or immediately adjacent to one another, while atpZ, atpH, atpF and atpA are clustered (in this polar- ity) in another transcription unit some distance away from the atpBE locus on the plastid chromosome (e.g., WESTHOFF et al. 1981, 1985; KREBBERS et al. 1982; HOWE et al. 1982, 1983; DEHEJJ et al. 1983; FLUHR,

The sequence data presented in this article have been submitted to the EMBL/GenBank Data Libraries under the accession number Y00310.

Genetics 116: 127-139 (May, 1987)

FROMM and EDELMAN 1983; DENO, SHINOZAKI and SUGIURA 1983, 1984; BOVENBERG et al. 1984; HUTTLY and GRAY 1984; KO, STRAUS and WILLIAMS 1984; ZURAWSKI and CLEGG 1984; OLIVER 1984; PHILLIPS 1985; BIRD et a l . 1985; RODERMEL and Bo- GORAD 1985, 1986). Among higher plants, the nu- cleotide sequences of all the plastid-encoded coupling factor genes have been reported in tobacco (SHINO- ZAKI et al. 1983a; DENO, SHINOZAKI and SUGIURA 1983,1984) and wheat (HOWE et al. 1982,1985; BIRD et al. 1985; J. C. GRAY, unpublished data). In addition, among angiosperms the sequence of atpBE has been reported in maize (KREBBERS et al. 1982, spinach (ZURAWSKI, BOTTOMLEY and WHITFELD 1982), barley (ZURAWSKI and CLEGG 1984), and pea (WHITFELD, ZURAWSKI and BOTTOMLEY 1983), and the sequences of atpH and atpZ have been reported for spinach (ALT et a l . 1983) and pea (COZENS et aE. 1986), respectively. The nucleotide sequences of atpA and atpH from maize are presented in the current studies.

Although the striking conservation of sequence con- tent and arrangement of the plastid-encoded coupling factor genes is in accord with the notion that the molecular evolution of angiosperm plastid DNAs is distinctly conservative in nature (reviewed in PALMER 1985a,b), little is known about the mode and tempo of sequence change in various regions of the plastid

Page 2: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

128 S. R. Rodermel and L. Bogorad

chromosome. Comparative DNA hybridization and restriction site polymorphism analyses have shown that sequences within the large inverted repeat struc- ture (characteristic of most higher plant plastid DNAs) are evolving more slowly than sequences within the single copy regions of the chromosome (PALMER 1985a). However, other than these global studies, detailed investigations of nucleotide divergence in the plastid DNA are extremely limited, and have been restricted in the case of protein-coding genes to com- parative studies of rbcL (the gene for the large subunit of ribulose bisphosphate carboxylase), at#, and atpE (SHINOZAKI et al. 1983b; ZURAWSKI, CLEGG and BROWN 1984; ZURAWSKI and CLEGC 1984). These analyses have demonstrated that the patterns and levels of nucleotide substitution are the same in these three genes in comparisons between maize and barley, indicating that each codon position is evolving at about the same rate in these genes in these two mon- ocots (ZURAWSKI and CLEGG 1984). One purpose of this study was to ascertain whether other plastid cou- pling factor genes are evolving like atpB and atpE in the monocots, and to investigate whether the mode and tempo of sequence evolution of these genes is similar in the dicot, as well as monocot, angiosperm lineages. The data show that the coupling factor genes differ in their patterns of nucleotide substitution both within and between the two lineages, and that these differences are manifested in differing constraints on nonsynonymous or synonymous codon selection.

Another purpose of this investigation was to exam- ine the patterns of nucleotide substitution in the 5 ‘ - and 3 ’-transcribed regions flanking the protein coding sequences of atPA and atpH. Previous comparative analyses of the flanking regions of plastid-coding genes, though limited, have shown that the 5’-flanking region of rbcL is evolving at about half the synony- mous substitution rate of the gene itself in compari- sons between maize and barley, indicating this region is selectively constrained in its sequence evolution with respect to the third codon position (ZURAWSKI, CLECC and BROWN 1984). The constraints in this region were attributed to conserved sequence requirements for promoter-binding. Northern hybridization experi- ments have established that atpA and atpH from maize are complementary to several of the same polycis- tronic transcripts, as well as to numerous other smaller mRNAs, and that the ends of some of these transcripts likely reside in the flanking regions of these genes (RODERMEL and BOGORAD 1985). It was further ob- served in these studies that all of the transcripts com- plementary to this region of the chromosome increase in abundance following the illumination of dark- grown seedlings, but that the increases are not uni- form from transcript to transcript. The data thus suggested that unless an extremely complex set of

RNAses change in activity during greening, the flank- ing regions of atpA and atpH likely contain transcrip- tion initiation and/or termination sequences involved in the photoregulation of transcript levels. Since com- plex transcript patterns have also been observed for the region of the plastid chromosome bearing atpA and atpH in wheat (BIRD et al. 1985) and tobacco (FLUHR, FROMM and EDELMAN 1983; DENO, SHINO- ZAKI and SUGIURA 1984), it was hoped that compara- tive sequence analyses of these regions might reveal conserved sequence elements that may be involved in transcriptional and/or processing activities. The data in this report indicate, however, that these, and other, putative regulatory elements in these regions are not particularly conserved among these three species. This is reflected in rates of nucleotide divergence in these regions which equal, or vastly exceed, synony- mous substitution rates (depending upon the se- quences compared). However, the data are not incon- sistent with the notion that these regions contain variable numbers of “species-specific” regulatory ele- ments.

MATERIALS AND METHODS

The plasmids pZmc527, pZmc415 and pZrl2 contain Zea mays plastid DNA restriction fragments BamHI 3, BamHI 24, and EcaRI b, respectively, cloned in pBR322 (RODERMEL and BOGORAD 1985). The nucleotide sequences of portions of these fragments containing the genes for the a subunit of CF, (atpA) and subunit 111 of CF, (atpH) (RODERMEL and BOGORAD, 1985, 1986), were determined by the MAXAM- GILBERT chemical cleavage method (1 980).

High resolution S-1 nuclease mapping experiments were performed by procedures modified from BERK and SHARP (1977). Total cell RNA ( 1 50 fig) from 16-hr-greened maize seedlings (RODERMEL and BOCORAD 1985) was mixed with approximately 100 ng of DNA (5’4abeled at a single end) in a total volume of 50 pl hybridization buffer [SOW deion- ized formamide/l mM Na2EDTA (pH 8.0)/0.4 M NaCl/4O mM PIPES (pH 6.4)) The sample was incubated for 10 min at 80” to denature the DNA, and then transferred to 41” for 16 hr. Following the addition of 400 pl ice cold S-1 nuclease buffer [0.28 M NaC1/0.05 M sodium acetate (pH 4.6)/4.5 mM ZnS04/20 pg carrier tRNA], the sample was incubated with S-1 nuclease (75 units, New England Nu- clear) for 30 min at 37”. After phenol/chloroform extrac- tion, the ethanol precipitate was vacuum desiccated, resus- pended in sequencing dye, and electrophoresed through 8% polyacrylamide/7 M urea sequencing gels next to DNA sequencing reactions of control fragments not treated with S- 1 nuclease.

RESULTS

The nucleotide sequences of atpA and atpH The maize plastid genes for the (Y subunit of CF,

(atpA) and subunit I11 (the proteolipid subunit) of CFo (atpH) were sequenced by the strategy outlined in Figure 1. It had been determined previously that atpA and atpH are closely linked on the maize plastid chro- mosome (RODERMEL and BOCORAD 1985, 1986). The

Page 3: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

Molecular Evolution of atpA,H

atpH atpA pslA2 pslA1 k-- - 4 , A q -

129

A

0 '

8

Alu I Bam H I CfoI Dde I Eco R I Fnu 4HI Hind III Hinf I Hpa II Nde I Rsa I Sal I Sau 961 Taq I

100 300 500 200 400 600 800 io00 1200 i400 1600

FIGURE 1.-Map location of atpA and atpH on the maize plastid chromosome, and the strategy used to sequence these genes. A, An 18- kbp portion of the maize plastid chromosome is illustrated [from BEDBROOK and BOCORAD (1976) and LARRINUA et al. (1983)], showing the locations and relative transcriptional polarities of the photoregulated genes for the a! subunit of CF, (&PA), subunit 111 of CFo (atpH), and the duplicated genes for the P700 chlorophyll a proteins of photosystem I (PslAI and psJA2) (RODERMEL and BOGORAD 1985, 1986; FISH, KUCK and BOGORAD 1985). atpH is contained within EcoRI subfragment "s" of BamHI fragment 3 (8.7 kbp), whereas atpA is partially contained within EcoRI fragment "4" of BamHl 3 and extends into BamHl fragment 24 (1.3 kbp), as well as into EcoRI fragment "b." EcoRI fragments "b," "4" and "s," and BamHI fragment 24 served as the sources of plastid DNA in determining the sequences of these genes. B, The nucleotide sequences of atpA and atpH, including portions of their flanking regions, were determined by the MAXAM-GILBERT procedure (1 980). Sequenced fragments include those indicated by arrows.

nucleotide and derived amino acid sequences of these genes are shown in Figure 2: atpA contains 508 codons and encodes a protein with a M , of 55707, while atpH contains 81 codons and encodes a protein with a M, of 7975. The nucleotide sequences of portions of the flanking regions of these genes are also presented in Figure 2. These include approximately 100 bp of both the 5'- and 3'-flanking regions of atpH, the intergenic region between the 5'-end of atpA and the 3'-end of atpF, the gene for subunit I of CFo (RODERMEL and BOCORAD 1986); and the spacer region between the 3'-end of atpA and the 3'-end of the convergently transcribed tRNAArg (UCU) gene (S. RODERMEL, P. ORLIN and L. BOCORAD, unpublished data).

Comparative molecular evolutionary analyses of the coding regions of atpA and atpH

Multiple substitutions at the same nucleotide site and nonrandom mutation bias are two of the most important factors which can distort estimates of the number of nucleotide substitutions which have OC-

curred in two DNA sequences since their divergence from a common ancestor (KIMURA 1981, 1983). TO minimize the former, it has been suggested that two sequences should be at least 85% homologous (BROWN

et al. 1982). For this reason, comparative analyses of atpA and atpH were restricted to sequences of these genes from angiosperms. These include the plastid atpA genes from maize (this report), wheat (HOWE et al. 1985; BIRD et al. 1985; HOWE 1985) and tobacco (DENO, SHINOZAKI and SUGIURA 1983, 1984; DENO and SUCIURA 1984), and the plastid atpH genes from maize (this report), wheat (HOWE et al. 1982; BIRD et al. 1985), tobacco (DENO, SHINOZAKI and SUCIURA 1984), and spinach (ALT et al. 1983). The nucleotide and derived amino acid sequences of these genes are shown in Figure 2; only those sites which differ from maize are indicated. The comparative analyses of these genes are presented in Table 1, and show that among the various sequence comparisons the nucleo- tide conservation of atpH ranges from 91 to 98%, while that of atpA ranges from 86 to 96%; the levels of nucleotide and amino acid conservation are similar in each comparison.

The possibility of multiple and revertant changes at the same site cannot be discounted even in closely related sequences, and thus numerous equations have been derived to correct for their occurrence (reviewed in LI, LUO and Wu 1985). Since some of these equa-

Page 4: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

TOSACCO : WHEAT: KAIZE:

C C A G C G TGGATGATT - - C - - - GG TAA T ~ TTC-TCC-- A __._...___

TTGTACTTATTAGTTACTTCTAC---------CCGATAGAG~A--GAA--GTT

T: - _ C C CA G GT -. -T w: - G A ACT C H: G G A A G T A A T A A ~ ~ C ~ G T A T C ~ A A C C A ~ ~ ~ G A C A C G ~ C T . - - C A T C

c - S: T: W!

G C A G T C

T G

T

n: ATG AAT CCA CTA ATT GCT GCT GCT TCC GTT ATT GCT GCT GGA TTG GCC GTA GGC N : N N P L I A A A S V I A A G L A V G W: T: S s:

S : T G A T G T: A G G T W: A T N : CTT GCT TCT ATT GGG CCC GGA GTT GCT CAA GGT ACT CCT GCA GGA CAA GCT GTA N : L A S I G P G V G Q G T A A G Q A V W’ T: S:

C A C A T G C A C G C

C I%: GAA GGT ATT GCG AGA CAG CCA GAA GCA GAA GGT AAA ATA ACA GGT ACT TTA TTG N : E G I A R Q P E A E G K I R G T L L U: T: s:

T A T A T

W: A H: CTT AGT CTA GCT TIT ATG GAA GCT TTA ACA ATT TAT GGA CTG GTT GTG GCA CTA N : L S L A F N E A L T I Y G L V V A L W: T: S:

5 : T: W: N:

T GT-..--r...-..--------..- A T T GCTT T AAAA-----

CGG CTT TTA TIT GCG AAC CCT TIT GTT TAA TCCTAAAAAAGAAA-ATGACTCCTTTAG T A T-TCTTTCGA C

n : n L L F A N P F V * W: T: S:

T: I T TCA T T A C w: A A A C N: CCT AAA CCT ATT GAT CGG AGA GGC GAA ATT GTC GCT TGA CA4 TCT CGC TTA ATT N : A K P I D G R G E I V A S E S R L I W: Y 1 T: S F

T: G C c T G C C G T w: A A C C T n: CAA TCT CCT GCT CCG GGT ATA ATT TCT AGG CGT TCC GTA TAT GAA ccc CTT CAA n : ~ s P A P G I I s R R s V Y E P L Q w: S 1: A

T: C T T A T A W: T n: ACA GGG CTT ATT GCT ATC GAT TCG ATG ATC ccc ATA GGG CGC GGT CAC CGA GAG ~ : T G L I A I D S N I P I G R G Q R E w: T:

T: C T G C W: n: TTA ATT ATT GGG GAC AGA CAG ACT GCC AAA ACA CCA GTA GCC ACA GAT ACA ATT n : ~ I I G D R Q T G K T A V A T D T I W ’ T:

C A T G A T: W: G G M: CTC AAT CAA AAA GGT CAA GAT GTA ATA TGT GTT TAT GTA GCT ATC GGT CAA AGA N : L N 9 K G Q D V I C Y Y V A I G Q R W: G T: Q N K

1: T T C G C A G AG A G w: A T O n: GCA TCC TCC GTG GCT CAA GTA GTA ACT ACT TTC CAC GAA GAG GGG GCC ATG GAA

T: L Q R

N : A S S V A Q V V T T F H E E G A M E U:

T: G C C A C T W: T

C T C

n: TAC ACT ATT GTA GTA GCT GAA A% GCG GAT TCA ccc GCT ACA TTA CAR TAT CTC M : Y T I V V A E M h D S P A T L Q Y L W: T: 1‘‘

T: A T A T I A C U: n: GCT CCT TAT ACC GGA GCA GCC CTG GCT GAG TAT TTT ATG TAC CGC GAA CGG GAT n : ~ P Y T G A A L A E Y F ~ Y R E R H W: T:

-69

-1

54

108

162

216

273

A

390

444

498

552

606

660

714

768

C

I : - - A A A A T G C C A C G T G T CTT TC ATC G - C W: A T CCC C T - n: A T T A G A T A C ~ ~ A G T A C A T T G G C A ~ ~ T T A ~ A A ~ A A G T - A T A T C A A T A ~ A

T: CTCC CC A T _ _ __._._ T T G - ..______... ......._ A A CC w: m A A C

8: C----TpTACTGCTATTTA ........ T T C A A m A T T A G ~ ~ G T T - C T A ~ ~ A G A A m

1: T ___._ T T T --.- G A C T A W: C G G G---TC N: A G G C A T T A T T T T T - C C C T T G C T T C T T A A A G A M C A C T A ATG GCA ACC Cn: ti:

-+ ~ A T L =

W: T: V I

T: CT G T A T A W: C G T M: CGA GTC GAC GAA A T T AAT AAA ATT CTC CGC GAA CGT ATT GAA CAA TAT AA1 AM:

U: H V L T: A S N I

n : R v D E I N K I L R E R I E Q Y N R

T:G AA TA C AC C T A G C C W: A T N: AAA GTA GGG ATT GAG AAT ATC GGT CGC GTA GTG CAA GTG GGG GAT GGG ATT GCT n : K v G I E N I G R V V Q v G D G I A W: T : E K V T T L

T: GAG A 0 G G A G W: n: CGT ATT ATA GGT CTT GGT GAA ATA ATG TCA GGT GAA TTA GTC GAA m GCA CAA n : ~ I I G L G E I n s G E L V E F A E w: T: H D V A E

T : T A T C A T T G W : T T G C N: GGG ACG AGA GCT ATT GCT CTG AAT TTG GAA TCT AAA AA1 G ’ n GGC ATT GTA I T A M : G T R G I A L N L E S K N V G I V L W: 1: I N V

T: T A T A C G W: n: ATG GGC GAT GGG TTG ATG ATA CAA GAG GGA ACT m GTA AAA GCA ACA GGA AGA n : n G D G ~n I Q E G s F V K A T G R W: T: L 5

1: w. c c

T I T

C G G G . ..

N: ATT GCT CAG ATA CCC GTG AGC GAG GCT TAC TTG GGT CGT GTI ATA AAT GCT CTC N : I A Q I P V S E A Y L G R V I N A L w: P V T:

1: T C C A G W: T N: ACC TTA ATA ATT TAT GAT GAT CTC TCC A M CAG GCA CAA GCT TAT CGC GAA ATG N : T L I I Y D D L s K Q A Q A Y R Q n W: T: P

T: T C G T T A W: T G C n: TCC CTT CTA TTA AGA AGA CCT ccc GGC CGC GAA GCT TAT CTA GGG GAT GTT m n : s L L L R R P P G R E A Y L G D V F W: P 1:

T: G G AG T W: N: TAT TTG CAT TCA CGC GTT TTA GAA AGA GCC GCT AAA TTA AA1 TCT cT1‘ TTA GGC N : Y L H S R L L E R A A K L N S L L G W: T: s s

T: A c T A C G T G W: A N: GAA GGG AGT ATG ACC GCT TTA CCA ATA GTG GAG ACT CAA TCT GGA GhC GTT TCC

W: T:

T: T C T G W: c c T

T

~ : E G S ~ T A L P I V E T Q S G D V s

n: GCC TAT ATT CCT ACT AAT GTA ATT TCA ATT AWL GAT CGA CAA ATA TTC TTA TCC U : A Y I P T N V I S I T D G Q I F I S W: 1:

I T C A c c T: C C U: C T N: GCG GAT CTA TTC AAT GCC GGA ATE CGA CCT GCT ATT AAT Gn: GGI A T T TCA G‘K M : A D L F N A G I R P A I N V G I S V W: T: s

T: G G A T W: G n: TCC AGA GTA GGA TCC GCA m CAA ATT AAA ccc ATG AM GAA CTA GCT GGC AAA n : s R V G s A A Q I K A ~ K Q V A G K U: T:

T: T A A G T T W: 0 n: TCA AM TTG GAA CTA GCT CAA TTC GCA CM TTA CAA GCC m GCA GAA TIC GCC n : s K L E L A Q F A E L Q A F A Q F A U: T: L E

343

- 5 1

12

66

120

174

228

282

336

B

822

876

930

984

1038

1092

1146

1200

D

FIGURE 2.-The nucleotide and derived amino acid sequences of the noncoding strands of atpA and atpH from maize plastids. Sites which differ from maize in other angiosperm atpA and atpH sequences are indicated above (for nucleotides) or below (for amino acids) the maize sequence (see text for references). The sequences were aligned to maximize homologies; gaps in one sequence versus another are indicated by dashes. Putative Y S ~ ~ ~ ~ - D ~ ~ ~ ~ ~ ~ ~ w ribosome-binding sites are designated by double underlines, and RNA termini identified by S 1 nuclease mapping analyses (Figure 3) are indicated by vertical arrows. Putative “-1 0” and “-35” promoter elements are underlined upstream from these termini.

Page 5: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

Molecular Evolution of atpA,H 131

T: T A c G T C A A T

P: TCC GCT CTG CAT AM ACA AGT CAG AAT CAA TTG GCA AGG GGT GGA CGA TI% CGG n : s A L D K T S Q N Q L A R G R R L R w:

w: T c A 1254

T: D A T Q T: GCT A G A A A X U: G G T A n: GAA TTG CTT AM CAA TCC WVL TCA AAC CCI CTC CCA GTC GAA GAG CAG GTA GCT n : E L L K Q s Q s N P L P V E E Q V A U: A I T: A T I U

T: A AC C A C G F : A U: ACT ATT TAT ACC GGA ACG AGA GGA TAT CTT GAT TCG mA GAA ATT GAA CAG GTA n : T I Y T G T R G Y L D s L E I E Q v U: T: N V G

T: G T T c CT T T AC A G w : T T n: AAG AM TTT CTG GAT GAG TTA CGT AM CAC CTA AAA GAT ACT h4A CCT CUL TTC ~ : K K F I D E L R K H L K D T K P Q F w: N T: R V T Y T N

T: C cc T G G U: C G T n: CM GM ATT ATA TCT TCT AGT AAG ACA mc ACC GAG CAA GCA GAA ACC CTT TTG 1470 U : Q E I I S S S K T F T E Q A E T L L U: I T: T E A

T: A A A G C T ATA G w: G C T T ._ n: AAG GAA GCT ATT w GAA CAG cm GAA CGG m TCC CTT CAG GAA CAA ACA TAA 1526 n : K E A I Q E Q L E R F S L Q E Q T * w: * - . * T: H D I A *

T: G A TAT ATC T G CTTA CTT-----AATM A A -.--.- GCG C GG U: T T ..________..__. T GTTGAG

1308

1362

1416

n: A T A T ~ ~ ~ C A T G T C T A C T T C C T G ~ A G T A G A A G A G ~ T C . - - . . - M A G A ~ C A T 1589

T: CA AT . -CAG C C T C----------- C M C AAMMT T G w: MAG C T ITTAG _.__. n: T T G M T C A T G C * M . T C G ~ A G . - - - - T A T ~ A ~ A M . ~ ~ - - G A A T A G A T A G A M 1649

T: T - U: T - n: AMGAT 1655 E

tions presume that mutational processes are random, while others correct for nonrandom mutational biases, the sequences in Figure 2 were systematically com- pared with one another to determine if either transi- tion (G-A, T-C) or transversion (A-T, C-G, A-C, T-G) mutational biases predominate in the sequence evo- lution of atpA and atpH from higher plants; if entirely random, transversions should occur twice as fre- quently as transitions. The data in Table 1 show that transitions occur about twice as frequently as trans- versions. Consequently, formulas of KIMURA (198 l), which correct for this bias, were used to estimate the average number of nucleotide substitutions per site ( “ K ” ) in each sequence comparison; the average num- ber of substitutions at the first, second and third codon positions (”KI,” “K2” and UK3,n respectively); the av- erage number of silent (synonymous) nucleotide sub- stitutions at the third codon position (“K,”); and the average number of amino acid-altering (nonsynony- mous) nucleotide substitutions at the first and second codon positions (“K,”). The proportion of synonymous and nonsynonymous nucleotide substitutions in each sequence comparison was also tabulated directly (“Per- cent silent substitutions” in Table 1).

The data in Table 1 show that the fewest number of nucleotide substitutions occur at the second codon position and the greatest number at the third codon position in each of the nine sequence comparisons (K1 vs. K 2 vs. K3); as expected, most of the changes at the

third position are silent (K, is approximately equal to K s , within the limits of confidence). Contrasting the patterns of nucleotide substitution in atpA vs. atpH, the data indicate that the average number of nucleo- tide substitutions ( K ) is higher in atpA than atpH in each of the three comparisons involving species in which both genes have been sequenced. Chi square heterogeneity analyses have shown that these differ- ences are statistically significant, although the differ- ence in K between atpA and atpH appears to be much less significant in the maize/wheat comparison due to the relatively small number of nucleotide differences in atpH from these two species (Chi square data not shown). Regardless, the higher nucleotide substitution values in atpA are primarily a consequence of higher levels of nonsynonymous substitution (K,), since syn- onymous substitutions at the third codon position (K,) are statistically homogeneous for both genes in each of the three species comparisons. It should be pointed out, however, that the relative proportion of nonsy- nonymous substitutions in atpA versus atpH is actually much higher than that indicated by comparisons of K,, since the K , values are serious overestimates of nonsynonymous substitutions in atpH. This arises from the assumption in calculations of K , that nearly all substitutions at the first codon position are nonsy- nonymous. In fact, the vast bulk of substitutions at this site in atpH are synonymous changes; only a single amino acid-altering change is observed in all of the atpH comparisons in Figure 2. The data in Table 1 thus show that the patterns of nonsynonymous substi- tution differ markedly in atpA vs. atpH, and indicate that the second codon positions of these genes are evolving at different rates.

To examine the mode and tempo of sequence evo- lution of atpA and atpH in greater detail in the mon- ocot and dicot lineages of angiosperms, rates of nu- cleotide substitution (per l O9 yr) were calculated from the K , K,, and K , values in the maizelwheat and tobacco/spinach comparisons (Table l) , assuming di- vergence times of 60 X lo6 yr for maize and wheat (STEBBINS 1981), and 60-100 X lo6 yr for spinach and tobacco (MULLER 198 1). Because these rates could not be calculated for dicot atpA genes (since only a single dicot atpA sequence has been reported), rate estimates derived from comparative sequence analyses of atpB and atpE, similar to those reported in Figure 2 and Table 1 , were included in Table 2. It is worth pointing out, however, that the nucleotide substitu- tion rates for atpA are likely similar to those for atpB among the dicots, since the patterns of nucleotide substitution in these two genes are very similar in comparisons between maize, wheat, and tobacco (as in Table 1).

The data in Table 2 reveal that there are no statis- tically significant differences within the monocots

Page 6: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

132 S. R. Rodermel and L. Bogorad

TABLE 1

Nucleotide divergence in the coding regions of atpA and atpH

Amino acid ho- mology

Gene n (%i

Maize/Wheat atpA 1515 97

atpH 246 100

Maize/Tobacco atpA 1524 88

atpH 246 99

Wheat/Tobacco atpA 1515 87

atpH 246 99

Maize/Spinach atpH 246 100

Wheat/Spinach atpH 246 100

Tobacco/Spinach atpH 246 99

Nu- cleic acid

homol-

:!2J 96

98

86

92

86

91

92

93

92

Transition/ Transversion

38/22 = 1.7

4/1 = 4.0

127/83 = 1.5

13/8 = 1.6

135/81 = 1.7

15/7 = 2.1

13/6 = 2.2

12/5 = 2.4

13/8 = 1.6

Percent silent

substitu- tion

75

100

63

95

60

95

100

100

95

K K1

0.0408 0.0201 (0.0053) (0.0064) 0.0207 0.0123

(0.0093) (0.0124)

0.1539 0.0790 (0.01 11) (0.0130) 0.0912 0.0507

(0.0204) (0.0256)

0.1601 0.0883 (0.01 14) (0.0139) 0.0962 0.0377

(0.021 1 ) (0.0220)

0.0822 0.0643 (0.0197) (0.0295)

0.0731 0.0513 (0.0181) (0.0263)

0.0913 0.0643 (0.0204) (0.0295)

K2

0.0100 (0.0045)

0

0.0726 (0.0125)

0

0.0842 (0.0136)

0

0

0

0

K3 K ,

0.0958 0.0805 (0.0146) (0.0129) 0.0513 0.0513

(0.0263) (0.0263)

0.3465 0.2929 (0.0336) (0.0307) 0.2484 0.2089

(0.0653) (0.0588)

0.3424 0.2969 (0.0339) (0.0135) 0.2869 0.2473

(0.0731) (0.0674)

0.1972 0.1647 (0.0560) (0.0497)

0.1805 0.1480 (0.0527) (0.0461)

0.2293 0.1825 (0.0612) (0.0871)

K.

0.0150

0.0062

0.0758

0.0253

0.0862

0.0188

0.0322

0.0256

0.0322

“n” is the total number of homologous nucleotide positions that were compared in each pair of sequences, excluding gaps in one sequence US. another. The total number of transition and transversion events in each sequence comparison is shown in the form of a ratio, while only the percentage of silent substitutions is shown for each comparison. In determining the latter value, each nucleotide substitution was considered as an independent event in those relatively few cases in which a given codon contained more than one nucleotide change. Estimates of evolutionary distance between the two sequences (in substitutions per nucleotide site) are presented on the top line in each comparison and their error variances (due to sampling errors) are given below in parentheses. K, K 1 , K I and K S were calculated according to equation (1 2), of KIMURA ( 1 98 1). K , and its error variance were calculated according to equations (1 1) and (1 3), respectively, of KIMURA (1981). The average number of nonsynonymous substitutions (K.) was estimated by K, + K2/2 (KIMURA, 1981).

TABLE 2

Rates of nucleotide substitution in plastid coupling factor genes (substitutions/site/lOg year)

Monocot/Monocot atpA atpB atpE atpH

Dicot/Dicot atpA atpB atpE at;bH

Nonsynonymous (K0/2T)

0.13 0.17 0.06 0.05 -

0.1 7-0.28 0.38-0.63 0.16-0.27

Synon mous

0.67 0.86 0.91 0.43

(K,J;T)

- 1 .Ol-1.68 1.14-1.90 0.90-1.50

Total ( K I 2 T )

0.34 0.42 0.38 0.17 -

0.46-0.77

0.46-0.76 0.64-1.07

~ ~~

Amino acid Nucleic acid homology (%) homology (W)

97 96 97 95 98 96

100 98 - - 95 91 87 88 99 92

Rates of nucleotide substitution in plastid coupling factor genes. The rates (X 1 O-’ substitutions/site/year) were calculated by dividing K, K , or K , by 2T (KIMURA, 1981), where T is the estimated divergence time between two species. Rates were calculated at the lower and upper bounds of 7’ in the spinach and tobacco comparisons. The nucleotide substitution values were obtained from Table 1, or from comparative sequence analyses of the plastid atpB and atpE genes from maize (KREBBERS et al. 1982), wheat (HOWE et al. 1985), tobacco (SHINOZAKI et al. 1983a), and spinach (ZURAWSKI, BOTTOMLEY and WHITFELD 1982).

“Dicot atpA comparisons cannot be made.

among the four genes in any of the three rate com- parisons, with the exception that the sequence evolu- tion (K/2T and K,/2T)(T = estimated divergence time between species of atpH) may be proceeding at a lower rate than the others (consistent with the data in Table 1). This is reflected in higher levels of nucleotide and amino acid conservation of atpH than are observed

for the other three genes in the maize/wheat compar- ison. There are also no statistically significant differ- ences within the dicots among atpB, atPE and atpH in any of the three rate comparisons, with the exception that atpE has a higher rate of nonsynonymous substi- tution (K, /2T). This is reflected in a much lower level of amino acid homology between the E subunit pro-

Page 7: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

Molecular Evolution of atpA,H 133

teins from tobacco and spinach (87%), than between the 0, or proteolipid, subunit proteins from these species (95 and 99%, respectively).

Contrasting the rates of nucleotide substitution in these genes in the monocots vs. the dicots, the data in Table 2 indicate (with a few exceptions) that each of the three rates is about the same in both lineages when a divergence time of 100 million years between spin- ach and tobacco is used in the rate calculations (e.g., K/2T ranges from 0.34 to 0.46 in both lineages, excluding values for atpH in the monocot comparison and atpE in the dicot comparison). In contrast, each of these rates (with exceptions) is about twofold higher in the dicots, assuming a divergence time of only 60 million years between spinach and tobacco. Although more precise divergence time estimates will be needed to differentiate between these two possibilities, the data nevertheless indicate that the sequence evolution of atpE and atpH has decreased markedly in the mon-

1. The rate of nucleotide substitution (K/2T) in atpE is about two- to threefold higher in the dicots than in the monocots (depending upon the divergence time of spinach and tobacco). Most of this higher rate of sequence evolution can be attributed to a six- to tenfold higher rate of nonsynonymous substitution K,/2T) in the dicots, since the rate of synonymous substitution in atpE is at most twofold higher in the dicots (again, depending upon the divergence time). These differences in K/2T and K,/2T are statistically significant, and are reflected in much higher levels of nucleotide and amino acid sequence conservation of atpE in the monocots than dicots (e.g., in contrast to more similar levels of conservation of atpB in both lineages). An apparent enhancement in the sequence conservation of atpE in the monocot lineage has been noted previously (PALMER 1985a).

2. The rate of nucleotide substitution (K/2T) in atpH is about 2.5-4.5-fold higher in the dicots than in the monocots (again, depending upon the spinach/ tobacco divergence time). Most of this higher rate of sequence evolution can be attributed to at least a two- to threefold higher rate of synonymous substitution in the dicots, since the rates of nonsynonymous sub- stitution are about the same in both lineages. (As stated earlier, the K, values are overestimates of non- synonymous substitution in the case of atpH). These differences in K/2T and KJ2T between the monocot and dicot atpH sequences are statistically significant, and are reflected in a higher level of nucleotide con- servation of this gene in the monocots than in the dicots (98% vs. 92%), in contrast to similar levels of amino acid conservation of the proteolipid subunit in both lineages (100% vs. 99%).

Comparative molecular evolutionary analyses of the flanking regions of atpA and atpH

The nucleotide sequences of portions of the 5’- and 3’-flanking regions of atpA and atpH from maize,

ocots:

wheat and tobacco are illustrated in Figure 2. North- ern hybridization experiments have shown that these regions are complementary to polycistronic and other mRNAs (which, however, vary in number and size from species to species), and that the ends of some of these transcripts likely map in these regions (FLUHR, FROMM and EDELMAN 1983; DENO, SHINOZAKI and SUGIURA 1984; BIRD et al. 1985; RODERMEL and Bo- GORAD 1985). The data in Figure 2 indicate that there are numerous insertions (deletions) in one sequence us. another. These insertions range in size from 1 to 20 bp, though the majority are short (3-5 bp), and most are flanked by perfect (or near-perfect) direct repeats. For example, of the five gaps in the 5’- flanking region of atpH from wheat us. maize, two are single nucleotide insertions in the maize sequence, while two are short insertions in the wheat sequence (“ACT” and “TTCTCC”) and one is a short insertion in the maize sequence (“GAAGTTG”); each of these latter three sequences is repeated immediately up- stream and/or downstream in both wheat and maize. Similar events have been observed in the flanking regions of other plastid genes, and it has been sug- gested that they arise by mispairing or slippage of DNA strands during replication or repair (PALMER 1985b). It might be pointed out, however, that opti- mal levels of A and T may facilitate the generation of such events, since in the present studies these events predominate in the intergenic spacer regions (which range from 67 to 75% AT) us. the coding regions of atpA and atpH (57% A T for both genes).

T o determine the patterns of nucleotide substitu- tion in the transcribed flanking regions of atpA and atpH, the number of transitions and transversions was tabulated in each sequence comparison (excluding gaps), and the average number of nucleotide substi- tutions per site (K) was calculated (Table 3). The data in Table 3 show that the transition/transversion ratio approaches random expectations (TJT, = 0.5) in most comparisons. The lack of a pronounced transition bias in these regions (in contrast to the 2:l transition bias in the coding regions of these genes) may be an artifact arising from small sample sizes in some of these com- parisons. On the other hand, the lack of such a bias in those comparisons involving a relatively large num- ber of substitution events may be a reflection of the ease with which the record of transition changes (but not transversion changes) can be obscured with evo- lutionary distance (BROWN et al. 1982).

Due to the presence of numerous gaps, it is difficult to estimate precisely the overall degree of nucleotide divergence in the transcribed flanking regions of atpA and atpH. Nevertheless, several general conclusions can be drawn from the data in Table 3. In the first place, if gaps are included in the calculations (assum- ing a gap to be equal to one substitution event), the

Page 8: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

134 S. R. Rodermel and L. Bogorad

TABLE 3

Nucleotide divergence in the flanking regions of atpA and atPH

5’ Flanking 3’ Flanking

Gene n Gaps Ts/Tu K n Gaps Ts/Tu K

Maize/Wheat atpA 89 2 6/3 = 2.0 0.1098 122 6 2/7 = 0.3 0.0779 (0.0377) (0.0264)

atpH 110 5 2/3 = 0.7 0.0469 112 1 9/11 = 0.8 0.2048 (0.0 107) (0.0480)

Maize/Tobacco atpA 53 5 2/6 =0.3 0.1687 118 5 23/29= 0.8 0.6850 (0.06 1 5) (0.123 1)

atpH 101 7 8/8 = 1.0 0.1789 102 6 12/24=0.5 0.4784 (0.0466)

Wheat/Tobacco atpA 53 5 4/8 = 0.5 0.2703 107 7 18/25= 0.7 0.5893

atpH 101 9 8/11 =0.7 0.2170 106 5 14/30=0.5 0.6053

(0.0901)

(0.08 28) (0.1 101)

(0.052 1) (0.1074)

See the legend to Table 1 for details.

estimates of nucleotide divergence ( K ) are statistically homogeneous for the 5’- and 3’-flanking regions of atpA and atpH in maize and wheat. This indicates that these four regions are evolving at about the same rate in these two species. Second, the 5 ’ - and 3’-flanking regions of both genes each appear to be evolving at about the same rate in tobacco, though the former regions are evolving faster than the latter. Third, each of the four flanking regions is considerably more conserved in maize and wheat than tobacco.

Putative regulatory sequences involved in transla- tion initiation, transcription initiation, and transcrip- tion termination are present in the flanking regions at atpA and atpH, but these sequences appear to show intergenic and interspecific variations:

Translation initiation: Sequences complementary to the 3‘ end of the plastid 16 S rRNA from maize (SCHWARZ and KOSSEL 1980) and tobacco (TOHDOH and SUGIURA 1982) are present immediately upstream from the translation initiation codons at atpA and atpH in all three species (double underlines in Figure 2). It has been suggested that these sequences may function in a manner similar to the “SHINE-DALGARNO” ribo- some-binding sequences of Escherichia coli mRNAs (SHINE and DALGARNO 1974; BOGORAD et al . 1984). If this is the case, the data in Figure 2 indicate that there are qualitative and quantitative differences in the “SHINE-DALGARNO” sequences which are both gene-specific and species-specific (e.g., the “SHINE- DALGARNO” sequences are located further upstream from the translation initiation codon in both wheat genes). These differences may be a reflection of con- trasting constraints on translation initiation since the efficiency of translation in Escherichia coli is correlated with the stability of the interaction between the mRNA and the rRNA, as well as with the spacing between the initiation codon and the “SHINE-DAL-

GARNO” sequence (STEITZ 1979). Consistent with these differences, the regions immediately flanking the “SHINE-DALGARNO” sequences in Figure 2 do not appear to be particularly conserved; these regions of Escherichia coli mRNAs appear to be important in determining translational efficiency because they, too, interact with the rRNA (STORMO et al . 1982; TANI- GUCHI and WEISSMANN 1978).

Transcription initiation: As illustrated by the S-1 nuclease mapping experiments in Figure 3, RNA ter- mini are present in the 5’-flanking regions of both atpA and atpE from maize (arrows in Figure 2). Im- mediately upstream from both termini are sequences (underlined in Figure 2) which resemble putative “procaryotic-like,” “- 10” and “-35” promoter elements (ROSENBERG and COURT 1979) present upstream of the transcription start sites of other plastid genes (STEINMETZ et a l . 1983). Although S-1 nuclease map- ping experiments have not been reported for these regions in wheat or tobacco, these sequences appear to be conserved upstream of aptA and atpH in all three species, with the exception of the 5’-flanking region of the tobacco atpH gene. The S-1 data do not rule out the possibility that the termini identified in the present experiments are RNA processing sites, rather than transcription initiation sites.

Transcription termination: Stable stem and loop structures, characteristic of rho-independent tran- scription termination in procaryotes (ROSENBERG and COURT 1979), can be formed in the 5’- and 3’-flanking regions of the tobacco atpA gene and in the 3’-flank- ing region of the tobacco atpH gene (DENO, SHINOZAKI and SUCIURA 1984; DENO and SUCIURA 1984), but similar structures cannot be formed in the flanking regions of either gene in maize or wheat. Since such a structural element appears to be lacking in the 3’- flanking region of atpA from maize (and wheat), it will

Page 9: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

Molecular Evolution of atpA,H 135

A

- . - - B

A A- A- A C ‘T

FIGURE 3.-S-1 nuclease mapping of RNA termini in the 5‘- flanking regions of A) the maize atpA gene and B) the maize olpH gene. The experiments were conducted as described in MATERIALS AND METHODS. As controls, RNA (‘-RNA”) or SI nuclease (‘-Sl”) were omitted from the reaction mixtures. The arrows indicate the protected bands.

be of interest to determine whether the convergently transcribed tRNAArg gene ( S . RODERMEL, P. ORLIN and L. BOGORAD, unpublished data; HOWE 1985) serves as a punctuation signal for transcription termi- nation in this region of the plastid chromosome in either of these species.

DISCUSSION

According to the neutral theory of molecular evo- lution, genes (or parts of genes) evolve at rates com- mensurate with their functional importance (KIMURA 1983). Consistent with this theory, the present data show that the greatest number of nucleotide substi- tutions in the coding regions of uptA and uptH occur in the third codon position, largely as silent (synony- mous) substitutions, and the smallest number in the second (nonsynonymous) codon position (Table 1). Rates of nucleotide substitution in mammalian pseu- dogenes are only somewhat greater than rates of synonymous substitution in the functional counter- parts of these genes, suggvsting that the latter events are under little functional constraint (Lr, Luo and Wu 1985). Consequently, synonymous substitution rates provide a rough measure of the mutation rate in a given gene. This rate has been estimated as 1.1 X lo-” substitutions/site/year for rbcL, utpB and atpE in com-

parisons between maize and barley (ZURAWSKI and CLEGG 1984). and in the present studies estimates of this rate range from 0.43 to 1.9 X substitutions/ site/year, depending upon the gene, species compari- son, and presumed divergence time (Table 2). These estimates a re significantly lower than similar estimates for genes from other sources-e.g., an average of 4.7 X 1 0-9 substitutions/site/year for mammalian nuclear genes (Lr, Luo and Wu 1985); 4.9 X substitu- tions/site/year for mammalian globin pseudogenes (LI, Luo and Wu 1985); and 100 X 1 0-” substitutions/ site/year for mammalian mitochondrial genes (e.g., BROWN et al. 1982; BROWN and SIMFSON 1982). T h e data are thus consistent with the notion derived pri- marily from comparative DNA hybridiiration and re- striction fragment polymorphism studies that angio- sperm plastid genomes are evolving conservatively in primary sequence with respect to other DNAs (re- viewed in PALMER 1985a).

Although the third positions of synonymous codons a re presumed to be subject to few selective constraints, there is a marked tendency for these sites to be occu- pied by A or T in plastid genes (reviewed in WHITFELD and BOTTOMLEY 1983). In the present studies, 73% of the synonymous codons in atpA and 83% of the synonymous codons in atpH end in one of these two nucleotides (Table 4). Accompanying this AT-bias in third position codon usage is a transition mutation bias of about 2:l (Table 1). It has been observed previously that a twofold higher frequency of A and T in mammalian pseudogenes is likely the conse- quence of a nonrandom mutation bias favoring C to T and G to A transitions in the evolution of these sequences from their functional counterparts, sug- gesting that such a bias may be responsible for high equilibrium frequencies of A and T in selectively unconstrained regions of the mammalian nuclear ge- nome (GOJOBORI, LI and GRAUR 1982; LI, Wu and Luo 1984). It is therefore tempting to speculate that a similar bias may be responsible (at least in part) for the fixation of high levels of A and T in relatively unconstrained regions of the plastid genome, includ- ing the third positions of synonymous codons (Table 1) and intergenic spacer regions (Table 3). Although a directionality of transition mutations cannot be in- ferred from the present data, analyses of plastid pseu- dogenes would clearly be of great benefit to test this hypothesis.

I t should be pointed out, however, that it is unlikely that a nonrandom mutation bias of this sort could alone account for the fixation of high levels of A and T in selectively unconstrained regions of the plastid genome, since even slight selective advantages appear to be capable of overriding such mutational pressures (Lr, Luo and Wu 1985). For example, examination of the patterns of codon usage in atpA and utpH from

Page 10: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

136 S. R. Rodermel and L. Bogorad

TABLE 4

Codon usage in maize plastid genes

atpA atpH Genome

Ala GCU 26 GCC 9 GCA 12 GCG 2

Arg CGU 5 CGC 8 CGA 5 CGG 3 AGA 10 AGG 3

Asn AAU 12 AAC 1

Asp GAU 16 GAC 3

Cys UGU I UGC 0

Gln CAA 25 CAG 10

Glu GAA 31 GAG 10

Gly GGU 12 GGC 6 GGA 9 GGG 13

His CAU 2 CAC 2

Ile AUU 30 AUC 4 AUA 12

Leu UUA 16 UUG 8

I O 228 1 45 3 105 3 39

0 74 0 26 0 85 0 10 2 65 0 9

1 103 1 61

0 131 0 44

0 16 0 6

2 120 1 41

4 190 0 49

5 155 0 41 4 147 2 58

0 71 0 31

5 206 0 65 1 89

3 177 2 91

atbA atbH Genome

Leu CCU 12 3 CUC 6 0 CUA 5 3 CUG 4 1

Lys AAA 19 1 AAG 3 0

Met AUG 1 1 2

Phe UUU 7 3 uuc 7 0

Pro CCU 8 1 ccc 5 1 CCA 2 2 CCG 1 0

Ser UCU 8 1 UCC 12 1 UCA 8 0 UCG 2 0 AGU 4 1 AGC 1 0

Thr ACU 8 2 ACC 6 0 ACA 10 1 ACG 3 0

Trp UGG 0 0

Tyr UAU 13 1 UAC 3 0

Val GUU 6 4 GUC 3 0 GUA 17 2 GUG 7 1

Ter UAA 1 1 UAG 0 0 UGA 0 0

82 21 74 36

152 38

114

158 100

81 42 61 20

76 78 58 10 76 22

112 63 60 44

92

101 29

122 22

108 42

8 5 1

Codon usage in the maize atpA and atpH genes, expressed as the total number of codons for each amino acid. The codon usage in other maize plastid genes is also shown. These genes include: rbcL (MCINTOSH, POULSEN and BOCORAD 1980); atpB and atpE (KREB- BERS et al. 1982); P s l A l and pslA2 (FISH, KUCK and BOGORAD 1985); psbA and rps19 (I. LARRINUA, personal communication); psbC and psbD (E. T. KREBBERS, K. M. T . MUSKAVITCH, M. CAS- TROVEIJO, H. ROY, D. RUSSELL and L. BOGORAD, unpublished data); rpS4 (SUBRAMANIAN, STEINMETZ and BOGORAD 1983); psbG (STEIN- METZ et al. 1986); and the gene for cytochrome b-559 (J. LUKENS and L. BOGORAD, unpublished data).

maize (Table 4) reveals that not all preferred synon- ymous codons end in A or T, as predicted if the AT- bias at this position were a consequence of such a mutational bias-e.g., UCC, rather UCA or UCU, is the preferred codon for serine in @ A . Since these patterns of codon usage are similar to those observed in other maize plastid genes (Table 4), the data are in accord with the “genome hypothesis” of GRANTHAM et al. (198O)-viz., that genes in a genome use the

same coding strategy with respect to choices among synonymous codons-and indicate that evolutionary forces other than nonrandom mutation are at least partially responsible for the selection of synonymous codons in maize plastid genes. The selection of syn- onymous codons corresponding to the most abundant isoaccepting tRNAs for a given amino acid, and of those codons which most favorably stabilize codon- anticodon interactions appear to be two of the most important of these factors in bacterial and yeast sys- tems (e.g., IKEMURA and OZEKI 1983). The importance of these factors in determining the patterns of codon usage in plastid genes has yet to be explored. On the other hand, it should be noted that optimal levels of A and T in these, as well as other relatively uncon- strained regions of the plastid genome, may be a reflection of other selective advantages associated with high frequencies of these nucleotides-e.g., energetic and/or deoxyribonucleotide triphosphate pool re- quirements by the plastid transcription and/or repli- cation machinery.

A central finding of the present studies is that the patterns of nucleotide substitution differ among plas- tid coupling factor genes. This is perhaps best exem- plified by the differing modes of sequence evolution of atpE and atpH. Nucleotide divergence in atpE is characterized by much lower rates of nonsynonymous substitution in the monocot lineage (Table 2 ) , sug- gesting that the epsilon subunit of CFI is more selec- tively constrained in the monocots. The data ai-e thus in accord with previous analyses of overall nucleotide and amino acid homologies suggesting that atpE is more conserved in the monocots (PALMER 1985a). It is not known whether these constraints are associated with functional differences, but consistent with this notion, differences have been reported in the catalytic activities of isolated CFI from maize versus spinach (W. J. PATRIE and D. MILES, personal communica- tion). In contrast, the sequence evolution of atpH is characterized by the virtual absence of nonsynony- mous substitutions, but by a significantly lower rate of synonymous substitution in the monocot lineage. Unless this latter difference is a statistical aberration, these data suggest that the proteolipid subunit of CFo is under equal functional constraint in both the mon- ocots and dicots, but that synonymous codon selection may be more constrained in the former. In accord with previous observations showing that highly ex- pressed genes tend to use codons corresponding to the most abundant isoaccepting tRNA species, and weakly expressed genes those codons corresponding to more rare tRNAs (e.g., GRANTHAM et al. 1981; GROSJEAN and FIERS 1982; BENNETZEN and HALL 1982), such a constraint for atpH could arise from differences in tRNA availability in the monocot versus dicot lineages. This of course assumes that the stoi-

Page 11: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

Molecular Evolution of atflA,H 137

chiometry of coupling factor subunits, which favors the proteolipid subunit by at least a factor of two to one (NELSON 198 l), is not primarily a consequence of post-translational events--i.e., that atpH is a highly expressed gene relative to the other coupling factor genes. Regardless of the precise nature of the con- straint on synonymous codon selection in monocot atpH genes, it will be of great interest to determine whether a similar phenomenon underlies the sequence evolution of plastid genes encoding other highly “ex- pressed” and/or conserved proteins, such as the 32- kd herbicide-binding protein of photosystem I1 (WHITFELD and BOTTOMLEY 1983).

The levels of nucleotide substitution in the 5’- and 3‘-transcribed flanking regions of atpA and atpH equal (in the maize/wheat comparisons) or vastly exceed (in the monocot/dicot comparisons) synonymous substi- tution levels in these genes, indicating that these re- gions are relatively unconstrained in their sequence evolution. Consistent with the high rates of nucleotide substitution in these regions, the data in this report indicate that there are few putative regulatory se- quences in these regions that are conserved among all three species either in their spatial arrangement (e.g., translation initiation sequences) or sequence complex- ity (e.g., sequences flanking RNA termini; transcrip- tion termination sequences). This may not be surpris- ing, since the number (and likely location) of tran- script ends in these regions appear to differ from species to species (FLUHR, FROMM and EDELMAN 1983; DENO, SHINOZAKI and SUGIURA 1984; BIRD et al. 1985; RODERMEL and BOCORAD 1985). The present data are thus not inconsistent with the notion that the flanking regions of atpA and atpH likely contain vari- able numbers of “species-specific” regulatory se- quences (e.g., photoregulated promoter sequences in maize).

Unless “species-specific” regulatory elements are numerous in the flanking regions of atpA and atpH, the question arises why the sizes of these regions appear to be so highly conserved. As illustrated in Figure 2, the flanking regions of atpA and atpH are characterized by numerous small insertion (deletion) events which do not, in general, lead to a significant gain of nucleotides in one sequence us. another. The predominance of such events (in contrast to large insertion/deletion, transposition, and/or inversion events) has been noted by others in the noncoding regions of other plastid genes (e.g., ZURAWSKI, CLEGG and BROWN 1984; TAKAIWA and SUGIURA 1982), suggesting that, in terms of frequency, such events constitute the dominant mode of structural evolution in these regions of the plastid genome. The data are thus in accord with results from comparative DNA hybridization and restriction site studies showing that in terms of size, gene content, and gene arrangement,

angiosperm plastid DNAs are evolving conservatively in structure (reviewed in PALMER 1985b). As noted earlier, the data in this report are also consistent with the hypothesis that higher plant plastid genomes are evolving conservatively in primary sequence with re- spect to other genomes. This is indicated not only by the low rates of synonymous substitution in atpA, atpB, atpE and atpH, but also by the rates of nonsynonymous substitution in these genes (Table 2), which overall are at the low end of the range of similar rates for numerous mammalian nuclear genes, as well as several plant nuclear genes (LI, LUO and Wu 1985; S. ROD- ERMEL and L. BOGORAD, unpublished data). On the other hand, the present studies show that the tran- scribed flanking regions of atpA and atpH are not highly conserved in sequence complexity from species to species. This appears to be the case for the flanking regions of other, though not all, plastid genes (e.g., ZURAWSKI, CLEGG and BROWN 1984; S. RODERMEL and L. BOGORAD, unpublished data). The data thus suggest that the plastid genome is not a static evolu- tionary entity, and that the plastid does not respond passively to adaptive signals generated by differing, and more rapidly evolving nuclear environments. Rather, the data indicate that the plastid chromosome is an active participant in the evolutionary process, and that the rapid evolution of the flanking regions of plastid genes may be one means by which novel regulatory strategies can arise for the coordinate expression of genes in both compartments.

The authors would like to express their appreciation to LIZ ORR and PEGGY SCHULTZ for technical assistance; to ANDRE STEINMETZ and ENNO KREBBERS for guidance in DNA sequencing and S-1 nuclease analyses; and to DANIEL VOYTAS for careful reading of the manuscript. This work was supported in part by research grants from the National Science Foundation and the National Institute of General Medical Sciences. 1: was also supported in part by the Maria Moors Cabot Foundation of Harvard University.

LITERATURE CITED

ALT, J., P. WINTER, W. SEBALD, J. G. MOSER, R. SCHEDEL, P. WESTHOFF and R. G. HERMANN, 1983 Localization and nu- cleotide sequence of the gene for the ATP synthase proteolipid subunit on the spinach plastid chromosome. Curr. Genet. 7:

Sizing and mapping of early adenovirus mRNAs by gel electrophoresis of S 1 endonuclease- digested hybrids. Cell 12: 72 1-732.

BEDBROOK, J. R. and L. BOGORAD, 1976 Endonuclease recogni- tion sites mapped on Zea mays chloroplast DNA. Proc. Natl. Acad. Sci. USA 73: 4309-4313.

BENNETZEN, J. L. and B. D. HALL, 1982 Codon selection in yeast. J. Biol. Chem. 257: 3026-3031.

BIRD, C. R. , B. KOLLER, A. D. AUFFRET, A. K. HUTTLY, C. J. HOWE, T. A. DYER and J. C. GRAY, 1985 The wheat chloroplast gene for CFo subunit I of ATP synthase contains a large intron.

BOGORAD, L., L. D. CROSSLAND, L. E. FISH, E. T. KREBBERS, U. KUCK, I. M. LARRINUA, K. M. T. MUSKAVITCH, E. A. ORR, S. R. RODERMEL, R. SCHANTZ, A. A. STEINMETZ, S. M. STIRDI-

129-138. BERK, A. J. and P. A. SHARP, 1977

EMBO J. 4: 1381-1388.

Page 12: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

138 S. R. Rodermel and L. Bogorad

VANT and Y.-s. ZHU, 1984 The organization of the maize plastid chromosome: properties and expression of its genes. pp. 257-272. In: Biosynthesis of the Photosynthetic Apparatus: Molec- ular Biology, Development, and Regulation. Edited by J. P. THORNBER, L. A. STAEHLIN and R. B. HALLICK. Alan R. Liss, Inc., New York.

BOUTHYETTE, P.-Y. and A. T. JAGENDORF, 1978 The site of synthesis of chloroplast coupling factor 1. Plant Cell Physiol.

BOVENBERC, W. A., C. J. HOWE, A. J. KOOL and H. J. J. NIJKAMP, 1984 Physical mapping of genes for chloroplast DNA en- coded subunit polypeptides of the ATP synthase complex from Petunia hybrida. Curr. Genet. 8: 283-290.

Novel features of animal mtDNA evolution as shown by sequences of two rat cytochrome oxidase subunit I1 genes. Proc. Natl. Acad. Sci. USA 7 9 3246- 3250.

BROWN, W. M., E. M. PRAGER, A. WANC and A. C. WILSON, 1982 Mitochondrial DNA sequences of primates: tempo and mode of evolution. J. Mol. Evol. 18: 225-239.

COZENS, A. L., J. E. WALKER, A. L. PHILLIPS, A. K. HUTTLY andJ. C. GRAY, 1986 A sixth subunit of ATP synthase, an Fo component, is encoded in the pea chloroplast genome. EMBO

DE HEIJ, H. T., H. LUSTIG, D.-J. M. MOESKOPS, W. A. BOVENBERG, C. BIsANZ and G. S . P. GROOT, 1983 Chloroplast DNAs of Spina&, Petunia and Spirodela have a similar gene organiza- tion. Curr. Genet. 7: 1-6.

DENO, H. and M. SUCIURA, 1984 Chloroplast tNRAGIy gene con- tains a long intron in the D stem: nucleotide sequences of tobacco chloroplast genes for tRNAG'y (UCC) and tRNAA" (UCU). Proc. Natl. Acad. Sci. USA 81: 405-408.

DENO, H., K. SHINOZAKI and M. SUCIURA, 1983 Nucleotide se- quence of tobacco chloroplast gene for the alpha subunit of proton-translocating ATPase. Nucleic Acids Res. 11: 2 185- 2191.

DENO, H., K. SHINOZAKI and M. SUGIURA, 1984 Structure and transcription pattern of a tobacco chloroplast gene coding for subunit 111 of proton-translocating ATPase. Gene 32: 195- 201.

Two partially ho- mologous adjacent light-inducible maize chloroplast genes en- coding polypeptides of the P700 chlorophyll a-protein complex ofphotosystem 1. J. Biol. Chem. 260 1413-1421.

Ctone bank of Nicotiana tabacum chloroplast DNA: mapping of the alpha, beta and epsilon subunits of the ATPase coupling factor, the large subunit of ribulosebisphosphate carboxylase, and the 32-kDal membrane protein. Gene 25: 271-280.

Patterns of nucleo- tide substitution in pseudogenes and functional genes. J, Mol.

GRANTHAM, R., C. GAUTIER, M. GOUY, M. JACOBZONE and R. MERCIER, 1981 Codon catalog usage is a genome strategy modulated for gene expressivity. Nucleic Acids Res. 9: r43- 1-75,

GRANTHAM, R., C. GAUTIER, M. Gouu, R. MERCIER and A. PAVE, 1980 Codon catalog usage and the genome hypothesis. Nu- cleic Acids Res. 8 r49-r62.

GROSJEAN, H. and W. FIERS, 1982 Preferential codon usage in prokaryotic genes: the optimal codon-anticodon interaction energy and the selective codon usage in efficiently expressed genes. Gene 18: 199-209.

The endpoints of an inversion in wheat chlo- roplast DNA are associated with short repeated sequences containing homology to att-lambda. Curr. Genet. 1 0 139-145.

HOWE, C. J., A. D. AUFFRET, A. DOHERTY, C. M. BOWMAN, T. A. DYER and J. C. GRAY, 1982 Location and nucleotide sequence

19: 1169-1174.

BROWN, G. G. and M. V. SIMPSON, 1982

J. 5: 2 17-222.

FISH, L. E., U. KUCK and L. BOGORAD, 1985

FLUHR, R., H. FROMM and M. EDELMAN, 1983

G~JOBORI, T., W.-H. LI and D. GRAUR, 1982

EvoI. 18: 360-369.

HOWE, C. J., 1985

of the gene for the proton-translocating subunit of wheat chloroplast ATP synthase. Proc. Natl. Acad. Sci. USA 7 9

HOWE, C. J., C. M. BOWMAN, T. A. DYER and J. C. GRAY, 1983 The genes for the alpha and proton-translocating sub- units of wheat chloroplast ATP synthase are close together on the same strand of chloroplast DNA. Mol. Gen. Genet. 190:

HOWE, C. J., I. M. FEARNLEY, J. E. WALKER, T. A. DYER and J. C. GRAY, 1985 Nucleotide sequences of the genes for the alpha, beta and epsilon subunits of wheat chloroplast ATP synthase. Plant Mol. Biol. 4: 333-345.

Localisation of genes for four ATP synthase subunits in pea chloroplast DNA. Mol. Gen. Genet. 194 402-409.

IKEMURA, T. and H. O Z ~ I , 1983 Codon usage and transfer RNA contents: organism-specific codonchoice patterns in reference to isoacceptor contents. Cold Spring Harbor Symp. Quant. Biol. 42: 1087-1097.

Estimation of evolutionary distances between homologous nucleotide sequences. Proc. Natl. Acad. Sci. USA

KIMURA, M., 1983 The neutral theory of molecular evolution. In: Evolution of Genes and Proteins, Edited by M. NEI and R. K. KOEHN. Sinnauer Associates, Sunderland, Massachusetts.

KO, K., N. A. STRAUS and J. P. WILLIAMS, 1984 The localization and orientation of specific genes in the chloroplast chromosome of Viciafaba. Curr. Genet. 8: 359-367.

KREBBERS, E. T., I. M. LARRINUA, L. MCINTOSH and L. BOGORAD, 1982 The maize chloroplast genes for the beta and epsilon subunits of the photosynthetic coupling factor CFI are fused. Nucleic Acids Res. 10: 4985-5002.

LARRINUA, 1. M., K. M. T. MUSKAVITCH, E. J. GUBBINS and L. BOGORAD, 1983 A detailed restriction endonuclease site map of the Zea mays plastid genome. Plant Mol. Biol. 2: 129-140.

Evolution of DNA sequences. pp. 1-94. In: Monographs in Evolutionary Biology: Molecular Evolutionaly Genetics, Edited by R. J. MACINTYRE. Plenum, New York.

LI, W.-H., C.-I. Wu and C.-C. LUO, 1984 Nonrandomness of point mutation as reflected in nucleotide substitutions in pseu- dogenes and its evolutionary imp1ications.J. Mol. Evol. 21: 58- 71.

Sequencing end-labeled DNA with base-specific chemical cleavages. Methods Enzymol.

MCINTOSH, L., C. POULSEN and L. BOGORAD, 1980 Chloroplast gene sequence for the large subunit of ribulose bisphosphate carboxylase of maize. Nature 288: 556-560.

MENDIOLA-MORCENTHALER, L. R., J. J. MORCENTHALER and C. A. PRICE, 1976 Synthesis of coupling factor CF, protein by iso- lated spinach chloroplasts. FEBS Lett. 62: 96-100.

MULLER, J., 1981 Fossil pollen records of extant angiosperms. Bot. Rev. 47: 1-142.

NECHUSHTAI, R., N. NELSON, A. K. MATTOO and M. EDELMAN, 1981 Site of synthesis of subunits to photosystem I reaction center and the proton-ATPase in Spirodela. FEBS Lett. 125 115-1 19.

NELSON, N., 1981 Proton-ATPase of chloroplasts. Curr. Topics Bioenerg. 11: 1-33.

NELSON, N., H. NELSON and G. SCHATZ, 1980 Biosynthesis and assembly of the proton-translocating adenosine triphosphatase complex from chloroplasts. Proc. Natl. Acad. Sci. USA 77: 1361-1364.

OLIVER, R. P., 1984 Location of the genes for cytochrome f, subunit IV of the be/f complex, the alpha subunit of CFI ATP- synthase and subunit 111 of the CFo ATP-synthase on the barley chloroplast genome. Carlsberg Res. Commun. 49: 555-557.

6903-6907.

51-55.

HUTTLY, A. K. andJ. C. GRAY, 1984

KIMURA, M., 198 1

78: 454-458.

LI, W.-H., C.-C. Luo and C.-I. Wu, 1985

MAXAM, A. M. and W. GILBERT, 1980

65: 499-560.

Page 13: Molecular Evolution and Nucleotide Sequences of the Maize … · 2003. 8. 1. · Molecular Evolution and Nucleotide Sequences of the Maize Plastid Genes for the cy Subunit of CFI

Molecular Evolution of atpA,H 139

PALMER, J. D., 1985a Evolution of chloroplast and mitochondrial DNA in plants and algae. pp. 131-240. In: Monographs in Evolutiona7)1 Biology: Molecular Evolutionary Genetics, Edited by R. J. MACINTYRE. Plenum, New York.

PALMER, J. D., 1985b Comparative organization of chloroplast genomes. Annu. Rev. Genet. 1 9 325-354.

PHILLIPS, A. L., 1985 Localisation of genes for chloroplast com- ponents in tomato plastid DNA. Curr. Genet. 1 0 153-161.

RODERMEL, S. R. and L. BOGORAD, 1985 Maize plastid photo- genes: mapping and photoregulation of transcript levels during light-induced development. J. Cell Biol. 100: 463-476.

RODERMEL, S. R. and L. BOGORAD, 1986 Identification, organi- zation and photoregulated expression of the maize plastid genes for the alpha subunit of CFI and subunits I and 111 of CFo. pp. 499-504. In: Regulation of Chloroplast Dtgerentiation, Edited by G. AKOYUNOGLOU and H. SENGER. Alan R. Liss, New York.

Regulatory sequences in- volved in the promotion and termination of RNA transcription. Annu. Rev. Genet. 13: 3 19-353.

SCHWARZ, Z. and H. KOSSEL, 1980 The primary structure of 16s rDNA from Zea mays chloroplasts is homologous to E. coli 16s rRNA. Nature 283: 739-742.

SHINE, J. and L. DALGARNO, 1974 The 3’-terminal sequence of Escherichia coli 16s ribosomal RNA: complementary to non- sense triplets and ribosome binding sites. Proc. Natl. Acad. Sci.

SHINOZAKI, K., H. DENO, A. KATO and M. SUGIURA, 1983a Overlap and cotranscription of the genes for the beta and epsilon subunits of tobacco chloroplast ATPase. Gene 24: 147- 155.

SHINOZAKI, K., C. YAMADA, N. TAKAHATA and M. SUGIURA, 1983b Molecular cloning and sequence analysis of the cyano- bacterial gene for the large subunit of ribulose-l,5-bisphos- phate carboxylase/oxygenase. Proc. Natl. Acad. Sci. USA 8 0

STEBBINS, G. L., 1981 Coevoluation of grasses and herbivores. Ann. MO. Bot. Card. 68: 75-86.

STEINMETZ, A. A., E. T. KREBBERS, Z. SCHWARZ, E. J. GUBBINS and L. BOGORAD, 1983 Nucleotide sequences of five maize chlo- roplast transfer RNA genes and their flanking regions. J. Biol. Chem. 258 5503-55 1 1 .

STEINMETZ, A. A., M. CASTROVIEJO, R. T. SAYRE and L. BOGORAD, 1986 Protein PSII-G. An additional component of photosys- tem I1 identified through its plastid gene in maize. J. Biol. Chem. 261: 2485-2488.

Genetic signals and nucleotide sequences in mes- senger RNA. pp. 349-399. In: Biological Regulation and Devel-

ROSENBERG, M. and D. COURT, 1979

USA 71: 1342-1346.

4050-4054.

STEITZ, J., 1979

opment, Vol. I, Edited by R. F. GOLDBERGER. Plenum Press, New York.

STORMO, G. D., T. D. SCHNEIDER and L. M. COLD, 1982 Characterization of translation initiation sites in E. coli. Nucleic Acids Res. 1 0 2971-2995.

Maize chloroplast DNA encodes a protein sequence homologous to the bacterial ribosome assembly protein S4. Nucleic Acids Res.

TAKAIWA, F. and M. SUGIURA, 1982 Nucleotide sequence of the 16s-23s spacer region in an rRNA gene cluster from tobacco chloroplast DNA. Nucleic Acids Res. 8: 2665-2676.

Sitedirected mutations in the initiator region of the bacteriophage QB coat protein and their effect on ribosome binding. J. Mol. Biol. 118: 533- 565.

TOHDOH, N. and M. SUGIURA, 1982 The complete nucleotide sequence of a 16s ribosomal RNA gene from tobacco chloro- plasts. Gene 17: 213-218.

Translation of mRNAs for subunits of chloroplast coupling factor 1 in spinach. Proc. Natl. Acad. Sci. USA 7 9 6304-6308.

WESTHOFF, P., N. NELSON, H. BUNEMANN and R. G. HERRMANN, 1981 Localization of genes for coupling factor subunits on the spinach plastid chromosome. Curr. Genet. 4: 109-120.

WESTHOFF, P., J. ALT, N. N E L ~ ~ N and R. G. HERRMANN, 1985 Genes and transcripts for the ATP synthase CFo sub- units I and I1 from spinach thylakoid membranes. Mol. Gen. Genet. 199: 290-299.

Organization and structure of chloroplast genes. Ann. Rev. Plant Physiol. 3 4

WHITFELD, P. R., G. ZURAWSKI and W. BOITOMLEY, 1983 Organization and structure of the genes for the beta and epsilon subunits of spinach and pea chloroplast ATPase. pp. 193-198. In: Structure and Function of Plant Genomes, Edited by 0. CIFFERI and L. DURE 111. Plenum Press, New York.

The barley chloroplast DNA atpBE, trnM2 and trnVl loci. Nucleic Acids Res. 1 2

ZURAWSKI, G., W. BOTTOMLEY and P. R. WHITFELD, 1982 Structures of the genes for the beta and epsilon subunits of spinach chloroplast ATPase indicate a dicistronic mRNA and an overlapping translation stop/” signal. Proc. Natl. Acad. Sci. USA 7 9 6260-6264.

ZURAWSKI, G., M. T. CLECC and A. H. D. BROWN, 1984 The nature of nucleotide sequence divergence between barley and maize chloroplast DNA. Genetics 106: 735-749.

Communicating editor: M. T. CLEGG

SUBRAMANIAN, A. R., A. STEINMETZand L. BOGORAD, 1983

11: 5277-5286.

TANIGUCHI, J. and C. WEISSMANN, 1978

WATANABE, A. and C. A. PRICE, 1982

WHITFELD, P. R. and W. BOTTOMLEY, 1983

279-310.

ZURAWSKI, G. and M. T. CLEGG, 1984

2549-2559.


Recommended