+ All Categories
Home > Documents > Nucleic Acids Research - NCBI

Nucleic Acids Research - NCBI

Date post: 27-Feb-2023
Category:
Upload: khangminh22
View: 0 times
Download: 0 times
Share this document with a friend
46
Volume 14, Supplement Nucleic Acids Research Compilation of small ribosomal subunit RNA sequences Erik Huysmans and Rupert De Wachter* Departement Biochemie, Universiteit Antwerpen (UIA), Universiteitsplein 1, B-2610 Antwerpen, Belgium INTRODUCTION In the present compilation of small ribosomal subunit RNA sequences (also named 16 S-like ribosomal RNA; further abbreviated srRNA) an attempt is made to list all such sequences published up to and including 1985. The se- quences are presented in the form of an alignment based on primary structure homology and containing information on the presumed secondary structure. The 57 sequences included are listed in Table 1 and belong to 14 eukaryotes, 9 ar- chaebacteria, 13 eubacteria, 4 plastids and 16 mitochondria. Only complete or virtually complete sequences were included. Since papers on srRNA structure are appearing at an increasingly rapid pace, some recent papers may have es- caped our attention or reached us too late to meet the publication deadline, for which we apologize to their authors. CONVENTIONS AND SYMBOLS USED IN THE SEQUENCE ALIGNMENT The 57 sequences appear in the alignment in the same order, and bearing the same number, as in Table 1. In addition to its number, each sequence is identified by the initials of the species name. The numbering at the top and the bottom applies to alignment positions, not to the nucleotides of an indi- vidual sequence. The sequences are divided in 5 blocks, comprising srRNAs from eukaryotes, archaebacteria, eubacteria, plastids and mitochondria. All the sequences are presented as RNA chains, i.e. using the symbols A, C, G and U, although most of them have been analyzed on the DNA level. A set of symbols for incompletely specified residues, listed in Table 2, has been recommended by the IUB nomenclature comittee (63,64). These symbols are only used to denote sequence heterogeneity detected at certain positions of srRNAs analyzed by RNA sequencing, in other words they denote heterogeneity due to gene polymorphism. In contrast, nucleotides incompletely identified during sequencing of a cloned gene are denoted by the symbol X. ©) I R L Press Limited, Oxford, England. r73
Transcript

Volume 14, Supplement Nucleic Acids Research

Compilation of small ribosomal subunit RNA sequences

Erik Huysmans and Rupert De Wachter*

Departement Biochemie, Universiteit Antwerpen (UIA), Universiteitsplein 1, B-2610 Antwerpen,Belgium

INTRODUCTION

In the present compilation of small ribosomal subunit RNA sequences

(also named 16 S-like ribosomal RNA; further abbreviated srRNA) an attempt is

made to list all such sequences published up to and including 1985. The se-

quences are presented in the form of an alignment based on primary structure

homology and containing information on the presumed secondary structure. The

57 sequences included are listed in Table 1 and belong to 14 eukaryotes, 9 ar-

chaebacteria, 13 eubacteria, 4 plastids and 16 mitochondria. Only complete or

virtually complete sequences were included. Since papers on srRNA structure

are appearing at an increasingly rapid pace, some recent papers may have es-

caped our attention or reached us too late to meet the publication deadline,

for which we apologize to their authors.

CONVENTIONS AND SYMBOLS USED IN THE SEQUENCE ALIGNMENT

The 57 sequences appear in the alignment in the same order, and bearing

the same number, as in Table 1. In addition to its number, each sequence is

identified by the initials of the species name. The numbering at the top and

the bottom applies to alignment positions, not to the nucleotides of an indi-

vidual sequence. The sequences are divided in 5 blocks, comprising srRNAs

from eukaryotes, archaebacteria, eubacteria, plastids and mitochondria.

All the sequences are presented as RNA chains, i.e. using the symbols A,

C, G and U, although most of them have been analyzed on the DNA level. A set

of symbols for incompletely specified residues, listed in Table 2, has been

recommended by the IUB nomenclature comittee (63,64). These symbols are onlyused to denote sequence heterogeneity detected at certain positions of srRNAs

analyzed by RNA sequencing, in other words they denote heterogeneity due to

gene polymorphism. In contrast, nucleotides incompletely identified duringsequencing of a cloned gene are denoted by the symbol X.

©) I R L Press Limited, Oxford, England. r73

.X@- 00(fO Cfr

C_

co0 co00 400o t - Ftot t-a0 0 0 00CI> 0 0 Zi"l

-_.-C.)

---- - - C.) 4.___C.) C.) C.)U;41;C_C

'.4'.- X'4

- - - - - 01 0 _1 C

0 0

000 00 0~~~~~~~ _ _.4440000004100

_-C£_-- IC 44

0000 000 0*** .i4.C X

0 0 4.4. 0, 0O>.OO 4 ooZ0 0OlOO£v

0000.1'4444Z X ZZe XZYZCCCX ZZ*ZZZg*Z nnnnXZeo;fuLoooC O-^- fZ o; O O O 3CC IC I-f

0 X t~~~

4. J C't 3

Q X K CA Y - 0(1. -O <<Q UQ > 0O N a.

- 00C .NC

U " - 0 - _ _ _

'.4 4.0 0 9 0 C. 0 <aa

IV000 _

_>D. 0 u

:1

0 :h00_ _ L.

C £40 d

0s1 G 1 i 04 ) A ) -

a0 1: * 0fi

b: -4.

~P4=0- " * @

* O C010(0

4.0.0.0X - C

4.4 Q O 4.4.£_ L4 N

I0£4)w>C4b 144.>41s 41.. '.K 0D ;4

_

.-I NCN(. 0 L-N C aO - N C '_ N O V 4 <0 t- 0 0> -O -" -t _ -4 . . . . N C> ¢9 CQ C

Nucleic Acids Research

0

0

-4

0

0

0

014

0

E040(-a

c

bo0

0

0ebec

c

S.

0I..

0

0

-4

-44.0

041.

c.cI4)

. -

-W

IV0ci

r.

4'

Et

r74

Nucleic Acids Research

.yX _ O.c 0

* 00cm OtO 00 O NC O 00 N

lCo 4olt NN0N00(1000"c~ 0 1 o "

X~cO O100gO IOIOCIO NNN N 0JU)01 01010000 -U)cm CY _ 0 0 o )o

to

0 0UuI 0 s 0 0a0 -W 0Dt 0 tC D<00 <)v x --Dbe N L aN t) 0>0. o 00

4000 ~~~~~~-0 . "0 0-'0

. >44J0.4.J -.)C. -

0~~~.00L 00 .--()I00

C'~~~~C C

L **0.0 -C1..- 0_ __

V 0 ~0~00 0 000 .0UC1 - L

00~~~~~~~~~~~~~~~~~~ ~~~~~~ C'L.04.JIL0.0 0.0-

11 C, C* - - - - - - - - 4jC>4i 4i J CL 0. C Q. A_ C

9o o o o 00C 0pw _ o 0 0 0 0 Q _

0.0 i 6J K llJ EZLo 4J AdoZe J 4_ o

= a" t=X~~~~~~~00CCaa==0 --i4 i i4 iauu 4- J" L 0 0 . aa0. .A.0 0 0 00 4 4 04.0w0 0..0.0.__0_

V~~~004J.0~~~~~~.0.0. ~ ~ 4

44 444".~( 0.00 00 00 00000000ZZ _ . .

4@*-h*WWw w 4 w _ X _ _ __C#) ^ cbe

000.0.0 00V0'00.0~~~~~~~~~~~~~cCO 0 0 0 0 E

0) 0~~~~~~~~~~~~~~) L_-_ _______ZWWPcw:zC - _ 60.0_ H

CD to - . 0 - 0.0A t0Coo ttL.bL a 0c 0L-

C%.0.) - -000.0C) a cm~~0 0 0 C.)

0) ~ ~ 0 01b

00L) a 8 L. $1.0) 4 0 .0". '- .c*01 0 C- i 0) 0 - N N

--0)00- 0. 41 - N ~~~~~0)0)

o 0+ + + 000 0

0) - - 0.41(.1a) 0000- . (.100 )100 )10010>

E 0L 0CL08 0. to00g 0 aC O0 0._(100 0.su a a0. -0. X0 aL).4. 0 o04o >o . __ 4. a

.4 a 2 L, Adc: C 4)u cZ sz* u ) c- ip c %40 6) a.n u

0 ~ ~C ,YO044z to l - O 0 -4L

644i0 0- u O. - CoL 3 c.- oa

al40. 0. 0a.04 0 0 0 0 a ..J.0 a 000 0a k

0iS . 0

0 ~ ~ * Eu - 0 X

00 0- O 0

.0 Ka N0 0. - -O - 000 A 0 000.6. 0 0cL0NSW 0> 0 -. 0 Da 0 aL S._SW -_ -D0.0 A 0 0 S.00N

0 0. 0N 4'N0N CCV u00C00 . 0 0. 00 -_.M_ 1_, .10_-.0.

0 )0 600c)-X K olu0; 000 010 004' 4.* 0.QXZ

0 t.4100.)C uoo-u01u aC)O L1100. C). to CDOtO-

cm1.0 0 0 CO) e) O)o 0 0coC-.0 IV 0 lo00 . L)0 L)e0 00

0U oo o O 0. 10_-00

1a.41" 0.00( 0- 040(101C0~'C 01.((0 4-6-_*.U410 0(0.._ -0

.0 _

*00)000.._00N0)401 C0-*00 N0140)0t.400N040)0000. NN N IO 0 0 01)4-444444440)0Ja)0)0)0)0D))O0)QQC :

r75

Nucleic Acids Research

Footnotes to Table 1a) This number precedes and follows the sequence on each alignment page and is

also the reference number of the paper from which the sequence was taken.b) This column contains the following data, if specified by the authors.

- Strain name for laboratory animals, (cultivated) variety for plants, cul-ture collection and strain number in the case of microorganisms.

- Tissue from which the DNA used for clone construction, or the RNA usedfor sequencing, was extracted in the case of differentiated organisms.

- Ribosomal RNA operon to which belongs the cloned srRNA gene in the caseof bacteria.

c) The taxonomic position is described according to the following references(68) for metazoa and protozoa, (69) for Ascomycota, (70) for algae and phy-toflagellates, (71) for higher plants, (72) and (73) for archaebacteria,(74) for eubacteria. Taxon disignations corresponding to an establishedtaxonomic level are followed by the abbreviation Ph. (phylum), Sph. (sub-phylum), Cl. (classis), or 0. (ordo).

d) The chain length is not mentioned in case the location of the srRNA terminion the sequenced DNA is uncertain.

e) Two different sequences have been reported for rat srRNA, both are listed.f) The primary structure was derived by RNA sequencing.g) Partial data on modified nucleosides are mentioned in the paper reporting

this sequence or other papers cited therein.h) A sequence missing one nucleotide (A*05) was previously reported in (58),

where complete data on nucleoside modification can be found.i) This paper contains a sequence corrected with respect to the original re-

port (59).j) The termini were allocated by comparison with a sequence from a related

species.k) A slightly different structure was previously determined by RNA sequencing

(60) and a sequence missing one nucleotide (Ug,) of the rrnB srRNA (61)was corrected in (27). Complete data on nucleoside modification appear in(60) and (61).

1) Approximately 15 nucleotides adjacent to the 5'-terminus are missing be-cause they were not comprised in the cloned sequence.

m) A somewhat different sequence was reported previously (62) without locali-zation of the termini.

n) The location of the termini is uncertain.o) The location of the termini is not according to the original paper (53) but

according to (54).

Table 2. Symbols denotinf sequence heterogeneity

Symbol Nucleotides Dresent Symbol Nucleotides Dresent

R A and G H A, U, and CY U and C B G, U, and CW A and U V A, G, and CS G and C D A, G, and UM A and C N A, G, U, and CK G and U

These symbols are used only to denote positional heterogeneity found by RNAsequencing (see text).

r76

Nucleic Acids Research

Posttransscriptional nucleoside modifications are not mentioned in the

alignment, mainly because there exist no conventional one-letter abbreviations

for all the modified residues encountered in srRNA. Moreover, data on modi-

fied residues exist only for a few sequences. References to papers containing

such data are mentioned in Table 1.

The alignment of sequences for optimal homology has been obtained without

the aid of a computer program. It is based on the existence of conserved se-

quence blocks, and on homology in secondary structure (see below). Certain

sections of several sitochondrial srRNA sequences are printed in more con-

densed format, without a space between the characters. These are sections

where it is especially difficult to discern homology with srRNAs from other

species. It has been attempted, however, to optimize homology between cor-

responding condensed-format sections in species that are adjacent in the

alignment. There are other areas where sequence alignment is rather arbitrary,

often corresponding to those loops in the secondary structure models (see be-

low) for which loop length is variable from one species to the other. In

general, sequence alignment is much more questionable in the variable areas

than in the conserved areas of the secondary structure models. The areas of

dubious homology may shrink in the future by the application of alignment

algorithms or/and availability of additional sequences for comparison. For

the moment being, their existence should be taken into account in applications

of the data for studies in molecular evolution.

SECONDARY STRUCTURE

The boxes superimposed on the sequence alignment enclose segments pre-

sumedly involved in base pairing. The boxes enclosing the complementary se-

quences forming helix 1 are numbered 1 and 1', and so on. Internal loops and

bulges are indicated by nested boxes. Bases that may belong to pairs other

than G.C, A-U or G*U (non-standard-, non-canonical or odd base pairs) are put

in parentheses. Two sets of helix numbers appear in the alignment. One set

Is listed under the uppermost block consisting of eukaryotic srRNA sequences,

and applies to these structures. A second set of numbers is listed between

the eubacterial and the plastidial srRNA sequence blocks and applies to all

prokaryotic structures.

In the case of prokaryotic srRNAs, the model proposed by Noller, Woese

and collaborators (65,66) is followed. Bacterial and chloroplast srRNAs fit

well into this model, but such variability is observed in the structure of

mitochondrial srRNAs, which show deletions as well as insertions of secondary

r77

Nucleic Acids Research

structure areas with respect to the bacterial model. Insertions prevail in

plant mitochondrial srRNAs (56) which have chain lengths approaching 2000 nu-

cleotides, while deletions prevail in animal mitochondrial srRNAs (46,47)

which have chain lengths of less than 1000 nucleotides. Among the srRNAs of

fungal and protist mitochondria, both insertions and deletions account for

structural variability with respect to the bacterial secondary structure

model. Eukaryotic (cytoplasmic) srRNAs are fitted into a variant model (7)

related to those proposed for Xenopus laevis (6) and Dictyostelium discoideum

(67) srRNAs. This model was shown to be applicable to srRNAs from 6 eukaryotes

in a previous compilation (7) of srRNA sequences, and proved to remain appli-

cable to all of the 14 eukaryotic structures presently published.

The prokaryotic and the eukaryotic srRNA models applied to the sequence

alignment are illustrated in Fig. 1 and 2. The two models are for a large

part homologous, but there are two kinds of differences, both localized in

some of the areas of variable primary and secondary structure that are labeled

Vl to V8. The first kind of difference is found in areas V2 and V4, where

eukaryotic srRNAs possess extra helices with respect to prokaryotic srRNAs.

The second kind of differences resides in area V5, which shows variability of

a different type. Here, it is not the absence or presence of a helix that

makes the difference, but the topology in which the helices are folded, in

other words the order of occurrence of complementary sequences when the chain

is scanned from 5'- to 3'-end. It should be noted that eukaryotic srRNA se-

quences can also be fitted into a model having the prokaryotic topology in

area V5, although at the cost of forming a less stable structure. A recent

review on srRNA structure (66) lists such models for five eukaryotic srRNAs,

but base pairing in the helix corresponding to P18 (our numbering system) is

questionable. Conversely, certain prokaryotic srRNAs can be fitted into a

eukaryotic-type model, which again does not seem to fit them as nicely as the

prokaryotic model, but we have not investigated this possibility for all

presently known prokaryotic srRNAs. The possibility that area V5 may undergo

a structural switch, or may be a case of evolutionary divergence in secondary

structure, has been discussed (7).

The helix numbering system (7) adopted in the alignment and in Figures 1

and 2 attributes a separate number to each helical area, which may consist of

an uninterrupted helix, or a set of helix segments connected by bulges or in-

terior loops. Helices (or helical areas) bear a different number only when

separated by a multibranched loop (e.g. helices 7 and 8), or when separated bya single-stranded segment which does not form a loop (e.g. helices 23 and 39).

r78

Nucleic Acids Research

Numbers are attributed in the order of occurrence of the 5'-proximal strand of

the helix in the sequence. Helices that are common to the prokaryotic and

eukaryotic models bear a single number from 1 to 40. In areas where the two

models differ in pairing scheme, helices are numbered in the format Pa or Pa-b

(prokaryotic srRNAs) and Ea or Ea-b (eukaryotic srRNAs) where a is the number

of the preceding common helix and b is a serial number. As an example, E9-1

and E9-2 are eukaryote-specific helices encountered after common helix 9 when

scanning sequences from the 5'-end. The common helices are not universal in

the sense that some are missing in certain mitochondrial srRNAs, but they are

present in all hitherto examined eukaryotic, archaebacterial, eubacterial and

plastidial srRNAs. Future refinements of the secondary structure models may

bring to light additional common helices and necessitate a revision of the

numbering system.

In sorting out the base pairing schemes superimposed on the sequence

alignment, we were often aided by the availability of secondary structure

models proposed in the paper reporting the srRNA sequence or in (66). How-

ever, we did not necessarily adopt such models entirely. There are two main

differences between the secondary structures proposed here and a set of 20

srRNA secondary structure models published by Gutell et al. (66). In area V5,

we assume a different topology in eukaryotes and prokaryotes whereas they

assume the same folding. Conversely, in area V3 we assume the presence of the

same topology in eukaryotes and prokaryotes while they assume a different one.

In the case of trypanosome mitochondrial srRNAs (52-54), it is especially dif-

ficult to propose a meaningful alignment and secondary structure model. Not

only are the sequences exceptionally short, but they are also very A*U rich

and hence very monotonous. Only a limited set of segments of these sequences

is aligned with other srRNA sequences according to the proposal of Sloof et

al. (52), the remainder being printed in condensed format.

AVAILABILITY AND ACCURACY OF THE DATA

The sequences presented here were entered in our database and checked to

the best of our ability for errors. They were not checked against a central

data base such as EMBL, one reason being that many of the sequences are pub-

lished very recently and there is some delay before they appear in central

data bases. We intend to carry out such checks in the future and to updatethe compilation regularly. Authors are invited to inform us about errors and

omissions of published sequences, and to send reprints of their newly pub-lished sequences.

r79

Nucleic Acids Research

34

V7

13

40I I

10 nucleotides

V8

39

V2

Fig. 1. Secondary structure model applied to prokaryotic srRNAs.Helices, defined as described in the text, are numbered in the orderof occurrence from the 5'- to the 3'-terminus, symbolized respecti-vely by a dot and an arrowhead. Forty helices present in eukaryotic*as well as prokaryotic srRNAs (with the exception of certain mito-chondrial srRNAs) are hitherto distinguished. Additional helicesspecific for prokaryotic srRNAs bear a number preceded by P. A prob-able interaction between the sequence preceding helix 1 and a segmentof the multibranched loop separating helices 1 and 2 is not numbered.Bold lines are used for areas relatively conserved in primary andsecondary structure. Thin lines are used for variable areas, numberedVI to V8.

r80

Nucleic Acids Research

32 30 33

20

34

V721

V4 27

25

E19-

36

16

15

11

10 nucleotides

V839

E9-2

Fig. 2. Secondary structure model applied to eukaryotic srRNAs.Conventions and numbering system are as in Fig. 1, except that heli-ces specific for eukaryotic srRNAs bear a number preceded by E.

The srRNA sequence compilation is available to interested scientists in

computer-readable form, either on magnetic tape or on floppy disks. An up-

dated version of a file containing 5 S rRNA sequences, in the format used in

previous publications (75) is also available. It is our hope that these com-

pilations will be an aid for studies on rRNA structure and molecular evolu-

tion.

r81

Nucleic Acids Research

I3? 3? ~~~~~~~~~~~~~? 3? IA? I'~~~~~~~~~~~~~~?INS.------------------------U A C C US66UU 6A U C CUS6C C A 6U A 6 IA UA U6CU-US6U CU CA AA 6A U UA2H....------- ------U A CC U6G6U U 6A U CCUSCC ASUA 6 -(C)A UA U6CU -US6U C U CA AAG6A U UA3 R.n------------------------ JA CC U6SU U6A U C CU 6C C ASU A IA UAU6C U -U 6U C UC AAAS6A UU AANRn.------------------------ UA CC US66UU 6A U CC UG6C C A 6UA6-CA UA U 6C U -US6UC UCA A A 6AUU ASUe.c------------------------I A C CU 6 6UUS6AU C CUS6C C A6U A 6-(C)A UA U6CU -U6U CU C AAA 6A UU A6X.1.------------------------IJ A C CUS66U U A UC CUS6C CA 6U A6-(C)A UA U6C U -U U C UCA AAS6A UU A7 A.s.------------------------IJ)A C CU6 6U U 6A UC CU 6C C A U A 6(CA UAU 6C U - U6U C UC AAA 6A UU AN S.c -------------U A UC U6 6U U 6A U C CUSCC A 6UA 6U A U AU 6C U -U 6U U CA AA GA U UA9Ud.------------------------U AA C U6 6UU 6A UC CU 6C CA 6U A 6UJC A UAUS6C U -US6U C U CA AAS6A UU A0OOn.------------------------ A A UC U 6U UG6A U CC U 6C CA 6UAS6UICIA U AU 6C U -UG6UC UC AAA 6A CU A11 S.p -------------A A UC U6 6U U 6AU C CUSC CA6U A 6U CjA UAU 6C U -U6 UC U CA AA 6A C UA12T.t.------------------------ AA C C U66U US6AU CC U 6C CA 6UMUA-I)A UAUS6C U -U6UC U UA AA UA U UA131n.s------------------------UA CC U 6SU U 6A UC C U 6CC A GA6U[EA UA U 6C U -U 6UCUC AA A 6A U UA14 ZLa.------------------------IJ A CC US66U U6AU CC U 6CCA6U AS6UICA UA U6C U -UG6UC UCA AAS6 A UVUA15 6... ------------- UA C UU66U U 6AUCC U 6C C AS6UA 6UI.A UA U6C U -U UC UCA AA 6A UU A

23317NH.h.------------------------ AU UIC CS66UIUG6A U CC US6C C 6AS66C C AU- US6C U A UC 6S6ASU CCS6A UU UIBNH.v------------------------ AU UIC CS66UIU 6SAU C C US6C CS6 GA 66U C A U - US6 CU A U US6666SUC C 6 A U U U19Hi-----N-------- AU UIC CS66UIU6A U C C US6C C6S6AS66C U AU -US6C U AUC6666U CCS6A UU C

23S.s.------------------------AVUUC C 66UU 6AU CCU6C C6S6A C C C 6AC - CS6C U A U CS6S66S6U AS6 AU A

26P.t.------------------CSA A C UAU AS6AS6U U US6A U CCUSC U C AS6AU US6A A CS6CUS66CS66CAU6C U- -U U27 E.e, ---------- A AA U US6A ASAS6UU US6AU C AU66C UC AS6AU US6AA CS6C US66CS6SC AS66C C- -U A28 P.v.------------------- UAAU US6A AS6AS UUUSA UC AUS66C U C AS6AU US6A ACS6C US66CSG6C AS66C C- -U A

32N.s-------------,-- UAA A AU - 6 A SAS6U U US6AUCCU66C UC AS66AU0A AA CS6C US66C 66CA U 6CC - -U A33 It.? .---------UUACA ACS6A AS6AS6U UUS6A U CCU66C U CA66A U 6A A CS6CUAS6 CU AC AS66CUC - - UIA34 F.h----- . IJU A CA A US66AS6A6U UUS6AUC C US66C UC A6S6A US6A A CS6C UAS6C66CA66C C - - UIA35 D.d.------------------US6A A C US66A 6AS6U0US6A U UC U6S6C U CAS6AU US6A A CS6C US66CS66CS6US6C U- - UIA36 .x.-------------------C A A U US66AS6AS6UU US6A UC C US66C UC A 6AAC6A A CS6C US66CS6CS6U 6CC - - UIA37A.n..------N------A AAUAAU66A UU6SA uccUCCU6S;CUCASSA~US6A A CS6CU66C 6 6CS6US6C U - - IA

38 E.N---. . ----US6S6A A AUS6A C 6AGU IJA U C U C CA 6S6U1AAC 6SCUS CS66U A US6C U- - IA

402... ------ C UC AUS66A6SAS6U UICS6AIU C CUI6C U CIAS66A UI6IA A CS6C U66CS66C A US6CIU-- I A

42H.s.------------------------A AUA 6 6 U USU66UC C UA 6CC U UUC UA U U-(A)6 C UCUUA 6 UA -A 6A- - UIU43Nn.&------------------------- AAA 696U UU66SU C C U6B6CCUUAUA A UU - A(A)U UA6SA6B6UA -ASA A - - UIU461X.1.------------------------ UA AIAS6U UIU6G6U C C UA 6C C U - A AUU - CA --C(A)6U 6-AUA- -A A

A8NAn.-------------------AA AA UU U UG6A UU U6A US6A U6G6CUC 6 A UU[AA C6C U6UC C AASUAC U - - US6

50 P.P. UUA C 66A U 6U AA66AUUCC UU U U UAGA66A6UB6UGAUCUA 6C UU U AUIU A A CG6CUA AUCB66U 6CA - - UIU5iP.t. A CU US66A US6U A A 6A AUUCU U U UUUAA 6 6 U 6A A6A uU UUA A C96CUAA UC AAAU6 C A- -U U

55TZ.*-----------A AUUCUNSUC [1U AU C US6S6C UC A6A A 6 A A C6CUA 6A AU 6 C U - - UNA556L-----------.a U C A UAS6U CAAA A U CU6A6U U UG6A U CCU 6 6C UCAS6AAOJA ACSC U ABC(AU)ANUG6C U- -IUA57UO.s ------AU C A U A6UC NA A AGA6 6NSASU USU6AUC C U6I6C UC A 6A AG A ACS6C UA 6 C(U)A UA U 6CU - - IA

I2 714 4

r 82

Nucleic Acids Research

A6 C C U 6 C AU 6 U C U AA6 U AC ----------6 C AC 6 - H.S. IA6 C C AU 6 C AU 6 U C U A A6 U AC - - - - - - - - - - 6 C AC 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - N.M. 2A6 C CA^U 6 C AU 6 U C U A A6 U A C - - - - - - - - - - 6 C AC 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R.nr. 3A6 C C AU 6 C AU 6 U C U A A6 U AC - - - - - - - - - - 6 C AC 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R.n. 4A6 C C AU 6 C AU 6 U C U A A6 U A C - - - - - - - - - - 6 C AC 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - n.c. 5A6 C C A U 6 C A C 6 U 6 U A A6 U A C - - - - - - - - - - 6 C AC 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V.I. 6A 6 C C A U 6 C A U 6 U C U A ABU A C - - - - - - - - - - A A 6 C C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A.S. 7

6CCU6 C ^ U 6U-CU-

6 U U : A C A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S.c. 8A 6 C C A|U 6 C A U 6 U C U A A 6 U A U - - - - - - - - - A A U U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - D.d. 9A 6 C C AU 6 C A U 6 U C U A A 6 U A U - - - - - - - - - - A A 6 U 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - n.n. 10A 6 C C AIU 6 C A U 6 U C U A A 6 U A U - - - - - - - - - - A A A U 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S.P. 11A C C CA^ U 6 C A U 6 U 6 C C A 6 U U C- - - - - - - - - - A|- U A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - T.t. 12A 6 C C A U 6 C A U G U 6 C A 6 UM^U - - - - - - - - - - 6 A A C U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - n.S. 13A 6 C CA^ U 6 C A U 6 U 6 C A A 6 UW^U - - - - - - - - - - 6 A A C U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Z. 0. 14A 6 C C A U 6 C A U 6 U 6 U A A 6 U(A)U - - - - - - - - - - 6 A A C U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 6.9. 1S

4 5

6 U l 6 U 6 C 6 - - - - - - - - - - - - 6 6 U6 UU61U 6C6- -- - - - - -- -- 66U6 UU 6C AC6-- - - -- - - ----A6U6 UU 6U AC6- - -- - - - ---- -66Ui UC 6BA 6A - 66U6UC Uj-A U- - - - - - - - - - - - - 66U6 6UC 6 AC6---- -- - ----AUCCB UCC UU CA- - - --- - - --CUCCC6UCB6C 6C6- -- - - - - - - -CCC66

U U- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H.c. 16U U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 1.h.I7U C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 1.v.15U C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - N.D. 196 C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - .h. 20UU9-Nv. 21UU9-Nt. 226 6 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - --S.s. 2366-T.t. 24

A CACA USCA A 6 UcS C6C- CC6C- A.t. 25A C A C A U C A A 6 U C 6 A A C 66UAAC - - - - A 6 6 - U C U U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -P.-t. 26A C A C A U 6 C A A 6 U C 6 A A C 6 6 U A A C A 6 6 A A 6 A - A 6 C U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - E.c. 27A C AC A U 6 CA A6 UC 6 A6 C66 U AA C A6 6 A 6 A A - A 6 C U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - P.v. 29A U ACA U 6 C A A 6 U C 6 A 6 C 6 6 A C (-- 6 U)66- A 6 C U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - B.s. 29A C A C A U 6 C AA6 U C6AAC 66 C - -6 6 U U U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - N.c. 30A U A C A U 6CAA6 U C 6 A A C6- - - - - - - - 6 6 6 6 - U 6 C U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - .c. 31A U A C A U 6 CAAUC6A A C 6 - - - - - - - - - 6 6 6 - U 6 C U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H.S. 32A C A C A U 6 C A A 6 U C 6 A 6 6 A6 UCC A 6 6 A A 6 A A A 6Ic U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A.f. 33

A C A C A U 6 C A A 6 U C 6 A A C 6 C 6 A A -_ - - - A 6 6 - 6E cu - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - n.d 35A C A C A U 6 C A A 6 U IC 6 A 6 c 6 C 6 A A u- - - - - A 6 - 66G C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M.x. 36A C A C UAU 6 C A A 6 U C G A A C 61~- - - - - - - - - 6 6 C - UCU U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A.n. 37

4 5

A C A C W U 6 C A A 6 U U C .WE-L A A U U A C - - - - - - t! A 6 C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - E.S. 38A C A C A U 6 C A A 6 U C 6 A A C 6 A6 C A - - - - - - - - - A A 6 C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C.r. 39A C A C A U G C A A 6 U C 6 A A C 6 6 A A - - - - - - - - - 6 U 6 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - La|. 40A C A C A U 6 CAA 6C U C AC G 6 6 A A - - - - - - - - - 6 U G 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - lI.t. 41

A C A C _ U G C A A G C A U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H.s 42A C A C A U 6 C A A A C C U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M.| 43A C A C A U 6 C A A A C A U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R.n. 44A C A C A U 6 CI A 6 C A U - - - - - - - - - 0 . 4

LU u A U A U 6 U A A A U U U U U 6 U 6 U 6 A A U U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - D.s. 47A C A C A U 6 C U A A U C 61A C.YA C U A(A)U U A 6 u U - U A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A.n. 4UA C 6 C A UMI)C 6 A 6 U AU A CI6U U U A U U A U U 6 A U A -A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S.c. 49A C A C AC 6 C A 6 U U UUA U U U U AFA CI 6 A A C C U U C U U U U U U 6 C U U A A A - - - - - - - - - - - - - - - - - - - - - - - P.P. SOA C A C A C 6 C A A 6 U U U U A U U U U A|C A CIA A 6 A A C C U U C U U C U U U 6 C U U A A A - - - - - - - - - - - - - - - - - - - - - - - P.t. 51-_ - - - - S2

-_ -_-_-_-------------.-t-..t. 5ACA C|AU 6 C|A A6 C AC 6|U U 6 U U U U C 66A -6 C---.-A6UU69JCUC CU 6C------ IC . 55A C A C 6CAA6 U

C6 U U 6U U U UC 6|66C66-A6-66C-UcT.a.56

A C A C A U 6 C|A A6 U C 61 AAC 6|U U 6 U U U U C 6 61 -6 A61CU666CAISA lttU"tACUNCUICCtmit6UU6UC6CCC- ---- -------_AAA6t Ct6 O.s. 574 5194 10 11s 26 131 1

"ABC-Ci. 39~~~~~~~~~~~~~~~~~~~~~

r83

A 6 C C WU 6 c U AA 6 C C A U 6 C U AA6CC CIA U6 CU AA6CC AU6C U AA6CC A U 6 C BAA6BC C A U6 C 6AA 6CC A U6CAAA 6CC A U66 6AA6CCAjU96C6A

-6CC 6 - 6 U A C A6 U 6A A A C U 6 C 6 A AU 66 C U C A U U A A A U C A6 6 USC

------- CC6-------6 C6-6--C6C6ACU 66 C UC AUUAAAUC A6 UUAU6 6UUCCUUU

-- - - - -- - -- -- - - -6CC 6--- - 6UAC A6UAAC U6C 6AU66 C UC AUUAA AU C A6 UUAU6 6UUCC 0UUU- - - - - - - - - - - - - - -6CCG- - - - 6UAC A6U6AAACU6C6AAU66 CUC AUUAAAU CA6 UUAU6 6UCCCUUU______- CC6- - - -600C 060600C 6C600066C SCUC OU AOSCOS 0^6 6UUC COOSU- - - - - - - - - - - - - - - 6 C C 6 - - - - 6 U A C A 6 U 6 A A A C U 6 C 6 A A U 6 6 C U C A U U A A A U C A 6M O 6 UfU C CUU Uv- - - - - - - - - - - - - - - C CC A - -- - 6 U - 6 66 C 6 A A A C C 6 C 6 A A U 66 C U C A A U A A A U C A 6(U)U AU66 U CCU U A

-- - - - - - - - - - -C-A - - - - U A C A 6U A A COUC6C6 A U 6 6 C UC

05 S C NOC 06U6U0 C 6U0U00 UC U U

_--- ---- - -C UU- -- - 6UAC 6AU6AAACU6C6AAC66 CCUC AUUACAA CAU6AUAAACAAAU U - AU AC A6 U 6AAAC U 6C 6AAU 66 C UC AU U AA C A6 UU U AU A U U AU U--_ _ UU----AUAC A6U6AAACU6C6AAU66 CUC AUSUAAAA CA6 UUAU A6 UAU

- U 6 A C A6 C 6 A A C U6 C 6 A U66C UC AU U A A C 6 U U A U A 6 U U U A UU00A U - C 6(A)A CU 6U 6 ACU 6 C 6 U 66 C U C U U A UC 6 U U U 6 U U U6U U

-- C6()ACU 6U 6A A A C U 6 C 6A 0 A U 66 C UCA U U A A A UC IU U U 6 U U U 6 U U UA A U U - CAM67AC U 6 U 6 A A A C U 6 C 6 A U 66 C U CAU U A A A U C AG7UU A A 6 U U U 6 U

59 6 7

__- --A6 ACC-- - ---------C6CA6C___- - --A6 ACC---- 6CA6A6C I

AU C U- C 6 U 66 CA6 C C C 6 U 6 C

- - - - - - - - - A A JA C - - - - - - - - - - - - C U C6.6 CC 6 6 C C - U 66 C

- - - - - - - C 6 666 U - - - - - - - - - - - C 6 U606 C I

U 666 6 - U 6 U6.6 C

- - - - - - - - - - - - - - - C 6 C C 666 - - - - - - - - - - C 6 C6.6C

C 6- - - - -U 6

-C G.~~~~~~~US6-05~~~~~~~

u C6S

66 - - - - - - - - - - - 6 6 U 66 C6AU-- -- - C U6 A6 U66CCUCUUU--16 CU6 C6A6 U66CC UUUCUU- C U6 C6A6 C66CC U C C - - - - -(6) U 6 U U A 6 C 66C0U660----6CCU U A6 U66CCACCC .---- --.--- A6 66CC A C C U C - - - - - - - - - 6 U 66CC U0UUC UUUC U 6 6AC6 C C 6 6CUCCUU- - - -060060 66tUCCU--- -- 6U060000666CCCCU---- - UA6UA6A6 C66C6 6 C U - - - - - - - - - - 6U 66 C

5,

66 A A A 6 CCA 6 U A A C A C 6 U 66 C CA6 CACC66 A A A 6 C U C|A 6 U A A |CC 6 U 6 C C|A A16 C|U A C|C C6AAAA6 CUCIA6UAAC AC6U66CCIAOIACIUCACCAA UA 6 CU C C 6UAAC AC 6U66UC A010A CIUACC CUACU6 CUCIA 60U0 C AC 6U6 6ACAAUSCIUSC6CCi6AC66 CU1C]AUU C C6U66UU5A[]CU[U1]ACCOUAC66 CUCA6U0CAAC6A666AU AACC UlA ACC66AC66 CU6 A6UAAC AC6U66COU AACCU ACCCC A C 66 CA 6 U A A C A|C 6 U A C C C C CU C C

16AC666 A0 C 6 6i6 A C 6 6B6 A C 6 66 A C 6 6

B6 A C 6 66 A C 6 66BA C 6 66CAC66BCAC666CAC666 C A C 6 666AC66

606

6U6606

6U6606

606

6U6605

06U6U6606

6U66U6

6U6

060U0 C 6C5U666-AAUCIiTI C 6060U0 SUCAUC 66- AAC66.MJ6 CICU060U0U6UCU666- A AACIU6CICU

A 6UAAU6uAU6666AC6 CCA 6 U A A C A C 6 U 6 6 6 U A A C C U 6(C)C UA 6 U0A C C 6 6 6 A C A A C Cj C 6

A 6 U0 A C C 6U0 AU CU0 A - CUS (C)C UA6uAACAC6uAUCJACCO3CCuA 6 U0 A C C 6U 0U CAC A C CUS6(C)C CC 6 U A A C6 C 6 A 6 C A A C C A(C)CIC6O0CI6C6U0660O00C[IIC^C)CUC6UAAC C6OUS6S6AAAUCIU6CICUA 6UAAC 6C6U6A6 U-CIU6CICJU

7

36 E..-- - - - - - - - - - - - - - - - - - - - 6 l1 66C66C66 6 6 61 U C 6C

39 C.?-- - - - - - -U U 6 UUA 6lU 6 6 C 6 A CS 616 U 6IC 6 U A A C 6C U A A61--ICCC CIC U40 2... - - - - - - - - - - - - - - - U - - - - - - - - - 6 U U U C CIA 6lU 6 6 Cl6 A A C 6 616 U 6A 6 U A A Cl6 C 6 U 61- A AC C UCl6CI C41 .t. - - - - - - - - - - - - - - - U - - - - - - - - - -16 U U U C CIA6U 6 6 C6 6 A C 6 66U 6 U A C6 C 6 U 6AAC C U6SC CC

42 H.s. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C C C 6 U U C C AUA 6 U U C A C C C U C U A A A U C A C - - - -

43 9..- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C A U A 6 A C C 616 U 61U A A A A U C C C U U A A A C A U U U - - - -U4 Ln - - -C C AU AAAC C 616 8U 6 IU A A A A U AIA 6 AIU L!6C45 M -- - - - - - - - - - - - - - - - - - - - - - - - - - - - C U A C A C C C C Ag A1 6 - A A UI6U CC CC CC U UC U -(A)S 6 U Ul - ---46 X.1.473.;.- X Tu TU1ju46 l----------- - - - - - - - - - - - - - - - - - - - --ICU(6 AU66U6AC IUI6U A 6U* U C CUCL48 A.n

- -- - -- - - - - - - - - -;-- - - - - - - AAC *UA A UA 6U A6 U 6 6 U 6 U ACCA66U6A6U A A A A 6 AU AAU UUa6 6 C

50 S.r. - - - - - - - - - - - - --

CA *UA UU

A AA Ur U 6U 6 6 Ul6 U A A AU 6I U 6IC 6 U A A U IU ^UNUUA UUM6FUAV R50P.r.---------A6CAA6A UUUUAAAUA000C6C0O6UAU69C600C 0UAAAUAU6ucOu0uluIuu(u

51 P.t - - - - - - - - - - - - - A U 6 C A A 6 A U C U A 6LU U U U A A A A U A 6 Cl6 U A U U 61 U6C 6 U A A A AU A Ul61U C U U U U UlA UIU C(6)52 C.?..53 T.b - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

54 L.t - - - - - - - - - - - - -

55Z. D. AC66M- IUU66 I6A6 16CA 6I6 C 6AAC6 6 MGC 6UAAC 6C6U 6 CI6- UC U656 T.&. _ I ccU U 6 A 6 ik A C A A 6U6 6 Cl6 A A C 6 616 U SIC 6 U A A C|6 C 6 U 6 6 6|- u C|CU 6 C|C 657 0.s. _ 6 u 66MG ------Oh"* UU6A6 AAC AAU66CC6 A C6 66USC6U U AC6C6U6 66 UCIU6C C6

1SW 16 170 ' 188 ' 196 201 216

r84

Nucleic Acids Research

I H.s.2 NM.,3 R.n.4 R.n.5 0.c.6 X.l.7 A.s.8 S.c.9 D.d.

l0 O.n.11 S.P12 T.t.13 0.s.14 L.15 6.&.

16 N.c.17 H.h.18 H.v.19 H.m.20 N.h.21 N.v.22 N..23 S.s.24 T.t.

25 A.t.26 P.t.27 E.c.28 P.v.29 D.s.30 H.c.31 N.C.32 N.s.33 Lf.34 F.h.35 D.d.36 N.x.37 A.n.

Nucleic Acids Research

6GCGUCCUCUUC6cuc6ccCuC U C uAUUS6(6)- AUAAC USU6S6UAA U -UCU A6A6C U A A U- - - C AU 6 CC6AC 6 66 C H.s. 166U CG6CU C 6CU C C UCUCC UAC UUS(6)- A UA ACUS6US6S6U AAU- U C U A 6AG6C UAAU(- - - CA U 6CC 6SAC6S66C N... 26 - U CS6CU CS6CUCC UCU C C UA CUU66-AU AAC US6U66UAAU- UC U A 6AS6CU AAU(A- - - C AU 6CCS6ACS666C R.n. 366U C 6C UC 6C UC CUCUCC U A C U U - AUAACUSU6SU66AAU -UCU A6A 6C U A AU- - - CAU 6CC 6AC 666C R.s 466UCSC U CS6C U CC UCU C CUACUU(6)-AU AACUUSU66VAAU -UCUA6 A6CU AA U(A)- - - CA UG6CC 6SAC 6 - 6 C D.C. 56 AU C6CUCC- - - AUCUIU U ACUU 6(S)- AUA AC U6US66U AAU - UC U AS6ASCU A AUM- - - C AUS6C CG6A CS6AS6C X.I. 6A U CG6U AC U- - - AU AUCCU A CUU6- AU9AACUSU6U86 AA U -UCU A 6AS6C UAAUMA- - - C A US6C AC AAU AS6C A.s. 7

6SAUAGUUC C- - - -UUUACU A US66U AU9AACCRS6 U A AU -UCU AS6AG6C UAAU A -- UC-UU AA A AU S.C. N6ACUUUCG6--6UUUUACCU U6U(6)-AUAACC6 AISUAAA-UC-6S66CUAAU(A)---CCAUACAASFC6AU6 M.. 96SAU6AAUCAA------AU U UACAU6(6)- AU A A CS66USSAAU-UCU AG6AG6CU AAU()- - -CAUC- U16UU A A U.Ls. 10SAUAAUCAA ----AVUUACAU6(6)-AUAACCSU66VAAU-UCNAS6ACUAAU(A)---CAU C-U6UUAA6 S.P. 116 AUMA AUUA ----- A 6AU U A CAUS(6)- A U A C()6SCUAA U - UIUU(6S66C UAA U(A - - -CAU6UC-UUA A AAU T.t. 126SAUSU6A C ------U6CU ACUC6(6)- AUAA CCGU AS6UA AUt-U CUA ASCU A AU(A)- - -C 6 C-ACAAA U.s. 136SA US66U A C 6------ U 6CUAC UC6(6)- AUAAC CS6SUA6VAAU-UC9ASAS6C U A AIUMA- --CS6US6C-AC A AAC Z... 146SAU6G6U AUC------UA CU AC UC 6(6)- AUAACCSUAS6U AAU- U CUAS6AS6C UANAUCA)- - - CS6U 6C - AAC(A)AC 6... 15

8 8' 9

US6UG66----S ------ACG6S6(A UA JC U C UCS6SS6A AA - - C US6A 6 C UA AU(C)C C C 6AU AACCU UUSC N.C. 16U60U56---------------AC 6S6S(AA UIA CIUC UCS66A A A- - CU 6AS 6CIUIA AU(C)CC CS6AUA ACS6CUU US6C N.h. 17U ACAG6---------------AG6AA C(S)A U AACC UC S6S6A NA - - C US6A S66CUIA A U(A)6SUIU CA U AC S SAS6U C N.v. 1NU C US66----S . ------A C(C6)66A UIA UIC C UCS66A A A- - C U6ASI6U C AIAU C C C(A)6 AU ACUS6CJU UCUA N.m. 190666AA6 ---------------AS6A66A UIA AIUC CCS6 66AAA- - C US6S6SS6UIAAUMAC U C C AU AS'U UCS6US CU N.h. 20U C(A)SS --------- 6 6AS(PDAU AC C U U68A A A- - C US6A6ASS[ AJAUMC UC CAU A ASAAA 6C A N.v. 21U00A666.---- -----A C U 6S(6)AU AAC C CUS66A A A --C USS66S6AU6A AU(C CS6AU6UA5UASS6 6~N.Ct. 22

UCGSS. ASSSS6ACAACCCC66GAAA--CUS6SSC~~~~~~~~~~~~~~~~USACC CCC A U AS A MS.. 24

US6C CC U . S----- ---6CG6G(A)AUA(C)U C C666A A A - - C UG66A(A)UUAAUMAC CS6C AU ACS6C CCUA C 6A.t. 25AS6U AG6----O ------S6(6)6S)AUWAiAC U ACUCS6A A A- -S6AS6U A6C U AAU(ACC(S)CA U6SA6AUCUA - - PAt. 266 A UG666 .---S(6S6S6(6)A UIAAC U AC US6A A A- - C6S6UA6C UAAU(AC C(SK CU AA C 6SUCG6C AA C.c. 27GA U AS6A ---------------6(6)66(6)A UIAAICU AC US6A AA- - CS6S6U66C UAAUMACC()CA US6A C GU C UA C P.V. 26GU A A GA---------------COUS66S6A UINAIAC U CC 60666AAA - - CC 6 666 C U 66A UMAC C 6 6AU606- 6SU US6UU US I.s. 29GAG6ASU .--S----- (61SS()AUIA6AICA6U CCSAA A - -S66C U6SCUAAU)CC(OCA UA ACG6UUCC US6 N.C. 30UAUSC ---------------6061S6(6)AUAACUUUU6G6AAA--C6AAA6AUAAU(A)CC(6)CAU6UA6AUCU UnA N.C. 31U AUAC ---------------S6(S)6NA U AAC U UUUN6A AA - -CS6A AAG6 AUAAU(ACC(C AUS6U AS6A UCUUIAI N.s. 32U U UACU---------------CS6S6()UAS6(CCU U UCGA6AAA - -S6A AA(AUAAU(6C C CS6AUSC6UIAAIU6 A.9. 33U AAUCA---------------6(S)SSA-AUIAS(CC CS6AAS6A AA - -UUCS6)UUAAAC AC C(SC AUA AA A AC ACAS6 F.h. 34

6SAS6US6C.---------------U CS6S(6)AUACCAS6U CS6A AA - -S6A UUG6SCUAAU(C C6 6A UAA 6SC C C(A 56. N.x. 36A C AS6G6A .C-SSS--(S-- Aj tCA6GUASA A - -CS6A CUS6 AUMAC C C 6AU6- U ~j A.n. 37

8 8' 9

UU666C--------SBA66(A1AUlFijCA6AU66AAA- -C6UUUGWi-IAU(6)CCUCAUAAUUUACU~~IA6 E.s. 38A U CSS6-A . --6 (6)A JA AICAU666UUSA AA - - CUS6(U)US6IC U AA ~U(C C C C AU AC6ACUI C.C. 39U UG6GS6A-S---SSS-66(A)6WAICAACACU66AAA--C66UU6ICUAAIU(A)CCCC6UA -G6 6C U A 6G6 Z... 40U0U6 666.--GSSS-------6 (A)AI AjC AS6C U6S6 AAA--C66CU6CUAAU CCC 6 A N.t. 41

-Os. 42- M . 4-N.m.- - - - -43C . 4

C.Rn. 44- - - - . . 4-N.t.- - - -45L . 4

.I--A MA . ....Pt. 54---------------------------------------------UUIAAU~IT~-2.-----.------- . 55

6AC--AS -- --..---.---UU--CSS--SJC--CA---AA---Uj---CC--U -- -----AJ----T--.-56.5

r85

Nucleic Acids Research

19H.s. 6UCUUAC CC CC U U- C GC GUUGUUGUUGA U 6C UUUC A U

29... 6UC UUGAC C CC CC UU C CC -UUUUG UUG 6AUU6C 6UUGC AUI39,R..G6CUUGA CC CC CC UU C C CUUGUUGGGG 66ACU6C 6U6C A U49R.n. GUCUU6A CC CC C C U UC C C GUUU UUUGG6A A CU6CUUU6C A U5 Uc. CUUGACCC C C U UU- -- GUGAUUC6GU GC A U62X.1. 6UCUGUA C C C .--------C C AG 6AAUU16C 6UUGC A U7 A.s. C CC A A CU-----------U C A CU66AAU66UU C UU I

S.C. C UCU6A C CC-----------UUUUA AU6A 6A U UUA U

9 D.d. U6UG6A C U -U--------6CA AC 6UAAUC U CA CAIS

It S.F. C C C A A C UU-----------UU6A 66AAU66C U UUA U

12 T.t. U C CUU 6UC-CU--UC C CC6U6A AEc6UUA UI

13 O.s. CCC GACU U C C-UUU-----666AUUUU66CU6C A U142Z... C C C6ACU U C C-U-----UU--666AU6UjC 6C A U15 La. C C CUACU U C U-U-------6AAUSAUUC A U

E9-11 E9-11'

U U A U C A GA U C A A A - A C C A A C C CU6 - 6 U C A 6 C C C CC UC

UUA U CA 6A UC A A A- A CC AAICIC CU6-6UIUC AI6C C C CC U

UUAUAUAAAc6A - AC CA AIUCICC 6 6 6 6 UUU(CC C

U U A U UrA U C A U U C U -A C C A AMUM C CU U C-UUA UU A 6AU AACAAAUC A AUAUCC UU--------UUA U UA 6AUAACA AAU C AAUAUU CC UU--------

U U A U UjJA6 U A U C A A U C C A U C A--------UUA UU A 6AIUAA AA 6 6C UU6AC1UCU- 6CU C C---UUA UU A AIU AA A A 6UC U ACUC 6 -U6 UCU C U-----

AUU6AU AAA 6UC A AC AC A- 6UUC UC U-----

E9-2

16NCA. C 66A-------CCUGGA-171h7AC,h. -----UCCU---------GA.19 A.V. CU] U U 6 A A U - - - - - - -

191 &U6UU9A.a -----U---------U6GAA----U-209 h 6ACU U---.h.-------ACU66AA----U- -----

21NY9. UCU6 ------.--UCU-----UAAC--229.?. CU66-----CCUU-----GAAU- -----

2323S,sCC. AAUI --C-CU-----GAAU---- ----

24 T.t. 6ICUA A --------------------------------------------------------------

25 MA.t.26 P t-Pt.. -

27 E ---.29 PP. ---.-29 Bs -A A CC-U---.s.C-- --

30Hc3 -AAG6-----A---- ---------AC- -----

31NC3 UUUC1-----.----.----UU(A-----)UCC.---32 A U U U. . ---33 L3.---?.34 F h-h,-35 L1.---4.369.;. uucUUUCUU------ ------ -----

37 A n---An.

3UEE. AUCUA A----A----UCUAU-----GUUA -----

39 C--Cr.40 Z ---2...41N41--.

42 A.s,43 U.D.44 U.n.45 3.t.46 X.1.47 U.sv.AU A.n,49 S.C.50 P.P.51 P.t.52 C.?.53 T.b.54 L.t.55 Z...56 Ta.s57 U.s.

----------------------------------------------------------------------

----------------------------------------------------------------------

----------------------------------------------------------------------

----------------------------------------------------------------------

----------------------------------------------------------------------

----------------------------------------------------------------------

----------------------------------------------------------------------

----------------------------------------------------------------------

---------------------------------------------------------------------

---------------------------------------------------------------------

---------------------------------------------------------------------

---------------------------------------------------------------------

---------------------------------------------------------------------

----------------------------------------------------------------------

----------------------------------------------------------------------

----------------------------------------------------------------------

24 3. 31t 3A 33. 34t 3.

r86

Nucleic Acids Research

371° . 301. 3930 4!

| CC 6 CCC C 66 C66 6 6 6 6 G6 6C 6- 6C U U U 6 6 U 6 A C U C U A U A AC CC6 6CUCCB CC 666B6U CG66C6- CC66 C6GC- UU66U 6AC UCU A 6AUAAC CC6 -CUC C66 CC 66666U CB66CC C66 A C66CU UU66U 6AC UCU A 6AUAAC CC6 C U CCCBCC6B666U[C666 C6CC C6CB6CU UU66U 6ACUC AU A6AUAAC C C 6 B C C C C 6 C 6 6BYY 6 6BLB(6 6PUC 6 A C U 6 CUU U 6 6 U 6 A C U C U A 6 A U A A

- - - 6 C C C C 6 C C 6 C U U U 6 6 U 6 AC U C U A6 AU A- - - - - - - - - - - - - - - - - - - - - - - - - C6U C UC U C U U 6 6 U 6 A C U C U I_ A A U A A

---- - - - - - - 6*6 & !A UC UUU 6 A U 6 A u AG1A6 u A A- - - - - - - - - - - - - - - - - - - - - - - - - - - --66 6 U(U)U U 6 6 6 A6 AU c A A61 A A

- 6 U 6 U UAU U U 6 ACU A U AAUAA- - - - - - - - - 6 U 6 U C U 1%uU U 16 AU 6 AU AtU AAUAA- - ----- AU 6 A()A1U C A A 6 UAA

- - ------------ C 6 A U 6 AU U C 6U6 A U A A- - - 6 C C C 6 C C 6 AU C C 6 AU 6 AU U C U 6 AUAA- - - 6 C C U 6 U U 6 C U U U 6 AU 6 AU U C U 6 AUAA

E9-2'

, ,,41? ,,. . . . 4C C U C 6 C C 6 A U C 6 C[iC 6 H.s.C C U C6 6 6 C C 6 A U C CIA C N.n.

C C U C 6B6 6 C C 6BAU C 6 CIA|C 6 R.n.C C U C 6U6 6 C C 6 A U C 6 C A C 6 R.n.C CECC 6B6 6 C C 6A C6CCAY 6 O.c.C C U C66 6 C C 6 A U C 6 C A1C 6 X.1.IC U AU A 6 C C 6 A U C C A C Ans.CUUUUC- -A AU C 6 CAU 6 S.c.

AU UB A - 6 AU C 6 - A 6 6 D.d.C6 AUC 6 - - A A U C 6 CjU O.n.CU6AUC6- -AAUC6C6 JBG S.P.C U 6 U C 6 - - 6AU C 6 -A 6 61 t.CUCBA6ACB6- -B6AUCC6 CC6 onS.CU U 6BAC 6 - - 6AU C 6 CAIC 6 Z.n.C U(C)6 UC 6 - - 6 AU C 6 CJC 6 6..

9' 10

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -|A 6 6 6 6 C A A A 6 C|C 6 6 A A AC6CU C C G H.c. 16- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A G 6 6 6 C A A A 6 CIC 6 6 A A AIC 6 CIU C C G H.h. 17_-A_ - --B- - - - - - - - - - - - - - - - - - - - - - - - - - - - -- 6 COG A C U C C A A A|C G C|U C A G M.v. 18

- - - - --- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A C A 6 A A A 6 C 6JA A A|C 6, U C C 6 M.i. 19- - - - - - 6 U U A U 6 C 6 A A C| A A A|6 A|- U C C 6 Mt.h. 20

- - - - - 6 AU U C U U U U C U G A A A6 C A U AU - M. v. 2 1- - - - - -16 6 UEDCCC AU A A A|U6 U C C6 M1.f. 22

-------- - - -- -- -- -- -- -- -- -- -- -- - -------------1666 Uc U C C U UC CU IAA6 6 CU A A6 6 C t - S.s. 23-----------AI----C-C-C-C--CC CABABCBAAACBB6UCC AT..t. 24

6 A u

- --------A1 6 6 U U C AAAC A AgA.A~~~~~~~~~~~UICCUUCABBAAC[C

~~~~~UBA^( B) AUC A^- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A U 6 A(6A A A A 6 A U CAAA A- - - - -AU

6 AACUUUUU 6 A

- - 6 C U (A)66 6 A1-_ - - - - - - - - - - - - Is6 6AAA

99

BUUUA- A.t. 256C6A6B6 P.t. 266 A 6 6 66B E.c. 276 C A6B P.v. 286 UB C U .s. 29

6 A|6 C -- H.c. 306-AACC N.c. 31

A)A C CB N.s. 32BBLU-1C(A) M.r, 33UAULUuAA F.h. 346 A U SB C f D.d. 35B66U6CIC N.x. 36CAU U U A - A.n. 37

10

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 6 U A 6 C A U A A A 6 A 6 A A U - E.s. 38_- A 6 U 6 A A A6 6 U 6 A A A C.r. 39

- - _ A 6 C A A A A 6 6 A A A A Z. 40- - - - - - -_A 6 C A A A A 6 6 A G 6 A A W.t. 41

U.n. 42- W. S 423

- -- -- -- -- -- -- -- -- -- - --- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- - -- R.n. 44-_-- -.- - -

-_-__ -__- - - - - - - - - --- X.1.

.-v_ -_------------------------------------------- D.v. 47__-_---------------------------- --- - - - - - - - - - - - - - A.n. 48

- - U U U A U A UAAUA AAA 6 U A U S.c. 49

~ ~-_ - - - - - - - - P . t .

n~AMII~L.t...- - - - - - - - - 6 A AU C A A A Z.&. SS

- 6 A 6 C A A A T. a . 566 A A A A . s . 57

36> A7 3 39l 4v0 418 424

r87

23

567

9101112131415

Nucleic Acids Research

IA.;. CCC --- CCCS6GSC66C6ACGACC(C)AFUUC6AACS(U)CU612 M.s. C C C - - - C C C 6 U 6 6 C 6 6 C 6 A C 6A A CCCCAU U C 6C A C6I U C U 63M1.;. CC C- --CCC6UG66C6OC6G C6ACCCUUCSAACI 6UlCU6 I3 R.n. C C C - - - - U C 6 U 6 6 C 6 6 C 6 A C 6 A C C A U U C C A A C 6 U C U 6I4 R.n. U C C - - - - C C 6 U 6 6 C 6 6 C 6 A C 6A C C CAU U CA 6A C 6 U C U 6I5 0.c. C C - - - - - C C 6 U 6 U C 6 - A U U C 6A A C U C U 6I6 X.1. U C C - - - - C C 6 U 6 A C 6 6 CACA UA CAU U C 6 6AU 6(U)C U 67 A.s. 6UC - - -- UC6 C AC C 66UCAC66UCUUUCAAAUO(U)CU66US.c. 6C C----UU6US6CUSSCSAUSOUUI@JAUUCAAA*UhjCUOlV S.C. 61 C - - - - U flU6 U UCU 66 C6 A 6 UUA C A A AU6UCU C U 6

100 .. A - - - - - U U U 6 C UC 6C AUA U CAU UCAA 6 UUCA C U 6us.n. 6C----- UUU CUC6C6AUAC UCAUUCAA6U.UICU6I12T.t. sL-00 --UAffCCU-CSAOAAA^U(C)AUCiUAAS6ULIUICU6I130,. 6C C-- -CUC6U 6CCS6 C 6A CCAUCAU CAAA UU U CU 6142.T. 6CC---UUC6U CCU-C C6CA UC UUCAAA U UCIUI 614 Z.9. 6 C C - - - U U C 6 U 6 C C 6 6 C 6 A C 6 C A U C A

UA A A U U U C U 6E155G... 6 CC-- - U U US6U6C C 66C 6AC 6 CA UCAUC A A AYUU. C US61

10' 7'

16 H.c.-- - - - - - - - - - - - - _ 6 C 6 C C A CA 6 6 A U 6 C 6 6 C U 6 C 617 .h.-- - - - - - - - _ 6 C 6 C C A C A 6 6 A U 6 C 6 6 C U 6 C6SI18 .v.-- - - - - - - - _ 6 C 6 C U 6 UA 6 6 A U 6 U 6 6 C U 6 C6S19 H.&-- - - - - - - - - - - - - _ C C C C 6 6A 6 6 A C 6 U 6 A C U 6 C 620 M.h.-- - - - - - - - U C 6 CU U C A 66 A U 6 A 6 U C U 6 C 6021 M.v. - - - - - - - - - - - - - - - - - 6 C 6 C C(C6 A 66 A U A 6 6 A C U 6 C 6022 M.t. - - - - - - - - - - - - - - - - -. A C C C U A A 6 6 A U 6 6A U C U 6 C 623S.s. -- --UUCCC6UOUOA6CC CCC6A66 U 6666CUAC624 T.t. 6JC6AU CUICC C6C 66UCC 6C C 6A6 60U66666U A C6

CCCCCC

CCC

CCC

CCC

CCC

CCO

CCC

CCC

CCC

CCC

CCC

CCC

CCC

CCC

uAuCj ACU

u A u CIAIA C U

U A U CIA A C U

C A A C U

uAuCAACuU A U C A A C UUAUCAACUC~AUCJiGCO

CIA U CIAi6 C U

U A U C A 6 C Uu A U CIA A C U

U A U CIAIA C Uu A u C JA C u

25A .t.-- - - - - - - - - - - - - - - -- u C 6 6 6 6 UA U 6i 6 AiI CCC6 C 6 uU 6 6A UOU O6 C U26P.t. ACC - - - - OUCS66CCUU6OUCUACUAO6AI6C6I6IC U6AUS6CAA6UA6O6U27 E.c. A C C - - - - U U C 6 6 6|C C U C UUS6C C C A U C 6 6 AO6UO6I C C C A 6 A 6 6 6 A U Ui6 C U26P.v. UU---- C OUC6A6ACCUUC6C6CUAUC6S6IAUjAAC CCAUAU666AUUIAC6CU29 5.s. U C 6 - - - - - - - - - - C UACAC U UA CA 6(A)U 6 6 C C C 6 C 6 C 6 CA UUOAI6 C U30 H.c. - - - - - - - - - - - --.- A C CCU U U C C 6AM6 6 6 U C C A C6SUC C 6 A U UIJ6 C U31 Mc. U UU0 - - - - - - - - - U U-C CU A U 6 A 6(A)U 6 6 6 6A U6 C 6 C 6 U A UU A 6 C U32 M.s. U U U - - - - - - --A66 U U- C U A U 6 A 6E06 6 6 6 A U 6 C 6 C 6 U A U UA 6 C U33,.t. - A- -UOAAA6A C C 6UA A A66(A)USA6666AU6CO6UCCAUUA66U34F.h. U - --- --UCAAAUAUUjUAS66AUUAA6(A)U666CAU6C6UOUCAUU0Iu6CU353 .d U C U - - 6 C U U 6 C U6 C U A U CAC 6 U A A 66O 6 A 6 U C C 6 C 6 C C C A UUIAI6 C U36MN.x. UCU--6UAUACAA6CUAUCCAOCAIUAUCAU66A6UCCSCS6CCCAC(IAUIC6CU37 A. -- - - - - - - - - - - -- -| C C U 6 U A 6 6 AjC U C 6 C6C C U 6 A U UW6 C U

10' 7'

UUCIUUCIUUCIUUCIUUCIUUCIUUCIUUCUUCIUUCIUUCICUC6UUCIIUUC6OUC6

- - A I- - A I- - A 6- - A 6- - A I- - A I- - A I- - A 61- - U I- - A C- - U I- - A C- AC

U6GU AG6U C 65CC GUS6 CI u AIq u C - IC C 9IMUSSOAGUC-SCCOOSCMOMMUA6UC-SCCSUSCMOSSUAS0UC-SCCSOSCA U 6 6 U A U - U C U6 C

U6G6U AO6BC6- UA USC6

U U A 6 U U A U U

A U 6 6 U A 6 6 A- U A 6 U 6 6CMUGGOAOUG-UAUUB^MOSGUASSA-UASOB61MUOSOASUG-UAUUG61MUOGUA6GA-UAG666SMUSBUAGOA-UAGUG61

MCSSOSSOS-UAACS

MCSSOSGSS6-UAACG SICMCSSU666G-UAACGOIC006006065-UAACS SICA C 6 6 CI6 U A A

006UU66656-UAAOS oLAC066U6666-AUAAC66CAC66UC 666 6-OAUAC6C

U 66 C6 6 6 6 - U AA661C

0 6 6 U 6 6 6U U 6 6 U 6 6 6U A 6 6 U 6 6 6U A 6 6 U 6 A 6UOU6 6 U 6 A 6US66UA66

U A 6 6 U 6(SA6UA 6 6 U OASO

U U 6 6 U 6 A 6U 6 6(C)6 6 6

U U 6 6OCI6 6 6U 6 6(C)6 6 6U 6 6 U 6 6 6

6 - U A A A 6 66 - U A A A 6 6S6-UAAC66S6-UAAU666-UAAC666 - U A A A 6 6

U A A U A 6- U A A U A 6

6 - U A A C 6 66-UAAC666 - U A CAO6 66-UAAU666- UAA666

C

C

C

C

C

C

C

C

C

C

C

11

38 E.t - - - - -lu U C 6C C U A 6 6 Cia *C AU6fqC uucU6 UU C - - U 6 U U 6 6 U 6 A 6 6 - U A A A 6 6 C;39C.r. A--------- - ----- -ACC CCA6AU A6A66CU6SC6UCUUlAUUW6CUA-- A6UU66(66 6SU AAC6 SIC40 Z.|.- -- - - - - - - - - - - - - --U C CI6IC C CA AS 66SA6 6 66I C U(C)6 C 6UOC U 6 A U UA 6 C UI- -A 6 U U 6 6 U 6A66[- C A A U A 6IC41 I.t.- -- - - - - - - U C6C C C 6 A6 6A6 6 6SC U(06 C 6 UC U 6 AU U A6 C Ut- - A 6 U U 6 6 U 6 A 6 6 - C A A UA 6C

421.;.-tCACOAUC AAAASSAAC

43Mm.-ACU AUU42 R.sn. - - - - - - - - - - - - - - - - - - - - - - - - -(} - 11 1 {i(-()- --ll-- - - - - - - -C A UC-44 R..-- - - - - - - - - - - - - - - - - - - - - UU jA(C)UUA A- A6 61A16 A16 6 6 C(A)U C(A)A-- 6CIC - - - - - - - - A A U -

6AU CC16CCt lUl--CIACIAC U CC-

495.c. U OI--- AUAA1UAU`UAUUUl CUA UA6USI U UU UAOO--CU--AUAUU^LUUAIIUAIUU50 P..- -- - - - - - - - - - - - - - - - - - CAAUA(A)6 6AAAAAA6A AUA6A6C$CUUC6--U UUCSA6 (S)Sl- - - - 6 660 C51 P.t. - - - - - - - - - - - - - - - - - - - -C-CAAUA(6) OAAAAAA6AUAUA6A6C6CUUC6--A UUC6AA(B)66 - - - - 6 6 C52 C. f. IIU - - - - - - - - - - - - - - - - - - - - - - - - - -

53 T.b. - - - - - - -- - - -

-B1- - - - -

6S J l6U^UU^66U - --

-I- - - - -

-I -- - - - - - -

55.-A------------ CU''ACCUC6U'AUAUUA66U--AIUU66UCA6S-UAAA656T..s ------:A6C6CU6UUOSU6A CCU6COUIA6UAUUA6SI--ASIUU66UICA6S-UAAA6 IC

570.;s-- - A6 C U6UUUOSA 6A CCUOC6U~A6UAUUA66U--A.IjUU66UCAO6-UAAA6 I~

436 4 4 4 476 I1 4

r88

11

6 U C 6 A U U A 6 6 UOUC6AUUOO66U --A6i

6 C C 6A U UA 6 6 U - -A 6I6 C C 6 A U U A 6 6 U - - A 6

DUC 6 AUUAU -A 63ucA 6 A U U A 6 6 U -A 6CCAOAUASSA - -U6ISCCCUAIASC66- -06SCCC(A1UCiSUt- -061

Nucleic Acids Research

UACCAU66UBACCACBBBU6AC6B66AAUCA6BBUUC6-AUUCCBABAB ~6(6)A6CCU 6A 6A A AC66 CUA) H-s. 1C U A C CA U66 UBACC A C 6 66U 6ACB6666A A UCA 6 66UU C 6-A U UCC66A6A6(66ABC CU6A 6AA A C66CU(A)HI.*. 2.C UA C CAU 6 6U6AC CA C 666UAAC 6 666AA U CA 666UUC 6 -AUUCC 66AA6 A666)A6 C CU 6A 6A AA C 6 6CU(A) R.n. 3C UA C CAU 6 6U6AC CA C 666U6AC 66 66AA U CA 666UUC 6 -AUUCC 66AA6 A66(6)A 6C CU6A6A AA C 6 6CU(A)R.n. 4C U(A)C C A U 660U6 A C C A CU66 6 U BA C 66 66A A U C A 6 66 U U C 6- A U U C C6 U AS A 6U(6)A UCCCUB6A U A A A C 6 6 C U(A) U.c. 5

C UA C C AUUU66U 6C A A C 6U A A C6S66S6A A UC6666UUCS6-AUU C C 66AB6A 6)A 6C C U 6A 6AA A C66CU(A) A.s. 7C UA C C A UB66UUU C A ACB66U A ACB6666A AUA A 66U U C 6- A UU CCBAU6AB6(6)A 6C C UB6A6A A AC66CUA) S.c. UC U A C C AUUU66US6U A A C 66U A A C 6666A AUUAB66UUCU6-AUU CCS6 6AS6A(6)A6C CUS6AU6AA A U 66CUA) D.d. 9C UAC CAUUU66C UUUC A C666UUA-AC6 6A 6 6A U U A 66UUC 6 -A U U CC66A 6A 6(6)AB6C C U6A 6A AAC66C U(A) U.n. 10

C U A C C A U 6C ABU C A C 66U A AC66A 6A AUUA6U66UUCB6-AU U CCB6AU6AAA(6)A 6C C U 6A6AA AC66CUA)T,t. 12C UA CC AU 66UU66UUU6SA CB 6UBAC6S6A 6A A U U AU66UUIJCS6-A U U CCU66AUAB6(6)AB6C CUS6AU6A A A C 66CUA) U.s. 13C UA CCAUUUU66U66U6A CB 6UB6A CS66AB6A A UU AS6U6U UCU6-A UU CCUBs6AU6AU6 66A6 C CU 6AU6A A AC66BCUA) Z.n. 14C UA C CA UB66UU66UU6SA CU666UBUA C6U6AS6A AUUAU66UUUCU6- A UU CCU6AU6AB6U(6)AUC CUS6AU6A A A C 66CUA) 6.n. 15

119 12 12' 6' 13 13'

C C AC CB6U6C C CAU A A U CB66U- A CB6B6U US6U 6A 6A6C AAA6-A6C CC 6B6AB6ACB66A A UC US6AS6A CAAB6A U UC A.c. 16CC A CC 6U 6C CICAIU A AU C 66UB- AC 66 6SUU6BUS6AB6AB6 C A A6B- A6ICC C6B6AU6A C6B6A A U C US6 AB6A C A A BA UUC A,h. 17

C CA ACA AB6CCUUA UC66U - A CB6B6U UB6UB6BB6A6C A AI6B-A6IC C CB66A 6A UB66AU UC UB6AB6A CACB6A A UC ".h. 20C C AC C A AB6C C U A C6A UCB6A- A C 6U6B6C C UUB6A6AB6AB666-ABB6jC C C 66AB6AUBB6(A) C US6AB6A C A C 6(6)C C C N-v. 21

C C AC C 6;AA6CCu ~A(C6 6 60U- A 6 6 CCS6UU6U6A AB6C6U66U-AS6C C UC C ABAU SU666C(A)C UB6A SAC AAU6(6)6 C C T.ts. 24

U UA C CA AS6CCU6UCB6AU C U 6U- A 6C US66UC UB6AS6A66A C 6- A C CA 6C CA CA CU 6BU6A CU 6AS6AC AC6 6 CC C P.t. 26

U CA C CUA66C 6A CB6AU CUC U- A 6CUS66UUC US6AB6A6 6A U 6-A UCA 6C CA CA C US6U6AC UB6AS6A CAC66C C C P.v. 26U CA C CAA66C A ACSA U6CB6U- A 6CC 6A C CUB6AS6AB666U 6 - A U C 66CCACA C U 6 AC UB6AS6ACA C 66C C C U.s. 29C UA C CA A 66C 6ACBA UC6U6- A 6C C 66CC U 6AS6AU666UB6- A A CU66C CA CA C U 6S6AC U 6AB6A CA C 66C CC A.c. 30(C)C AC C UA6U6CS6AU6BA UA C 6- A 6C C 6 A AC UB6AB6A 66UBU 6- A UC SU6C CACA U US6SU6A C US6A AA U AC 6 B C C C N.C. 31(C)C A CC U A6BC 6A U 6A UA CB6U- AB6C C 6A AC U 6ASAB66U UU6 - A U CS66C CA CA UUB6B6A CU 6 AS6A U A C US6C C C N.s. 32UCACCAA6 C C UU6A U 66AU- AS6666UBUUUC U AB6AS66AA 6-6UUC CC C CACA U UU66AAC U 6A 6ACA C 66UC C Lt. 33C C(A)C C A A6U6CBA CB6A UU6ACU- AU6S666A UC UB6AB6AU6A US6- ACCCC C CACA C US6 SUA C UB6AB6A C AC 6 6BA C C FPh. 34C C(A)C CA A6 6C AU CS6A UB6S6U - A6C CB6A UU U 6A 6A66A U 6- A UC66C C ACA CUS66AA CUS6A A ACAC 6 60UCC S.d. 35C C(A)C CA,AB66CAA CB6AC6 U- AU6C U66UBUC U6AS6A66A C6-AUC AS6C CACAC U 6U6A AC UB6AB6A C ACU6B6 C C N.x. 36C U A C CA AS66CSA CB6A UCAB-A6.C U66UUC U 6AS6A66AU - A 6CACCACAC UB66SUAC UB6AB6A CAC U A.n. 37

119 12 12' 6' 13 13'

UUCA6CAC6 6 U A U 6 6 6 A 6 A C 6 C C C 6 U 6 6 C 6 M C...30C ACCCA C() A6U-A6 CU6A6A Cr,AUCA6C U666AC 39AC66C..3U AA66C66CCCA6U- 66UCC6 CAU6-ZCCACA.ACU6A406CC..4UUACCA6C6AU6UA6 U- A6 U 6U CC A A 6A U6 A C 6C CA A U6 66 AC 666CCCA C6C t.CN. 414

AASCAUCAASCACUCABCAA6--- IA I-AI. 4A 6 -CCUA6CCACA--------------------UH.s. 43

UA00 C CU A 6AA _CCUUU-- UGASA-- -A U A ACI ACIA T- CAn . 4

AACUA6CCAAIC (U) A A A A AA 6U U A-AAC 6SAU CACB6IU UB6A C tU6A A AU ACU)- - AS6O S.c. 49C(A)U UUB6B6UICUUCCUUC -AICUAIU U UI6UU6AI6UAAIA- AUICAU6CIC ACIAIUAIIA6UIA6BUA A C61A C C U P.,. 50C(A)UU666CUCUC6CU - AIfII f6 U6 A A 6 A it r AA - - AU A U 6U 6A A 61AC C t.P--51

-Ct. 52C f- 5-Tb 53. - 5

06 A C C A A 6 CC A A US A US6C 00U-AU6 C 0660U C 0000U CUB A 06- AU C A 6 CC A C A C U6 6 AC US6AS6A C AjC 6 6 CC C T-a. 56U - A[ 6CU66 U CU U U C A C 6 6 CC C O.s. 57

50t 54 5A 5i 54~~ ~ 55~ 566

r89

Nucleic Acids Research

IAH.S. C C A CWU C CA B C2M.a. C C A CA)U C C A A3R.n. C C A CA)U C C A A4 Rn. C CAC(A)U C C A A 65U.c. CC A C(AUC CA A6 X.1. C CC(A)CU C C A A7A.s. C C AC(A)U C C A A 6US.C. C C AC(A)U C C A A 69 D.d. C C A C U U C U A C I

10 O.n. C C A CMU C U A A III S.P. C CAC(A)CU C U A A12T.t. CU A CAAC UACI130.s. CC A C(AUC CAA614Z... CC AC(UCCAAG15 6... C C A CA) CC A A G

14

3 - AIAIG6- AMAIG6- AMAIGI - AIAIGII- AMA)G6- A(A)GI - AIAIGI - AMAIGI-A A6 - AMA)G6 - A(A)G6 B UEQI - AIAIGI - A(A)G6- AIAIG

14

36CA6CA6CA6CA6CA6CA;cI;cIICI6CIE6cI6CI6CICEICA

6CA6GCBC A 6 6 C6C A 6 6 C6C A 6 6 C6CA66CSCAC6 6 C6CA C6 6 C6CAC6 6 C6CA66C6 C A 6 6 CBCAC6 6 C6 C A 6 6 66CA66CICAB6C6 C A G 6 C

4'

BC BC AAAU U AC C C AC U CC CG6A C C CG6G6B - 6BAG6UAG6UG6 A CG6AAAA AGCGCAAAUUACCCACUCCC6ACC CBBG-GAGGBAGUGAI6CGAAAAA6C6C AAUUACC CAC UCCC6 ACC C666-6 A66UA6UG6ACA6AAAAA6C6C AAUUACC CAC UCCC6 ACC C666-6A66UA6UB6ACA6AAAAA6 C 6 C A A A U U A C C C A C U C C C 6 A C C C 6 6 6 - 6|A|6 6 U A 6 U|6A C 6 AA A AA6 C 6 C A A A U U A C C C A C U C C C 6 A C C C 6 6 6 - 6|A|6 6 U A 6 U|6 A C 6| A AA AA6 C 6 C A A A U U A C C C A C U CC C 6 A C 6 C 6 6 6 - 6lAG66 U A 6 UA6A C 6A AAA AB C 6 C A A A U U A C C C A C U CC C 6 C A 6 6 6 - 6BA 6 6 U A 6UG6 A C 6AAA AA A6 C 6 C A A A U U A C C C A A U C C U|A A U U C|A 6 6 - 6 A 6 6 U A 6 U|6 A C A A U A A A6C 6C AAUUACU CA UCCCAAUAC666 -6AA6U A6 U6A CAAUAAA6CGUAAUUA CC C AUCCUGA CU A66- 6A66UAG UG6AC AA6AAA6 C 6 U A A A U U A C C C A A U C C U 6BA C UBC A 6 6 - 6AG6 6 U A 6 UI6 A C A|A 6 AAA

6AAAAUU66C CAAUCCUAAUU A66 -6A6 C C AG6U6A CAA BAAA6C6C AAUUACC CAAUCCU6ACACG666-6 AB66UA6UG6ACAAUAAA6C6C AAUUACC CAAUCCU6ACAC666-A6A6A6UA6UA6ACAAUAAAGCBCAAAUBACCCAAUCCBACAICGB-GABBBAUGACAA5UAAA

15 15' 3'

16 HAc. C66 CUA C -16- 6 6 C6 C|A 6 C A 6 6 C 6 C 6 A A A CIC U U U A CU 6 U AC16 A A17 H.h. C 61616 C C CIU A C 616 - 6 6 C|6 C A 6 C A 6 6 CB6 C 6 A A A C|C U U U AIC A CIU 6 U A C|6 A A I18 N.V. C 6|6|6 C C C|U A C 6|6 - 6 6 C|6CIA 6 C A 6 6 C|6 C 6 A A A C|C U U UAIC A C|U 6 C A C|6 C A19 H... C 6|6|6 C C C|U A C 6|6 - 6 6 C|6CI 6 C A 6 6 C|6 C 6 A A A C|C U U U A|C A C|U 6 C A C|G C C I20 h.h. C AI6B6 C C C|U A C 6|6 - 6 X C|6 C A|6 C A 6 6 C|6 C 6 A A AAAC U U UBAC C A|U 6 C 6 6|6 C A2111.V. C jA6 6C1CCUA CAA666CCAAG66C 6C 6AA AC CUCC6]C AAU6CAAC6AA226N.?. C AB6|6 C C CIU A C 616 - 6 6 CB6 C A C A 6 6 C|6 C 6 A A A CIC U C C6C AAU 6 C A Cl6 A A23 S.s. C AG6 C C C AUACG6|6 - 6 6 CB6 C AC A 6 6 C|6 C 6 A A A C6 U C CCAAU 6 C 6 CB6 AA24 T.t. CGC CCU A C 66- 6 U6 C A1C A 6 6 C|6 C 6 AA UBAC U C C AU 6 C 6 616 C A

25A.t. AA26P.t. A6G27 E.c. A 628P.V. A6G293.s. A6G30 N.C. A 631 N.c. A 632H.s. A6G333.. A^A34 F.h. A 635 D.d. A 6366M.x. A6G37 A.n. A 6

CBCC

CUCC

cuccl

cucc

cucc

14

UAC6UAC6UAC BUAC6UAC 6UAC6UACaUAC6UAC6UAC BUACIUACIUAC6

B-6AG6-B6A6-6A66-6A66-6A66-6A66-6A66-6AG6-6A66-6A66-6A66-B6AB6-6A6

14'

6CE6CI6CA6CP6CE6BC6CIB6C6CEB c66BCBCI6 C A

6CA6U66CA6U66CA6U6GCA6U6iCA6UA6CA6U66CA6UA6CA6UA6CA6U66CA6UA6CA6U66CA6U66CA6U6

4'

6 6 6 AA U A UA66 6 A U U U6 6 6 A AU A U6 6 6 AA U A UG 6 6 A A U C U6 6 6 AA U C UA66 6 A U U UA66 6 A U U UAG66AAUAUA 6 6 AA U AAA66 6 A AU A666AAUU6 6 6 AA U U U

UBBACAAU666C

|U C C 6|C A A|U G 6 A CBCCGCAAIU66ACuBCCCAAU666CuucA cAAu66AcUUCACAAU66AC

BuCCB1CAAUBB6C15

38 E.s. WA|AUUC C 616 - A A 616 C A 6 C A 6 U 6 A 6 6 A A U U UIU C C6CAAU666 C 6 C A 039C.r. A6GICACCIUAC6 6-6A 66CA6CAG6U6 A66AAUUUUUCG6CA AU66CG6C A40z... ^6uCCI*66666*6^U666FUlC6C^U8C6^416l.t. M*uc1-^1clc6666FUlc6c^61c6^

6 U 6 C 6 _ UA A s 6 6 6 A C U CC 6 AIB6U6C6 A UAA66 66ACUCC6A6U6C6 A UAA66 66ACCCCAA6UGC GIA UAA6UA 66ACCCC6 AECC6U6 A UAA66 AAACCCC6A6UC6[ C66666A6CCCCAAB6U6C6 C 6666 AAACCCC AA6CUU6 A 666C6 CUCACCCC6ACC 6C6A C6666 CCACCCC6A

G6CCU6 AUCCA 6CC U6CCC IG6CCU6 A UCC 6CA A U6CCGCIG6CCU6 A U6CA6CC A U6CCC I16CC U6AUCA CC A U6C C C6UCU6 AC66A 6CAAC6CC6CIG6CCU6 C66A 6CAA U6CC6CI66 U C U 6 A U 6 BA 6 C A 6 C C 6 CG6UCU6 AUA6A6CAA6CC6CI6 C C U 6 A A C C A 6 C C A 6 U A 6 CI

I CCU6CAACCA CCA U6CC6CII6CCUA6C6CA6C6ACCCGCB6CCU6 A C6CA 6C AACCC6C

I 6C- -AC66A 6CAAC6CC6C I

15' 3'

[ CC SUAC6 G A 6 C A AU A C C 6 CI6C- -6AC6A 6CAA U6CC6 CI6 CC66 6 CAACU6C C6Cc ACG66A 6CAA U6CC 6C

42 H.S. - --CC AC AA ACA6 C A 6 U 6A U UA A C CIU U U A 6E AA ACB6 AA A|6 U UAACU A A 6C1 UU AC U A A443U .. - - CC CC A CC 6 - 6|A C U C A|6 C A 6 U 6|A U AA A BAGU BUAA 6BC A A A AAACl6 A A A B6BBAU UB (A A 6U UIA U A CC U C44 R.n. - - - C C C A C G6 -6 A C U C AI6 C A 6 U 6A U A AAAUAAUBA 6IC A ABU6 A A CG6 A A A6 U UAC UMAA C UIAJU A C C UC451.t. - - --C CCAC 6 GA A ACA6CAG6U6ACAA AAAUU MAA6C CABUAAACAAA6AA AAB6AUAUUB6C U A

AAAj -

46 X.1. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A A A 6 A A C A A A 6 C U C 6 A 6 U U A A)6 U -

47 I.Y. -UUAUA AUUAU - UUU UC 6 CAuuA uAAA6 A A AU^^ ^UUUA A AUA C A UU -4 A.n. TCAAUUB A6CuA 6 U 6A6 6 A A U AU U 6 6 U C AAU6CC6AAA6CGAA CAuAACAUU 6 6AA49 S.C. C A AfU AU A" U A - 6 A* C A 6(C)A 6 U 6 A 6 6 A A U A U U 6 6 AcC A Ai U C 6 A A A 6 A UBql CA u CA6U UllU Uu UA A50P.P. UA±U1C --6-GA- CAAA' 6 66GA AUU U6G6C AAUC6CCUUAA6C6U6ACCCACAAAUU6AUA-49BPot. CAAA U UC ABAA- GA- GCA )A UA 666GA BA U UGU CABA GAB6C U A GABC6U G6A BCCC A CB A AU BU6AB UJA51 Pt. _U A-- BCB,1 C--- G- GA-- CAGAAGUGGGGAA^UUlU BUGGC AAUB1C BCBU AAGlC GBGAuCCC AG6C AAA*U BBAU

53 T.b.-54 L.t.55 Z. UAICC CIC 61 - 6 16C Al6C AG6U6166B6A ABUCBU|U G6AIC A AIUG6 6Cl6A AABCC 6AIUC CAl6G AIAIUAB(C)B BlSAT... GBIAICBCCICACGIB-GGGIGCAIBCAGBGIB BGAAUC UIBBAICAAIUBGBCIBAAAIGCCCCGIAIBCCA(GCA*IAIBABC)BCIG57 U.n. GJACj C ACG-66 G-C GC A[666^UCI66IJ^U66Ij^IC C6 UC^JG^ (C6I

S7l S 4 2F

r90

6CAE6AAE6CAE6CAI6AAE6AAI6AAE6AAE6CUE166A6AAI6AAE6CAI

Nucleic Acids Research

C A A CA A C A CA6IA CUC UUUC- 6 AG6 CIC C6UGUAA UU0G6 A A U6A UA C AUUU0(A A A U -------------- Oc. 5

U A A CG6A U6C A6GAC UC A UC C - 6A6G - GC C C U6 U6A UU66 AAUAG A6UAC AC U UAA AU --------------A.s. 7U A ACGAUAC A GG6C C C AUUC -666- UC U- GUUAA UU 6 6A A U6 A 6UAC A.AJU6 UA)AAU--------------S.c. 8U AUC A A U AC UAUC C U U UUG66AG-----6GCA A UGU6A A UGAAC A C AA A UAUAAA- S .d. 9

UAAC66-~~~JCCGAA6CC-UAU-60U-OC- - - 666AOO6CAAU6A6UA6AA~~~~~~UIU AAC-S------- . n. I11

U A AC AAUAC C 66GC6C6 UU A - GU6- U C U - 6 6U AAUU 66AA U6 A 6U AC AA UC U A AAU-Z-----n----L. 14

16 16' 17

U GC 6A ------GG66C AUAC AGOUCC ------ U C 6CU U UU(C)G(0)G AC C U --------NH.S. 19

UVOCUC------A U6CAC - - A6C AOU------ 6 GC U UU(U)CA A6UGUO-N------------.v. 21

U 6AGU606AU 6A A 66 C CUUUA - 66GGU00- 6 U A A A 6C C U UO C(A)C COG A A A6 AU--------------------A.t. 25U 6C AG6 A U6 A A 6 C CC U C -GOO6U U -G UAA A CUG6C UUU U 6U A C GGA AC 6A AA A 6C C 066 6 6C U A A U ----- Pt. 26U6U A UGA AG6A AG6 CC U UC - 6 6 G'U - 6UA A AOU ACU U UC AG6 C 66 6 6A 6 6A A 66 6A6 U AAA G U0UA AU-----Ec. 27U6U A UG6A A 6AAOGC C U UA- 6 6 600U- GU AA AG6UA CU UUC AG6C G66 6 A 66 A A G U606A U A A A U U A AU-----P.v. 26UO6AG6UGA UG6A AOG6UUUU C- O6AUC -1GU A AACU CUG6UOUGUUA6G6A AG6A A C-AA U AC CG6UU~COGA AC - - - Bi. 29

06 AG6URA UG6A C 6 C C UUC - 06 00-:6 U A AA 6C U C UG06UU0UA A GA A6AAA(A)A U AOA UA60NOON A-Nc-- . 31U 6AAG6AUGAA6 GC UCqUA OG6U C -GUA A AC UUC UUU U AUAU AAG6A A UAAAG6C AOUA- .? . 33U 6C AGG6AAG6AC(A)GC C CUCUG6 6 U(C)-GA AACU6 CUUUUIA UUC666GA AUA A ACC UA C UUAC----------F.h. 34

16 16' 17

U 6 ASAA)6iA[c6 6C CU UU -6 6 6U U -,JA AiiiU(UCUU U UC UC A A A 6A AGA A6------------------- E. 38U 6C A66 A A6A16 6CC U 6U - 6 66C -I6U A AACU6C U UU UC UC A 6A6A A 6A A6------------------- C. 39U 6 6A6I60U06 A AI6 6C C U AC - 6G6UC -I6UC AAC UU CU UU U C UC 6 6A 6AA 6A AA-Z----------n . 40U 6AAG-N.t.-6U 41CUUC66A6AA6A . 4

CCC---NI;, 42

-----N.---m.-43W 4

-Rn.-- 44

4

6IA AUGA A A6UUAUU AU A AU AAUACAJAUAACGIAUUIAUAUC6UAU IIC ----------URAAAAGAAAUA A.n. 48-1 6AI AUAAAAAUAUU J AUUUAINC0-- KS.C. 49

~~~~~~~~~~P.,. 50- -----I NOAAIIm e -IM AAAAA PA. 51

.Cf. 52- - - - - - - - - - . 5.-..T.b.--- 53-- -- .. 5.L.t.54---- - . 5

0fiiAiU jA A iA A iT TCA AU6C C 6C U- UGU A A AGscZUCU UCWGU CG6AG6UICG6CG6A------------------ 2 ... 55UI AG6UIGA AG6A AIOG CIA AUO6C CG6C U- UGUAA AG6C U CIUUUCG)U CG6AG6UIGCG6CG6A----------------- -- .a. 56

G6AA6A AG6G66C A AUGCC 6 C U- 86 U AA ANOC U C UU UC(GUC 6 AGOIUG6C GC GA ------------ --Os 571 646 656 66 7 696 696 786

r91

Nucleic Acids Research

1 H.s.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C U U U A A C 6 A - 6 6 A U C C A U2 N.e ---- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C U U U A A C 6 A - 6 6 A U C C A U3 R.ft - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C U U U A A C 6 A - 6 6 A U C C A U4 R.n. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C U U U A A C 6 A - 6 6 A U C C A U50.c.--A- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ---C C U UM AC 6 A -6 6 A U C C A U6 X.1.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C U U U A A C 6 A - 6 6 A U C U A U7A.s.-- - - - - - - - - - - - - - - - - - - - - - - - - - C C UU AA C 6 A 6 6 A U C C A U8 S.e.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A C U(A)A C 6 A - 6 6 A A C A A U9 DAd. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C U C U U A A U U - - A A C A C A A U10 O.n.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C C U U U A A - 6 6 A U C A A Uit S.pr. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C C U U U A C 6 A - 6 6 A U C A A U12 7.t.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C U C URU)6 A1 - 6 6 A A C A A U13 O.s.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C C U A 6 - 6 6 A U C C A U14Z8---- - ------------- -C -C--C--U-U C CC U A 66A -666 AC CC A

15 s.s - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C C C U U A A C 6 A U 6 6 A U C C A U

17'9

16 H.C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A 6 6 U 6 6 U 6VCt - C 6 A A U A A 617 HAh. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A 6 6 U 6 G U 6(C)A - C 6 A A U A A 618 H.v.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A 6 6 C 6 6 U C 6 A - 6 6 A A U A A 619 H.* - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A 6 6 6 U C(U)C -0A)6 A A U A A 620t NAh. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A A U A A C U 6 6 - - U Cl 6 A A 621 K.v - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - --R---AAA^ -(6)6 A A U A A 622 ,, ., - - U UED- 6 6 A A UA A 623 S.s.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A A A A 6 6 C 6 6 6 - 6 6 A A U A A24 T.t. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A 6 6 A 6 C C 6 6 6 - C 6 A A U A A

26 PA - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A U C C C C 6 6 6 U C A U 6 A C 6 6 U A C C 6 U A - A A A1! 627 E.¢.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A C C u u u 6 C U C A U U 6 A t 6 U U A C C C 6 C - A 6 A A 628 P.,.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A C C UU U 6 C A A 9 U 6 A C 6 U U A C C C 6 C - A 6 A A 6 A A 629 Ls.- - - - - - - - - - - - - - - - - - - - - - - ---- ---- 6 6 C 6 6 U A C c Cr_ 6AC6 6 UA C C U A A- C @A A 630 H.C - - - - - - - - - - - - - - - - - - - - - - - - - - --_ - - - - U U U 6 A C 6 6 U A C C C 6 A - 6 A86 A A 631 l§.C. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -: : AA UU C V U U A UC)U 6 A C A 6 U A C C U 0 A - C A A1 A A 632 N.s. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A C U U U AU (C 6 A" C A 6 U A C C U U A - C A 61A- A 633 B.?- - - - - - - - - - - - -- - - - - - - - - - - - - -- - - - -- - -- - - 6 U A"C U 6 Ub6UAU68 UAUUA A- U &"^I34 F.h- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 6 U C U A A 6 U A 6 C U 6 A A U 6 U A C C C 6I1 A A vAU A 635 D.d.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - U C A C A 6 C 6 u A c u 6 A C 6 6 U A C C U Ui C- A 6 A C36 N.x- - -- -- -- -- -- -- - -- - - -- -- ------ -- ---- - - cA CC* 6 C ;ii6AC@66U tC6 C6 6AA 637 A.n- - - - - - - - - - - - - -------------------------- - - - - - - A A 6 u 6 A c 6 6 U cC C A - I I A Ap A A

179

3B E.§.- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A A U 6 A C 6 6 U A u u u I A - 6 6 A A u A A 639 C.r - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - U U C U 6 A C 6 6 U A U C U 6 A - 6 6 A A U A A 640 La| - - - - - - - - - - - - - - - - - - - - - - - - --- - - - - - - - - - - - - - - - - - - - C A A U 6 A C 6 6 U A U C U 6 A - 6 I A A U A A C41 NA -. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C A A U 6 A C 6 6 U A U C U 6i 6 - 6I A A V A A_

42 K.s- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A 6

45 Lt. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - VU A*1--- - - - - - - - - - - - - - - - - - - - - - - -

61X.1.-CUUAGGACA

47 Lv- -- - - - - - - - - - - - - - - - - - - - - - - - --

-- - - __- A_A____________^^&

47A.* ------------------------VAICCUUAAVICGA-6vAUCAU

51 P.P - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

51 P.t. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -1-

S3 T.b---- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -1-C

Z.*---------------------------------------------- VCAVIA&e*l eCVC -mgii 4q56T..a--- - - - - - - --- - -- - - - - - - -- - -- - - - - - -- - -- - - - - -- - - - - VCADIA eAIG Cgc* l^ "9 eAAA57o.s - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V*CAUS CAII C V CI A 401 A

1 71 1 7 1 73 1 74 1 756 1 76 1 7T

r92

Nucleic Acids Research

81?. . .I

U 66 A 6UAG6AU 66 A 6U 66 A 6U 66 A 6U66A6UG666UAGA6U 6 XA 6U 66 A 6U66A6UA AG6U 66 A 6U 66 A 6U 6A)A 6

r66 c A A 6 UG66CARA6II66 c A A 6U66CAA6AI66 C A A 6 U66 C A A 6 U66 C A A 6 U66 C A A 6UA66CAA6AU66 c A A6U66 C A A 6 U66 C A A 6 U66 C A A 6 U66 C A A 6 U66 C AA 6 U

C -U66U6CCC-U66U6CCC - U 66 U 6 C CC -U66U6CCC -U6AU6CCC-U66U6CCC-U66U6CCC-U66U6CCC-U66U6CCC -U66U6CCC - U6A6 U 6 C CACA U 66 U 6 C CC - U6A6 U 6 C CC - U6A6 U 6 C CC- U6A6 U 6 C C

18

6ACU6 66CAA6AC-C66U6CCC6ACU6 66CAA6A C-C66U6CCA6CU6 66CAA6A C-C66U6CCIA6CAU666CAA6A C-C66U6CC #

66CU6 66CAA6U U-C66U6CCIA 6CU6 66C AA 6ACC- C66U6 C CC666666CAA6U C- U66U6UCiC 666 66UAA ACU -UA G6 CC

CCCC- 66CUAA CU-UC6U6CCdCACC- 66CUCAA CU-C6U6CCICACC- 66CUAA CU-CC6U6CCCCACC- 66CUAA CU-CC6U6CCICCAC- 66CUAA CU-AC6U6CCCCCC- 66CUAA CU-AC6U6CCdCCAC- 66CUAA CU-AA6U6CCdCCAC- 66CUAA CU-ACUAUCCU6CAC- 66CUAA CU-CC6U6CCC6CAC- 66CUAA CU-CC6U6CC ICACC-66CAACU-CC6U6CCdiCACC-66CAAACA-CU6A6CCd#CCAC- 6CAAAAUU-CC6U6CCd

18

CACC- 66CUAACU-CC6U6CCiCAUC- 66CUA CU-CU6U6CCIC AUC -6G6C U A C U-CU6U6C CI

I6 C A 6 C C 6 CI6 C A 6 C C 6 CU6CA6CC6CI6 C A 6 C C 6 Ck 6 CA 6 C C 6 CI6 C A 6 C C 6 CA6CA6CC6Ck6CA6CC6Ck 6 C X 6 C C 6 CI6 C A 6 C C 6 CA6 C A 6 C C 6 CA6 C A 6 C C 6 CA6 C A 6 C C 6 CA6 C A 6 C C 6 CA6 C A 6 C C 6 C

A6 C C G C C 6 CC6 C C6 CC 6 C

I6 CCC6 CC 6 CA6CC6CC6ClIX XX XX X X XU6CA6CC6CA6 C C 6 C C 6 CA6 C C 6 C C 6 CAG C C 6 C C G C

A6CAGCC6CG C A 6 C C 6C6 C A G C C 6CA6 C A 6 C C 6 CC6 C C6 CC 6 C

6 C A 6 C C 6Ck6CA6CC6C6 C A 6 C C 6CC6 C C6 CC 6

6 C A 6 C C 6C6 C A 6 C C 6C6 CA b C C 6CC6 C C6 CC 6 C

U6CA6CC6CA6 C A 6 C C 6 Ck6 CA 6 C C 6 Ck6 CA 6 C C 6 C

66UAI66UAi66UAI66UA66UAI66UAI66UAI66UAI66UAI66UAA66UAA66UAA66UA66UA66uAl

AUUAUUACUAUUAUUAUUAUUAUUAUUAUUAUU

CC A 6 C U C C A AUC C A 6 C U C CC A UC C A 6 C U C CA A UC C A 6 C U C C A A UCCA6CACCAACC C A 6 C U C C A A UC C A 6 C U C C A A UCCA6CACCAAUCCA6CACCCAACCACACCCAAC C A 6 C U C C A AUC C A^JC U C C A A UC C A 6 C U C CA A tC C A 6 C U C C A A UC C A 6 C U(ACC A A t

6 C 6 U AUA.;UU AA C6 6 C A M.s. 1A6C6 UUAUMA AAAA6U AU6CUU6C N1.m. 26 C 6 U AUN.yUU AAA 6 U U 6|C U|6 C A R.n. 36 C 6 U A U(A)UU AAA 6 U U6ACUC 6 C A R.n. 4G6C6CACA)UCA A A6UU6 CUU6CA 0.c. sU6C6UUA(UAUAAAA6UUA6CAU6C X.1. 66 C 6 U A UMAU U AA A6 U U 6 C UU6 C 6 A.s. 7A6C6UAUAAUAUAA6UU6UAU6CA S.c. 8A 6 C A U AU A C U AAA6 U U 6AU U|6 C A D.d. 96 C 6 UA UA U U AAA 6 U U 6AU U 6 C AO.n. I06 C A U A UA U U A AA 6 U U6AU UA6 C A S.r. 11

6 C 6 U AU(A)U(U)AAA6 U U6AU UC6(CA T.t. 126C6AAUA UUUAA6AUUA6CA U.s. 13U6C6UUAAUUUUAA6AUUAUUU6C 2Z.. 14h6C6UAUAUAUAAA6A6AU6CA 6... 15

18' 2'

:6GUA ACC66CA6UCC 6 A6U6AU66CC66UUAAACC66CA6 CCA 66UA66CCI66UAAUA CC66CA 6UC A C6A 6 U6UU A6C C66 UAAACC66CA6CC6A6 6AA6A6CCX6UAIAAACC6XXXXXXX6A6U66U66CCI|66UAAU] CC AC 66C C C6 A6U66UA6 CC6UAACAC C 66 CA6C ACAA6U66U66CC66UAAUACCA6CUCC6C 6A6U66UCG66666aUA CCA6CCCC6C 6A666UCA66

66AUAAUCA66666C UAA6C6UU6U UC66UAAUAC6U666U6CA A 6C6UUAAUCI66 UAA AUC66A 6666CUA A6C 6U U A AUC6 6UAAUAC 66666 U6CAA 6 C6UUAA U C66UAAUAC6UA6666C 6C6UUUUCC I66AUAA U A C 6UA66U66CAA6 C6UU6 UC CI66UAAU ACAUA66U66AA6C 6UUAAUC CI66UAAUACAU66A66C 6C6UUAUCC I66UAAUAC66A66 AUC C 6 A6C 6UUAUC CI66UAAUAC66A66AUCCAA6C6UUAUCC66UAAUAC66666U6C AA6C6UUAAUCI66AUAAC6666A6C A6C6U6UAUC I

--18'9 2'

66 UAA UA C666A6AU6C 6 A6C6UUAUC C666UA AUAA CA6A 666U6C AlA 6CU U 6UCC66UUAA6ACA666AU6C A A6C6UAUCCI66UAAUACA6A66AU6AC A A6C6UUAUCC

E18

GA U C U UAAU U 6 6 6 A.c. 16A U C U UA U U 666 AH.h. 17

6 A U A U UAA U U 6 6 6 H.v. 18AC U A U UA1UU 6 6 6 H.9.19

C UA U UAAC U666 N.h. 20ACUCAUAUA666 N.v. 216AUAAUAUA666 N.t. 226U6AAUUACU666 S.s. 236 U 6 UAC 666 T.t. 24

66 A AAI6 6 A A U U IA6 A AU U I

B 6 A A U U IB 6 A A U U I6 6 AA U 6A6 6 A U U UA66AUUUI6 6 A U U UI6AUAUUI

6 6 A A U U 06 6 A A U UI6 6 A A U UI

C U 6 G G A.t. 25C U 6 GG P.t. 26C U 6 G G E.c. 27C U GG 6 P.v. 28U U 666 D.s. 29C U 6 A6 R.c. 30UU666 N.C. 31UGU666 N.s. 32UAU66 3.f. 33U U 6 6 6 F.h. 34C U 6 6 6 D.d. 35UU66G6 N.x. 36U U 6 6 6 A.n. 37

P18

6 6 A A U UAAU U 6 6 6 E.s. 386 6 A A U 6AU U 6 6 6 C.r. 3966AAU6AAUU666 Z... 406AAUA6AAUU666 N.t. 41

66UU- 6 UCAAAU -UC6U6CC66UU- 66UAAAUU -UC6U6CC66UU- 6UAAAUU -UC6U6CC66UU- 66UAAAUC -UC6U6C CAUU-A66AUCAAU -UC6A6CL

ACAUGACCAAAU-ACBUUCCACCUAuACUsAACC -UU6U6CC

CUANUUCCAAAA^C-AU6U6CCCAUBUBCCAAUAC-AAUAUCCU U-- AC U A A U U - U66 6 U 6 C C

U - - - - -IC A 6 A A A U - 6 6 A U 6 CU - - - - -flC A 6 A A A U - 6 6 A U 6 C CWCCCC-UUCUAAlCU-CC6UBCC

CCCC USCAAAlCU-CC6U6CCC CCC

A 6 C C A C C 6 C 6 UAqA C 6 A AA C C CA A 6 U(C)A A U A6 A AU C C - - - - --. 6 6 H.s.A6CCAC C6C 6 UICU4AC6AIUUIAACCCAA A(C)UAA UGAUCU UC------ 66 N.m.

A 6 C CA C C 6 CU 6 U|CA UA C 6AU U IAA C C CA AA (C)AUA A U UA U- - - - - - UIU U(C)6 6 R..A6CCACC6CE6UUCAUAC6AI6JJACCCAAUl(C)AAACA 66U-A--^UG CUU Nt.

A 6 C 6A C 6 CI6 6 U CA UAC C6 A UU AUA CAAAA U A U A UU6 - - - - UA 6 U UA 3.;.A 6 C C CU C 6 CI 6 U U U A C 6 A I A A A C u CIUIA 6 U 6U U C A -CI U u U UI 6 6 X.I.iA6CA6UC6C6$UUAANACCAAUAAUACAAAUAAAUUA UCUC---- AAUA6UrU AD..A6CA6UC6C1U6UA CACAAAAf666CUA6C6UUAAuCAuAAAu-----366 S.c.

A 6 C A 6 C C 6 CS66 U AAUVAC A U 6A16 U A 6 C UA 6 C 6 U U AA U C 6 U C U U U A 6 P.r.A6CA6CC6C6CUA UACAU6SA6UA6CUA6CUUUAAUC6UCCUUAUU6A6 P-t.A6CC6UU6C66UUAAU AUCUAUUCUUUUAAA U A U UA UAC - - - - - - - - - - - C.f.A 6 C C 6 U U6C66UAA U C U AU 6 CU U U AAAU A U U A UAC - - - - - - - - - - - T.b.A 6 C C 6 U U 6C166 UAA^ qu c u A u 6 CUUtUlU A U A U U A U A C - - - - - - - - - - - L.t.A6CA6CC6C166UAJ1N6ACU6A666666 AA6U6UUCAUAUACt6.6AU6ACU666Z.A 6 C A 6 C C 6 C166 U AAU6 A C 6 6666666 CAA 6 U 6 U U C U U C6A6 A A U 6AiC U 666 T..A6CA,6CC6C6UUAUCC6666666C66UUCAAU6UUC UUC66AA AUA6C U 666 OB.

s ' 816 828A1 4

42434445464748495051525354555657

r93

7 1. . . . . .

.. . . . . .I

Nucleic Acids Research

850 860 870 880 890 900 910

2... .I..AAIAA.CIC..AI.U..A ....I...AIC- ..IC...I....I..C.I....I....I....IH.S. U U A A A A A G C U C G U A G U U G G A U C U U 6 - 66 A G C- 66 G C G 6 6 C G G U C C G C C G C G A G G C G A G C C A C C 62 M.I . G U U A A A A A G C U C G U A G U U G G A U C U U 6 - G6 A G C - - - - - G G G C G G G C G G U C C G C C G C G A G G C G A G - U C A C C G3 R.n. G U U A A A A A G C U C I U A G U U G 1 A U C U U 6 -G6 A G C - 6 A G C G G 6 C G 6 U C C G CC ICG A G6 C G A G C U C A C C G4 R.n. G U U A A A A A 6 C U C 6 U A G U U G G A U C U U G G G A G C-- G C G UG C G G U C C G C C G C 6 A G G C G 6 C U C A UC GS O.C. G U U A A A A A G C U C G U A G U U G G A U C U U G U G G A G G G U G C G U A G C G G G C G G U C C G C C G C G A G G C G A G -C C A C C G6 X.l. G U U A A A A A G C U C G U A G U U G G A U C U U G G G A U C - - - - - G A G C U G G C G G U C C G C C G C G A G G C G A G C U - A C C G7 A.S. G U U A A A A A G C U C G U A G U U G G A U A U AG - 6U C U C - - - G U CC G6 U G 6U G C C G C C U C A C G G U G G U C A C U G8 S.C. G U U A A A A A 6 C U C G U A G U U G A A C U U U G- G G C C C - - - - - G G U U G G C CA U C C G A U U U U U U C G - - U G U A C U G G9 P.d. G U U A A A A A G C U C G U A G U U G A A G U U U A -A G G U U - - - - - U A C C G G G U U U A U G U C A U U U A C C A - C U U C G U G G U10 O.n. G U U A A A A A G C U C G U A G U U G G A U U U C U - G G G A G - ----A G C G C C A A U G U C G U C U U G U U G A C --U G U G C A G C

11 S.P. G U U A A A A A G C U C G U A G U U G G A U U U C U - G G G A G A G C G C C A A U G U C G U C U U G U U G A C - - U G U G C A G C12 T.t. G U U A A A A A G C U C G U A G U U G A A C U U C U - G U U C A - - - - - G G U U C A U U U C G A U U C G U C G U G U G --A A A C U G G A13 O.s. G U U A A A A A G C U C G U A G U U G G A C C U U G G G C G C - - - - - G G C C G G G C C G G U C C G C C U C A C G G C A G G C A C C G14 Z.a. G U U A A A A A G C U C G U A G U U G G A C C U U G- G G C C G - G C C G G G U G C C G C C G C C G U A C G G G C A G A - A C C G15 G.m. G U U A A A A A G C U C G U A G U U G G A C C U U G - G G U U G - G G U C G A U C G G U -C C G C C -U C C G G U G U G C -A t C G19

16 H.c. C C U A A A G C G U C C G U A G C U G -

17 H.h. C C U A A A G C G U C C G U A G C U G18 H.v. C C U A A A G C G U C C G U A G C C G -

19 H.m. C C U A A A G C G U C C G U A G C C G -

20 Pt.h. C U U A A A G NG U CC G U A G C U GM .V. C C U A A A G C G U C C G U A G C C G --_22 I t. C C U A A A G C G U UE G U A G C C G -213 S.S. C C U A A A G C G C C U G U A G C C G -

24 T.t. C U U A A A G C G C C C GU A G --G

25 A.t. C G U A A A G C G C A CE G G C G -

216 P.t. C G U A A A G C G U G C G C A G G C G -217 E.C. C G U A A A G C G C A C G C A G G C G -

28 P.v. C G U A A A G C G C A C G C A G G C G -

29 P.s. C G U A A A G G G C U C G C A G G C G30 H.c. C G U A A A G C G C G U G C A G G C G -

31 c. C 6 U A U A 6 6 G U G C G U A G G C G -

32 M.s. C G U A u A 6 66U G C G CG --G33 Pf Uy U A A A G G A G C G U(A)G G U G-.34 F h U U U A A A G G6 U G C 6 U A G G C G35 Pt.d. C U A A A 6 G C A C G U A G G C36 Y. cl U A A A G C G C G U G U A G G C -G37 A.-n- C U A A A G C G(C)C U 1 G G C G -

19

38 E.g. C G U A A A AG U U U A G G C G -

39 C.r. C G U A A A G C U C U G U A G G U G -

40 Z.m. C 6 U A A A 6 C 6 U C U G U A G G U G -

41 W.t. C G U A A A G G U C U GUA G -G

42 H.s. C G U A A A G AJG U G U U U1U A G A UIC --------------------------------------- - - - - - - - --43 A.*. C G U A A A A G U G U C A A 1U A UA1 - - - - - - - - - - --------------------------------------44 R.n. C G U A A A A C G U G C C A A clu A U A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

45 B.t. C G U A A A A C 6 U G U -

46 X.l. C GU A A A C G 6 U 6 A A A G U - -

47 D.s. GA A U U A A A U U 6 A U U U A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -_48 A:,n. U U A A A G G 6 UW ACC U - - - - - - - - - - - - - - - - - - - - - - - - - - - - _49 S.c. U UU A A A G 6 AUCCGU A A A3 - - - - - - - - - - - - - - - - - -

S0 P.P, C U A C A G A G U 6 Ai A G C G G C 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

51 P.t. C GU A C A G A 6 U G A| U C6 G C G|- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

54 L.t. - - - - - - - -_SS Z.B. nU A A A G G G C A CG U §G G -G- - - - - - - - - - - - - - - - - - - - - -

56 T.a. C G U A A A G G|G C A C| UAG G C G - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

57 O.s. C GU A A A GG| C A CG AAG G C G|- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

1851 ~~~~~866 870 886 896 900 16

r94

Nucleic Acids Research

?n930 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~9809,~...I

C Ct -6G U (CC-ACGC(CC(CUU6 C C U C U C 6 G6LJG C C U C C C C 6 A U 6 C 8(C686AG8C(8 A686000UC C[L6 -0G00CGC C C 0880A 8,O-

C00CjGUC88 AGCCG6 66C6CCGGCUCUU((88 CA U G66U(--88U 686C((80 - 0 U 6 6 U C 800,6(08, C~j 0 AG X.c. P

C C U C8 ALC -6GACAUUC U 6 C A U C A A C U G A A U C66 CU C U Ui C A C U ,166 ~U C68 C LIUG 8 6 GC CG A t.. 1U C C A 6I C U C - - 6 A C C C U A-LU C U 6C 6 6 C 6W A U6 C 6 C C C U6 6C C U U8 UC U6 6 C C 66 6 0 0 C6 U 0 C, C6,P 6 C 6 0-. IGGCG6CUCUUCCAUCCU--UCUGCC6UCAUCCI-UUICGGUA6UU 8U---6CGUC66UCG U GGGIUICC6[AL Za.- 1406 GC6G6C UC -UUCCAUC C U--UCU C6U8UU6AC6-jCUUUCGG6U AUCU UACU--GIC-IUAC66UCGU GCCIC UC A AU6 S... 11

Elg9-1 E19-1'

....1..vH.v I8

H..Mh.v221

M, f 22s 23

t.- 242

.A0.. 25,-P .t26`

`

.-P . 29

. 302.H 313.C 32M 333

F-h- 34

1.-,h 3336337

.-A 38

.Cd. 393

.-C . 40t.- 414

s.N.t 42

4- .. 434

.-M 44

.-R . 448.-1.t 46

.. 47

A.P 484.. 49S. 4

P-.. 51

...t 523.-,. 535It.-T 545.-A 5

920 930 940 950 968 970 988

r95

Nucleic Acids Research

990 1?0 11 12014 105?

3 R.n. CGU U UA C U UGU6AA AA AA U UAG A G U UU CA A AGCAAGG CC CG AG CC GC CU AGGA U AC C G- CA GC U A GGAA UA A

A Xlc. CGGLIU UA C UU U GA A A A A A UUA GA UUUGCCAA A GC A GGC CGC6AGUC G CC UAGGA UACC GU-C A GCUCA GG AA UA A

7 A.s. C GU U UA C U UGUA AC A A A U UAG6A GUC UUA A A GC A6GGUXC A CCAC G C CU GA A UA AC A - C A GC A UGG A AU G ABS.c. CU UUUA C U UUA AA A A A U UAGAGUGGUU CAA A GC A 6 - CGU A UU GC UC GA AGUA UA U- UA GC AU G GAA UA A9 DGd. AU UUC A C UG6UGAG6A A A A UU G UGGUU U AAA A G CA GG -CGU C UCG CC UG AUC UU U UG CA G CAU GG UA U 6AG0OOn. A UUUU A C(C)UU GA GA AA A UU AG(A)G U GUUCC A 66C A6 6- - -C UCGC G CCGG AA UA CAU - UA GC AUG G AA UA A11 S.r. ~JUU UUAC(C)U UG6A GAA AA U UAG(A)GUG6UU CC AGGC AGG - - - CU CG6C G C CGGA A UA CA U- UAG6C A UGG AA UA A121T.t. A UU UUA CU(G)U GAA AA A A UUA(G)A G UA6UUC CA GG6C A G- -- U UUUA GC CC GA A UAC A U -UA 6C AU G GAA UA A130O.s. C C GU UA C U UGU6A A GA A A U UAGAGUGC UC A A A C AA- - C CA UCG6C UC UGG A UA CA U- U AGC AU G GGA UA A140Z... C CG UAUA C U UUA AG6A A A ULUA GA GU C UC A A AGC AA- - C CA U CGC U CU G GA UA CAU - UAG6C A UGG6A UA A15 Gm. CU GU UAC U U UGA A GAA A UGUAGA6U G C UC A A AGC A AG- - CC UA C GC U CU G UAGUA CAGU- UA GC AUG G GA UA A

E19-2 E19-2'9

16Hc-A----. ----GCUGAA--CAAGU----CC6AAC17H,h.---------------------------------------------------------- GC U GA ACAAUC ClOAn---------, -------------G6C CACGAGGU UC19H.m---------------------------------------------------------- G CCAC6A GG UC C20 M.h----------------------------------------------------------GA(UA UA CAAUCC21Mv------,-GU----CCAG----UAAGU----CC--GU 6UAA22M4.----.-GCU----UGAUAA--GUCU--6C AUAA232 s-----.n.------GCCCA---CCAA----GUCG--GC CCAA24Tt-1.1. -----GCC----CGGCAA---GUCG-------6 G6CA

25At-----.t ----.----A----AUUU----AAGU----CAUAUU 'G26Pt------.-GU----U----UAA--GUACA6UAA27Ec-----.- ----GUA---AGUC---AGUUA28Pv-.-----CAAU----UAAUCA~UUAA29M--9----.----.-GU-----UUCU----UAAG----UCU--6U UUAA30H3----.c.----GACA----U(AA-----GACAUCUA6 C31 Mc.-----U CA----U6UUUUUAG U32M s---M ---- ----CAAG---U--UUUUUIA U33B3-3 ----GACU----GGUAA----G--6A GUAA A34Fh-4-----.-GC----CUGU----UAAG--GC UUAA A350.d---6.-----GU(A---GU----GAA--GUCA--G(AGU AG C363 x-----.x ----CCU----GACA----AGUC-----G--CC ACAA37An----A.n.-----GUUAA--UCAA----GUCU-------6 AUCA

38 Eg----------. . ------------- GU C(A)A 6 U J u U U39 Cr.---------------------------------------------------------- 6C U C ~A G UC U400.Z.&----------------------------------------------------------G6C UU WiAAGU C C41 N t--Mt.---C---UUG-UCCA 6 U

42 H s---M--.43 M m---Mm.44 R n---Rn,45 D--LI.46 X 1---.1.47 D w-D.--.48An----.----.------AAA----AACUAAAIIAA49 S--.c.50 Po -. 1-AU--Si t---.--.. 6 U C A U A52 C.f.--m-m--m-AAUA.53 T.bA. ----AJAX--AA AUAAA54 L.t.------------ NAWXmWmmrMMXXV4JuMIACLiMU CMM1AMCUMUI55SSZ. ---- 6(0 U656TSAT.----.---------U---AA---66---6AAU06 066

"6 I 1004? 101? 102? 103? 104 1 1 056

r96

Nucleic Acids Research

UGGAAUAGG(A)C-C(G)CGGUUCUAUUUUGUUGG-UUUUCGG-AACU-G(A)6G(C)CAUGAUUAAAGAGGrr 'GGCC H.S.UGG A A U A G G(A)C-C(G)CGGUUCU A U U U U GUUGG-UUUUC GG-AACU-G(A)GG(C)C AUGA UUAA GAIGGGACG GCC r.N.

UGGA A U A G G(A)C-C(G)CGGUUCU a UUUUGUUGG-UUUUCGG-AACU-G(A)GG(C)C AUGA UUAA GAIGGGA C GG CC G.re.U GG A AUC A C. G(A)C-C(G)C GG UU CUAAUU UU GU UG G- UU U UC GG -A AC U-G(A)G G(C)C A U GA U U AA GA|G GGA C G|G C C O-c-lU G6G A A U A G G(A)C- U C C GG U U C U A U U U U G U U0G 6 - U U U U C G6G - A A C U-G GIC)C A U G A U U A A AGGG A C GG C C X..UGG A A U AG TAC-CUCGGUCUU A U U AUGUUGG-UUUUCUG-GACU GAGGUA AUGG UUAA GAAGAC AIG A C A.s.U A GA A U A GG ACGUUUGGUUCUA UUUUGUUGG-UUUCUAG-GACC-AUCGUGAAUGA UUAAU(A)GGGA C G|G U C S.c.UGAAACAUGACAUUUUA(C)GCUAUU-----GG-UUUGCGU-(U)UAA-AGUGUAAUGAUUAAU(A)GGGA(U)GGAU tid.UAGAAU AGUGAC--UUAUCGUCUCCU UUGGG-UUUGAGG-GACU-GAAGU AAUGA UUAAU(A)GGG A U AIG U U O.r.UASA A U A GG AC--UUUAGUCUC CUUUGUUGG-UUUGAGG-GACU-GAAGU AAUGA UUAAU(A)GGGA U A|G U U S.F.U G6 A A U A G G|A C - - - U A AGUCCA UUUUAUUGG-UUCUEGG-AUUU---GGU AAUGA UUAAU(A)GGG A C AIG U U T.t.C A U C AU A GG AU-UCCGGUCCUAUUGUGUUGG-CCUUCGG-G A U C - GGAGU|AAUGA UUAAU(A)6GG0 C A|6 U C O.s.C A U C AU A GG|AU-UCCGGUCCU|AUUGUGUUGG-CCUUC|GG-GAUC-GGAGU|AAUGA UUAAUA)G6GGACA GUC 2...C A C C A C A GG U-UCUGAUCCU|AUUGUGUUGG-CCUUCGG-GAUC-GGAGU|AAU6A UUAAC(A)GGG A C AGUC 6.0.

E19-3 E19-3'

GUUGGG|A A A|U C U GUCCG|CUUAAGUUGG G|A A A|U C U G U C C G|C U U A AA U C GG G|A A A|U C C GCCAG|CUCAAGUCGG G|A A AUC C A C C C G|C U C AACUUG A6AAAACC CCCGGCUUAACU6U0t.AAIU0CUCU6GIC0UAUC UGGUG_AJA AUC UC AC GGC UUA ACCCCUIUAAAUCCCCGGICUCAACUCCUIGAAAICCCCAGGICUCAA

UUG|GG A C C G|G A A G A|C H.C. 16U U G|G G A C C G|G A A G A|C H.h. 17UUG|GG A C C G|G A A GG|C h.v. 18U U G|G G G C C G|G G AGAIC H.a. 19U A GIG G A C C G|GG A G A .6M.h. 2CUUGIGG A C C GIG G A GAIG A.v. 21UUG|AGGCCG|GGAGA|G M.f. 22U A G|G GGG C G 0GG A GA|G s.5. 23U A GIG GGGICGGGAG AIG T.t. 24

GGGGUG|AAAUC|CCAGAGGUGGUG|AAAUC|CCCGGGGAU6GUGAAAUC|CCCGGGGAUGUGAAAGC|CCCGAGGAUGEGAAAGC|CCCCGGG6AG66U G A A A GUC C06 6 GA

GAGGU|AAAG CC CG GAGGAGGUUAAA6UCC 66A66 GUGUGUA AGUUUCCGGG6GGUGAAAGACGGOAGGGGGUGAAAUCCCACGGGGUGUG|AAAGC|CCUCAGGUUGUCIAAAGCI686666

P19

CUCAACUCAACUCAACUCAACUCAACUCAACUCAACUCAACUCAACUCAACUCAACUCAACUCAA

A A U 6 U U]AA A A G|U C A A A G|C U U A AA A U G U C|A A A UAAC C A 6 6 G|C U C A AG C C G U CIA A A U CIC C A G G G|C U C A A6 C C G U C|A A A U CC C A G G G|IC U C A A

CUCUGGCCUGGGCCUGGGCUUGGGCCGGGGCUCCGACUCC6GCUCC66CC6UAACUAUC6CC6U66CU6A66CCUCAU

AA C - - U G C C U U - U G A U - ACAA C - - U G C C A U - U G UG-A CAA C - - UG C A U C - U G A-ACAA C- -U G C A U C - U G A A C

A8 G - -GUC A U U - G G A A - A CA A C-- CC U u - G G AA - A CU U C- - GC C U U G - AAl A - CLUU C- - G C C U U G - A A A - C

A U 6C A G C -U G -A CC A G-- U G C C C U -U G A U A CAA C- U G C C U U -U G A U A CAA G-- U G C C C CCG AFA CAC- - 6 6 C A A U- 6 G AIA - A C

|U G G G U A U CIU U G|U G C A A G G C|U A GIU G G C A A G CIU U GO6CAGF-GGCUAG

IU G G G G A A CIU U GUGGG)AUGUCUUGG U|UUUA C - A GG U UUUA CLU - A GU G U C A G U CU U GU G A C6 G G C|U U G|U G C A C A A C|U U GIU G U U G U G CIU U GjU G A U U G A C|U A G

IUT

A GUAUGGA|AG A|G A.t. 25A GUGCGGC|AGA|G F.t. 26A GUCUCGU AGA G E.c. 27AGUCUUGUAGAGA v. 28AGUGCAGAAGAIG k.s. 29AGAGAUGGAGAAG He. 30AAUGCAAGIAGAIG M.c. 31AAUGCAAG AGA G M.s. 32AGUAOCAGUAGAG 8.9. 33A A U A C A C AG A|G F.h. 34A A U C C G G AG A G D.d 35A G U G C C GG A X AG Mx. 36A G U A U G G UA G GG A.n. 37

U U GA G U A U G G UAGG E.s. 38U U G6A 6 U[]C G G UIA G GIG C.r. 39U G 6|A G UAC G G U|A GG G Z... 40U G G|A G UJC G G U|A G .t. 41

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A C C C C C U C C C C A A H.s. 42------------------------------ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A U A A A U A A A M... 43

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C U C A U A A R.n. 44- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - U A A A 6 C A C C A U A C C A A A 1.t. 45M.mX.. 43

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - -I UAA A AIW .S. 47DC - - - - - - - - - - - - - - - - U A A A U - - - - - - - -|I6 A 6 Ul- A CIU u A U U U A C U A 6|6UU U U AUU| U A|A A.n. 48

- - - - - - - - - - - - - - - - - - - - - - - - - A A - U U A U A U A U U A U A A U U U A 6lA 6 U U A A U AIA A AIU S.c. 49A UAA C|U A U U U - U UU|- - - - A U A U - - - - - -p[A A A---A U A U A - | 1 R ULKU UUU U A|U P.P. so-AA8-A-8-GCCA8AUCAAAAAAAA 45

__ _ - _~~ ~~~~~ ~~~~~~~~~~~~~R _ _ -- - -- - - - - -- - - - -- - - - - - - - - - - - - - - - - - - - - - - C.f. 52_x_K 8 t R [ ~~ ~ ~ ~~~~~~~~~~~~~~~O--- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - L t. 54

AMAA 6 U 6|A A A 6 - U C C C A|A A A A 6U G 6 C 6 6| AU - - - 6 C U(C)U - C 6 A| - AC C(A)A U U C A C|U U 6 A 6 U 6; A 6 A CIA 6 A|G Z.| 55A(A)A 6 U 6|A A A 6 - U C 6 C C A|A A A A 6|U 6 6 C 6 6| AU - - - 6 C U(C)U - C 6 A| - AC C(A)A U U C A C|U U 6 A 6 U 6 A|6 A C|A 616 T.. 56AMA)A 6 U 61A A A 61- U C 6 C C A|A A A A C|U G 6 C 6 6 A U - - - 6 C U(C)U - C 6 A -C C(A)A U U C A C|U U 6 A 6 U 6 A|6 A C 6 O.s. 57

107C1080 ,09C110 111668 1128

r97

6789

1112131415

CGGGCAGG C-G UCCAGC-GGA A7-ACUGUUCAGCICGGGCAGG C-G UCCAGC-GGA A-AC UGUUCAGCCUGGCGG6C-G UCCGGU-GAA A-A C C[C G6UGGCCGGGUGGGACG UCCG6C-GGA A-A CCjIAUCG0CCC 6 G U6G GL U CA GG6-3 A A A - C UG- U A U(U)U C

iCCAGAGGAC- GGCAG6-6AU A-C UG-CUGGACCCCGUGAGA A-U GCUGGAGAU A-C UA-UUAGGCiCCG6G6A C-GGGGGGC-GAU A-C UG-GUGGGCICCGCUGGGGGG GGGG6C-AU A-C UG-CCGGGC

P19

C0UUUGGAAG6--G6CAUU-AAAA-jiEIUGCU(A)GACIiCCUU6GACC --6GCAUU-GGA6UArCU A G6AGCIiCCCUGGACA --6GCGGU-GGA A-AICU ACC1AAGCiCCCUGG ACA --GGCGGU-GGAA-A C U AA AAGC

Nucleic Acids Research

U U.s. 6662 M.a. 6663 R.n. 6664 R.n. 666S O.c. 6G 66 X.1. 6667 A.s. 6668 S.c. 6669 D.d. 666lO O.n. 66611 S.r. 66612 T.t. 66 613 O.s. G G 614 Z.a. G 66156... 6 6 6

6 6 C A U U C 6 U)U U 6 C 6 C C 6 - C U A 6 A 6 6 U 6 A A6 6 C A U U C 6 UMA)U U 6 C 6 C C 6 - C U A 6 A 6 6 U 6 A A6 6 C A U U C 6 UWAAU U 6 C 6 C C 6 - C U A 6 A 6 6 U 6 A A A66 C A U U C 6 UMA)U U 6 C 6 C C 6 - C UAA GAG G U G A A6 6 C A U U C 6 UAU U G U 6 C C 6 - C U A 6 A 6 6 U 6 A A I66 C A U U C 6 UMA)U U 6 U 6 C C 6 - C U A 6 A 6 6 U 6 A A66 C A U U C 6UCG U 6 C 6 A C 6 - C UA 6 A 6 6 U 6 A A I6 6 C A U(C)6 6 UAAU U C A A Ua6 - U C - 6 A 6 6 U 6 A A A6 6 U 6 U U C A U(A)U U 6 6 U 6 6 6 - C G A 6 A G 6 U 6 A A I66 C AU AUG U(AUA AAU -AU U- C AG A 66 A A AU6 6 C A U U A 6 UAU U U A AUA G - U CAJ6 A 6 6 U 6 A A A6 6 C A U U A 6 U(A)U U U A A U A 6 - U C A 6 A 6 6 U 6 A A A66 C A U U C 6 U)U U U C A U A 6 - U C A 6 A 6 6 U 6A A A6 6 C A U U C 6 UMA)U U U C A U A 6 - U C A 6 A 6 6 U 6 A A A6 G C A U U C 6 UAAU U U C A U A 6 - U C A 6 A 6 6 U 6 A A A

20

C U GA GG - - - 6G U(A)C G U C U 6 -G U A GG AC U G A G G - - - G U(A)C G U C U G G G - 6 U AGG AU C G A G G - - - GG U()CG U C CG G 6 - 6 U G66 AC A G A 6 A - - - G UA)C G U C C G G G - G U A 66 AlG U G A G A - - - G G UA)C U G C C GG G -G U A6U 6 A6 A C A A G- - - GG U(A)C U C C A6U66 - 6UGG C 66 U U A 6 C - - - G 6UGA)C U C C C G 6 - 6 U A6A66GCGG6 G ---G GU(A)CUCCCGG(A)-6UAGG66 C C 6 C C - - -G 6 U()C U C C 66 66 - 6 U A 666

6 U A A 6 U - - - G G A A U U C C G A 6 U - G uFA 6 A 6I6666AU- - -A6AA UUCC6C6A-6U A 6CA I

6 66 U - - - A 6 A A U U C C A C 6 U -I 6 U A 6 C G

6GG66U- --A6AA UUCCAC6U- 6UA 6C6 I6 AG AG U - - - 6 G A A U U C CAC6 6 U- A 6 C 6G6AUAGAC- -- GGAA UUCCC6AU-6 AC6C66UAA6C- - - 66AA UUCCAU6U-6GUA C6 I6UAA6C- - - G6AA UUCCAU6U-6U A 6C6 IGU666C- - -GGAAAUUCGUGG6AU-UA6CG46 U A 66G C - - - G G A A U G U G A C AEH--GU A G C 666U66C- - -G6AA UUCCA6GU-6UAGGA IG G U 66 C - - -G 6 A A U U C C C C A(A)-GlJG A 6G U A 6 C G - - - 6G A A U U C C AG G U-t f-U A|6 C 6

20

GOU A A 6 - - - U U U C C A 6 U -U6 C 66 C A G A G - - - G 6 A A U U C C A U G U - 66 A 6 C 6G6CA6AG- - - 6AA UUUCCGGU- 66A 6C6I6CAGAG- - - 66_AAj UUUCC6 GGAU-A 6C66i

6 U 6 A A A6 U 6 A AA6 U6 A A A6 U 6 A AA6 U 6 A A A6 U6 A AA6 U 6 A A A6 C 6 A A A|6C 6 A A

6U6AA6 U 6 A AAUGU6 A AAG6 A AU

IU6AAIU6AAA6UGAAA6U6AAAIU6AAAIU6AAAU6AA A

IU6AUAiU6AAA

6U6AAiU6AAAi6GAAAAG6AAA

UUCUUCUUCUUCUUCUUCUUCUUCUUClUCUUCUUCUUCUUCU UC

UASGSiIU U6G6 A CUGU66A CDUU66A CUA66ACUU66A CAU GGAC

DUG66U

UC6GAUUCAG6AUUA6A UU A66 A UJUG66 AUJUG66 AU

I. . 7.? . . .

C 66 C 6 C A A(G)A C 66C 66 C 6 C A A(G)A C 66C 66 C 6 C A A()A C 66

C 66 C 6 C A A(6)A C 6 ACG C 6 C AAA C 6 AA A U 6AU6 AAIG AC ():U3UU6AAWA)CUMICUC A U C A A(6)A U 6 A

U U A A A(6)A C U AA6UU A A A(G)A C U A

UUAUUAA6(6)ACUAAAUUAU6UAA(6)AC6AAUUAU6AAA(6)AC6AAUUAU6AAA(6)AC6A

CCA6A6C6AAA6C^ CCA6A6C6AAA6CIC C A 6 A 6 C 6 AA A 6 ChC C A 6 A 6 C 6 A A A 6 CkC C A 6 A 6 C 6 AA A 6 CIC C A A A G C 6 A A A 6 CIC A A C U 6 C G A A C A AIC U A C U 6 C 6 A AA 6 CIC U U C U 6 C 6 AA A 6 CICAUAA6C6AAAGCC U U A U 6 C 6 A A A 6 CC UA A U 6 C 6 A A 6AiCAACU6C6AAA6CCAACU6C6AAAGCCAACU6C6AAA6C

20'

U C C UA iTi) A U C C U 6 6 A C G(G)A C C 6 C C 6 6 U 6 6 C 6 A A A 6 CU C C U G U A A U C C U 66 A C G(G)A C C 6 C C 6 6 U 66 C 6 A A A 6 CU C C 6 U A A U C C U 66 A C G(6)A C C A C C 6 A U 66 C 6 A A A 6 CU C C U 6 U A A U C C U 66 A C G(G)A C C A C C 66 U A 6 C 6 A A A 6 CA CCU 6UA AUCC C 66 6A G()A C C A C C U AUG 6 C 6A AG 6 CA A6U UU6 AUC C UU 66A G()A C C A C C U A U66 C 6 A AG 6 CC C U AUA U C C C 666 A G(A)A C C A C C U AG6 6 C 6 A A 6 6 C

UC C U A AA(A)C C 66 6 A 66 A C C A C C A 6 U 6 C 66 A A 6 CU C C U A U A A U C C C C 6 6 A 6(G)A C C A C C A 6 U 6 6 C 6 A A A 6 C

AU C 6 AUGA|A U A U U C G 6 A G 6 A A C 6 C C A 6 U 66 C 6 A A 6 6 CU 6 C 6 U A 6|A U A U G C 6 6 A 6 6 A A C A C C G A U 6 G C 6 A A 6 6 CUU6C GUA6A6 AUCUAG6 6AAAUACC66UG6CGAA66CU6C 6UA6A AU6U66A6 6AAUACC66UG6C6AA66CU6C 6UAG A6 AUA6U666A6AC ACCCA6U66C6AA66CU6C 6UAA6 AUAUC666A6 6AACACCC6U66C6AA6GCU6C 6UA6AAUUAAU66AA 6AACACCU6U66C6AAA6CU6C 6UA6 AU AAU66AA 6AACACCU6U66C6AAA6CI 6 |L6 A UA U C A C 6 A A 6 A A C U C C 6 A U U 6 C 6 A A 6 6 CU 6 CC A S6 A3uAU 6 U C A C A 6 A A C A C C 6 A U U 6 C 6 A A 6 6 CU C C6 A U C U 6 6 A 6 6 A A C A U C A 6 U 6 6 C 6 A A 6 6 CU U C U A 6 A UWAU 6 G G 6 A G 6 A A C A C C G 6 U 6 6 C 6 A A 6 6 CU 6 C|UA 6 AUA U C U 6 6 A A1 AA C A C C A 6 C 6 6 C G A A A 6 C

20',

U 6 C 6 U A G A 6 A U U 6 G A A A 6 A A C A C C A A U 6 G C 6 A A G 6 CU 6C 6 U A 6A6 AUA U 6 6A6 6A AC AC C A6U 6 6C 6AA 6 CUGC AUU6A6 AUC6GAAA 6AACACCAAACGC6AAAGCUAC 6 U AUA C 6 66A A 66 A A C U C C 6 6 U 6 6 C 6 A AUA 6 C

42M.s. U AUAAGC- UACCC-u-SAA U 6 UAC6GAAUGU43U.. U A6AAA. C CUAAC -UUAAuGU U6AUAAUA CC AAUU ACCUAAACUCAAU AAC GAAAUA44 .n.U A-- - - UAUCC AAU C UUAAUA6AU6AAAAUUCAUU6- UAUUC CAAA6CC C A AUUAACG6AUA45 .t. UA666 U- - - 6AAU(UAAC-UA6CU6UAAAAA6CC AUG6--UUAAAAAAAUAUAAA U6AC 6 AAA 6U

476D.. UAAAC--------- -U UAA U P u uU6AUUACG--U-UGUAU GAA-A--C--C--C-- -----U-A-

46A.n. 6AA66A-- -A 6A A CUU66C-6UA6A AUA UAUACUC6UAUAUAUlAC AGACU66UAAC66C6AA66C49S.c. UAAUAAU--A6AAUAUAAU - 6U AAA6AU6AAAUAA UAAU AAUAAUUAUA6ACUAAUAUUAU6U6AAA56P.p. GU(U) UGU---IU6 G,6 ICUU GU A-F CU GAU UAA*^^U AC UGGGAG6UUUACAGGAACAUACAAC-GCGAAAGCSliPt. GU(A)AGU---IU GAGCU A6U A-lCU AG6U AAUA^^lJCAGGGAGUUUACAGGAA^CAUACGAC-6CGAAAGC52 Cd..S5 P.P. 6 A 6A 6 U - - - U 6 A Ag CU U 6 UU A A A - 6lU 6 A A A UA A 1U AC A6 C§-ASU C AC A 6 6AA CA UC C A AC 6 C 6 A A66 C

S5T.a. 6 6 A 6 U - - - 66 A|U U U C 6 U 6 - 6AUU6 6-6 U 6 A A A|U C C|6 U A 6 A U C|U A C 6 A A 66 A ACC C A A 6 C 6 A A 66 C57 U.s. 6 A 6 A 6 U - - - 66 A A|U U U C 6 U 6G- 6 U 66 U 6 A A A|U C c166 A 6 A U c U C 6 A A 6 6 A A C 6 C C A A A A 6 C 6 A A 6 C

113 114 11 1176 1166 119

r98

16 A.c.17 R.h.16 M.v.19 H.D.20 M.h.21 M.v.22 At.23 S.5.24 T.t.

25 U.t.26 P.t.27 E.c.28 P.v.29 D.s.30 H.c.31 A.c.32 M.s.33 6.f.34 F.h.35 D.d.36 A.x.37 A.n.

38 E.A.39 C.40 Z..41 A.t.

.1

Nucleic Acids Research

12on 121?A U U U B C C A A B A A U B U U U UCA U U AA

A U GUSU C C AA 6A A U 6UU U UI AI GUA AA U UU 6C C A AB A A U 6 U U U UIC UA A AA U U U S C C A A 6 A A U B U U U U CA U U A AAUUU6 CCA A6A A 6UU U U IC A GUUA AA U GUSU C C A AS6A A U 6U U U UL.AJU U A ABUUIUCCAASAA*^USUUUUICAUUAAl

6 U U U 6 C C A AB6 A C6 U U UUICM U U A AA U U C A C C A A A U A C(G) C C C(C)A U U A AA U U U 6 C C A A6 A U6 U U U UICA U U A AA U U U 6 C C A A 66 A U 6 U U U UCA U U A AC U U U S C C A A A I A U I U U U UICMA U U A AA U U U B C C A A 6 B A U B U U U UICMA U U A AA UU U S C C A A 6 5 A U 6 U U U UMCA U U A AA UU U B C C A A 6 S A U B U U U UMCA U U A A

19'

124?1_ , , .123?

21

1 12on . 121?A U C A GA U A C C 6 U C B U A H.s.A U C A 6A U A C C 6 U C 6 U A M.D.A U C A 6 A U A CC 6 U C 6 U A Ron.A U C A 6A U A C C 6 U C 6 U A R.n.A U C A 6AU A C C 6 U C 6 U A O.c.A U C A 6 A U A C C 6 U C 6 U A X.1.AUCA6U ACCC6CC CUA A.s.AUCU6AUAC C 6UC 6U A S.c.A U C A 6A U A C C 6 U C 6 U A D.d.AU C A 6A UAC C 6 U C C U A .n.AUCA6AUAC C6UC CUA S.P.AUCAIAUA CCUC IUA T.t.A U C A 6A U A C C 6 U C C U A O.s.AUCA6AUAC C6UC CUA Z..AU C A6AUACC6UCC GU S.&.

21'

6 CCUCA 6 6AAA C 6A6 UCCS CCUCA66 A6AA C66S UCCCA CCUCA6 AAA6 C6BA UCC6 UCUCU6 6AIAA C 6A6 CCCIAUCUCACCA6AACC6 UCC1ACUU6UCU 6AA C66 6 UCCI

6 6CUAACU 66AA CC6 6 CCU6 CCCC6CUA 6AACC 6 CCC

I

BB666C66C ACACC 6 CCiC

I66CUGACU68UCCAUnjACUIA A U C C C C U6B6 6 C C1CTA C UI6 6 C C C C C U 6A6C 6 6 A C U_6 C CC C C U 66A C A AA CU6 ACUCUCU 66UCUU6A ACU6 6 C U A U C U 66 AC A U U A U C U6 6CUUACU66C U U 6UUA U U6 6CUUACU 66 CUU6UU UUAA6CUCACU 6 ACU6 AAACUIA 6 C U U A C U 6 U 6 U 6 A U UI6 6CCACCU 6BCC6U6UUU I6 6CCACCU 6AC66UAACU'6 C6CUACU 6JCCAU AC U

ACUUU(U)CUAT 6CCA AIII CUI6 C UCUUC GBU6B6 CCC C6 AA CU IlAIC U C U 6C I6 6 6C C 6 AC CUA C U C U 6 C U 6 6 C C 6 A C C U

6 A C A 6 U 6 A 6 6 6 A C 6l AA 6 C U A 6 6 6 UC U C 6 A A C C 6 6lSA1CABUISASISBAC GAAAB^ CUASBBUICUCBAAICC BSlSAlCSSUIBASIBBACBAAABCUAS6SBGCUCSAAICCBBA C 6 6 UG6 A 6B6 6 A CA AA6 C UA 6 6 6 C 6 A AIC C 6 6

BACA6B6UI6AS B6S6A CS6AA AB6C UUBS6S6A6SUC 6A AA C CBS

BAlC BBU BAS6UAACSAAABCCABSBSC BC BAAlCCAlC BU S6 C CCS BC AlCAC AC U16A1666 CB6AA AS C U B6666A6CA AAC 66A 6 G6IC A A C 6BA A A6 CS 6 GA6 C 6A AAC

GA1CSCUICABISUBC BAAASl C BGBBSBAS6C AAAlC ABB

BSAC 6 GU A 6I U BAC S6A A AG6 C G SAS6C6C6A AAC

6 A|C 6 CU

6A 6 AABC AA ABC6 U 6 66B 6 A 6|CACA 6 6A C C U1.C.ZI6 C ABC6 A A AG6C 6GUGGB6 BAS6CA A AC AS66 AC 6 C U|6 A 6 6 U 6 C 6A AqA6 C 6 U 6 6 6 6 A 6 C A A A C A 6 6

A1C SCU BgCA CB B6C S6CB A|C 6 C U|6 A 6|6 U 6 CW A A[A:6 C 6 U 6 6 6 6 A 6 C A A A| C 6 6B A|C 6 C U16 A 616 CA C 6 A A A 6 C 6 U 6 6 6 6A 6 C A A A|U A 6 6|

6 A C A C U(6BA U 6 C U C 6 A A A 6 U 6 U 6 6 6 UIA U C A A A C A 6 6B A 6 C U|6 A 6|C ACA A 6 C 6 U 6 6 66 U C A A A C A 6 616A S6C UG6 A 616U 6C6AAA6C 6U6666A6CAAACA666 A|C 6 C UG6 A 6 A C 6 C 6BA A AB6 C 6 U 6 6 6A6A 6 C A A AC A 6 6BAC6CUJC.AIU66(AICSAAA6CUA666B AC AAA666B

19' 21

A c 6c uG A AC6A A I

6BAC AC UGI A6A6 ACA A 0SAC AC U 6A 6A 6 AC 6BAA I

AC AC UI6A6jA S6ACB AA I

C6CUS6A666A6C A ICAI6C U6 66 6 6 A 6C 6A AIUA6 66C UA 6666B A6CACAAUA6 6 6

C6CUA6666SAS6 C 6A A UCE6 B

A U U A 6 A UA C CC 66 H.c. 16A U U AB A U A C|C C 66I6 U A H.h. 17A U U A 6 AU ACC C 6B6|6U A H. I8A U U A 6 AU AC|C C 66|6U A H.&. 19A U U A 6A U A C C C 66I6 U A N.h. 20AUUA6AUACCCC666UAI.n. 21A U U A 6 A U A C|C C G 6|G U A NM. 22AUUA6AUACACCC6A6UAS.s. 23A U U A 6 A U A CICC C USG U A T.t. 24

A UU A 6 A U ACA U U A 6 U ACA U U 6 A U ACA U U A 6 A UACAU U A 6 A UACAU U A 6 A UACA U U A 6 A UACA U U A 6 A UACAUUAGAUACAUUA6AUACA U U A 6 A U AA U U A 6 A UAA U U A 6 A U AC

CCUS6S6 U A A.t. 25CC U 6|6 U A P.t. 26C C U 6B6 U CE.c. 27C C U 6|G U A P.v. 28CC U 6|6 U A B.s. 29C C U 6 6 U A Hoc. 30CC U A|6 U A A.c. 31C C U A|6 U A M.s. 32CCU6B6UA[t.f 33C C U 6|6 U A F.h. 34C C U 6|6 U A M.d. 35C C U 6|6 U A .nx. 36C C C U16 U A A.n. 37

21'

A U U A 6 A UA CCC U 6 - E.S. 38

A U U A 6 A U A C|C C U A|G U A C.r. 39A U U A 6 A 6 ACAC C CCAG U A Z... 40U U A 6 A U A C|C C C A|G U A t. 41

6 t C U U UA- - - - - - - - - A C A U C U 6 A A A 6 A A UAIA U U C U A - - - - - - - - -- 6 U C A u * *i-J C A CA1AGuAuCuACu---- -A--CUUUALAAUC16ACAC6A6 AC C C U A - - - - - - - - - - C A A U A 6 C C£6 A CJ6 C A CU A U A

-IUUUUAUAAAAUAAUAAAU- - - U AAUUAAAAAAIUCCUUCUAUSUAAIA CU AC UU 6(6)ACAASI

U AUAl U U A A A U A U U A A C US A |AU 6 6 A U U A A A

G1A C UCA) C AUA UA A A A A A U A C 6 C U|6 A A A

S61A C UMAA C A A U - U A A A A A A U Cc AA CCB6AA AI

A 6 C U C U C U 6 U C|C C U A C C 6 A C 6 C U 6 U G C|6 A A

1206 12,1 122l

C UAAGA C CC A AA CUG66SGA U UAS6A UA C C C C AC U A A. s.B C U A A 6rA C C C A A A CIU 6 6 6 A U U A 6 A U A C|C C C A|t U A H.S.SCGAAGAFCCCAAACIUBSGAUUASAUAClCCCAlCUA Rn..B C U A A 6 A C C C A A A C|U 6 6 6 A U U A 6 A U A C|C C C A C U A Ron.SCAUABA6CCCAAAC UG666AU A6AUACACCCCACUA Ut.

I C U A 6BBA A A C A A A CIU 6 6BA U U A 6A UA C|C C CAC U A X.1.U U U AAUAA A A AAACAU6 6l U U 6 A U AC.CC U A.UU A D.

BC U U 6SUA 6 CA A 6AA 6 6A U UA 6A UA CIC C UA6 UA A.n.ACUAASUA6BCB6AAAC6GAUUC6AUACACC6G6UA S.c.IUAUA66AAUA6AAA66CAUUUUA CC CUA UGA P.P.BAUACUAAUA6AAA6SC AUUUUAAACCUAU A P.t.AUBAAAAB ACGAUBA ----U SAUA tUG UAAA C.t.AU6 UAA 6 6 AU UAA U - - - U 6 AAAAU 6 UAALT.b.U 6 AUAA 6 6AU UAA - - - U 6 6 AAAU U AAAU Lot.

6 C A U 6 6 6B 6A C6 AC A A U U 6 A UAC CCU 66 UA Z...SCABU66A666CBAACA66AUUAA6UACCCUB6UA T..6BC ABUB6B6C6C6SAU6C6U AUACCCGUS6-UA UOs.

1236 1246 1266

r99

U C A A6AAC 6 AUCAABA6AAC6AU C A A|6 A|A CB AUCAASA6AAC6AUCAABA6AAC6AUCAABA6AAC6AUCAAA6AACA6AU C AAA6AA C 6AUCAASA6AAC6AUCAASA6AAC6AUCAASA6AAC6AU CA AA6A CA6 AUCA A B6A AC 6AUCAABAAC6 AUCAABAAC6A

A AG6 U C 66 A 6B6 U U C 6 AA 6 A C 6AAS6UCBSA6SBUUCSAASA*CSAAB6UCSBABS6UUCBAASA^CBAAGGUCSBABSUUCBAABA^CSAAS6UCB6SABBUUUBAABA^CBAAS6UCBSABS6UUCBAASA^CSAAB6UUAGASS6UUCSAABS6CSA6 U CU 6 A 66|6 U U C 6 A A|6 A C 6

AEA6 U U U 66 6|6 A U C 6 A A|6 A C 6AABUUUBBSSA*UCSAABA&CSAABUUABSGBA*UCAAABACSAASUUASSSSACUCAAASA^CSA B6UUBBSBB6CUCSAABA^CSAAU6UU6C6A6A6AUCAAC6AC6AA6UUA6666ABGUSCBASCGCSAAACS6

I

23

1415

42434445464748495051525354555657

Nucleic Acids Research

1iN.,. 6SU UC C6A^CC A UI20...BOUCC .ACCAU.2 1H.|. 6 U U C C 6 A C C A U A3 R.n. 6 U U C C 6 A C C A UN4 R.n. 6 U U C C 6 A C C A UO50.c. 6 UUCC6AC CAC6X.I. 6 UUCC6AC CAUA7A.s. 6 UUCUAAC CAU8S.c. 6 UCUUAAC CAU9 D.d. 6 U C C A A A CUA U

10 OOn. 6 U C U U A A C CA U11 S.P. 6 UCUUAAC CAUU12T.t. 6 UCUUAACUAUN130... 6 UCUCAAC CAU142.4. 6UCUCAACCAUU155... UUCUCAC CAUA

128Q

IACIACIACtACtACACAC

hACAC

tACtACtACACIACAC

60

606060B0JA

60

J6 C C 6 A CJ6 C C6 A CU 6 C C 6 A CJ6 C C 6 A CU 6 C C 6 A CJ6CC6ACJ6 C C A ACJ 6 C C 6 A CJ6 U C 6 A CJ6 C C 6 A CJ6 C C 6 A CUJAC C 6 A CJ6 C C 6 AC6 C C 6 AC6 C C 6 AC

E18'

In 1 30.? 131? .132 133?

6EC[i AF]C 66C6 6 6 AU-- A s A 6 6 6 6

U 66C 6AU6 C66C66 C 6UUAU - - ---- - -UCCC AU6ACCC6CC6 66CU 66C 6AU6 C66C66 C 6UUAU - - - --- - -UCCC AU6ACCC6CC6 66COU66C 6AU6 C66C66 C 6UUAU - -- - ----UCCC AU6ACCC6CC666C

:U 66C 6AU6 C66C6I6 C UUAU -- UCCCC AU6ACCCCAC66CU AC6 6AUC C66C6J6C AUUU - - -- -- --UCCCAU6ACCC6CC6ACC A 6 C 6SA C 6 C 6 6 A C 6 U UA - - - - - - - - U U 6 A U6 A C U C C 6 C 6 C

U A 6 6 6 A U C 6 6 6 U 6 6 U 6 U UlU - - - - - - - - U U U A A 6 A C C C A C U C 6 GEC A 6 6 6_ U C 6 6 U U A A A U uU- - - - - - - - U U C-A A U U U A A U C 6 6C

AU 66A6A UC66A66C -C6C- - - - -- --UUAUC-- C6CCUUC66 C:UA666 A UCAA666C6 - - C6C-- --- --UUUAUC- - C6CCUUU66 C

U C6 6A UC66CU66AA UAAAU-C--6CCA6 UC6ESCC6A U

CA 66 AIUC 6C66 A 6UU 6C- ----A-- -UUAU6 6ACUC C 6CC6 C

C A666 A UCA6C66- 6IUUAC- - - - -A-- - UUAUAC66ACCCCU 6 C

CA E66A6 UCA6C66EAUU 6C-- -- ----UUUUA6 6ACUCC6C U66 C

E21-1 E21-1'

16 H.c. 6 U C C U A 6 C U 6 U A A A C 6 A U 6 U C C 6 C U A 6 6 U 6 U 6 6 C 6 CA 6 6 CUA C 6 - - -|A 6 C C U 6 C 6 C U 6 U 6 C C|6 U -

17 H.h. 6 U C C U A 6 C U 6 U A A A C 6 A U 6 U C C 6 C U A 6 6 U 6 U 6 6 C 6 C A 6 6 CUEAC 6 - - - - A 6 C C U 6 C 6 C U 6 U 6 C C|6 U -

16 H.v. 6 U C C U A 6 C U 6 U A A A C 6 A U 6 C U C 6 C U A 6 6 U 6 U 6 A C A C A 6 6 C U|A C 6 - -|A 6 C C U 6 U 6 U U 6 U 6 C C|6 U -

19 H... 6 U C C A A 6 C U 6 U A A A C 6 A U 6 C U C 6 C U A 6 6 U 6 U 6 6 C 6 U U 6 6 CUA C 6 - - - - - - - -A6 C C A 6 C 6 C U 6 U 6 C C6 U -

20 Nh. IU CC C A 6 C U 6 U A A A C 6 A U 6 C 6 C 6 U U A 6 6 U 6 U)U C A 6 U 6 A CCA C 6 - - - - 6 U C A C U 6 A(6)6 U 6 C c16 A -

21 .Nv U C C U 6 6C, 6 U A A A C U C U6 C 6 A A C U A 6 6 U 6 U C A C C U 6 6 6 CCU C 6 - - - - A- 6 C C C A 6 6 U 6 6 U 6 C CA6 A -

220,9. 6JUCCU66CC6UAAAC6AU 6U66ACU U 6 U6UU 666(A)U66CUCCA6----E---6CU6(C)CCCA6U6CCS6A-23 S.s. 6 U C C C 6 6 C6 U A A A C 6 A U6 C 6 6 6 C UA 6 6 U 6 U C 6 A 6 U A 6 6 C U|U A 6 - - -|A 6 C C U A C U C 6 6 U 6 C CS6 C -

24 T.t. 3u c c C 6 6 CEC6U A A A C 6 A U6 C 6 6 6 C A6 C U 6 U C 6 6 C C 6 6 6 C U|U A 6 - - - - - - - -16 6 C C C 6 6 C C 6 6 U 6 6 CC6 U -

25 A.t. 6UCCAC6C C6UAAAC6At26P.t. 6EUCCACGC6CCAUAAC6At27 E.c. 6|U C C A C 6 CC6 U A A A C 6 A t28P.v. 6UCCACC6CUA6UAAACA6293.s. 6SUCCAC6CCS6UAAAC6A30 H.c. 6JU C C A C 6 C|C 6 U A A A C 6 A31M.c. 6SUCCAC6C C6UAAAC6A3211.s. SUCCAC6C C6UAAAC6At333.l. 6UCCACACA6UAAAC 6AL34 F.h. 6U C C A C 6 CC U A A A C 6 A35 D.d. 6SU C C A C 6 C6 U A A A C 6 A t36..x. 6SU CCAC6CC6UAAAC6At37 A.n. 6 JUC C U A 6 CC 6 U A A A C 6 A

6AAU6AC6CCCCAA6A6C6CA6UUACU6UC66U66C6UCAACUS6UU6UU666UC U- --A--A--- ---- GC6ACUCA6UAAC6 U C 6 A C U U A66 A 6 6 U U 6 U 6 CCC U - - - - - - - - - - - U 6 6 C 6 U 6 6 C U U C C6 U C 6 A U U 6 A 6 6 U U 6 U 6 6 UCU - - - - - - - - - - - - U 6AA C U 6 U 6 6 C U U C U6 A 6 U 6 C UAA 6 U 6 U U A 6 6 6 6SU U - - - - - - - - - - - C CSC CC C U U A 6 U 6 C U6 A 6 U 6 C UA6 6 U 6 U U 6 6 6 UAUC - - - - - - - - - - - 6 C C C 6 6 U 6 C C6 A 6 U A C UAA 6 U 6 U U 6 6 6 UAA - - - - - - - - - - - - - - - - --C U C A 6 C 5 C U6 A 6 U A C UAA 6 U 6 U U 6 66S 6 UA A - - - - - - --.C U C A 6 C 6 C U6 A A U A C UC6 C U 6 U U U 6 CA6 A UA - - - - - - - - - - - - - - - U A C AS6 U A A 6 C 6 6 C6 A A U X XXXX C U 6 U UA 6 C6AUA - - - - - - - - - - - - - - - U CA6 U U A 6 C 6 6 C6 6 A U 6 C U AS6 A U 6 U C 6 6 6 6 U -- - - - - - - - - - - AU CU U C 6 U 6 U C6 A 6 A A CUPA- 6 6 U 6 U C 6 U 6 66iU U - - - - - - - - - - - 6 C C C 6 C 6 6 U 6 C C6 A A C A C U A 6 6 U 6 JU U 6 C 6 U uUC - - - - - - - - - - - 6 A

P21-1 P21-2 P21-2'9M E.g. I1U C U U 6 6 C|C 6 U A AA C U A U A66 A U A C UAA 6 U 6 6 U cu 6 A AA - - - - - - - - - - - - - - - 16u6 C U 6 C u16 U -

39 C.r. 6 U C C C A 6 C C 6 U A A A C U A U 6 6 A 6 A C U A 6 U 6 C U 6 C C 6 C A A - - - - - - - - - - - - - - - 6 CA 6 U 6 C US6 U -

40 e.6.U C C U A 6C.C6 U A A A C 6 A U 6 6 A U A C U A6 6 U 6 C U 6 U A C U C - - - - - - - - - - - 6 A C CI U 6 C A 6 U 6 CUl6 U -

41M.t. AUICCUAC6C6AUAACE6A U66AUACUA66C6CU6U6CA6CA - ----------6ACC C6U6CA6U6CU16U-

42 H.s. U]C U U A6 CC C U A AA C C U C AMC A 6 U U A AA - - - - - - - - - - - - - - - - - - - - -

430 ... LLCU U A 6 CCCA U A AA C C UA AA U MA) UAAA-- - - - - - - - - - - - - - - - -

443... UOCUUA6CCCUAAACCOUUAAAAUAAUUAAAC--------------- ----- --- ------------------

45 1.t. U C U U A 6 C C C U A A A C A C A 6 A U A A U U C A U - - - - - - - - - - - - - - - - -

46 X.1. U IIC C U A I CCCA U A AA CU U U UA C 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

473... AUVAUAAAAU6UAA-UAAAU-46A.n. ISCCAA6CA6ACAAUSAO^ IiU6AU-CAUAIACUA6A"AAAAA6UCUUOA6ACOAOAAAFU0I&tUCUOuA0A-49Lc. OOUCUABUOA6UAAACUAUO6AAUACI-AALUUA!UUAUAI----AUUA------ AUAUUAUAIVAUAAUAAUI*-50 P.P. C U A U A CIU 6 U AA A A 6A UA6 A6 U AU U UAA U A C U AA 6 U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

51 P.t. 6 U C U A U A CAU6 U A A A A6 A Uj6A 6 U A U U UAA UA C UA A 6 U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

52C.t.

53 T.b. A U C : UU - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

54 L.t. U U AU---- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

552... 8ICCA C6UAAAC6SAOI6U6UU1CTDCCUF6FUCUICUAC-SCUAC6CUOCIJUTUCI668t1CC-S T.. SIU C C AU 6 C C 6 U A AA C 6A UI6A 6 U 6 U UCIS C C C U U 6 I U CUI- -- - - - - - - - - - - cCCI6 6 A U C A 6 C CIC C -

570.S. I CCAU6C C6UAAAC6AU 6A6U6UU C 6CCCUU 6UCU- - - - ---- - -C-----AC6C A 6UC*C6C CC-

1276 1206 1296 1306 1316 132 1133I

rlOO

6C -6 A -6 6 -6 6 -6 C -6 C -6 C -6 C -C A -U A -6 U -6 A -6 U -

Nucleic Acids Research

134?135~~~~~~~~136?137?13S?~~~~~~~1 139? 14U

- Ascu ucc6Rs AAACCcA- AARucuuuRG(6)UUCCRRGGG666ARUAuR6-uu6CA-AAA6CU6AAACU UAAA66 A. s.

- A6CUUCC666AA(AC CA-AA 6UCUUU66(6)UUCCRG6G66G6GA6UAUG66-UU6CA-AA6CUAAACUUAAA66 R.n. 3- A6C U UC CG6R6AAAC C A -A A6 UC U U UG6(6)U U CCG6G6GG6G6AG6U AUG66 -U UG6C A- A A 6C UGAAAC U UA AA6 6 Rn. 4- A6C U U C CR6A6A(A)C CA -AAGU C UU U 66(6)U UC CR6G6GG6G6AG6U AU66G- U UG6C A- AAG6C U 6AA AC U UA AA 6 6 O.c. S- AG6C U UCCG6RA6A(A)C CA -A A 6 UC UUU6(6)UUC C 66G6G66AG6U AU66- UU6C A -AA6C U 6AAAC UUA AA 6 60X.1. 6- AGC UU C C6R6A6A(A)C CA - A A 6U6 U U UG6(6)U UC CG6G66G6AAG6U A U66G- U UR6CA- AA 6C U6AA AC UUAA A 66 A.s. 7- A CC UUA C 6A 6A AMAUC A-A A6UCU U UG6(6)UUCUG6 6 66AAGUA U6 6 -U C 6CAA A 66CU6AAAC UUA A AG6 S.c.- A C CUU6U6 AA~UCfUR6A6GU6U U A6AUU C CG6 6 6G66AG6U A UR66-(U)C6C A -A 6(U)C UG6AA AC U UAA A 66 11.d. 9- A C CUU A ARAA(AU CA -AAGUC U U U6(6)U UC U66G6G666AR6U A UG66 -U C 6CA- A66GC UG6AA AC U UA A A 6 0.n. 10

.---6 A 6A 6AA(C UA -A AG6UC UU U 66(6)U UC UG6RG6RG6AA 6U AU616U A C 6C A- AG6U CUGAAAC U UA A A 6 ITt. 12A C CU AUR6IAAMAU C A- AA6U C UU U 6 (6)U U C C 6R6G6RA6A6UU66G- UCR6C A- A6R6C U 6AA AC UUAA AR66 Ons. 13

C AIC C UuA U6A6GA A(AU C A - A A 6RUC UU UG66(6)U U CCGR66G6666AG6UAU66- UC 6 CA -AGG6C UA AA A CUUAA A 66 Z.n. 14

E21-2 E21-2' 22 22' 1'

- A 6 66 A A 6 CC RAG AA C66C RA CC 6 CC 06 66---------A AG6U iIC RU -CU RCA - A 6 6 A U 6RA AA C U U A A A 6 6 A.c. 16- AR6RR6A AR6C CR6AR6A AR6CR6R6AC CR6C C UR666-R ----- A AG6UIA CG6U- C UR6C A- AG6 RA U6AAIAC U UA AA A6 .h. 17

A-ARA AG6CCG6AR6A AR6CR6AR6C CR6C CUR66---------AAR6UIAIC6U - C CR6C A- AG6G6AUG6IAAIAC UUAAAGA66N.Y. 10

A-A R A A R6C C G6A R6A A R6C RA R6 C C G6C C 06 6 6 .--- - - - - - - A A G6 U IA C 6 U - CC 6RC A - A G6 6 A U R I AA IA C U U A A A A66 H... 19

A G6R6R A A R6C C G6UUA A R6C C C R6C C R6C C UR666R .--- --6A G6U A CG6- UC R6C A -A G6AC U G6AA AC UU A A A G6 S.s. 23-ARRRAAA C CR6U U A AR6C C CR6CIC16C CUGR . G--AR6UACR6- C C16C A- AG6RCUGAAC U UA A A I t. 24

AARC U AA CR6C AUURAARUC6ACUC CR6C C URG6-G--------6AR6U ACR66- UCR6CA-ARAC66URUAAA AC UC A AA 6 6 P.v. 29-A6U U CA CR6CA AUA AR6C ACUC CR6C CUR66-G-----6AR6U A CU6 -C CR6C A-AGG6UUG6A AA C UC A AA A66Hc. 30-ACU AA CR6CA U UAAG6U AC UC CR6CC UG6AR6-U--------AG6U AUG6C - UCR6C A- AG6AG6UR6A A AC UC A A A6NHc. 31

AG6CG6AA AR6CAUU AAR6U A UUC A C CUG66-G-----AG6U A CG6C -CR66C A- A C 6GGUG6AA ACUCAA AG Ut6 . 33A-AC6A A AR6CR6U U AAG6U A UUCCA C C UR66 .G---AG6U A C6C -C CG6C A- AG6GR6UG6AA AC UC A A AR6 F.h. 34-AUUAA CR6CG6U UAAR6C AUC C C 6C C UG66-G--------6AG6U A CG66- U CU6C A- AR6R6C URA AACUCA AAR6 A 1.d. 35

-ARC A ACCRU UAAR6UG6U U CC 6RC C UR666RR .--- R--AG6U ACRC-ACj6CA- AI6UUG6AAAC UCA AA6 A.n. 37

P21-3 P211-3' P21-i' P18 22 22' 1'9

- A[6U UAA CA C 6jU UAA6U A UCC C6CC U 666--------- 6A 6U W C 6C - U UR6C A - C AA 6U 6[ AAAC U C A A A6 E.g. 30- AIR C U A A C C6GUUAAR6U C UC C CG6C C UR66-R--------6ARUIAIUR6C - UCR6C A -AG6ARUG6IAAIA CU C A AIAR6 C.n. 39- A6GCIU AA C6GCI6GUUAAR6U AU C C CR6C C UG66-G--------6AR6U AIC RU- U C RCA - AG6A AUR6IA A A C UC AAIAG6 Z... 40- A6CIU AAC[6C6UUA AC UAu IC6CCCU66R6-R-----A6UACG6U-UC6C A- A6AAU6AAACUCIAAA 6 A.t. 41

.---UC AACAA AA CURCUC6CCARAA---------CACUAC16A-6CCAC-ARCUAAAAC UCAAARA66H. s. 42----U UU AA CA A AA(C)U A UU UR6C CAG6AR.--- ---A A C U A C UA- GC C AU - AR6CUUA AA A C UC AAA 6 6 N... 43CU AC AA A AUU AUUU6CCAG6AR6---------AA CU AC UA -G6C UAC -AG6C UU AA AA CA CA A AR66R.n. 44AAA C A AIA A U UA U UCG6CCAG6AR.--- ---UAC UA C UA-G6CAA C-A6C U UA AAAC UC A AA 6 It. 45

.-----CAA AAA-_-U C6CC(A)6AA-----------CUACGA-6CCUA-A6CUUAAAACCCAA ARG 0.1. 46. -U6CUAAA---------UA6UAAUA-RUUAU-6UUCUAAACUAA A AA D.;. 47

- Af7AU6 AAA[F1UJ6UAA6CAU-UC'CIACCUCAA-G-----AGUAIAUA-UGRCA-ACAUAUAACU6AA A UC A-n. 40- ALUJ6AAA"~6AAA6UAU-UCCIAECC6AA- . G----A6UAICGU-UA6CA-AUAAU6AAC A A AAC S.c. 49

UAACACAAAAAUACUCC6CRUU6G-U--------A6UAAC6-CC6CA- A6UCA6AACA ACIAR6 P.,. 50UAACACAAA AAUACBUCCrCUUrt R------------UA6UAAA6-CC6CA-A6UCA6AAA UAACIA6 P-t. 51

C-t-M 52. .b. 53

.It. 54L 5AUA6U A AC AC CC 6C U6 66 6AUiI AC 6 6- UCR6C A -A 6A C GAA6 A CU C AAA 6 Z.&. 55

-AICU A A C 6CUG6AIA AC AC UC C 6C C UGGI---R.---IG-6AG6UIA C6-UC-GC A -AIGAC C6IAAIC U CIA AIAR6 T.a. 56CACUUC6CCU66----------6 CR6C A -A 6A C C 6 C UJC A A[ 0.66;. 57

1 1346 1356 13,~ 1376 1306 1396 o6

rlOl

Nucleic Acids Research

14t0 1420, 143 14A A U U 6 A C 6 A A6G66 C A C - C A C C A 66A6 U 6 6 A 6 C(C)U 6 C 6 6 C U U A AA A U U 6 A C 6 6 A A 6 6 6 C A C - C A C C A 6 6 A 6 U6 6 - 6 C(C)U 6 C 6 6 C U U A AA A U U 6 A C 6 6 A A 6 6 B C A C - C A C C A 6 6 A 6 U6 6 - 6 C(C)U 6 C 6 6 C U U AA A U U 6 A C 66 A A6B6 6 C AC-CA C C A 6 6 A6B 6 6 A 6 C(C)U 6 C 6B6 C U U AA A U U 6 A C 6 6B A A B B C AC-CA C C A B A B6 U 6 6 A U C(C)U 6 C 66 C U U A AA A U U 6 A C 6 6 A A 6 6 6 C A C - C A C C A6B6B A 6 U 6 6 A 6 C(C)U 6 C 6B6 C U U A AAA U U6AC 6 6AA666C C- CA C(C)A 6(A)SU 6 B6A C(C)U6C66C UUAAA A U U 6 A C 6 6 A A 66B6 C A C - C A C U A 6 6 A6 U 6B A 6 C(C)U 6C 6B6 C U U A AA A U U 6 A C 6 6 A A 6 6B6 C A C - C A AMS 6 A6 U 6 6 A 6 C(C)U 6C 6B6 C U U A AA A U U 6 A C 6 6 A A 6 6B6 C A C - C A C C A66 A6 U 6 6 A 6 C(U)U 6 C 66 C U C A AA A U U 6 A C 6 6 A A 6 6B6 C A C - C A C C A6 6 A6 U 6 6 A 6 CU)U 6 C 66 C U C A AAAU U6AC 6 6AAC(A)6CA - - CAC(C)A6AAU A6A6AC(C)U6C66C UUAAA A U U 6 A C 6 6 A A 6 6B6 C A C - C A C(C)A 6 6 C 6 U 66 6 6 C(C)U 6 C 6B6 C U U A AA A U U 6 A C 6 6 A A 6 6B6 C A C - C A C(C)A 6 6 C 6 U 6 6 A 6 C(C)U 6 C 6B6 C U U A AA A U U 6 A C 6 6 A A 6 B B C A C - C A CIC)A 6 6 A 6 U 6 A 6 CC)U 6 C 6 6 C - U A A

23 24 25

1l.4 .

14 1 1471UUU6 ACUCAC C666 AAACCUCACCUUU6 ACUCAACAC666 AAACCUCACC

UUU6 ACUCAAC AC666 AAACC UCAC CUUU6 ACUCAACAC666 AACCUCACCUUUB6 BCU C AACAAC 6BAAACCUCCCUUU6 ACUCAAC AC666AAACCUCACCUUU6 ACUCAACAC666 AAACUCACCUUU6ACUCAAC[UC666BAAAACUUACCUUU6 ACUCAAC C666 AAAACUUAC CUUU6 ACUCAAC C666AAAACUUACCUUU6 ACUCAACAC66A66AACU CACUU UU6 ACU C AAC A C6 66 6 A ACUUAC C6UUU6 ACUCAAC AC666 6AAACUUACCUUUBACUCAACAC66BBAAAC UUACCUUU6AC UCA AC666ABAAAC UUAC C

26 26'

16 H.c. A A U U 6 6 C 6 6 6 6 A 6 CACC -UAA CMUA C C 6 6 CA66 BA6 C C U 6 C 6 6 U U|U AAUU C U C A 6C C A C A U C U C A17 N.h. A A U|U 6 6 CI6A66 6 6 A6 C AC - UA C(A)A C C 6 6CA6 6BA6 C C U 6 C 6 6 UU U AU U 66AA C U C A A C 6BC C 6 6A C A U C U CAC C18H.v. A A U|U 6 6 C 6 6 6 6 6 A 6 C AC - U|A CMA) C C 6 6 A 6 6 A 6 C C U 6 C 6 6 U U A A U U 6 6 A C U C A A C 6|C C 6 6 A C A U C U C A C C19H.s. A A U|U 6 6 C 6 6 6 6 6 A 6 C A C - U A C(A)A C C 6 6 A 6 6 A 6 C C U 6 C 6 6 U|U U A A U U 6 6 A C U C A A C 61 CC66 A| C A U C U C A C C20IK.h.A A U|U 6 6 C 6 6 6 6 6 A 6 C A|C - C A CMA) C 6 6 6 U|6 6 A 6 C C U 6 C 6 6 U|U U A A U C 6 6 A C U C A A C 6|C C 6 61A A U C U C A C C21NI.Y. A A U U 6 6 C 6 6 6 6 6 A 6 C A C - C A C(A)A C 6 6 6 U 6 6 A 6 C C U 6 C 6 6 Ull UU A A U U 6 6 A U U C A A C 6 C C 6 6 6 C A U C U C A C C22 M.. A A U U 6 6 C 6 6 6 6 6 A 6 C A C - C A CMA) t 6 C 6 U|6 6 A 6 C C U 6 C 6 6 U|U U A A U U 6 6 A U U C A A C 6 CC 6 6 AC A U C U C A C C23S.s. A^IAUU6 C IBIBGGGAGCAlC- UC (AIACCGBABBAGlCCU BC BBCUC ^UUAAIU BGIAUC AACB3C CU6 ACUCU CAICC

23 S :nAsB C(A) BAACCU6B A A U C U BU A

24 T.t. AIA UiU 6 6 CBG66 G6 CAB C - CA C(A)A6 6 6 6 U6 A6CUU 6 C 6 6 C|U U A A A 6 U C A A C 6 AA C C U U AC

25 A.t. A

26 P.t. A

27 E.c.A28 P.v. A

29 .s. A

30 H.c. A

31 N.C. A

32 .s. A

33 .f. A

34 F.h. A

35 d. A

36 U.x. A

37 A.n. A

AU

AU

AU

AU

AU

AU

AU

AU

AU

AU

AU

AU

AU

UB6C i 6666CCCU 6 A C 6 6 6 6 A C C CU 6 A C 6 6 6 6 6 C C CU 6 A C 6 6 6 6 6 C C CU 6 A C 6 6 6 6 6 C C CU 6 A C 6 6 6 6 6 C C CU 6 A C 6 6 6 6 A C C CU6AC 6 66GACCCJ 6 A C 6 6 6 G 6 C C CU 6 A C 6 6 6 6 6 C C CU6AC 6 66B6CCC6U6AC 6 6666CCC6U6ACB6 666CCCG

23

--C

--C

--C

--C

--C

--C

--C

--C

--C

--C

C(A)A 6 C6 6 UI C(AA 6 C6 6 UCMAIA 6 C6 6 UC(A)A 6 C6 6 U

A C(A)A 6 C6 6 UCAA 6 C 6 U UCA)AA U6 6 U

h C(AA 6 U6 6 UCIAUA C 6 6 A

C(A)A 6 C 6 6 A

CMNIA C6 6 Uh CMA 6 C6 6 UC(A)A 6 C6 6 U

24

66666666666666666666666666

A6cAU6U6BUU 6 A U 6 U 6 6 U

A6CAU6U66UA6CAU6U66U

6 C A U 6 U 6 6 UG C A U 6 U 6 6 U

A6CAU6U66UA 6 C A U 6 U 6 6 UA A C A U 6 U 6 6 U6 C A U 6 U 6 6 U

A 6 U A U 6 U 6 6 UA 6 C A U 6 U 6 6 UA 6 U A U 6 U 6 6 U

25

BUUAUUAU

lUUAU AUAlUUAUUA0UU ANlUUAlUUAUU A

UUANuuBAUUAI

UUCI

UUC6BUCS

UUCB

UUCB

UUCB

UUCB

UUCB

UUCB

BUCB

26

AABCAACI

A U 6 C A A C IA U 6 C A A C IA U 6 C A A C IA A 6 C A A C 6A C 6 C A A C GA A 6 C A A C AA A 6 C A A C AAU6AUAC6A U 6 A U A C IA U 6 C A A C IA C 6 C A A C IA U C C A A C I

C 6 ~ASAA C C U U ACA6AAACCUUAICBAABAACCUUAICBAABAACCUUAC6BAA6AACCUUAC 6 A AG6 A C C U U A

ICBAABAACCUUAICBAABAACCUUAICBAABAACCUUAC 6 A 6G A A C C U U A

ICBABBAACCUUACBAABACCUUAC B6'A A A C C U U A

C A A A A C C U U A

6'AA6AA

ccCC

ccccCU

ccCC

ccccccCC

CcCC

38 E.s. AA UU 6A CfBG 6 G 6 C C C 6- - cl C(A)A 6 C 6 6 U 6 6 A 6 C A U 6 U 6 6 U|U U A AUUC6 U 6 C A A A C 6A A C C U U A39 C.r. AIAUVU 6 A C - 6 6 6 A C - C 6B- CtAC(A)A6 C 6 6 UI66BAU U A U 6 U 6 6 AU UA IU U C 6lA U A CA A C 6IC 6 AA6 AA C C U UAIC C

40NZ.. AtAUU6AC6666CCC6 --C ACMA6C6 6U 66 A6C A U 6 UA CI6I U6CAA66IC6AAI6AACCUU ACC41 M.t..A UU 6 A C6 6 6C 6 CC6-- CAC(A)AB6 C 6 6 UA661A 6 C AU 6 UAUUA U U C 61AU 6 C A A A 6|C 6 AAA6 A A C C U UBACC

42 H.s. AC C|U 6 6 C 6BiU6 C U U C AUA U C C CUCCUA6 6 6A 6 C C U 6 U U C U A - 6 A U A A A C C CC C A A C C U C A43N.s. A CU U66CjJ UACUUUAUAAUCCAUACUAA6A66A6C CU6UUCUA UAAA-(U)C6IAUAAACC CIC6(C)UICUACCUC AC C44 R.n. A C U|U 6 6 C6U C U U U A U A U C C 6UCCU A 6 A 6 6 A 6 C C U 6 U U C UA U A AWC 6 A U A A A C C C|C MUMUC U A C C U U AC C45 .t. ACU U C6C6CA6C6CUU6CUUAUAUC UCU)CAB6AA6C6CA6UCAU6UUAU-AUCA6AUAAACCCC6AUlAAACCUCCC46 X.. A C U U 6 6 CI6B6 U 6 C U C C A A C C C U A 6 A 6 6 A 6 C C U 6 U UCU16 UA -AU)C 6 U A C C C CC (C)U1A AAC C U CAC C

47 l.n. AU U U 6 6 C C6UAUUUXI U A&-

U C C C A 6U

66A} AC C U 6 U U 0 U 6 U A - A U C 6 A U A A U C C A1C 6 A U16 6 C C U U lC U

4NA.n A UU 6E|C 6 C U[6 A-- 6 A|(6 UC 6 u16 A6 UACU 6 U U AU U EU AAUU C 6 A U 6AU C C 6 C 6 t)AAAAA C C U |CC

49S.e,. AAUA6BC 6B UCAB --CUU-A)A6CA6UBBAACAUBUUAUAAU)UC6AUAAUCCAC6A(C)UAACUUUACCP.,. A A UUB6CBBA C UU -U UC(A)A ACBB6UBB ABCU BUUAAJUB6CB6AUAAUCCAC6U A AAAC CUUA

51 P.t. AAU UB6CB6B66BACUUB6--UUjC(A)A AC66UB66ABCAUBUB AUUUUAAB6C6AUAAUCCAC6UAAAACCUUA52 C.f. UCAAAA U U AA - - - -

53 T.b. - -- AUCAAAA- - - UUUA - - - - - - - - - - - - - - - -

54 - - - AA A- -C U AA - - - - - - - - - - - - - - -

552.Z TA UBU6A 6BB 6B BCC UB6- CJi A -66U66A CAAU 6U 61UU UAAAIm Uj C 61A U AC A A CBW6 ~ A AA C CUB A[rA56 T.a. A U|U 6 A 6 6 6 6 C C U 6 - - CIACA)A6 C 6 6 U|Is'6 A6 C A U 6 U 6 6 U|U U A A|(U)C 6A U A CA A C 6 C) A A C C UU A C C57GO.s. A IACA U U U UBAAUC 6A UACA A CB6 AAA A C C UUA

1416 14A 143 14 145 14,i4 1476

r102

1 H.s.2 N.*.3 R.n.4 R.n.5 O.c.6 X.1.7 A.s.8 S.c.9 D.d.

10 U.n.11 S.P.12 T.t.13 U.s.14 2...15 B.O.

Nucleic Acids Research

140

C U6(C)C C U 6 A - - - -C 6 6(C)C C B 6 AC 6U (C)C C 6U6 A - - - -C 6U C)C C U 6 A - - - -

C U GCC)C C U 6 A----A U 6(C)C C 6 U - - - -A 6U(U)C C A 6 A - - - -A A UICIU AA U A - -A 6 U(U)C C A U A - -A U 6UU)C C A 6 A - - - -A6CC AAUA- - - -A U6 UU)C C A A---A U U(U)C C A A- - - -A 6 U(U)C C A U A - - - -

27 28

15lS0?1 _ t151? .U 152?G A u U GAU A 6U U U 6A U - AU C U CUCU U C6 A U U 6 A C A 6 A U U 6 A U - A G C -U C U|U U C|6UAUUA6CA6AUU6AU-A6C -UCU UUC6AUU6 CA66UU6A U- A6C - UC U UAC6AUUA6CA6AUU6AU- A6C -UCUUU C6AUUA6CA6UAUU6AU-A6C-AUCAUUUC6AUUA6 C A6AUU6A6- A6C - UCUUU C6UAUUA6CA6UAUU6A6-A6C-UCUUUC6AUU6ACA U6ACUAAA-AA6-UCUUUC6AUU6ACA6AUU6AU - A6C -UC UUUC6UAUUA6ACA6AUUA6U-A6C - UCU UUC6UAUUA6ACA6UAUU6A6-A6C -UCU UUC6AUU6 ACA6UACU6A6-A6C-UCUUUC6AUU6 ACA6UACU6A6-A6C-UCUUUC6-UUA6CAA6ACU6A6-A6C-UCUCU

U C 6 UAU - -U C C 6U IU C G U -- U C C 6U iU C 6 A U - - U C C 6 U CU C 6 A U - - U C C 6 U IU C 6 A U - - U C U 6 U CU C 6 A U - - U C U 6 U IU U 6 A U - - U CA 6 U Il!LU 6 A U - - U U U 6 U I,jU 6 A U - - U C U A U 0IU U 6 A U - -U C U A U CIU U 6 A U - -U C U A U I|U U 6 A U - -U C U U U I|U U 6 A U - -U C U A U IIU U 6 A U - -U C U A U CJU U 6 A U - -U C U A U CUA-U

CA6-UA6UAA-UA ACC A - UA 6 UAA - U 6 ACAC U A - C A G UA - U GA CC A 6 U G66C A 6 - U 6 A CC A 6 - CUA -A - U 6A UC A - CU 6 - A -U 6A CC A 6 - C A - A -U 6AU|C C - C A U- - U 6A CC A G - CA 66 - A - U 6 AA

GU(C)A BUAU A U -S A C - C UIU)A C C66U(C)IAGUUU 6AU-UA C-CU(U)ACC6UAUC)A6UAUUU U-U6AC-CUMU)AUC

BuC)IA UCCU6AU-16 6-CUCuA CU

6U(C)666 6 A -BAC - U CIUIA C U

U66C(C)A66U:UU AC -UA C - CUGU)6CCC(C)A 66U U U 6AC -C UU) C UU66 C(C)A

66AA C

AC C UMUIB C C

66C(C)A66lCUAAC-6AlC-CU(U)6CC

CGA 6- - 6 C U A C U 6 -

C 6 - AG- - 6 C U A C U 6 -

A C 6 AC- - 6 C U 6 U A 6 -

- - U6AA- - 6 C C A C U 6 -

- U 6A- -UA C U 6 -

- - U 6A - -A C6 C U 6 -

- - U 6 A - - C A AG6C U 6 -

- -U6 C 6 C 66 -

- - 66 A - - C 6AIG C U 6 -

A6CUCCACCCU66UCUACUCA66UCAA66CA666CA666CC666CC666CUA66UUU66CA666U

27

UU6ACA U UC 6 UUUUUU GU6CA6U6IUU6ACA U6C-66 AAC-UUACCAIUU6ACA UCCA-C66 A 6-UUUUCAIUU6ACAUCCA-6C6 A U-CCUUUAIU U 6 A C A U C C U - C U 6 A C A -AU C C UAIUUUAC UCCU-CU6UAA -CCC6A0UAAU-6ACA UCCA6U6U A A-6CUAUAIU6ACAUCCA6U6 A AA-6CUAUAI

UUA AA U6BC AU6 UB6AA- 6AU6U 6U U 6 A A 6 U Uj0- 6U 6 A A U- U A 6A U AUU6ACAUCCA-C6A6A C-CCUCCCU U 6 A C AU C C U -C A 6 U -C C U U CAUU6AC AUCCC -CCA6 U-CUCUU 6

28

A 66 A U|U U G A C AUA - - C UA 6G6 -AM U U U|IA 6 UU U 6 ACAAU6UC -A A iC C U CU CA I

A 66 G C|U U 6 A C AU GC C - 6 C U - C C U C U U|IA(U)U6 BCU U A C AU BC C - C 6U - C C U C U U|l

S6AASABAIA6A

A6A A

BA6AA6AA

BA6AD6AABAAA

BA6N6AAN6BA66AAN

6 A AAA A A

A A

6 A A A

ICAUUBUCCUU-CAUAU CU- -G6CCCC _ 6 AACU 66 UUU - 66 U - 6C U C 6 AA A6 A6 CC Uc 6 C A t -A CU6A6AA-U 6U-6CCUUC66UAA- -CCUA6A 6-A CUAA666- AUU-6CCUUC666AA- -C6CU6 A6-A CUAAAC-UUC-CCCUUC6666 --CA6A6 U 6-A CUUAUC 66-AG U -6CC C U U C 6666 A C A 6A6 A 6-A CiU AU A 6 U -A6 A- 66 - U U - - - - - - CA U U 6A 6 -A CU A UA 6 U -A A6 A - 66 - U U-U- -- A-- CA U U 6A 6 -A CCAU6UC- 6U-6--A6CAAUCA--CC6CU6U- 6LiU A U U U A- A6 U - 6 - - A6 C A AU C A- - C A cA6AA -C466A66-6 6U-6CCCUUC6666ACCU6 Ju 6-A C

6UA666-A 6U-6CCC6CAA666AACU6A6 A 6- tCIC 6 A 6 A6 - A16 U - 6 C C U U C 6 6 A 6 C 666 6A 6 - A C

28'

A66 A.t. 25A66 Pt. 26A 66 E.c. 27A 6 G P.v. 29A 66 1.s. 29A 66 W.c. 30A66 N.c. 31A66 N.s. 32A6U6 It. 33A 66 F.h. 34A 6 6 D.d. 35A 6 6 Hx. 36A 66 A.n. 37

4iUBGGAG-GIUU-GCCCAAACGGAI- -CUUGAnC-AIA66BAGAGU6-GIGU-BCCCUCGGGAAI--CGCGGIAIC-AlCIAGG

416 A 6 A 6 6 - 6 6 U - 6 C C U U C 6 G 6 A A - - C 6 C 6 6 AC - A CA6

A C C U C|U u 6 C U - - - - C - A 6 C C U A -lu A U A C C * U C U|U C A - 6 C A - AA C C C U 6A U 6 - - - - - - - - - - - - - - - - H.S. 42A U C U CU U 6 C U A A U UCN-6 C C U A -IU A A C.6C CI u C u u C A - S C A - AA C C C U A A A A - - - - - - - - - - - - - - - - A.&. 43(C)C U U CU C 6 C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - U AA U -U CA 6 C R.n. 44AMAUUC UC---------------------- ----- ----- ----- ----- ---- U-AAU-ACAUSW .t. 45A C U U C U 6 C C A A A C CC 6C--- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - X.1. 46U A A A Uu U 6 U A A U C A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - --- - - - - 6 U M. 47A C A 6 UUUU6 A A U A U UU- UA C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A.n. 40A U A U !U U 6 A A U A u A - U A A U A A - U U A- - - - - - - - U U AU A - A U U - AU U A U Al- U U A C- - - - - - - - - - - - A 66 S.c. 49A6 CFU-UUA U U UU U - U U CU UUAIUU() 6C UU A A U - 66 - C U(6)- AAAU U U U UA 6 A- - A A A A A A16 - 6 U(:U .P.,. 50A6C 6 U UUUUU-UUCUIUU -A U UM 6C U CAU- 6 6 - CUM-AAUUUUUA6 - -AAAAAA6 -A6AU()P.t. 51

~~~~~~~~~~~~~~~~~~~~~~~~~C.?.52

---------------------------------------------------------------------- T.T.b.S5--- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - LAt. 54A6CC CIUU 6ACA AAUC6-AAAAC- AAAACCUGUCUUUAAAC- - AU66U CuuAC--u uuCAUl-- CAB66 La. 55A 6 C C CU U 6 A CAU U 6-A AIC A A C - A A A AIC C U 6 ulc C U U A A 6A6- U 66 U A C U 6 A C- - ulu U C A U A|-- C A66 T.s. 56

A6CCCUU6AC AUAU6-AACAAC-AAAACC U6UCCUUAC -666-AU 6B6 UACU6AC--UUUCAUA--CA O.s. 57

140 149 1506 1516 15A 153s 0M6

r103

C A - C 66BA C A -C A - C G 6|A C A -C A - C 66 A C A -C A - C6U6A C A - 6C A - C 66U A C A - 6CA - C 66 AA-AC A - C U 66 AA-CA-CAAUAA-6U A - U A 6|U A A - 6C A - U A 6U -C A - U A 6|U Al-C A - 6 A 6 A A 6 - 6C A - U A 6|C A A - 6C A - U A 6|C A AC A - U A 6 -

* 154?6 U- 6W.s.6- U U - 6 U .e.6- UU - 6U6 R.n.6- U|U - 6U R.n.

6 - U U - 66 .c.6- UU - 66 X.1.6-U - 66 A.s.6-U - 66 S.c.- U - 6A6Dd.6- B U - 66 O.n.

6 - 6U - 6U6S..- 6|U -6U6 T.t.6- U U - 66 0.s.

i - 6UU - 6U6 Z.m.;-UU-U6 U.B.

A G C C C|C - G A - - -A UC C C|C - 6 A - -A 6 C U C|C - 6 A- --GU6C A C|C - 6 A --G G A UU A - 6 A - - -A GU A GC - G A- - -A G6 G6 |C - G A - - -GU6G6 GA- G A - -CUUUC-A- - -

111213145

A6 A7 A.c. 164 G *A G H.h. 17A6 NH.v. Is

I 6UAI6 H.D. 19k6 A 66 M.h. 206 A 6 6M.Nv. 21

R16UA6 Nt. 22I16BA6 S.s. 23R6UAI T.t. 24

E.C . 38C.r. 39Z.a. 40N.t. 41

Nucleic Acids Research

..? .... I... I. .

I H.s. (U)I 6 U 6 C A U 66 C C 6 U U -(C)U U A2A.&. (UIG 6 U 6 C A U 66CC 6 U U -(C)U U3 R.n. (I6 6 U 6 C A U 66 C C 6 U U -(C)U U4 R.n. (U)6 6 U 6 C A U 6 6 C C 6 U U -(C)U U A5G.C. (U)G 6 U 6 C A U 6 6 C U 6 U U -(C)U U6 X.1. (U)6 6 U 6 C A U 6 6 C C 6 U U -(C)U U A7 A.s. (U)6 6 U 6 C A U 6 6 C C 6 U U -(C)U U A8S.c. (U)6 6U6 C A U 66CC 6U UU CUC9 D.d. (U)6 6 U 6 C A U 66 U C 6 U U -(C(U U A10 U.n. (U)6 6 U 6 C A U 6 6 C C 6 U U -(C)U U A11 S.r. (U)6 6 U 6 C A U 6 6 C C 6 U U -(C)U U A12 T.t. (U)6 6 U 6 C A U 66 C C 6 U U -(C)U U I13 0.s. (U)G 6 U 6 C A U 6 6 C C 6 U U -(C)U U A14 Z... (U)G 6 U 6 C A U 6 6 U C 6 U U -(C)U U15 6... (I 6 U 6 C A U 6 6 C C 6 U U -(C)U U A

25 A.t. (U)6 C U 626 P.t. (GIG C U 627 E.c. (U)G C U 621 P.. (GG C U 629 1.s. (G6 6 U 630 H.c. (U)G 6 U 631 I.c. (U)G 6 U 632 .s. (U)6 6 U633 3.. (U)6 C U 634 F.h. (U)6 C U 635 D.d. (U)6 C U 636 A.x. (U)6 C U 637 A.n. (U)6 6 U 6

C AwnIS6 Cu6U)C-6 U C A

C A U 6 6 C U GIU)C - 6 U C A

C A 166 CIU 6U)C - 6 U CAC A U 6 6 C U 6GU)C - 6 U C A

C A U 6 6 U U GIU)C - 6 U C A

C A 6 6 U U 6U)C - 6 U C A

C U 6 6 U U 6GU)C - 6 U C A

C A U 6 6 U U GIU)C 6 U C A

C A U66 U U 6GU)C- 6U C A

C U 6 6 CU 6GU)C - 6 U CAC AU66CUGGU)C-6UCAC MAG 6 C U 6U)C - 6 UC A

C A U 6 6 C U 6GU)C - 6UC A

29

38 E.G.I(U)66 U6 EVA U 6 6 C U 6U)C - 6U CI39 C.r. (U)66 U 6 CIA U 6 6 C U 6SU)C - 6SU C A

40 2.. (U)66U6 CIA U66 CU 6GU)C -(6)U C A

41 .t. (U)6 6 U 6 CAU 6 6 C U6GU)C - 6 U C A

6UU6UU6UU6UU6UU6UU6UU6UU6UUIUU6UGUU UU UIIUGU

6UG

66-U66A66-U66P66-U66166-U66G66-U66G66-U66G66-U66166-U66G66-U66G66-U66G66-U66G66-U661I6-U66A86-U66A66-U66A

6 C U C 6 - U 6 U C6 C U C 6 - U 6 U C6 C U C 6 - U 6 U U6 C U C 6g@U 6 U U6 CUC 6-U6UC6 C U C 6 - U 6 U C6 U U C 6 - U 6 C C6 U U C 6 - U 6 C C6 CUC 6-U6CC6 C U C 6 - U 6 C C6 C U C 6 - U 6 U C6CUC 6-U6UC6 CUC6-U6UC

30 31

6 C U C

16CUC

6-U6UC6-U6CU6-U6CC6-U6CC

6C6A6C6AiC6AiC6A6 C6A6C6AiC6Ai6GAiC6AiU6AiU6A6UaA6C6AP C6A6 C6A

UUUCUC U6IU U U 6 U C 66U U U 6 U C U 6U U U 6 U C U 6U U U 6 U C U 6U U U 6 U C U 66GU U U 6 U C U 66U U U 6 U C U 6 CU U U 6 U C U 66U U U 6 U C U 66U U U 6 U C U 66U U U 6 U C UU U UCUGG

UU U 6 U C U 66UUUG6UC U66

31' 32

lS590 16G 161?6

UUAAUU CC6AUA-AC6AAC 6A6 CUC UUUAAUU CC6AUA-AC6AAC 6A6

UUAAUUC C 6AUA- A C G6 C 6 A6A C UC UUUAAUUC C6 AUA-AC6AAC 6A6ACUCUUGAAUUC C6 AUA- AC6 AAC 6 6ACUCUUUAAUUCC6AUA-AC6 AACGA6A6CUC C

UUA AU UC C 6A UA AC A AC 6A 6A CUC UUUAAUU 6C6AUA-AC6AAC 6A6ACCUUUCAAUU CC6 AU-ACA66C 6A6ACCUCUUAAUU CC6 UUA-AC6 AAC 6A6 CCUUU AAUU CC6 UUA-AC6 AAC 6A6A CCUUU AAUU CC6 UUA-AC6 AC 6A6ACCUUUUAAUU CC6 UUA-AC6 AAC 6A6ACCUCJUAAUU CCGUUA-AC6AAC6A G6 CCUCJUAAUU CC6 UUA-AC6AAC6A6 ACCUC

32' 30' 33

U6A6 (CIG U C C U 6 UA AG6UUC AG 6 C A AC AC GA A C C C

6 U 6A 6G6CG IC C U 61UU A A 6UC A 66 C A - A C6A6CI6 A G6A C C C 66 U 6GA6 SC)6U C CU 6UUA A6U C A6 6 C A - A C6A6CI6GA6 A C C C

UG6 AG6 6C)6 UC C U6 U U A AG6U C A 6 C A ACI6A6CI6 A6 A C C C 66 U 6 A A G6C)A U C C U 6 U U A 6 U ICA66IC A - A C I6AGC 6 A 6 A C C C A

C 6A 6C)6U C C U 6UUA A6UC66UA - A C66C6 A 6A C C C6UG6 AG6 6C)G U C C U61U A AS U C A 66 C A - A C A 6 C G6A6 A C C C A

6 U 6 A A A U 6 U C C 6 6 U U A A 6 U C C 6 6 C A A C 6 A 6 C 6 A 6 A C C C C[UG6 A6GU6UCC6G6UUAA6U CCGGCA-ACISA6CG6 A6ACCCC

6UA6A6AU6U 666 U 6 A 6 A U 6 U U 6666U6AAAU6U U6666U6AAAU6U U6666 U 6 A 6 A U 6 UGU6666UA6 A6AU6U G66C6 U 6 A 66 U 6 UU 666C6 U 6 A 66 U 6 UIU 6666 U 6 A 66 U 6 UC66 C6 U 6 A 6G6 U 6UG 66C6 U 6 A 6 A U 6 UGU 6666 U 6 A6 A U 6UG U 6666U6A6AU6UIU666

31' 32

U AA 6 UJU A A 6 UiU A A 6 UU A A 6 UU A A 6 U

JU A A 6 UU A A 6 UJUAA6UJU A A 6 UJUAA6UJ U A A 6 UU U A A 6 UU U A A 6 U

CCCG

CCC6CCCG

CCCG

CCCG

CCC6CCC6CCC66CCACCCG

CCCG

CCC6

329

:6A6CC16A6CC16A6CC16A6CC16A6CC16A6C

C16A6CC16A6CC16A6CC16A6C]6A6C30'

6CAACCCU6CAACCCU6 C A A C C C U6 C A A C C C U6CAACCCU16 c A A CccCu16 C A A C C C U6 c A A CccCu6 c A AM cCCu6 C A A CC C C6 C A Acc cC6 C A A C C C U

33

6U6A6AU6UU66 6UUAA6UCCC6CA AC6A6C6 C A ACCC6 U 6A AG6 6 U A U A 6UUA A 6U C UA GA-ACA66 C6C A A C CCU6UAA66U6U U666 UUAA6UlCUC61CA-ACl6A6Cl6CAAClCCU6UAAG66U6U U666I1UU C6U CCCA6-AC G6AG 6CCAACICC

43 H.m ---------------------------------------------------__ _ -_-___ _ -__ -____ __ __ __ __44 R.n. C UAUA GAC CA C CCUUCAC CC AA A A A B6C A C U-AA----------AA- 6GU)ACAA6ACA45 St. C AUAUACA CCUAC-UUCA CA6ACA-CCCAUAAAA A6A6AA -A-A-A--------A6AA 6C6UAAUIAU46 X.1. CUA AUACA CACC IC -6GCCAS6 C CC- UC6UCUAIU]IA6A C .-- - - - - - - - -U - UUA 6Ci CIUA-AUUU47 6.Y. U U A U A 6 U U- A CA.6A A U A - U UU U A A A 6 A- - - - - - - - - - - --UIA U AAIUIC - U UUAuA UU48A.;. (C IGCU6 C6UU- UUUA6UGAAU-6UC6U A A cu66U A 6 ICIV JU A A C U49 S.c. (CG6 U C A 116 116 C- A GU C 6U-6 C U SC A A A U GUU A1 A U GA A A

6 U 6AC C A A A A C C

50 P.P. WamU U 6 CA 66C US(U)C -(MIU C A6 U U C -(U)6 U U UU6AIAAU(U)UijA6 A AU UA A 6 U UjA U A AI CIS AA C16 CSCAACicEW51 P.t. 6UAIU U 6 CAU66C U 6(U)C -(6)U C AG6 U U CG6 -(U)6 U UIU U 6A,AAU(U)UAA16AAU UA AG 6 Ulu UUUA U A AA C16 AA C 6 CAA CICC52C.t.--- -A

53 T.b-- - --

54 M - -- - - - - - - - - - - - - - - - - - - - - - - - - - ___ __552... (U)6CU6CAIS66SCUS(UIC-6UCA6CUCS-U6UC A6U6Al6AU6UU(U)6UC AAUA6UCC(U)lUA-AC6A6CSAAACICFCU56 T.a. (GIG C 6SC AIU 66CIU A A16 A I66|U C U|C C A A A A A A C|57G.s. ( B C-SIC ABC6U C6516 1CG 16 16U6 1^6UCA -AC ABCS CC

1 1554 161 57^ 1584 1541 1 1616

r104

16 H.c.17 H.h.18 H.v.19 H.m.20 N.h.21 lN.v.22 A.f.23 S.s.24 T.t.

29 30 31

A 6 6 U 6 C U 6 6 C C 6 C C - 6U CA 6 C U C 6 - U(AC CAIG GUG6C AIU 6 6 C C 6 C C - 6 U C A 6 C U C 6 - UmAC CA 6 6 U 6JC AIU 6 6 C C 6 C C - 6 U C A 6 C U C 6 - U)C CIA6 6 U 6 CAl U 6 6 C C 6 C C - 6 U C A 6 C U C 6 - U(C CIA6 6 U 6 C AU 6 6 C C 6 U C - 6 U C A 6 U U C 6 - U(C Uj6 6 U 6[JAJG 6 C C A U C - 6 U C A6 C U C 6- U(MC CA 6 6 U 6 C A U 6 6 C C 6 C C - 6 U CA 6 C U C 6 - U(C CA 6 6 U6 CA 66C C6UC - 6 A2 6 C UC 6 - U 6 U6U6 CAU66CC6UC-6UA 6CUC 6-U6UUA66U6CAU66CC6UC-6UCA6CUC6-U6CC

,I . . .

:A - A C"A - A C:A - A C:A - A (:A - A (:A - A C:A - A C:A - A CJA - A C:A - A C:A - A C:A - A C:A - A C

Nucleic Acids Research

I1e11e ltq , 1 1 16~ ~ ~~~~~~~~~~~~~~~7?lt

-6 6 C A U 6 C U A A C U A 6 U U A C 6 C 6 A C C C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H.s.-6 6 C A U 6 C U A A C U|A 6 U U A C 6 C 6 A C C C - - - - - - - - - - - - - - - - - - - - - - - - - - ".B.-6 6 C A U 6 C U A A C U A 6 U U A C 6 C 6 A C C C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R.n.

C 6 6 C A U 6 C U A A C U A 6 U U A C 6 C 6 A C C C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R.n.- 6GC A U 6 C U A A C U A 6 U C A C 6 C 6 A C C C - - - - - - - - - - - - - - - - - - - - - - - --- - - - - - - - - - - - - - - - - - - - P.C.-U CC A U 6 C U A A C U A 6 U U A C 6 C 6 A C C C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - X.l.

A A C C U A C U A A A U A 6 U 6 6 t A 6 C A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S.c.6i C C U 6 C U A A C U A 6 U lU A U U U A U U A 6 U CIS A U A UIA 6 A C(6)A U A 6 C UU ulcuTM 6 6 6 6 U|U Ui6 6 A A U 6 A ulu U C 6 D.d.-A 6 C C U A C U A A C U A G U 6 A U U C 6 U - - - - - .n.-A 6 C C U A C U A A C U A 6 U cU A C C6 U -_ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S.P.-A A C C U 6 C U A A C U A 6 Uu G C U U 6 U - -_ - - - - - - - - - - - - - - - - - - - - - - - - - - T.t.-A 6 C C U 6 C U A A C U A 6 C U A U 6 C 6 6 A 6C - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - O.s.

-A6CCU6CUAACUA6UICUiC66ACCC-N---------------------.;,

-A6 CCU 6 CUA AUA6CGUG6UC6AC.A-.---------------------------- - -I.e.

34

-CA-

-CA-

-CA-C A-

C A -

CUCCUA A U U 6 C C AI C A 6 U A C C C|U U U - H.c..C(U)C C U A A U U 6 C C A 6 C 6 6 U A C C C U UU- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M.h.C U U C U A A U U 6 C C A 6 C A 6 C A 6 U bu C 6 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ".v.

.U C C C U AU U 6 C CA 6 A 6 C A 6 C C U U 6 - - -.N..

.C 6 A 6 C A 6 I6IC C 6j3U U 6 A C C U U C G6- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H.h.

.C C C U AU6 C A C U A C U U U C U C C 6-- - l.v.

.C C C U UA6 U U()C 6 C 6 6 AU C C U U C6.- - - MN.t..CCCCU 6 U U6 6 UA UU C U 6 6 AC U C C 6-- - - - - S.s..C C C C U A6 U U 6 C U C C C 6C U C U U C 6 - - Tt.

101112131415

161718192021222324

-C6 - CCCUUA6iU6CC A6 -UU------------------------------------------------ A.t. 25-U6 _CCAUUA6SJU6CUIA-ICAUUC-------------------------P.t. 26

-U A-UCCUUU6UUCCC A6 C6 CC---------------------------------------------- E.c. 27-U A-UCCUUU6UUICC A6C6C6UA---------------------------------------------- P.v. 28-U6-A UCUUAGUU6CCiT§CAUUC------------------------- 3.;. 29

-UA-_UCCCUA6WUU61CC[rFUC6A6A6.-----------------------N.c. 30

-UG-UC6UUA6Ui^ACUIAAICAUUAA.N--------------------- .e. 31

-U61-UC6UUA6j JACUjAAICAUUAA-N---------------------- .;. 32

-UA-UCUUUAGIUUAICUIAAICABGUU-3---------------------- .?. 33

-UA- U6UUUA6|U6CCA6CAC|6UU----------------------- -.8.-UA-UG6AUA6|UUGCCAG6CAAG6UA-3--------------------- .d. 35

--UUUUUA6UU6CCj 9CAUUC-6----------------------.n. 37

34

-C!LJUCUUUAGUUGCcA1.--------------------------C.r. 39

-C[R3U6UUUASUUR1CCAgCUAUG------------------------ Z... 40

~~-NU .t. 41

-CACUU^UAUU6CAUUU--CAU-U-U-U-- --UUUU-U-U-U-UU------UU---UUU-----------------------------81 A U U U 6CU AU U --------------- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

-C6j_ UUU6U6UE UU61 US6A6ICAU6C6CCUAAI_MN F-CC~ _ UUUUCUC UU6CIUsA6* AU C6C6CI8lU S-CC_UUUU6U6 UU6C U6j6j BC6

1626 163 164 1656 166 1676 s6

N.s. 42N.m. 43P.n. 44U.t. 45X.1. 46P.v. 47A.n. 48S.c. 49P.1. SOP.t. 51C.t. 52T.b. 53I..t. 54Z... 55T.a. 56O.s. 57

ri05

Nucleic Acids Research

43R.n.---~~~~~~IHS - - --. - - - - - - - - - - - - - - - - - - - - - - - - -

3 R. --. - - - - - - - - - - - - - --.

58D.c.-4R.n - -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- - - -- -..........................................- - __.-.IOO.n. ----------------- -- -- -- -- -- -- -- -- -- -- -- -- -- -- - -- -............................-- -.- -..-

1US.c.-

12T.t..--. ,..13 0.;. ---------------.__14Z.-.15 6.|..

16HNc.------------------------------------17HN.h.--------._........

20HN.h.-------_.._....___.,,_,_____21 Nv. ---.........22 N.?.23 S.;.---......__,,24T.t.--

25At.---------.- . .. . .. _

26 X.1 - - -- -- -- -- -- -- -- -- -- -- ----------------------------------- - -..................................-.- -.- - -..---

27 E.c. -----------------------------------298P.v.-----..........__.,,,,,

29g.S ----- ------------- - -- -- ------ ---- -- - --- - - - - - - - - - - - - - - - - - - - - - - - - - --...... ... . ....

9 PAc- - -- -- -- -- -- -- --- - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - -- - - ---.- - -.--.-.- -.-.-.-.-.

329 .;.------...... .

33..---,- ..,-...34F.h.------------------------------------35D.d.------------------------------------

39 E.g.------'-'-39Cv.r---------. ...

10 Zo - --. - - - - - - - - - - - - - - - - - - - - --_

13 O.s ---- - -- -- -- -- -- -- -- -- ----- - - --.- ... ..-..-----------

45 3.t. ---------- . . .

14 Le - - -- -- -- -- -- -- - -- - - - - - - - - -- - - - - - - - -- - - - - - - - -- - - - -.-.-.-.--.-_-_- - - - - -- -_-._ -.- _ .

16 H.c - ---------------- - -- -- -- -- -- -- -- -- -- -- - -- -...........................-- - -- - -..- - .-.

617.1.----------_ ...................... _ _.,,__,__ ,_

le P.t --------------------.-.-.-- -.-_- - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - -_._- - - -..-.

47.;..-------------------------------.-.---

53T.b.-_

16t96 170 1716 ,1n a,01 1s61

10O6

Nucleic Acids Research

.CCGA6CF~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~8?..18------C----- -----C----- ----- .--rC2GWs

.CCGABC--------- ----- ----- Or.-- 3GG H&

------C-----A----C1----- ----- ----.-----,A 4AG.,

-GUCAUCUC CUGC~~~~~~~~~~~~~~~~~~~~~~UUCA A 6SA6MTu-U(A)G UC UGAC6AA.d9.116~~~~~~~~~~~~~~~~~~~~~~ucuCCU6CsC DdO

91

.

- -06G S. r. 110-AT.t.

- - 12--.P-CA

- -- Os.---13---- .t

-CA- - .--14---C .s

.OAAG.m.------------ ------1AU 56

-Ac- 16- 1A A G o

-N.h.-17H c

- .18--.h-li

- - -

.19--.v-N..

- - - 20--.2-

- - - - - - - - --.- - - -21C . 2

--------------------------------------------------------------N.v. 22.S.s. 23

- - - - - - - - - - . 2---- - - - - - - - - - - - - - - - - - - T.tS. 24

.At. 25- - -. 2

.Pt. 26

- - - - - - - - - - - . . 2

-

- - - - - - - - - - - - - --27-- - P 2

.P.v. 28

- - - - - - - - - - - - - -- 2

-

- - --s.- - - - - - 29--- - - - 2

.lIe. 30

- - - - - - - - - - - - - - - D . 2------ -- - - - - - - - - - - - - -

li,. 313-N.s.

- - - - - - - - I -- - - - - --32- - N 3

-0.1.

- - - - - - - - - -33- M . 3

.Ph. 34

- - - - - - - - - -. 3

-0.

-- -- -- -- -- - ----.35- - - -- -.. 3

-N.e.

- - -36- P . 3

-As. 37

- -- - - - - -- - - - - -- -- N . 3

-- - - -3A 3

.r.. 39----------- ----------- ----------- .. 3

-i.e.

-- -- - - - -- - -- - - - -- -- - - - ----40- - - - - C . 3

.lit. 41

- - - - - - - - - - - - - - . 4

-N.s.- - -42N 4

---l----------------i---------------- .. 43-Os. 44

-- -- -- -- - -- - -- -- - - -- - -- - - - - - - . . 4

.3.t. 45

- - ---- - -- - -- - -- -- -- -- - - - - -- - - -- 0 . 4-------- - - - - - - - - - - - - - - - - - 8.1-.46t4------- - - - - - - - - - - - - - - - - - - - - - - - - - - - .-X. .

47

-

- - -- - - - - - - - --.- - - --4-6- 0 . 4

.S.c. 49-- --- -- - - - -- -- - - -- - - - - - -- - --- - . . 4

-P.,.

- -- - - -- - - - - -- -- - - - - - -- - - - 50---- - -- - . . 4

-L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----.t. 54

I1766 s,,6 ieo6~~~~~~~~~~~~~~~~sa6M, sAAI .5

M A Ors 07

Nucleic Acids Research

iNs.UC 6CGC6U CCCCC AACU31.n.UNC6C6UCCCCCAA^CU3 Ron. UCC 6 C 6 U C C C C C A A C U4.n. UC 6 C6NUCCCCC AACU50. UC 6 6C6CU CCCCC AACU6X.I.U 66NC6U C---C AACU7 As.. 6 AUACN6--C--UCUN S.C. UU6CU 16 UACACU91S.d. 66UAC1 AUAA ACU10 O.n. C U A A 6 C 6 A A C CC AC UllS.P. CUAAI6C66AU UUGACU12 T.t A A UA A CA 6C U U 6 U A C U130.s. C CUCC6 C UA6 C U14Z.m. CjFC)UC6CNANUUA6CU15 6... C C U C C A A 6 C U

a'uuuuuuu009uuuu

C ICCC I

CCICICLCIC IC IC lC I

34,

16 N.C.17 H.h.19 N.v.19 N.D.20 Nbh.21 N.v.22 Not.23 S.;.24 T.t.

_|C:666A C U 6 6 U AC U6 6 U_ _-6Nu 6ACU6N66UAICAuu6NU)I

-_ C U 6 6 C U 6 61;U AC A U U A 6 AAI---U 6CU6 6 CU666A C AUA6 6 6A_A6A6UU A c CAC UNCU CI

----_ AUJCCN66(C)A ACUAAN66.C-6UCCA61CCACCACUA66N

- - - -6 6 C 6N 6 6 6 6 C A C A C U A 6 6 6

. .? ,,, !..I. .|.?. .1l7?.... ...U? . . ,I.,I'o

Uv A 6 AN6 N A C A - A 6 U 6 6 C6N N C A 6 C C A - C C - C 6 A 6 A U N 6 A - 6 C A A(U)A A C A - 6U U A 6 A N N A C A - A 6 N 6N C N UUC A N C C A - C C - C 6 A 6 A UU 6 A - 6 C A A(U)A A C A - 6U U A 6 A 666 A C A - A 6NUN N C N N U C A N C C A - C C - C 6 A 6 A U N 6 A - 6 C A AIU)A A C A - 6UA6 A666ACA-AN6U66C 6UUCA6CCA-C C--6A6A AU 6A- CAAI )AACA -CUCA 6AA666 C A - AC U 6C 6 U U CA CA - CC - - 6A6AUU 6A - C CA A(U)A A C A - 6U U A 6 A 6 6 6 A C A - A 6 U 6 N C6 U U C A N C C A - C A - C N A 6 A U C 6 A - 6 C A A(U)A A C A - 6U U A 6A N N N A C A -(A)N N N N CN6 CU A N C C A - A-N A N A 6NN A - N C A A(U)A A C A - 6UUAN6A666A CU-AUC66UUUCAAN6CC6-AU A6A6NNUAN6N6 CAANUIACA -6UNA6A666AC UU-(ACCUNNCUCAA 6CA6-6(C)66AAN6UNC6C 6C A AI(U)AAC A -6U N A 696 A CU - UU NU6 - A C U A A A C AC- AA 99 A AN6U UN6 A66 CA ANU)AA C A - 6UNA 6 A N66 6 A C U - NU 66N A C U A C A C - A A6 6 A AN6NU UNU A 6 6 C A AIUNA A C A - 6UNA N A 66N C N AA N N C A A U A A NC C A - A UN A A N U NU A A 6 6 C A A()A A C A - 6U A N A 6N6 C U -(A)U6N6 C C N6 U U A 6 6C C - A(C)6 A A6 UU U6 A 6 6 C A AIU)A A C A - 6

U U 6N A - U 6 C C6 U U A 6 6C C - 6 C 6 - A A6 U U UN6 A 6 N C A AUA A C A - 6U UA6N A N16N A C U - A U 6N6 C C N C N U A 6N C C - A C 66NA A N6Nu C A ANA A C A - 6

35 35' 33'

6NACU-(N6CCNCUN6CCAA ANC6-6N(A)NAANAACN6N6CAACN6 N -66 A C U-(6)C C 6C N N C C A A A C N -6(A)6N6A A 6N A A C 6N6 6 C A A C 6N6N A - 66 A C N -(6)C C 6C UN6C UA A A N C N -6(A)6N6 A A N6 A A C 6N6N C A A C6NUN A - 66 A C U-(6)C C 6NUC N C U A A 6A C 6- 6NA)6N6A A 6N6 A A C 6N6 C A A C66U A - 66 A|C C -(6)C C U C U|t C U AA A 6 6 6 - 6(A)6 6A A 6 6AAU 6 6 6|C AA C 6 6 U A - 66 A C C -(6)C U A 6C 6C UA A N C NA- NAN6N A A6 6 A CN66N C A A C N ANUA-66 A C C -(6)C C A 6U 6A UA A A C U6 - 6A066NA A N 6 A6N U N C A C N UN A - 66 A C U-(6)C C 6 6C 6- UA A 6C C 6- 6A6 6 A A6 6 A 6 6 6 C CAC6 6 C A - 66 AIC U -(N)C C 6N6CIN - UA AlN C C 6- 6(A)N6IA A6 6A 66N66 C N NC6 A-N

25 A.t. - - - - - - U A 6 U|U 6 6 6 C A C UNCN A 6 6 626 P.t. A- - - - - - -ANAU6 A 6 CACU U A A U 6 C27Ec.N- - - - - - --CC NAAU6 6 6 A ACU A A 6 6 A2N P.v A- - - - - - - UNN6 C 6 6 6 A A CUC A A A 6 6 A29 I.s. - - - - - - -. ANUN 66NCACUCU AA 6 6 U30 N.c.-- - - - - - - A UNC 6II66JN UACU A 6 6 6 A31 Mo.e. - - - - - - - A 6NU 6 A 6 A uc uAC AAC A32No.s.----N----666A CNCUAACN6A33L.f..-------AU U6A6CCUA6A6A34 F.h. - - - - - - A A 6 6 U 6W6 6AC U C U A A A C A35 .d. - - - - - - ANU 6U 6 66NCUA C U A UU C A36 N.x-- - - - - - - AAA6U 66C UC UAN6 A 6 637 A.n. - - - - - --A6UU666CACUCUA6A6A

34,

NANANANANANANANANANANANAAA

CU-NDC66CC U -(N)C C 6N6NC U -(6)C C A 6 UC U -(6)C C 6 6 UC U -(N)C C 6 6 NC U -(N)C C C 6 NC U -(N)C U A 6 UC U -(N)C N A NcC U -lCC U CC U -j UC UC U -(N)C C C 6 6C U -(N)C C 6 6 NC U -(NIC C 6 6 U

35

NAUAANCCNANANNSAAN6 NNEC6 A C A A A C C6 - AON66 A AN6 6N6NN A U A A A C U 6 - 6(A)6 6A A N N U 6 6N6 16 A U A A A C C 6 - 6(A)6 6 A A6 6 UN6 66N6 A C A A A C C N - 6SA)6 6 A A 6 6N 6N6 6 16 A C 6 A C C NB - lAN)6 6 A A 6 6 6 6 6 1u -AAN6CUA- AN)66AA66N 66N1

NU-AAN6CUA- (A)6N AA66 U6N1N6U AN ANU6CU-A616 AAN66UN66616CAA ACA6jAN AN6NAANN66ANN66UUNAACC N66-(AN6NAA6NNUNN61

N U U A A A C C N -6(A)6N A A N 6 U N N 66 A C A AACCN-CANA ANNN66 N

35' 33'

3C Ea - - - - - - - - - - C UU C)A- U A AAU^6^ u*C U -(XC U 6 6U|U ^U U -ACC 6- 6(A)CG61 66A^C A C C( 5- A39 C.r.-- - - - - - - - UUNBN6UCUCUAAAN6A ACU-(6)CCA U|6U-AA6CU6-8(A)N6AA6 6AN6 6 A 6 A CN(U)C A -A

40 Z..-- - - - - - U U N6 6 A C C r U CANA C C -(N)C CN66 U A 6CC 6- 6NA)6 6NA A 6 NA6 A AU 6 A(6)(C)C A - A

412N.t.--A-A-N-N-C6A6U U 66 A A C C NAICA-AACCN6 6UAA CAA NC-AI6CCNN6AAA6A6 CNU A-C(U)CA-A

430.s.--------------------AN6UNANA-AAN6UAA6CAAAANAAUCAAACAUAAAAACNUVIAN6CA-A

46 xL. - - - - - - - - - - - - - - - - -- e F uc -l------------A---------------------------------- - yAAaN ANIAC-A

46P.p.----------v9OIAVACAO*AAvIAu-VA466AAvuAUUCAv-uuvAC CAACAs CA SIu A- Al

52 C,n - - - - - - - - - - - - - - - - - - -1 - - - - - u- - - - - - - - - - - - - - - - - - - - - - - - - - - - -1 - - - - - Z- - -

53 S.b- - - - - - - - - - - - - - - --- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -154Lt- -- ------- -------- -------- - - - - - - - - - - - - - - - - - - - --UU - - - - - - --s"xpl

5I osP.A. C TFCC A C A i6 1lU1C -W C A 6U0AUANN ANCA INI-U NA A A ANI6. 6AC(A)6 CA

P.t.-----UU^*^^Il^U*O)^ll^U*UU*-"jU NA(AIS(Sl"#"CAAA*

53T.b.---------.-----__________ ___________.54L.t..-------------________________._ ___

5A1. a. AA^ .INNC NCU ICCI NC Al C NIC AC - (NICC ANNNANANAU C NN- N(ACINSAA C3 At-C N(U )C A-

579.,. c ACNANlNWCNc6@ ll le -(N)t Co c AS NNAuNANC N- Alus lu U NAC IC A-*wI135I 1 1371 13511

ri08

A NNA C 6(U)CI- AA UNA C NINIC A - AA N 6 A C N(U)C A - AA U 6 A C 6(U)UA - AA U 6 A C 6(U)C A - AA U 6 A C 6(U)C A - AA N 6 A C 6(U)U A - AA N 6 A C N(U)C A - AA N 6 A C 6(U)C A - AA C 6 A C 6(U)C A - AA C 6 A C N(U)C A - AA 6 A C6(U)C A - AA C 6 A C N(UNC A - A

JIJIJIJI

I

Nucleic Acids Research

191? ., 19210

B- A U (U)C C B B66 - A U 6(U)C C 6 B 66A 6-AU 6(U)CC66666 - A U 6(U)C C 6 B 6 B6 - A U 6(U)C C 6 6 6 6

A 6-AU 6(U)CCU666ft B - A U 6(U)C C 6 6 6 Bft B - A U GtU)C C U 6 6 6

B AA C 6(U)U C U 6 6 66 - AFU A(C)C U U 6 6 66-AU6BU)CCU66 66 - A U 6(U)C C U B 6 6

A6-AC 6U6CUC6 6A 6 - A U 6(U)U C U 6 6 6A 6-AU 6(U)UCU66 6

6 - A U 6(U)U C U 6 6 6

27'

193? ,,

C U B C A C B C BICIB C UC U 6 C A C 6 C 6(C)6 C UC U 6 C A C 6 C 6(C)6 C UC U 6 C A C 6 C 6(C)6 C UC B B C A C 6 C 6(C)6 C UC U 6 C A C 6 C GICIG C UC C 6 C A C 6 C 6(C)6 C UC C 6 C A C 6 C 6CIG C UC C 6 C A C B C B(C)6 C UC C 6 C A C 6 C 6(U)6 CUC C B C A C B C 6(U)6 CUC C 6 C A C 6 C GIC)S U UC - 6 C A C B C 6(C)6 C UC C B C A C B C BICIB C UCC BC ACA 6C(C)6CU

25'

194? I... .5 ... I....

4 C A C U 6 - A C U 6 6 C U C A 6 C G U 6 U 6 C C U A H.s.f C A C U 6 -A(C)U 6 6 C U C A6 C 6 U 6 U 6 C C U A M.a.f C A C U 6 -A(C)U 6 6 C U C A 6 C 6 U 6 U 6 C C UA R.n.4C A C U 6 A A(C)U 6 6 U U C A 6 C 6 U 6 U 6 C C U A R.n.C'C U6-A(C)U66 C UCA6C6 U6U6CU UA .c.kC A C U 6 - AA C 6 G A U C A 6 C 6 U 6 U 6 U C U A X.I.CAC U6-BAA6AA UCA6C6 C6UCCU A..shC CAU -A6 A C 6 6 A 6 C 6 U C U A S.c.f C A U 6 -U A 6 6 A AA C A A AA A 6 6 C - - - - D.d.f C A C U 6 A C 6 C A U A A 6 A 6 U A C U - .n.f C A C U 6 -A C 6 CA U A C A 6C6 6 UA CU S.p.C AAU 6 - A C U 6 6C CA A A 6 U A U U- - T.t.CC ACU 6 A U6UAUCC AA C 6A 6UA U A UA 0.s.

C A CU - A U 66U A U C C A A C 6A6 U A U AU A Z..C A C U 6 - AU U A U U C A A C 6 A 6 U C U A U A 6..

36

G U C - - A G U - AiU 6 C C C C|G A A - - U|6 G 6 C U 6 66 CA ACiC 6 C 6 6 6 C U|A C A|A U 6 - 6 U C 6 A6 U C - - AI6 U - AIU 6 C C C C|6 A A - - U 6 6 6 C U 6 6 6 C|A A C|A|C 6 C 6 6 6 C U|A C A|A U G - 6 U C 6 AG U C - - A|6 U - A|U 6 C C C C|6 A A - - U|6 A 6 C U 6 6|6 CUA C|A|C 6 C 6 6 6 C U|A C A|A U 6 - 6 U C 6 A6 U C - - AIG U - AIU G C C C C|G A A - - U|G U 6 C C G 6|6 C G|A C|AIC 6 C 6 6 6 C U|A C AAU - 6 C C G A EG U C - -AJB6C - AIU 6 C C C C| AA - - U|U A U C C G G|6 C U|A C|A|C G C G 6 6 C U|A C AA6 - 6 A C A 6 66 U C --A G C - A U G C C C CBG A A - - UAC U C C U 6 6 C U CAC A C G C 6G 6 C UA CAA6 - 6 C U A 666 C C-t U - A|U G C C C C|G A A - - U|C C C C U G 6|6 C U|A C|A|C 6 C 6 6 6 C U|A C A 6 - 6 U U A 6 GG U C - - G C - A|U G C C C C|6 A A - - A|C U C C C 6 6|6 C C|6 C|A|C 6 C 6 6 6 U U|A C A A 6 - 6 C A 6 6 I6 U C - - A6U-|U G C C C C|G A A - - A|C C C C G G 6|6 C U|G COAC G C 6 A 6 C U6 C A AG-6C6666

ftCAAIACAAI

ftCAAI

ftCAAI

ftC AAIftCAAI

ftCAAI

4CAGF

6 6 6AA6 C C A A.c. 166 6 6AA6 C CA H.h. 176 6 6 UUC U A HA. 19

6 6 6 A C U A H... 196 6 6(U)C 6 A M.Ah. 2066 CU 6CUA A.v. 21G 6 G|U U C C G A At. 226 6 6 A U 6 CA S.s. 236 6G A U C C jA T.t. 24

6U C--(U CA

6U C--CC-

U C-- AUC -ABUC JA U C -A

AUC- - AUC -AAU C -- C -

AC U-- AU - AAUC -- C -

AU C -- C -A

6U C-- AUC- A6 U C --UC -A

6U C- -(C)UC -AGU C- - A UC-A

6 6 C(C)C U UB6 6 C(C)C U U

6 C(C)C U U6B C(C)C U u

B6 C C(C)C uU6 C C(C)C U U

B6 C C(C)U U U

C C)CIC U U

C6 6 C(C)C U U

B6 6 C(C)C U U

G 6 C(C)C U UB6 6 C(C)U U U

U6 C C(C)C U U

29'

6 U C - -AU C - AU GC C(C)CU UG U C - -A 6C -AIU 6C C(C)CU U6UC--AU C-A U6CC(C)CUU I6UC--AU C- AU6CC(C)CUU i

A--C

A--U

A--C

A--C

A--U

A--U

A--U

A--U

A--C

At--C

A--C

A--U

A--C

G G C U G 6G G U GG6 IA C C A G 6A G U A G6A C U U G GUCUUG6UCCUGGU C C U G6U C C 6 6

6 U C C G G6C C U A G 6ACCAG6U C C U G 6

27'

BCU

C UU

BCUIBCUg

6CU16CC Ul6CU

6CUBCU

6CUl6CU

C A C A C 6 U 6 C UA C|A|C A C 6 U C A UAC ACAC6U6CU

P C|A|C A C 6 U G C UtC A CAC6U6CUC A C A C 6 U 6 C UC A C A C 6 U 6 C U

kC AC A C 6 U 6 C UIC A C A C 6 U 6 U UC A C A C 6 U 6 C UC A C A C 6 U A C UC A C A C 6 U 6 C UC5'CA C 6 U A C U

25'

A - - U| UC C U 6 6|6 C A[C[iC A C 6 U 6 C U|A C A A Uf--CIAUCCU66I6ClUCIAICAC6UAAUACAAUt- -UIBCCCU66I6C6ACIAICACBU6CUACAAU- - UjG C C(C GAU)16C61 JCA^C 6 U6C UA^CAA^U

A C A A U|G - 6 U 6 G U gA C A 6 UA C A AU 6-66666 BA C A 61A C AAG - 6 C G C A UCAC A A IA C AA U - 6 C A 6 A UCAC A A 0

A C A A UB6 6 BC 6A C A

|A C A A U|6 - 6 C C 6 6 U|A C A A I

ACAAUG6-6CU66ACAA

ACAAU6-66B666ACA6AACAAU6-6AU66UACA6AAC A AU|6 - C C 6 CA C AA APACAAUG6-6CC66UA^CA6AACAAut6-CUCCB6A1CAGI

B66 A C6A A.t. 256 GGU Pt. 266A 6 A A 6 C 6 A E.c. 276A6A AA 6 C 6A P.v. 26B6 6 C A 6 C 6 - .s. 29C 6AA6CA A.c. 30

6AUUCAA tc. 316_A_G1U U 6 C A M.s. 326 Gc ^6CI-cluA M.t 33

5iLC A F.h. 34

6 6 61|A 6 C 6 A Md. 356C 6 U U 6 C CAM.x. 366BA 6|A C 6 C 6 A A.n. 37

36

6-66UUAABA CAAUA ASUBCAA6 - 6 U U 6 6 6A CAAUC A 6 6AC6 -66C666 ACAAA 666UC6C6AI -6CC666 ACA 666 UCGC6A

E.C. 38C.r. 39Z... 40N.t. 41

6 6 U - - 6 U A|- 6 C C C A - U|6A 6 6 U 6 6 - C A A 6AA U 6 6 6 C ulA C Au A c uC -c H. s. 426 6 U - - 6 U A - 6 C C A A - - - - - - - U 6 AWA) U 6 6 - 6 A A 6 A A A U 6 6 6 C U A C A U - U C U l- - - - - - - - - - - - - - - - Nl.m. 436 6 U - - 6 U A - 6 C C A A U - - - - - - - 6 A A 6{36 6 - A A A 6 A A A U 6 6 6 C ulA C A U ulu - U C U U U C - - - - - - - - - - - - - R.n. 4466U--6UA-6CCIA-UA6AlAA 6AAAU6A--6-A-A-AAU666CUIACAUU-UCU A-----A-m--------*.t. 456 6 U - - 6 U A - 6CA UA U - - - - 6 A A 6 U 6 6- 6AIA6AA U 6 6 6 C UIAC AUU C U UA - - - - - - - - - - - - - - X.1. 4666U--6U-6!CUU ^ U- - -- - - U U ^ 6 l- - 1^ IU ^ U 6 6UUA C A AIU A - A AJU U U A - - - - - - - - - - - - - - M.. 47

6 U C - -AU C - A 6 C AUUA - AUL U b U 6 6|6 CUA UA6 A C 6 U UA U -S ICC U U |AC AA A|6 6 6 A U16 C 6 A.n. 49AUC- -I! ^AUAA U6U)C(C)AUu A U- -AUAUuB6A UA6AC6 U -. .--MAAI6AGA696U-------M_A S.c. 49

GUC- -CUU- AC(6)6UCU6C6- -U AC6CU666CvCACACU6UU6CUAAiA A-6UAAIAAACA AUA6AAAAAACIA P.r. 56

6 U C - C U U - A C(6)6 U C U 6 C 6 - - U AC 6 U 6 6 6 C U C A C 6 U 6 U UA C A IU 6 - 6 UA A AAA C A AIU A6 A AA A CA P.t. 51

-T~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~.b.53

-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L.t.S469U C - C 6 C A[U666 C(C)C 69U A - -96 66C U96 66CC A CA C A C U69 SC U A C A A U 6 -6 C AA AUG6 A C A A UG666A AB6C A A 556UIC--C6lC-AU66C(C)CUUA --UI666CUGG66CCIACIAICAC6EuSCUIACAAuI6-6CAAUIjACAAIUJ666IAA6CAA T.i. 566U9C- - C 6| U 6 6C- C U UA - - U 6 6 6 C U 66A6C CA CC A C 6 U 6 CU5AC A A U|6 - 6 C A AU7C A A U 6 6 61A A 6 C A A T.a. S

12 1 1946 1196I

rl09

6UC--6UC- -6UC- -6UC- -6UC- -6UC- -6UC- -6UC--6UC- -6UC- -6UC--6UC- -6UC- -6 UC - -6UC- -

6u66U66U6

6U66U6

BUG:BUBBUBBUB6U6

6U66 uBGGBUB6

U 6 C C(C)U UU 6 C C(C)U UU 6 C C(C)U UU 6 C C(C)U UU 6 C C(C)U UU B C C(C)U UU B C C(C)U UU B C C(C)U UU B C C(C)U UU 6 C C(C)U UU B C C(C)U UU 6 C C(C)C UU B C C(C)U UU 6 C C(C)U UU G C C(C)U U

29'

1U

2

13

4567

q

1112131415

Nucleic Acids Research

1H.s4 CCC- C6tCC 6CA-s6 UC6-666UA CU6US6AM CCA(U)U-C6 UU6UAUU66AUC66SAUUSCAASU2M-. CCC- U6 CC6CCCA-6C6C - 666UAACC C6UU6AACCC CAU)U - C6 U6AU6666AUC 6666AUAUCAAUU3

C- AC6CC 66CA- 66C6 -666UACC C6UU6AACCC A(U)U- C6 U6AU6666 AUC 6666AUU6CAAUU4 U.n. CCiVAU C6CCS6CA- 66C6ICl-666UVACCC6UVVACC CCA(U)U-CO U6ASU6666AUC GSSSAUUSCAAUU9S.c. CCJtjVU lCC 6SCA- USC6t- --66UAACCCSUUSAACC CCA(V)U- CSVUA^UA 6GASCSGSA^UUS CAAUU

7 A.,. CC C -(UISU -CC SAGA-U6 AC(C)-66 W X CC UCUUVC -CS6 6 U USU AU 66 A CUSC AA 66

S.c. ACC -66jCC - 66UC AACSC6^U U 6 * UCC6U- C6UU C UGUSSAVASASCASUGUAAUU

.d. 3C C 6U C C 6S6 A 6 6S6 A C A A C CC AU C U A UA 6 U 6 U U 6 A A U U

lOO.n. UCC)CCASCUCC6A6A- 6S6SACUM6SU AUC--6CUAACAUC USUC- C 6US6AUUU6AUAUAU CU UU6AAUU1US.p. UAC)CCB6CUCC6A6A- 66ICIASCVS6SU)AAUC--ASCAA UAVC6U-CS6U6AU6666AU A6AUCU UUGSAAUU12 M.t. IU]C C-U6 U CCVU 6656 A 6S A C- 66 CU UA U VA A C C A6 U-CS U U A 66 A VA SIU)U CU U U A AU U

130..s 6CC- U66U)AUCCUACA- SICC 666AACC-UAA6CUC6U6661AAAUUUC AU-C6U6AU6666AUAS6AUC AUU6CAAUU14 Z... C6C - 6CC 6ACA-66CC - -66-UAACUU666AAAUUUCAUC6 U6AU6GGA6UA6AU C AUU6CAAUU156.3.6 SD C UUSC66C AC A -U66JCC- 6SSUAAVCUU- U A6 CAU-C6U 6SUAV GU AUA6AUCAVU 6 CAAUU

36' 37

IN-.e,-- CUCC 6A6A-17H.h. - CUCC 6A6-toH.v.------ UCUC 6AAA-19Hl-,-_ CCCC 6A6A-20M.h.-- CACC 6A6A-21HNv.------ CCCU 6AAA-22.Nt------ CACU 6AAA-23S.s.------ CCUC 6AAA-24M-t---C Ct 6AAA-

25AMt- ACA26 P.t.-- - - - - AC C C27C.e.------C CUC2P.Y-- - - - -CCUC293.. - - - - - -AACC30H.c. - 6CC31 N.C -- UCCU323.s, UCCU333.?. - S--S-6C6634F.h.------ 6CU835M.-..--- _ 6ACC36 3.x.-- - - - C C C37A.n..A--- 6CC

UClA-

SCSA-

UCAS A -

6C6A-SC6A-AC6A-SUSA-SUSA-GUSA-UCAA-SCSA-SCSA-SCUA-

3 CE.g - UU 6U6 -

39C..--. ._- CUC 6U6A-40Za-.-U CUC 6C6A-41N.t ---- -U CCt 6C6-

66A6 U-C6eUTAU-CUC6 A 6 Sc u A A U- C U C

6A sAA-C6C jAU-CUC

G66 6A-C6CUU :AU-CUCI6U6 A-66AUAU)-CUCI6 66 A -C6C6 -t CUIU A6S-A66UAA-CUC66 6 6 - AC6C V--CC

I6G66 66 CAU-CCC

UGUC UA-CCUC6S 66-A A CCC

A6AUCA-A6C AC-CUCA 6 C A A6 C S6 A A - C ('C6SUVA-A6CCAAU-CCC6SV -A6 C-C 665UU66S-A6CE±.:U-CUC.C6UUA-UjFTiJAAU-C C

CA6C|A-AA AAUCUS6UU6- UC CAA -CCC6 66 A- 6CVUAA C-6CC6VU AA-A CAAAU-CuVC

A A

UA A

:CUAAA:C UA A A

:UUAAAVA AA

UUC6AU-C6UUA6UUC6A6UUSAU666C U6AAACUUCS6AU-C6U A6UUC66AUUUAU666UCU6AAACUUUC6AU-C6UA6U UC66AU U6A666CUU6AAACUUUCU6U-C UA6UUC66AU U6C666UUU6AAACCC U6UC - C6UAA UU C6UAU 6SC666UU6UAACUCUA6U-C A6UUUC66AU C6UU66C U6UAACUCCU66C-CU U6UUC66UAUUG6A66C U6UAACUC C U6C - C UA6UU6 6US AUCU6 A66CU6AAACCC CC 6C -CCCA6U A6 U6AUC6666C US6C AACU

C AAAA6CCAU--C|UCA6UUCGUAUA^UAAASCCAUU-CKUUAUCCGUAUA UAAAI6 U 6C 6 U-CAU C6UC 6 AUAUAAA6USCU6U-CU6UACC66AUAVCA*AAUCUSUU-CUUASUUCCGAUA C A A A U C USS U C UC AU6 U U C 6 A U

A6SAAA6CC6UU-CICCA6UUC66AUAAAAAA|CCAUU-CIUCAUUUCUSAUA AAAAACC6UU- C UCA6UUCG6AUAAAA&U-CCUCU-CUCAUUUCGA*UA^AAAAUCCAUU-CACAUUUCS6UUAAAAAACUCUU-C CAUUCCUSAU

A*UAAAACC6USSC|UCGUUSCASAUICCAAA}CC6USS-C|UCAUUCAUAU

U6CACUIC6CAUIUGA66UIU66A6UIC6 CA8UIU6CUCUIU6AA6UIUGAAUUIC6A AUUA6A6SUIU6CA6UIU6GA6UCU6CA66

37

U6CAACUUCA ACU

U6CAACUU6CAACUU6CAACUU6CAACUU6CAACUU6CAACUU6CAACCU6CAACUU6CAACUG6CAACUUSCAACU

UC A6 66A UIU6UA66CU6A ACUCAUUUCS6UAUjUUUAUC}U6CAACUUC A6UU C 6SA UIU 6CA66 CU 6CAA C UU C A GU C USA uI C AGUCjS GCAAC U

42 Ns- --- ---------------------- ------ CA---cc-Ck^^ cuAC 6AuA 6C C C u A U 6 A AC U

4.P.n.1 --------- --- ---- --V- -VCAC A--- ACUAAACAAAAUU UC6SAAACU

5P.t. ----- UCAA-CC----C.... U--UV VUUVA0AAAA C-C CASUAC6SAAIUUSUVSCUSAAAC47 . A- UC-UAA--- S----CU-VACAAA----CCUISAV C-CISCASUUUUCIUSCIAAACU

49^.n. U: ----- CUSAUA - GCIS -AGCSAA-- CCSISAAA JAUVSC-CIUVCASUUCSSAIUSU CVCU6CAACU

I 20 201103

rhlO

36'

AAAlUT-TTI3AA--UCUVTT UTTCUUASC-C.AS6CU-A6CII1U..CSUIUUAAAICCCAAC-C66U6U-A6CUAAAUCC AAAAACCC6UC-C,66 6 U-A6 C U A-AC A c C6UC-I

I

Nucleic Acids Research

UCCCCAU6AACGA661UCCCCAU6AAC6A5GIUCCCCAU6AAC6A66IUCCCCAU6AAC6A6GIUUCCCAU6AAC6A661UUCCCAU6AAC6A6GIC C C - C A U 6 A A C C A 6 6 I1U G C U C A U C A A C 6 A 6 61Ij0 A UC A U 6 A A C 6 A 6 G I

AGAUCUU6AACGA66dAGAUCUU6AAC6A66166 A U C U U 6 A A C 6 A 6 61UG6UCUUCAAC6A661U66UCU UCAAC6A66IU66UCU UCAAC6A66I

37'

20 ,I

AUUCC A 6UAUUCC CA 6UAUUCC CA6UA U U C C C A 6UA U U C C C A 6UAUUCC COA 6U

A A U(C)C C U(A)G UA A U U C C U A 6 Uk A U U C C U U 6 UI A U U C C U()G Uk A U U C C U(A)6 Uk A U U U C U(A)6 UkA U 6 C C U(A)6 UA A U 6 C C UA)G UA U U C C U(AI6 U

24'

.. .. I...Y?....I...2~ ... I .. .. I ..

A A 6 U 6C 66G U C A U A A 6 C U U GC 6 U U G A U U A A 6 U C C C U G C C C U H.s. IA A 6 U 6 C 6 6 6 U C A U A - A 6 CU U 6 C 6 U U 6 A U U A A 6 U C C C U 6 C C C U N.e. 2A A 6 U 6 C 6 6 6 U C A U A - A 6 C U U 6 C 6 U U 6 A U U A A 6 U C C C U 6 - C C C U R.n. 3A A 6 U 6 C 6 6 G U C A U A - A 6 C U U 6 C 6 U U 6 A U U AA 6 U C C C U 6 - C C C U R.n. 4A A 6 U 6 C 6 6 6 U C A U A - A 6 C U U 6 C 6 U U6 A U U A A6 U C C C U G - C C C U O.C. SA A 6 U 6 C 6 6 6 U C A U A - A 6 C U C 6 C 6 U U 6 A U U AA G U C C C U 6 - C C C U X.1. 6A 6 6 CG C A A 6 U C A U U - A 6 C U U 6 C 6 U C 6 A U U A C 6 U C C C U 6 - C C C U A.s. 7A A C 6 C A A 6U C A U C - A 6 C U U 6 C G U U 6 A U U A C 6 U C C C U 6 - C C C U S.c. 8AA 6 C 6 U A A U C A U U - A C|C U U A U G C UG A A U A U 6 U C C C U6 - C C C U D.d. 9A 6 C 6 C A A 6 U C A U U - A C|C U U 6 C 6 C U 6 A U U A A6 U C C C U 6 - C C C U O.n. 10A 6 C G C A A 6 _UC A U U - A CC U U 6 C 6 C U6 A U U A A6 U C C C U 6 - C C C U S.P. 11

A A 6 U 6 C A A 6 C A U C - A6 C U U6 C6 U U G A U U A UU U C C C U G - C(C)G U Tlt. 12A A 6 C G C 6 A 6 U C A U C - A6 C U C 6 C U U 6 A C U A C G U C C C U 6 - C C C U O.s. 13A A 6 C 6 C 6 A 6 U C A U C - A 6 C U C 6 C 6 U U G A C U A C 6 U C C C U .- C C C U Ls. 14A A 6 C G C 6 A 6 U C A U C - A 6 C U C 6 C 6 U U 6 A C U A C 6 U C C C U G - C C C U G... 15

38 38' 23'

C G C C C U C A U G A A 6 C U 6 A U U C 6 G U(A)G U A A U C 6C 6 U G U C A G C - A 6 C G C 6 C6 6 U 6 A A U A C G U C C C U 6 - C U C C N.C. 16C 6 C C C U C A U 6 A A 6 C U 6 6 A U U C 6 U()G U A A U C 6 C 6 U 6 U C A G C - A 6 C G C 6 C 6 6 U 6 A A U A C G U C C C U G - C U C C H.h. 17C 6 C C C U C A U 6 A A6 C U6 6 A U U C 6 G U(A)G U A A U C G C A U U U C A A U - A 6 A 6 U 6 C 6 6 U 6 A A U A C 6 U C C C U 6 - C U C C H.v. 19C A C C C G C A U G A A 6 C U 6 6 A U U C 6 6 U UA)6 A U C 6 C A U L U C A 6A - A6 A 6 U 6 C G 6 U 6 A A U A C 6 U C C C U 6 - C U C C H.S 19C A C C C G C A U 6 A A 6 C U 6 6 A A U C C 6 U(A)6 U A A U C 6 C 6 U U C A A C - A UA C 6 C 6 6 U 6 A A U A U 6 U C C C U 6 - C U C C A.h. 20C 6 C C C A C 6 U 6 A A 6 C U 66 A A U C C 6 U(A)GU A AU C 6 C A 6 U U C A UA- A UA C U 6 C 6 6 U 6 A A U 6 U 6 U C C C U G - C U C C N.Y. 21C 6 CC C U C A U 6 A A 6 C U 66 A A U6 C 6 UA) U A A U C 6 C 6 U 6 UC A U A - A CC 6C 6 C 6 6 U G A A U A C 6 U C C C U 6 - C U CC A.. 22C G C C C U C 6 U 6 A A C 6 A 6G AA U C C C U(A)6 U A A C CC C 6 6 6C CU A C -AC C C C 6 C 6 6 U 6 A A U A C 6 U C C C U 6 - C U C C S.s. 23C 6 C C C U C 6 U 6 A A C 6 U 6 6 AA U C C CU(A)6U A C C 6 C 6 U 6UC A C C - A|A C 6 C 6 C 6 6U24A A U A C 6 U C C C U 6 - C C C C M. 24

u 6 A A 6 U U 66 A Au c 6 c U(A)6 u AU 6 A A 6 U C 6 6 A A U C 6 C U(A)6 U AU 6 A A 6 U C 6 6 A A U C 6 C U(IA6 U AU 6 A A 6 U C 6 6 A A U C 6 C U(AG U Au6 A A 6 C U 6 6 A Au c 6c UA6 u AU6AA66C66 AAJC6CUAI6U AU 6 A A 6 C C 6 6 A A U C A C UMAIG U AU 6 A A 6 C C 6 6 A A U C A C U(A)6 U AU 6 A A 6 C U 6 6 AU U C 6 C UIA)6 U AU 6 A A 6 U U 6 6 A U U C 6 C U(A)6 U AU 6 A A 6 U U 6 6 A A U C 6 C U(A)6 U Au6 A A 6 6 A 6 6 A A UcC6 C UAG U AU 6 A A 6 6 C 6 6 A A U C 6 C U(A)6 U A

24'

A U C 6 C A 6 A U|C A 6 C -A U6C U G C 6S UA U C 6 U 6 6 A U|C A G A - A U|6|U C A C 6 6UAUC6U6 AUCA6A- AUG6CCACG6UUA U C 6 U A 6 A U|C A 6 A - A UG6 C U A C 6 6 UAUC6C66AUCA6C- AUG6CC6C66UAUC6C666UCA6C -AUA CC6C66UAUC6 C6AAUCA6CUAU6U C 6 C 66UAUC6C6AAUCA6CUAU6UC6C66 UAUC6 C6CAUC A6C CC616 C 6C66UA U C 6 C 6 U C A 6 C AA A C 6 C G 6 UAUUC6AA6UCA6C-A6CUC6G6U6AUC6CA9AUCA6C -AC CU6CG6U6AUC6C AG6GC A6C -ACU6C660U

38 38'

C|6 C C U A C A|U 6 A A 6 C CG60 AA U C 6 C U(A)G UA 0C 6 C C 6 6 UIC A 6 C U A U i C 6 6 C 6 6 U 6 A A U A C 6 U U C UIC 6 - 6 6 c c E.j. 38C 6CCUACAU6AA6CC66 AAUC6CU(A)6UAAUC6CCA6U CA6 CUAU AU 66C66U6AAUAC6UUCC C6-6E)UC C.r. 39C 6 C C U 6 C A U 6 A A 6 C A 6 6 A A U C 6 C U(A)6 UlA U C 6 C C 6 6 U C A 6 C C A U A C 6 6 C 6 6 6 A A U C C 6 U U C C C 6 - 6 6 C C Z.*. 40C6CCU6CAU6AA6CC66AAU C.6!CjUA6U AUC 6!U CA6CCAUA C66C 6U6AAUUC6UUC C C 6-6)6CC N.t. 41~~~-UA AA6A6UIC CUAA66A6UAUUUA6UAUAAAUUAAA6AAUA6A -U6AICUUAAUU6AAUUAU6 CAAU6 AA6SU N.. 43- U4JAA6US C AAGUA6^UAUU^CUA6j C

U AIAAUU AA6 jj*U A6 A-j^6 jCUU AAUUlSAAUA6A6CAAU6 - AAU U.n. 44AAS 6 6 AC A A A 6 6 A 6 6 A U U U A 6 U A 6 U A1A U U A A 61A U A 6 A - 6 C U U A A U U 6 A A U A 6 A 6 C A A U 6 - A A 6 U R.n. 4UA UA CAAA 6 6 A 6 6AUU UAVI A AACUAA 6AUUUASA-[CUUA6UuI6AAUUA66CCAU6-AA6C .t. 45

AUC AAAA66C66AUUUA6CA AA6AWA A CAA6AB6 A6UU(CCUCUUUAAAAC66CC CU6-6A6C X.1. 46AX A U U U UU 6 A A 6 6 U 6 6 A U U U 6 6 U A 6 U A1A A U U A UIAA- A 6 A - U A1 U A A U U u1I U U U U A 6 C ULU A - A A A U M.. 47

CSACUACAU6AAUAAU6AAUUACUA A COUSNAUCACCAD UCAC6AU6AAU-UAAAAAAMC-A6~ffA.s. A8C6AUUAUAUSAAAAAA6AAUU6CUAIUA ACGUAA UASUA UACS SUAAU-AUUCUAAC-UBUIJ!JS.c. 49U|6 6 AU U 6 AAI6 A A 6 A A A6U|U U A6 C A U AU A6 U A u

BUc6Uv

u IAh A AU A A A AC U AS -

A 6 U P.c. 50U66AA6CAUA6AAAAA6AAAUCUUU(A)SUAAUC6CAA A AA6U-A U UC6 A AAAUAUAA6U CA-AAUC P.t. 51

--- --- --- --- --- --- --- --- --- --- --- --- U A - 6 6 U C.?. 52.-AAA(-A-AG-U- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 6 6 U T.b. 53

--t-- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- A - A 6 6 U L.t. 54

Cl6 6 6 A A C A U 6 A A 6 U U 6 A A U C 6 C U(A)6 U AIA U C 6 C 6 6 A UIC A 6 C - A Ul6lC C 6 C 6 6 Ul6A A U A U 6 U A C CI6 - 6 6 C C T ..................................56C66 6AACAIU6AA6UU66AIAUC6CU(A6UIAIAUC6C66AUICA6C- A 6UCC 6C 66 U6AAUAU6UACCC6-66CC T.a. 57

2 205A2U '204 U0.210

rl 1 1

A uA UAUA LAUA LAt!LALAA UA6L6 L6 UGU

6 A 6 U G CA C U 6 C I

A6 c u c cIG A C U C C I6 A C U 6 C I6 A G A 6 C IA6A C U C I6AcuucI6 A C U U C IG A C C U C U6 A C U 6 C IA6 c U c Ci6 C C U 6 C 0

37'

6 A A U A C 6 U U C C6 A A U A C G U U C C6 A A U A C 6 U U C C6 A A U A C 6 U U C C6 A A U A C 6 U U C C16 A A U A C 6 U U C C6AAUAC6UUCU6AAUAC6UUCU16 A A U A C 6 U U C C6AAUAC6UUCC6 A A U O C 6 U U C C16 A A U A C 6 U U C C6 AA U A C 6 U U C C

C 6 - 6 G C C A.t. 25C G - 6 6 U C P.t. 26C 6 - 6 6 C C E.c. 27C 6 - 6 6 C C P.s. 20C 6 - 6 6 CC U.S. 29C G - 6 6 CC A.C. 30C 6 - 6 6 U C A.c. 31C 6 - 6 6 U C A.s. 32C 6 - 6 6 CCC .?. 33C 6 - 6 6CC F.h. 34C 6 - 6 6 CCC .d. 35C 6 - 6 6 CC A X. 36C 6 - 6 6 CC A n. 37

23'

ccccccccccccc

213 2149 ,. 2159 §, 2169

. 217?

CU A C U(A)C C6 A GAU 66 U(U)A 66U66 6 C C(C)U C 6CC C C CCM6 66C U A C U(AMC C 6 A|U U 6 6 A U66 AU U(UA 6 U66 A 66CC C U C 6 6 A|U C 6 6 CIC C C 6 C C 6 6 6 6C|U A C U(A)C C|6 A|U U 6 6 A U 66 U U)A 6 U|6 A6 6CC C U C 6 6 A U C 6 6CC C C 6 C C 6 6 6 6C U A C U(A)C C|6 AGU U 6 6 A U666U UA 6 U66 A6 6 CC C U C 6 6 A|U C 6 6CC C C 6 C C 6 6 6 6C|U A C U(A)C C|6 A|U U 6 6 A U 66 U U()A 6 Ul6 - 6 6 C C C U C 6 6 A|U C 6 6 C C C 6CC 6 66 6C U A C U(A)C C|6 AGU U 6 6 A U 6 6 U U)A 6 U 6 AS U C C U C 6 6 AIU C 6 6 C C 6 C C 6 6 6 6CIU A C U(A)C Cl6 AU U 6 A A(U)6 A U U(U)A 6 U|6 A|6 A A CU U C 6 Al- C 6 A C C 6 C C A 6 6 6 C(CUA6 U(A)C CG6 A GU6 A AU 66 C U(U)A 6U6A6(6)C CAC A 66AGUC 6 C UU(A)6 A 6A(A)66CAUCCU(A)C6CA ACAAUG6AA AAIAA6SUUUUUAucu6

C C U(A)C C6 AG UUC GAG6 U A CC6 AAACCIAAUUU66 A|C C 6 C 66 A A 6UI---C U C C U(A)C C G6A|U U U 6 A 6 U 6AAC C 6 6 U 6 A AC A U U U166AC C 6 C 6 6 6 6 C U --

C U U 6 U(A)6 UGA A - C 6 A A U 6 6 U C U6 6 A U 66 A 6 C 6 A C A - - - - -

C U C C U(A)C Cl6 AGU U 6 A A U 6 6 UAC C 66 U6 A|A 6 U 6 AUU C 6 6 A|! C 6 C 6 6 C 6(A(C 6 6 6 6 6C U C C U(A)C C6 AU U 6 A A U 6 6 UIC C 6 6 U6 AA 6 U 61U U C 6 6 A C U C 6 6 CICG C - - - A ClC U C C UAAC C AU U 6 A A U 666 C C_6 6 UPA 6 U 6U U C 6 6 A U 6 C 6 6 C 6(A)C 6 u 61A 6

39

16 H.c. U U G C A C A|C A C C G C C C G U C A17 H.h. U U G CI AIC A C C G C C C G U C A18 H.v. U U 6 C|A|C A|C A C C G C C C G U C A19 H... U U GC|AG C A|C A C C G C C C G U C A20 H.h. U U G C|A|C A|C A C C G C C C G U C A21 M.v. U U 6 C[A C A C A C C G C C C G U C A22 M.f. U UG CGA|C A|C A C C G C C C G U C A23 S.s. U UG CCA|C A|C A C C G C C C G U C G24 T.t. U U 6 CAC A|CA C C G C C C G U C G

A A U C(A)C C CA A U C(A)C C CA A 6 C(A)C C CA A U C(A)C C CA A C C(A)C C CC A C C(A)C C CC 6 C CAC CC C C(A)C C CC A C C(A)C C C

6 A6 AAA6IAA

AII A G6U 6 6 6 6U C 6 6 A UG6U 6 6 6 6 U C 6 6 A U6 U 6 A 6 6JC C 6 6 A UGAGASGGUCCGSAU6 U A 6 6 U C AU66 A U16 U U 6 6 6 U U C A 6 6 U|A A A 6 6 6 U U U 6 6 A U6 C 6g]6 A A A 6 6 6 6 UG666A6UUCUCU6C

G-A6 A6 A6A6A6A6AA

6 6 C C 6 6 C - - - - - - - - - - - - - - - - - - -

6 6 C C 6 6 C - - - - - - - - - - - - - - - - - - -

6 6 C C A C C- - - - - - - - - - - - - - - - - -

6 6 C C 6 6 C - - - - - - - - - - - - - - - - - -

6 6 A U 6 U AU -- - - _6 6 C C UU 6 6 C - - - - - - - - - - - - - - -

6 6 C C A U A 6 U C|- - - - - - - - - - - - - - - -

6 6 U C C C U U 6C-- - - - - - - - - - -

6(6)C C C C U C 6 C UG66- - - - - - - - - - -

25 A.t. U U 6 UAC A26 P.t. U U 6UGA|C A27 E.c. U U 6 U|A|C A28 P.v. U U 6 U A C A29 3.S. U 6 UAC A

30 H.c. UU 6 aACA31 N.c. UUGGACA32M.s. UUGU ACA33 M.. U U 6UGAC A34 F.h. U U 6 U|A|C A35 D.d. U U 6 U A C A36 M.x. U U 6UGA|C A37 A.n. U U 6 UGAC A

C A C C 6 C C C 6 U C AC A C C 6 C C C 6 U C AC A C C 6 C C C 6 U C AC A C C 6 C C C 6 U C A|C A C C 6 C C C G U C A

Ic A C C 6 C C C 6 U C A|C A C C 6 C C C 6 U C AICACCGCCCGUCAC A C C 6 C C C 6 U C AC A C C 6 C C C 6 U C AC A C C 6 C C C 6 U C AC A C C 6 C C C 6 U C AC A C C 6 C C C 6 U C A

38 E.g. U U 6 UAC A|C A C C 6 C C C 6 U C A39 C.r. U U 6 U|A|C AIC A C C 6 C C C 6 U C A40 Z... U U 6 U AIC A C C 6 C C C 6 U C A41 M.t. U U 6 UAC A|C A C C 6 C C C 6 U C A

C A CC-UG A 6UAGUAUUACCCGAAGUAG U 6Cf C U- A A C C 6C A C CA)G66 6 A G C 6 6 U C U C 6 C C A G A A G U A 66 A AG C C U - A AC CIC A C C(A)U 6 6 6 A G U 666 AU U S C A A A A G A A G U A 66 U A 6 C U - A A C C UC A C C(A)U 6 6 6 A 6 U 6 6 6 U U 6 C A A A A 6A A 6 U A 6 6 U A 6 CAU- A A C C U-C A C C(A)C 6AG A 6 U U U 6 U A A C A CCC A A 6U)C 6 6 U 6 6 U -A A C C AIC A C C(A)C 6 A A A 6 U C 6 6 C A A C A C CC 6 A A 6(U)C 6 6 U 6 A6 6 U -A A C CA-C A C C(A)U 6 A 6 A 6 U U 6 6 U A A U A C C A 6 A A 6(U)A 6 6 UAC U U- A A C C AC A C C(A)U 6 A 6 A 6 A A 6 6 UA A A A C CLA 6 A A 6(U)A 6 6 U A G(A)A U - A A C C A -A 6 C C(A)U 6 6 6 A 6 C C6666 6 U A C C U6 A A 6 U A - - - C6 U -A A C C - _A 6 C CMAU 6 6 A A 6 U U 6 6 666U A C C _J A A 6XU A - - - - - C6 U - A A C C - - _C A C CWAC GA A A GAU C 6 6 AUGU U A C C C GAA G(CC 6 AGA C C - A AIC CAAG C .C A C C(A)U 6 6 6 A 6 U C 6 A U U 6 C U C C A 6 A A A U C A U C U C A C C -ACA C C(A)U 66 A A GAG 66 C C(A)U 6 C C CGA 6 G Uf|C C C U- A A C C 6 .

39

CACC(A)U66 A A6UC66CU6(U)6CCG A6(U)6AUU A UCUU -6CCUA --- - -- - -C A C CA)U 66 A AG C U 66 A A C USC U C C A A 6(U)C 6UU A C C C ULAI - - - - - - - -

CACU(A)UAA666 6C(U)66CC(A)66UUU6A6)6UCAUUACCCUAACC -- - - - - --

CACCU(A)U 6 6 6 CU66CC(A)U66 -6(U)C6UUACCUU C C

42 H.s. 6 C 6 U i C A C A C C 6 C C C 6 U C A C C C U C C U C A L6 U A U A|C U U C A A A 6 6 A C A U U U A A C U A A A A C C C C U A C 6 C A - - -426.;. GC A^CAICACCGCCCGUCACCCUCGCAA UCAU^**^CAAAGGCAU ^ U^UUAC CCCU 6CGACCA-- -

46 AJC 6 A C A C C C C 6 C A C C A A A LA A AA A U VA A CA A6 A A A U CA U A A C6 A

44 R.n. A C 6 C A C A C A C C 6 C C C 6 U C A C C|C U C C U C|A AU U 6 AU6ACAUV C A C A U A U A - - - - - - - - - - - - - - - - - - -

45 Lt. A C 6 C A C A C A C C6 C C C 6 U C A C CC U C C U ClA*A A V A 6A U6 Al U C V - - - A A C C C U A U - - - - - - - - - - - - -

47 l.w. AU 6 fUl,ACACAU C 6 C C C 6 U C 6 JUC U U AU U A U U - - - - - - - - - - - - - - -

49 S.c. IUC6ACUUC4CUCAUCAC 6CuueucuaC 6ccUU AA C AU AUU AU Ul CUciuuuluulldUAuutUAUINA9fV l A V ASGP.P. CCU CACAUCU6CCCAUACAC6CUCAAA6A6UUCAAAUCGU66AA6CU6AAUUU---SlP.t. CAU CACACGU6CCCAUCAC6CUCAAA6A6AUGUAUAU CUAAUU A6 CGU6AA GUICCCMUUR-R--

52 C.f. A - - - - -UU 6UU6C C CA C CAAUUU --------- -- -- -- -- -- -- -- -- -- -- -- - - - - ---- -

53 T.b. A - - - - - U U 6 U U 6 C C C A C C A U U U U U A] A U A A A A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

54 L.t. A - - - - - U U 6 U U 6 C C C C C A U U C U U U 6W A A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

552..8.C6U1~IFICACC6CCC GCAICACCCUCCG66AC66C61CCC6A26CAGC66A1CCAA1C2AACACCC2U61

1 2116 2126 2136 2146 2156 2166 2176

rl 1 2

Nucleic Acids Research

1 H.s.2 H.&.3 R.n.4 R.n.5 G.r.6 X.1.7 A.s.A S.c.9 D.d.10 6.n.Al S.r.12 T.t.13 6.s.14 Z.o.15 G.D.

.. I..

U U 6 U _C AUU6U A CA

U U 6 U A C AUU6U A CAUU6U CA

U U 6 U A C AUU6U A CAUU6U A CA

U U 6 U A C AU U 6 U A C AU U 6 U A C A

UU6UUACA

U U 6 U A C AUU6U C A

UUA6UACA

211? , 21

C A C C 6 C C C G U C G

CACCGCCC6UC6C A C C G C C C 6 U C 6

CACC6CCC6UC6C A C C 6 C C C 6 U C 6

CACC6CCC6UC6CACC6CCC6UC6CACC6CCC6UCGCACC6CCC6UC6CACCGCCC6UC6CACCSCCCGUCSCACC6CCC6UC6

C A C C 6 C C C G U C G

CACCGCCC6UCGCACC6CCC6UCG

I ... I.

_-

Nucleic Acids Research

19iII?I 2220? 221?,222

8C 6C C ---------------- CA C6 6 C C - 60C 6JT1C8GA6 A C6 6 UC6A ACU(U)G- ACUAU C UA M4.s.U C E6CC----------------C A C 6 C C C USC B6A 6C 6IC UGA A A6A C 6 U C A A CU(U)6 - A C UA UC UA N.D. 2U CRUCC----------------C A COG C C UGU6- C 6 6 A6CIC U6A6AA6A C 66UC 6A AC U(8)6- A C U A UC U A R.n. 3U C 66CC----------------C AC86C C U UGC 66A 6 CIC U6 ABA A 6A C 66U CG6A A C U(8- A C UA UC UA R.n. 4U C C C---------------C-CAC66cCC U6C66AAACA6C8AACAA8AC U6C 6A AC U(8)G - A C UA UC UA D.c. 5C88CC ---------------AC6 CC C U6 6 C6 6l C C 6A8A A 6ACBA UCA A AC UMS- A C UhUC U A X.1. 6

MAG CUCC.---6 (06CUGC8CU 66UG UU68 AAA8866U8 UUC AA AC U(8)G-A UC(C)UU UA A. s. 7_681--------6C-------A AC UC C(AU C UCMA16A 6C 66A6A A U U66A)C AA ACU -(8)6U6CAU U 6 S.c. 8------------------------ 6 6C AAC AC8U6 AUAjQA A A UUAAA AALA_UC8UC -A UU 688UUUD.d. 9

.--------------C UC 6UAU U CfUi6US8AJA.A A 8CUA UAAACCA U- AUCAC UU A 8.;n. 10----------------------------- C UU8CA8UC IjUJ6U $AIA A AUCUA6UA AA CC AU-A UC AC U UA S.P,. 11

c1 IIU uC 6.--------- C CG6C C cCCC6 A)C - 6BUC -96- C 6A6AA 6UCC AU U6AA CC U U- A UC A U UUA 8.s. 13

39,

.----------------------C1AA~C 6CC 6 6UCI6AIAU C U 6 6 CUU C GC N.ma. 19-------------------------------------------AU86 A UA U 6 UCI6AIAU C(8G6 66UUUU6 C N.h. 20------------------------------------------U U U C UA 6 6 UCL]AACC U 666CUC A 6C N. v. 21

----------------------------------------- AUAA6U6 66 66AUCGA AC UCC U UUCC C 6CS.s. 23.------------------6 C AAC A6 6U 66 6 66(A)C 6A 6C A 6A 6AACU C CC T.t. 24

.------------------C AA - 66A66C96 CjUJA CCAC66 UA- 6 66UC A6CA.t.25-------------------------------------UAA -66A 6G6 C 6CU UACCAC66 CG6- 6G6 UUC 6U P.t. 26-------------------------------------UC 6- 66A66 6C 6C U UACCACUUU 6- U6AUU C AU ECc. 27

.------------------U C 6-G66A66 6C 6C UUACCACUUUG- U 6AUU8C AU P.;v. 28

.------------------1I8UrA 66A 6CCM C C6(C)C 6A A66U6 - 66 ACA 6A U 8.;s. 29-------------------------------------U UCA!68A 6CCj6 C C6(C)C 6A AIS66U 6-[ii6UC 6A U N.c. 30------------------------------------- uU 6 6 A 6A 6C 6C U (C)CCA AIS66UA -I66ACU A 6C N.c. 31

.8-----------------uuU-8 6AW8C 6C UU(C)CC A A66 U A - 66ACU A 6C M.;s. 32

-----------------------------------------6CAA6 A C 6UC CU A 666UA-IjA.. CC 6A U F.h. 34-------------------------------------A A U -6 CA6cC8(8)C7UAC66 UA- 66C C6 A UD.d. 35

------------------------------------ 8C6C86 6 666C6(C)C6A66 U A-666CU6A U A.;. 37

39,

.G------------------6A A AA 6A 66 A A AUA(C)CUFI 6 6C(C -)U 66CU 6 6U E.. 38

-------------------------------------6 UAA6 A 6 66 6 A U 6C( CJ 66 C U-(A)G6 CU(U)GC Z... 40.------------------6 C AA 6 -86686A U6(C) C2. -6 C 6-(6)6 6C UA 6U N.t. 41

-------------------------------------------------- UA AACACAC AAUUAACA C AA X.1. 46

UAU~AUA6CA6 ---------8UUAUAUA UAU UUAUA A 6 UUA A A U UUj 6CAU C -AU A C U U A.n. 48

4A-ONU8UCMC -- - -C AA A8A A8AS8UA A A C6AAACC UA-U86 A AA8U P.t. 51

C UUCUBU 6 UA C C A C UA6U6CCACAAA 66C CU U U66 U6 6UCIU UA U U66C6CA UA C CAC 6886U6- 6 6UUC UUC T.a. 56

1 2136 2196 -o6 2216 2-26 2236 2246

rl113

Nucleic Acids Research

4Ac.

6 AjG6 A AGU A AA A6 U C 6 U AAC A A 6U U UC C 6U A 66U6A A CC U6 CG66AA 66 A UCA U UA--------------6 Ol. J6A6VAMA6UAA AA 6 UC 6UAA CA A66 U UUC CG6UA66U 6 AAC C UG6CG66A A 6 A UC AHUUA--------------

9 I.d. 6 A 66(A)A 66AGA AGUC6 UCGAAA C AA 66UAUC C 6U AG66G6AHACCU6C 6G6A UG6 A UC A UUUU-------------10 0.n. 6GA GG(A)A 66AA A 6UC 6UA A CA A 6 6UUUC CG6UAGA66U6AACC U 6C66GA A6 6A UC AU UA--------------

130D.s. IAI66M(A )6AG6AA A 6U C6UA AC AAG66UU UC C 6UA 6G6O6AA C C U6C 6G6A A6GA UC AU U6 --------------

150G... 6WA6 6(A)A 6 6A6 AA 6U C 6U A AC AA 6 6 U UC C 6 UAG6U6 AACC U6 C 66A AG66 AU CA UGU6--------------40 40'

10 H.v, 1AAG6 6(6)6 CU UAGAAUC6 A A C A AG6UA 6C C 6UA66 6 6A A UC UG6C 6 6C U 66AU CA CCOUC CU-----------19 A.s. HAIAGG6(6)GAAAUUAAGU C 6U AAC AAUA6OHC C6UAAGGGGH6ACAGUCU6C66C U66A UC A CC UC C U-----------20Wh.h. A AH6 66(6)G6UUAA6HAU CG6U A AC A A6G6OUAG6C C 6UA 6 66HAAUC UGC 66GC UG6 AUCA C CCUC CAU-----------21 W.v. GA 6 66(6)66U6 AGA 6UC6UAWAC A GA6UA 6C CG6UAAG66WAAC C UG6CG6G6C U GA UC AC C UCC------------22 M.?. GHGAGGG1A-166CGA AGOUC GUA ACAAGGA6UAG6C CG6U AGG66G6AHAC CAUG6CGG6C U GA UC AC C UACCU-----------23 S.s. GA 6 66(6)6 6A 6GAHA 6 U C 6GUAHA C A AGGOHUG6C C 6 UA GGGGA6 CAGAcC 6c66 COG H SAUC A C C A C A------------24 T.t. GA6G6(6)G6UG6AAGUC 6U AAC AAG66UAHC CG6UAGGGG6A AC CUGCG66C U 6AU C AC CUC C ------------

25 A.t. A C UG6(6)6G6U6AGA 6UCG6UA AACAAGGOUAG6C CG6UAWGGG66AA C C UGC 66CAUGG AU C AC C UCCAUUUC A .26 P.t. GWAC UG(6)G6UGAGAOUCG6U A ACA GGUAWC CG6UAUCG6G6AAG6U 6C 6G6C U6 6A UC AC C UC CU UUC A -------27 ECc. 6A C U6(6)GGU6AGAAG6UCGOWAA CAWGGUA AAC CG6OWAGG6GG6A ACC UG6C 66 UGUG6AAUC AC C UC C HA---------2A P.v. 6 ACAU6(6)6G6UG6AA 6AUCG6U A A C AGGA66UA A C CGUAG6G6H6AAC C U6CGUGU6G6A UC AC C U C C UUA --A .---29 RIs. GHAOUUG(G)GGOGAA WGUCG6UAWACWAGAGOUAG6C CGOHUAOUCGG6AWAGGOG6CG6CAUG6A U C AC C UOC CUOUUCO.---30 H.c. A UAUG(G)GG U6GA GOUC OUWAAC HAGAGO6UWG6CCGUA U CGGH6AGAGOG6CG 6C U 6 UC AC C UC CAUUUC U---.

32 H.s. 6 AUUG6(6)6GGU6AGAH6GUCGWUAHAC HAGAGOUAUC CG6OWACG6GG6WHAC GOUG6C GG6AOUG6AUC A C COUC COOSU CU-------333.?. GAC UO6(6)GG6CUA AA6GUCGOUAHACAAWGGUAWGC CG6UAHCCGGAAWG6UG6CG6COUG6AHAC AC COUC CUOUUCAU-------34 F.h. A AC UG6(6)GGCUAAA UCGUA A C AAGGUAHXX XOXUA C CG6GAAHGIXOXXOXXGXXX6A ACWACCAUC C UAUUCOU-------35 D.d. GWUU 6(G)6GOUGWAGAOUCG6OHAA CHAA66UAGG6 C CGOWGAGGG6A A C C UG6CGGC UGG6AOUC AC COUC COUOU---------36 Hm.;. ACOUG(G)GGGWU6AGAOUCG6OUAAC AWGA66UAG6CCGUAG6G 6A AC C UG6CG6G6CAUG6AUCHAC COUC C UOUCAU-------37W,A.nGWCAuG(G)GGOAWAG6uC6UCGOWACWWGAG6UWAGCCGUA CCGG6AAWGGOGOGG66C UGGIAOuC A C CAuC CUAo.-

40 40'

39 Cr..ACGWUCA(6)6G6OUWWGA6UC GWAAC WWA6GGUAWG6G6COUA COUGG AAWIGGOG6G6C C COUGI C AC AC C AC C UUC---------

42 H.s. UJH 6 A 6 6 A 6 A C A H 6 0 C 6 0 A A C HAU6 6 A A A 6 0 6 A A C U 6 6 A H A 6 A 6 C A C A A 6 6 A C 6 A A C---------------43A.GGGWWAGCAWWGAHG ACGWHAOCOGWAO43R.&. 6GA6GA6GGWGWAOUAWA6AUCGOWUAACWAWGGOUA AG6CWAUAHCOUG6WWWAOAAG6U6U6CAAU6GWWOAA.UAAU------45 ..6A6A66A3..GGGWAWBCAWA CWAGABWWGA6CWAOWACOUGG6WWAWS6OGAG6C U U66A AUWAWoU---------------460X.1. 6AA6AGWG6 C CWAWAGUC 6OUWR6A C CWAC UG6AWGAOCG6GRA6CWACCACC6 A CR AWO--------473D.w.. 66A66OWAGAOAS UCG6WWCWOWGOAUOWUACOU6GRWWWRO 6U- CUOAR UAWOAA----------------486W.n. 6WAOUUHB6OUROUAHWROUCB6AWAWOUAOUSSOOUCRO6OWWAOARUWHWUo6CWACHS6WoASUW6A AUUA--------------49 S.c. WHOU WAGU6CB6WAAR6UOU6SAAWAUWACWA6 00UACCGOUA 6 S6WWAC C USCG6O6U6S6COUWUWAWUAOWAUO UC U UAWAOAUAUU5HP.P. GWAC6AGGW6GARU6AWSOGCWACWAWARROUWACCS6ROUA(OS6SAWWCOUO6 C CG6GOUR6AG6CAOG6WAUO6WWAC A6AWC C WHAAUA51 P.t. 6AAUC 6G6AW6USWAAG6UOUGACWAC AWAGROW AC C 6 6 UMUM)GAWAC(U OGC C6GOUSS6AG6C A A6BOUGWAWAC A AC C A AOU52 C.?. UAAAAAU--WAWAWACG6OG6CWAGOUWWAOOU-WWARA)UWAMCOUUAOW AAAWWWWUW)CWAoU----------------------53 T.b. --WAAAWAAOWUCAAUCWBUWAAO-AWAWAOAOAWOWWWWWOWUWRO-----------------------54 L.t. - - UAAWBAC6HWAAC 6 6CAGCGUWA AOUU- WWAOWAOUAOUOOUAOWWWAWAOWAAOWAOOAUOU----------------------552...::AC UAGGGGUOIWWAG UCG6OWAWACWWAGR6AWC C 6 666AAWGWC C UGA6GU66COU6WAOUU6WAOAUCC------------561. SAC UAGGGGU AA6WWUCG UAWACWWGG6OWUG6C C 6 UWA6G6GWWAC CAUO6GU66C U GW6AGUWW6AOUC c------570O.s.ACU 6 6U AGCGGOAW6GUCG6OWWAC AAGGHUG6CCGWGGWCO6UAGOG6AAGU66COUSGWAUOUGWAAOUCC------------

2 224 2271 2-4 2294 2346 2316

rl 14

Nucleic Acids Research

2320...,. I,...-- 0.,. 1_-. . 2

R.n. 3R.n. 4O.c. 5

____--- X.1. 6___

- 1,6,A. . 7S.C. 8D.d. 9

- .n. 10S.P. 11

T.t. 120. S. 132S.. 14

----6... 15

.Hc. 16H.h. 17

-H.v. 18R.D. 19A.h. 20M v. 21M. . 22Sn. 23T.t. 24

A.t. 25P t. 26E.c. 27P.v. 28

_ .s. 29".c. 30- c. 31

Mn. 32- f. 33

F.h. 34-D.d. 35

M x. 36A.n. 37

En.j 38Cr. 39Z... 40MN.t. 41

- - - - - - H.ns. 42M-- - - - M. 43

_- R.n 44-- - t. 45- - X. l. 46-D.Y. 47

- - - - - - - - A.n. 48C U U A CA - - S.c. 49U A A A UG C C P.r. S0C A U G CA C C P.t. 51- - - - - - - - C-.? 52

T.b. 53- - - - - - -- L.t. 54

2... 55------ T.a.5S

-- - - - O.s. 57

2320

r1 15

Nucleic Acids Research

ACKNOWLEDGEMENTS

The present study was made possible by an F.K.F.O. assistantship granted

to E.H.. We thank Erna Dams, Raymond De Baere, Lydia Hendriks, Hilde Van den

Eynde and Peter Willekens for their cooperation in the correction of the srRNA

computer files.

*To whom correspondence should be addressed

REFERENCES1. Torczynski, R.M., Fuke, M., Bollon, A.P. (1985) DNA 4, 283-291.2. Raynal, F., Michot, B., Bachellerie, J.-P. (1984) FEBS Lett. 167, 263-268.3. Torczynski, R., Bollon, A.P., Fuke, M. (1983) Nucl. Acids Res. 11, 4879-

4890.4. Chan, Y.-L., Gutell, R., Noller, H.F., Wool, I.G. (1984) J. Biol. Chem.

259, 224-230 (1984).5. Connaughton, J.F., Rairkar, A., Lockard, R.E., Kumar, A. (1984). Nucl.

Acids Res. 12, 4731-4745.6. Atmadja, J., Brimacombe, R., Maden, B.E.H. (1984) Nucl. Acids Res. 12,

2649-2667.7. Nelles, L., Fang, B.L., Volckaert, G., Vandenberghe, A., De Wachter, R.

(1984) Nucl. Acids Res. 12, 8749-8768.8. Hogan, J.J., Gutell, R.R., Noller, H.F. (1984) Biochemistry 23, 3322-3330.9. McCaroll, R., Olsen, G.J., Stahl, Y.D., Woese, C.R., Sogin, M.L. (1983)

Biochemistry 22, 5858-5868.10. Elwood, H.J., Olsen, G.J., Sogin, M.L. (1985) Mol. Biol. Evol. 2, 399-410.11. Same reference as 10.12. Spangler, E.A., Blackburn, E.H. (1983) J. Biol. Chem. 260, 6334-6340.13. Takaiwa, F., Oono, K., Sugiura, M. (1984) Nucl. Acids Res. 12, 5441-5448.14. Messing, J., Carlson,J., Hagen, G., Rubenstein, I., Olesen, A. (1984) DNA

3, 31-40.15. Eckenrode, V.K., Arnold, J., Meagher, R.B. (1985) J. Mol. Evol. 21, 259-

269.16. Hui, I., Dennis, P.P. (1985) J. Biol. Chem. 260, 899-906.17. Mankin, A.S., Kagramanova, V.K., Teterina, N.L., Rubtsov, R.M., Belova,

E.M., Kopylov, A.M. , Baratova, L.A., Bogdanov, A.A. (1985) Gene 37,181-189.

18. Gupta, R., Lanter, J.M., Woese, C.R. (1983) Science 221, 656-659.19. Leffers, H., Garrett, R.A. (1984) EMBO J. 3, 1613-1619.20. Yang, D., Kaine, B.P., Woese, C.R. (1985) System. Appl. Microbiol. 6, 251-

256.21. Jarsch, M., Bbck, A. (1985) System. Appl. Microbiol. 6, 54-59.22. Lechner, K., Wich, G., Bock, A. (1985) System. Appl. Microbiol. 6, 157-

163.23. Olsen, G.J., Pace, N.R., Nuell, M., Kaine, B.P., Gupta, R., Woese, C.R.

(1985) J. Mol. Evol. 22, 301-307.24. Leinfelder, W., Jarsch, M., B8ck, A. (1985) System. Appl. Microbiol. 6,

164-170.25. Yang, D., Oyaizu, Y., Oyaizu, H., Olsen, G.J., Woese, C.R. (1985) Proc.

Natl. Acad. Sci. USA 82, 4443-4447.26. Same reference as 25.27. Brosius, J., Dull, T., Sleeter, D.D., Noller, H. (1981) J. Mol. Biol. 148,

107-127.28. Carbon, P., Ebel, J.P., Ehresmann, C. (1981) Nucl. Acids Res. 9, 2325-

2333.

rl 1 6

Nucleic Acids Research

29. Green, C.S., Stewart, G.C., Hollis, M.D., Vold, B.S., Bott, K.F. (1985)Gene 37, 261-266.

30. Woese, C.R., Debrunner-Vosbrink, B.A., Oyaizu, H., Stackebrandt, E.,Ludwig, W. (1985) Science 229, 762-765.

31. Iwami, M., Muto, A., Yamao, F., Osawa, S. (1984) Mol. Gen. Genet. 196,317-322.

32. Frydenberg, J., Christiansen,C. (1985) DNA 4, 127-137.33. Weisburg, W.G., Oyaizu, Y., Oyaizu, H., Woese, C.R. (1985) J. Bacteriol.

146, 230-236.34. Same reference as 33.35. Oyaizu, H., Woese, C.R. (1985) System. Appl. Microbiol. 6, 257-263.36. Same reference as 35.37. Tomioka, N., Sugiura, M. (1983) Mol. Gen. Genet. 191,46-50.38. Graf, L., Roux, E., Stutz, E., Kossel, H. (1982) Nucl. Acids Res. 10,

6369-6381.39. Dron, M., Rahire, M., Rochaix, J.-D. (1982) Nucl. Acids Res. 10, 7609-

7620.40. Schwartz, Z., Kossel, H. (1980) Nature 238, 739-742.41. Tohdoh, N., Sugiura, M. (1982) Gene 17, 213-218.42. Eperon, I.C., Anderson, S., Nierlich, D.P. (1980) Nature 286, 460-467.43. van Etten, R.A., Walberg, M.W., Clayton, D.A. (1980) Cell 22, 157-170.44. Kobayashi, M., Seki, T., Yaginumi, K., Koike, K. (1981) Gene 16, 297-307.45. Anderson, S., de Bruijn, M.H.L., Coulson, A.R., Eperon, I.C., Sanger, F.,

Young, I.G. (1982) J. Mol. Biol. 156, 683-71746. Roe, B., Ma, D.P., Wilson, R.K., Wong, J.F.H. (1985) J. Biol. Chem. 260,

9759-9774.47. Clary, D.O., Wolstenholme, D.R. (1985) Nucl. Acids Res. 13, 4029-4045.48. K6chel, H.G., Kuntzel, H. (1981) Nucl. Acids Res. 9, 5689-5696.49. Li,M., Tzagoloff, A., Underbrink-Lyon, F., Martin, N.C. (1982) J. Biol.

Chem. 257, 5921-5928.50. Seilhamer, J.J., Olsen, G.J., Cummings, D.J. (1984) J. Biol. Chem. 259,

5167-5172.51. Same reference as 50.52. Sloof, P., Van den Burg, J., Voogd, A., Benne, R., Agostinelli, M., Borst,

P., Gutell, R. (1985) Nucl. Acids Res. 13, 4171-4190.53. Eperon, I.C., Janssen, J.W.G., Hoeijmakers, J.H.J., Borst, P. (1983) Nucl.

Acids Res. 11, 105-125.54. de la Cruz, V.F., Lake, J.A., Simpson, A.M., Simpson, L. (1985) Proc.

Natl. Acad. Sci. USA 82, 1401-1405.55. Chao, S., Sederoff, R., Levings, C.S. (1984) Nucl. Acids Res. 12, 6629-

6644.56. Spencer, D.F., Schnare, M.N., Gray, M.W. (1984) Proc. Natl. Acad. Sci. USA

81, 493-497.57. Brennicke, A., Moller, S., Blanz, P. (1985) Mol. Gen. Genet. 198, 404-410.58. Salim, M., Maden, B.E.H. (1981) Nature 291, 205-208.59. Rubtsov, P.M., Musakhanov, M.M., Zakhariev, V.M., Krayev, A.S., Skryabin,

K.G., Bayev, A.A. (1980) Nucl. Acids Res. 8, 5779-5794.60. Carbon, P., Ehresmann, C., Ehresmann, B., Ebel, J.P. (1978) FEBS Lett. 94,

152-156.61. Brosius, J., Palmer, M.L., Kennedy, P.J., Noller, H.F. (1978) Proc. Natl.

Acad. Sci. USA 75, 4801-4805.62. Sor, F., Fukuhara, H. (1980) C.R. Acad. Sci. Paris Ser. D 291, 933-936.63. Nomenclature committee of the International Union of Biochemistry

(1985) Eur. J. Biochem. 150, 1-5.64. Cornish-Bowden, A. (1985) Nucl. Acids Res. 13, 3021-3030.65. Woese, C.R., Gutell, R., Gupta, R., Noller, H.F. (1983) Microbiol. Rev.

47, 621-669.

r117

Nucleic Acids Research

66. Gutell, R.R., Weiser, B., Woese, C.R., Noller, H.F. (1985) Progr. Nucl.Acid Res. Mol. Biol. 32, 155-214.

67. Olsen, G.J., McCarroll, R., Sogin, M.L. (1983) Nucl. Acids Res. 11, 8037-8049.

68. Kaestner, A. (1969) Lehrbuch der Speziellen Zoologie, Gustaf FisherVerlag, Stuttgart.

69. Ainsworth, G.C., Sparrow, F.K., Sussman, A.S. (1973) The Fungi, an Ad-vanced Treatrise, Academic Press, New York.

70. Fritsch, F.E. (1971) The Structure and Reproduction of the Algae, Univer-sity Press, Cambridge.

71. Engler, A. (1954) Syllabus der Pflanzenfamilien, Gebrdder Borntraeger,Berlin.

72. Balch, W.E., Fox, G.E., Magrum, L.J., Woese, C.R., Wolfe, R.S. (1979)Microbiol. Rev. 43, 260-296.

73. Schnabel, R., Huet, J., Thomur, M., Zillig, W., Sentenac, A., Stetter,K.O. (1983) in Endocytobiology, vol. 2, pp. 895-912, E.A. Schenk, W.Schwemmler eds., De Gruyter, Berlin, New York.

74. Woese, C.R., Stackebrandt, E., Macke, T.J., Fox, G.E. (1985) System. Appl.Microbiol. 6, 143-151.

75. Erdmann, V.A., Wolters, J., Huysmans, E., De Wachter, R. (1985) Nucl.Acids Res. 13, rlO5-r153.

r118


Recommended