Introduction
The nucleolar biosynthesis of eukaryotic rRNAs consists of three major steps. First, RNA polymerase I synthesizes a large precursor RNA (pre‐rRNA), in which the 18S, 5.8S and 25/28S rRNAs are flanked and separated by external and internal spacer sequences (
Hadjiolov, 1985). After transcription, the rRNA regions of the pre‐rRNA undergo extensive covalent modifications. Many precisely selected nucleotides are methylated at the 2′‐
O‐hydroxyl position, and several uridine residues are converted into pseudouridine (
Maden, 1990;
Eichler and Craig, 1994). Finally, the mature‐sized rRNAs are nucleolytically processed from the modified pre‐rRNA (
Eichler and Craig, 1994;
Venema and Tollervey, 1995;
Sollner‐Webb et al., 1996).
In vertebrates, the mature 18S, 5.8S and 28S rRNAs contain >100 2′‐
O‐methyl groups and ∼95 pseudouridines (
Maden, 1990;
Ofengand et al., 1995;
Ofengand and Bakin, 1997). Although substantial evidence points to a catalytic role for rRNAs in protein synthesis (
Green and Noller, 1997;
Nitta et al., 1998;
Schimmel and Alexander, 1998), the function of modified nucleotides remains entirely speculative. Since pseudouridylation and ribose methylation sites cluster on the universally conserved functional centres of rRNAs and their positions show significant conservation during evolution (
Maden, 1990;
Ofengand and Bakin, 1997), we can anticipate that the modified nucleotides contribute to ribosome assembly or/and function (
Lane et al., 1995;
Ofengand et al., 1995).
In the nucleolus of eukaryotic cells, 2′‐
O‐methylation and pseudouridylation of pre‐rRNA is accomplished by a large number of different small ribonucleoprotein particles (snoRNPs) (
Smith and Steitz, 1997;
Tollervey and Kiss, 1997). Each snoRNP contains a specific small nucleolar RNA (snoRNA) and a set of associated snoRNP proteins. The 2′‐
O‐methylation and pseudouridylation guide RNAs possess distinctive structural elements and are associated with different sets of proteins (
Maxwell and Fournier, 1995;
Smith and Steitz, 1997;
Tollervey and Kiss, 1997). The methylation guide snoRNAs carry the conserved C, C′, D and D′ box elements that are essential for both the nucleolar accumulation and function of snoRNAs (
Caffarelli et al., 1996;
Cavaillé and Bachellerie, 1996;
Cavaillé et al., 1996;
Kiss‐László et al., 1996,
1998;
Watkins et al., 1996). The methylation guide snoRNAs select the target ribosomal nucleotides by forming a 10–21 bp Watson–Crick helix with rRNA sequences. This snoRNA–rRNA interaction, in conjunction with the D and C or D′ and C′ boxes of the snoRNA, provides the structural information for the methyltransferase activity to methylate the correct ribosomal nucleotide (
Cavaillé et al., 1996;
Kiss‐László et al., 1996,
1998;
Tycowski et al., 1996).
Information on the structural requirements for accumulation and function of pseudouridylation guide snoRNAs is much more limited. This group of snoRNAs share a common secondary structure that consists of two major hairpins connected by a hinge and followed by a short tail (
Ganot et al., 1997b). The single‐stranded hinge and tail regions contain the conserved H (consensus AnAnnA) and ACA box elements, respectively (
Balakin et al., 1996;
Ganot et al., 1997b). In vertebrates, the box H/ACA snoRNAs, similar to the box C/D methylation guide snoRNAs, are processed from introns of pre‐mRNAs, whereas in yeast, most H/ACA snoRNAs are transcribed from their independent genes by RNA polymerase II (pol II) or, less frequently, are processed from pre‐mRNA introns or polycistronic pre‐snoRNA transcripts (
Maxwell and Fournier; 1995;
Balakin et al., 1996;
Tollervey and Kiss, 1997). Previous studies suggested that the human intron‐encoded U17 and U19 box H/ACA snoRNAs are processed from the removed and debranched host introns by 5′→3′ and 3′→5′ exonucleolytic activities (
Cecconi et al., 1995;
Kiss and Filipowicz, 1995;
Kiss et al., 1996). Supporting this view, recent studies on the biogenesis of yeast box C/D snoRNAs demonstrated that debranching of host intron lariats by the Dbr1p RNA debranching enzyme (
Ooi et al., 1998;
Petfalski et al., 1998) or endonucleolytic cleavages of polycistronic pre‐snoRNAs by endonuclease III (
Chanfreau et al., 1998) are essential for snoRNA production. These cleavage reactions provide the entry sites for exonucleolytic activities responsible for processing of mature snoRNAs (
Petfalski et al., 1998; C.Allmang, P.Mitchell, E.Petfalski and D.Tollervey, personal communication;
Qu et al., 1999).
The ACA box motif that is located three nucleotides from the 3′ end of box H/ACA snoRNAs, together with the adjacent 3′‐terminal stem, plays an essential role in the processing and/or stability of the RNA. This structural motif determines the correct 3′ terminus of the RNA, most likely by binding snoRNP proteins and thereby protecting the snoRNA sequences from the processing exonucleases (
Balakin et al., 1996;
Ganot et al., 1997b). The H box was proposed to contribute to the 5′ end formation of box H/ACA snoRNAs (
Ganot et al., 1997b). Consistent with this, it is required for the accumulation of at least the intron‐encoded members of H/ACA snoRNAs.
The pseudouridylation guide snoRNAs select the substrate uridines by forming two short base‐pairing interactions with rRNA sequences that flank the target uridine (
Ganot et al., 1997a). The two rRNA recognition motifs occupy the opposite strands of an internal loop, termed the pseudouridylation pocket, which is located in the 5′ and/or 3′ hairpin domain of the snoRNA. In this study, the essential elements for accumulation and function of pseudouridylation guide snoRNAs have been analysed by expressing various mutant yeast snR5, snR36 and human U65 snoRNAs in yeast cells. Our results demonstrate that the pseudouridylation guide snoRNAs, in marked contrast to the methylation guide snoRNAs, have to meet very strict and complex structural requirements to ensure efficient snoRNA accumulation and guide RNA function.
Discussion
The nucleolar maturation of eukaryotic rRNAs is assisted by an unexpectedly complex population of snoRNAs (
Smith and Steitz, 1997;
Tollervey and Kiss, 1997). While a few snoRNAs are required for the nucleolytic formation of mature‐sized rRNAs (
Maxwell and Fournier, 1995;
Sollner‐Webb et al., 1996), most of them direct the site‐specific 2′‐
O‐ribose methylation (reviewed in
Maden, 1996;
Peculis and Mount, 1996;
Tollervey, 1996;
Bachellerie and Cavaillé, 1997;
Tollervey and Kiss, 1997) or pseudouridylation (reviewed by
Maden, 1997,
Peculis, 1997;
Smith and Steitz, 1997) of rRNAs. We report here a comprehensive analysis of the structural elements essential for accumulation and function of rRNA pseudouridylation guide snoRNAs.
The pseudouridylation guide snoRNAs feature a highly conserved ‘hairpin–hinge–hairpin–tail’ secondary structure with two conserved sequence motifs, the H and ACA boxes. Previous works (
Balakin et al., 1996;
Ganot et al., 1997b) together with this study (
Figure 1A) demonstrate that the box H and ACA motifs are absolutely required for accumulation of both mammalian and yeast H/ACA snoRNAs. These single‐stranded sequence motifs most likely represent protein‐binding signals that are recognized by snoRNP proteins common to this class of snoRNPs (
Henras et al., 1998;
Watkins et al., 1998). However, it seems very unlikely that either the H (consensus AnAnna) or the ACA (consensus AcA) motif alone could provide sufficient information for binding of snoRNP proteins. Supporting this assumption, the H and ACA boxes are always located in the close vicinity of the 5′‐ or 3′‐terminal helical stems, respectively, that are also required for snoRNA accumulation (
Balakin et al., 1996;
Figure 1B). Alteration of the distance between the ACA box and the 3′‐terminal stem interferes with the accumulation of yeast snR11 RNA (
Balakin et al., 1996). In this study, we demonstrate that the position of the H box relative to the 5′‐terminal stem determines the 5′ end of the intron‐encoded yeast snR36 (
Figure 2). These observations strongly support the notion that the H box together with the 5′‐terminal stem, and the ACA box in concert with the 3′‐terminal stem, constitute the recognition signals for snoRNP proteins. Most probably, snoRNP proteins associated with the 5′‐ and 3′‐terminal ‘stem–box’ structural motifs protect the snoRNA sequence from the processing exonucleases and, thereby, control the correct 5′ and 3′ end formation (
Balakin et al., 1996,
Ganot et al., 1997b).
In yeast, the majority of H/ACA snoRNAs are synthesized from independent transcription units by pol II. Selection of the transcription initiation site and the co‐transcriptionally added 5′ cap determines the 5′ terminus of these RNAs and, therefore, they undergo maturation only at their 3′ ends. The other group of H/ACA snoRNAs that are processed from intronic or polycistronic pre‐snoRNA transcripts undergo both 5′ and 3′ end maturation. Apparently, the basic structural requirements for accumulation, such as the presence of the 5′‐ and 3′‐terminal stems and the H/ACA boxes, are identical for both groups of snoRNAs. However, our results demonstrate that the steric structure of the 5′ and 3′ end‐processed snoRNAs has to conform to more rigorous requirements. When transcribed within the intron of the yeast actin pre‐mRNA, neither the yeast snR5 that is normally transcribed by pol II from its own gene (
Figure 1A) nor the human intron‐encoded E3, U17, U19, U64 and U65 (our unpublished results) snoRNAs accumulated in yeast cells. However, when expressed under the control of the
SNR5 promoter, all these snoRNAs accumulated efficiently (
Figures 1 and
3; data not shown). It is notable that the human U65, when it was synthesized by pol II and carried a 5′‐terminal m
3G cap, not only accumulated, but also directed the pseudouridylation of yeast rRNA (
Figure 3). Moreover, accumulation of a mutant version of the intron‐processed snR36 RNA carrying two unpaired nucleotides in its 5′‐terminal stem was rescued when it was transcribed from the
SNR5 promoter and possessed a m
3G cap (
Figure 1B). Collectively, our observations suggest that the 5′‐terminal cap structure, through stabilization of the snoRNA transcripts, contributes to the efficient accumulation of box H/ACA snoRNAs in yeast cells.
Thus far, four common snoRNP proteins, Gar1p, Nhp2p, Cbf5p and Nop10p, have been identified for yeast H/ACA snoRNPs (
Balakin et al., 1996;
Ganot et al., 1997b;
Henras et al., 1998;
Lafontaine et al., 1998;
Watkins et al., 1998). Since the Cbf5 protein shows striking structural similarities to known pseudouridine synthases (
Koonin, 1996;
Watkins et al., 1998), it is most probably the enzyme that is responsible for the synthesis of ribosomal pseudouridines (
Lafontaine et al., 1998). Therefore, each H/ACA snoRNP particle can be considered as a site‐specific pseudouridine synthase. While the Cbf5p provides the catalytic activity, the snoRNA component of the particle provides the specificity for the rRNA pseudouridylation reaction. Indeed, demonstration that novel pseudouridines can be introduced into the yeast 25S rRNA by manipulating the rRNA recognition motif of pseudouridylation guide snoRNAs (
Figure 8) proves that all the information necessary to select the correct pseudouridylation sites is carried by the RNA component of the snoRNP particle. These experiments also provide direct evidence that pseudouridylation guide snoRNAs select the target uridines by forming direct Watson–Crick base‐pairing interactions with the target rRNA sequences.
The two major structural domains of box H/ACA snoRNAs, the 5′hp followed by the H box and the 3′hp together with the ACA box, share striking structural and functional similarities. Pseudouridylation pockets are found equally frequently in the 5′hp and 3′hp and, even more tellingly, many snoRNAs carry pseudouridylation pockets in both the 5′hp and 3′hp domains (
Ganot et al., 1997a). It has been documented experimentally that the 5′ and 3′ pseudouridylation pockets of yeast snR5, snR34 and human U65 snoRNAs can direct pseudouridylation of rRNAs at two different positions (
Ganot et al., 1997a;
Figure 3). The H and the ACA boxes are located normally ∼14 nucleotides downstream of the catalytic centre of the corresponding pseudouridylation pocket in the 5′ or 3′ hairpin, respectively (
Ganot et al., 1997a;
Ni et al., 1997). Alteration of the wild‐type spacing between the ACA box and the 3′ catalytic centre of yeast snR8 impairs the efficiency and correctness of rRNA pseudouridylation directed by the 3′ pseudouridylation pocket of this snoRNA (
Ni et al., 1997). This shows that for selection of the correct ribosomal uridine, in addition to the snoRNA–rRNA base‐pairing interaction, the pseudouridine synthase activity also relies on the position of the ACA box relative to the catalytic centre of the snoRNA. Although not yet experimentally supported, it is easy to imagine that the H box that is located 14 nucleotides downstream from the 5′ pseudouridylation centre possesses a function analogous to that of the ACA box.
Demonstration that the yeast snR5, snR34 and the human U65 snoRNPs, and probably many others, possess two independent catalytic centres for rRNA pseudouridylation implies that these snoRNPs carry two copies of the Cbf5p pseudouridine synthase (
Ganot et al., 1997a;
Figure 3). Moreover, the notion that the 5′‐ and 3′‐terminal domains of these snoRNAs are functionally equivalent presupposes that they bind the same set of snoRNP proteins. This view was strongly supported by recent purification of yeast snR42 and snR30 box H/ACA snoRNPs (
Watkins et al., 1998). The isolated snoRNP core particles contained three common snoRNP proteins, the Gar1p, Nhp2p and Cbf5p. Electron microscopy revealed a highly symmetric bipartite structure for these complexes and, intriguingly, predicted a molecular mass that would be consistent with a particle consisting of a snoRNA and two copies of each of the Gar1, Nhp2 and Cbf5 proteins. The detailed architecture of box H/ACA snoRNPs remains to be understood. Since the Cbf5p lacks an apparent RNA‐binding motif, it seems unlikely that it would bind directly to the snoRNA. Another H/ACA snoRNP protein, the Nhp2p, would be a more likely candidate to bind to box H/ACA snoRNAs, since it contains an RNA‐binding motif also present in ribosomal proteins (
Koonin et al., 1994;
Watkins et al., 1998). The Gar1 snoRNP protein, although it has been reported to interact
in vitro with snR10 and snR30 snoRNAs (
Bagni and Lapeyre, 1998), seems to bind to the snoRNP particle through interaction with the Cbf5p (
Henras et al., 1998).
The 5′hp and 3′hp domains of box H/ACA snoRNPs act apparently in a highly co‐operative manner. Destruction of any of the functionally essential elements—the H or the ACA box and the helical stems bracketing the pseudouridylation pockets either in the 5′ or 3′ hairpin—impeded rRNA pseudouridylation mediated by both the 5′‐ and 3′‐terminal pseudouridylation centres (
Figures 6 and
7). We envisage that to construct a functional snoRNP complex, a direct or perhaps an adaptor protein‐mediated interaction is required between the two sets of snoRNP proteins which are bound to the 5′ or the 3′ hairpin domain of the snoRNA. Of course, this model would also explain why H/ACA snoRNAs that possess only one functional pseudouridylation pocket still contain two hairpin domains. The hairpin element that lacks a pseudouridylation pocket is required to provide scaffolding for snoRNP proteins to construct the functionally active bipartite structure of the snoRNP.
Similarly to pseudouridylaton guide snoRNAs, many 2′‐
O‐methylation guide snoRNAs feature two rRNA methylation centres (
Kiss‐László et al., 1998). The two rRNA methylation domains consist of an rRNA complementary sequence that is followed by either the C/D or C′/D′ box motifs. Interestingly, this bipartite structural organization is preserved even in those methylation guide snoRNAs which do not contain an RNA recognition motif next to the internal C′/D′ boxes (
Kiss‐Lászó et al., 1998). At present, the significance of these intriguing structural and functional parallels drawn between the rRNA methylation and pseudouridylation guide snoRNAs is nebulous. However, it might underscore further the notion that the two major classes of eukaryotic snoRNAs evolved from a common ancestor molecule (
Ganot et al., 1997b). In the future, an understanding of the molecular mechanisms of the snoRNA‐directed rRNA modification reactions will provide us with more insights into the complex world of small nuclear RNAs and may also facilitate the understanding of other RNA‐guided processes, such as certain RNA editing mechanisms.