- Open Access
Now on display: a gallery of group II intron structures at different stages of catalysis
Mobile DNAvolume 4, Article number: 14 (2013)
Group II introns are mobile genetic elements that self-splice and retrotranspose into DNA and RNA. They are considered evolutionary ancestors of the spliceosome, the ribonucleoprotein complex essential for pre-mRNA processing in higher eukaryotes. Over a 20-year period, group II introns have been characterized first genetically, then biochemically, and finally by means of X-ray crystallography. To date, 17 crystal structures of a group II intron are available, representing five different stages of the splicing cycle. This review provides a framework for classifying and understanding these new structures in the context of the splicing cycle. Structural and functional implications for the spliceosome are also discussed.
Group II introns are mobile ribozymes capable of self-splicing and retrotransposition . As retrotransposable elements, group II introns have invaded the genomes of most life forms and enhanced genomic diversity in all domains of life. In this way, they have played a crucial role in the evolution of modern organisms [2, 3]. At the present time, they remain important in archaea, bacteria, and unicellular and multicellular eukaryotes because they ensure the correct expression of certain housekeeping genes and because they hinder the distribution of other harmful mobile genetic elements [4, 5]. Of particular interest to the field of RNA processing, group II introns are considered evolutionary ancestors of the spliceosome, which is the ribonucleoprotein complex essential for pre-mRNA processing in higher eukaryotes, including humans [6–8]. Finally, group II introns are potentially useful medical tools, because they can be artificially reprogrammed to insert into desired DNA or RNA sites [9–11]. Consequently, they are macromolecules of great microbiological, biotechnological and pharmacological interest.
Group II introns catalyze splicing in a series of SN2 reactions (Figure 1). Briefly, in the first splicing step, a water molecule or the 2′-OH group of a bulged adenosine in D6 attacks the 5′-splice junction, forming an intron/3′-exon intermediate. After the first splicing step, the intron is believed to rearrange and prepare for the second splicing step . During this final step, the 5′-exon performs a nucleophilic addition to the 3′-splice junction, releasing ligated exons and the excised intron in a linear or lariat form. Finally, the lifecycle of a group II intron can also include reverse splicing of the excised intron into target positions within genomic DNA of the host organism, along with retrotranscription via an intron-encoded maturase, culminating in a process known as retrohoming or retrotransposition. At a molecular level, the reverse splicing reaction involves the same target recognition elements and proceeds with the same stereochemistry as the so-called spliced-exon reopening (SER) reaction, by which the free intron specifically recognizes and cleaves the ligated exons in vitro[13–15]. Therefore, SER is considered a biochemical mimic of retrotransposition.
The functionality of group II introns is mediated primarily by their intricate and stable three-dimensional structure. Historically, the structure of group II introns was elucidated throughout a 20-year long, stepwise process. Initially, phylogenetic studies showed that, despite their relatively poor sequence conservation, all group II introns share a common secondary structure and are composed of six domains (D1 through D6, Figure 2) [16–20]. Three major classes of group II introns have been identified and designated IIA, IIB and IIC. The group IIA and IIB classes are approximately 900 nt long and are found in bacteria, archaea, mitochondria and chloroplasts, while the introns belonging to the group IIC class are shorter (approximately 400 nt) and they are present exclusively in prokaryotes, representing the most primitive lineage of group II intron ribozymes . More recent work has indicated that additional families of group II introns exist, and as new sequences are discovered, useful new classifications are being developed . Over time, a series of biochemical experiments performed primarily on the group IIB ai5γ intron from yeast mitochondria (reviewed in ), on the group IIA and IIB introns from the brown alga Pylaiella littoralis, and on the group IIA intron Ll.LtrB from Lactococcus lactis led to the definition of tertiary contacts and to the design of tertiary structure maps [23–25], which provided a concrete understanding of functional intron architecture. Ultimately, a breakthrough in understanding group II intron structure-function relationships was made possible by a crystal structure of the self-spliced form of the group IIC intron from Oceanobacillus iheyensis (O.i.) . The crystal structure demonstrated how D1 of the intron forms a compact scaffold, which encloses the other intron domains, and presents the exon recognition elements (exon binding sites, EBSs). By contrast, D2 and D4 project away from the intron core, enabling them to encode sequence insertions and open reading frames. D3 acts as an interaction hub  further stabilizing the structure thanks to its characteristic internal loop and conserved S-turn. Most importantly, the highly conserved D5 forms the active site, where the catalytic triad (C358-G359-C360, the numbering is for the O.i. group II intron), the two-nucleotide bulge (A376-C377), and the J2/3 junction (A287-G288-C289) join into a major-groove triple helix. Only D6, which contains the branch-site adenosine (A406) and which connects to the 3′-splice site, could not be visualized crystallographically because of its intrinsic flexibility [21, 28].
A detailed description of the structural features specific to each domain, and of tertiary interactions amongst the domains, has already been reported . However, a wealth of new structural information about group II introns has recently become available through a series of new crystallographic studies [29–31]. For the first time, these structures depict the intron at different stages of the splicing cycle (Figure 1, Table 1), revealing the positions and the roles of critical functional elements, including reactants and substrates before and after catalysis, and in multiple alternative conformations. Moreover, some of the new crystal structures also define the position and identity of key metal ions, demonstrating how different types of metals stabilize the intron structure and participate in catalysis .
The purpose of this review is to provide a framework for classifying these new structures and understanding them in the context of the splicing cycle. After providing a brief summary of all available 3-D structures of the group II introns, the catalytic cycle will be outlined in a step-by-step fashion. Each catalytic event will be presented in a way that highlights structural details while describing the experimental strategy used to capture each state crystallographically. Finally, the implications of all group II intron structures for interpreting spliceosomal function will also be discussed.
Overview of available group II intron structures
Five different constructs have been used for crystallizing the group II introns to date. They all correspond to the Oceanobacillus iheyensis group II intron. Its wild type sequence was initially modified by adding a GAAA tetraloop to the terminus of the D2 stem, by inserting an RNA hairpin in place of D4, by truncating the D6 stem to about half its length, and by supplying native exons at the 5′- and 3′-ends . These modifications resulted in the construct named here OiD1-6. From OiD1-6, two other constructs were derived, specifically by mutating the catalytic residue G359 to adenosine (construct OiD1-6-G359A, ), or by removing D6 and the flanking exons (construct OiD1-5, ). Finally, from OiD1-5 the construct Oi5eD1-5 was obtained by adding the short 5′-exon sequence UUAU at the 5′-end, and the construct OiD1-5-C377G was obtained by a point mutation at the catalytic position 377 .
Using these five constructs, 17 different structures of the O.i. group II intron have been published [26, 27, 29–31] (Figure 1, Table 1). All of these structures are highly isomorphous to each other, with pairwise root-mean-square deviation (RMSD) values in the range of 0.6 Å to 1.5 Å. Their high similarity shows that the overall intron scaffold does not undergo major structural changes during the splicing cycle. However, the active site elements do show distinctive features in each structure and five different stages of forward and reverse splicing can be discerned.
Two structures describe conformational rearrangements that occur between the first and second splicing steps. These are 4FAR (2.86 Å resolution) and 4FAU (2.87 Å resolution) .
The postcatalytic state of the intron is represented by structure 3IGI (3.13 Å resolution) .
Seven structures reflect the ligand-free, linear form of the intron. These mimic the state of the ribozyme that is released after exon ligation, and were obtained using construct OiD1-5 crystallized in the presence of different metal ions: K+/Mg2+ (4E8M, 3.50 Å resolution), Rb+/Mg2+ (4E8P, 3.28 Å resolution), Tl+/Mg2+ (4E8Q, 2.84 Å resolution), Cs+/Mg2+ (4E8R, 3.36 Å resolution), NH4 +/Mg2+ (4E8N, 2.96 Å resolution), Na+/Mg2+ (4FAX, 3.10 Å resolution), and K+/Ba2+ (4E8V, 3.99 Å resolution) . A ligand-free form was also obtained for the functionally impaired C377G mutant (4FB0, 3.22 Å resolution). Most of the ligand-free structures represent active (K+/Mg2+, Rb+/Mg2+, Tl+/Mg2+, NH4 +/Mg2+) or partially active (Cs+/Mg2+) states that mimic the retrotransposable form of the intron before it binds target substrates .
Four structures correspond to the retrotransposable form of the intron after target substrate binding. These structures were obtained by crystallizing the spliced (OiD1-6) or ligand-free (OiD1-5) intron with oligonucleotides that mimic ligated exons. They are 3EOG (3.39 Å resolution) , 4E8K (3.03 Å resolution) , 4E8T (3.34 Å resolution) , and 4FAW (2.70 Å resolution), respectively .
The precatalytic state
Upon transcription, the O.i. group II intron folds spontaneously into a stable tertiary structure, forming a ribozyme that is highly reactive in the presence of Mg2+. Therefore, to trap the intron in its precatalytic state crystallographically (Figure 1A1), it was necessary to deactivate the intron and prevent hydrolysis at the 5′-splice site. Two different inactivation methods have been used, namely site-directed mutagenesis  and metal-ion replacement .
The first approach (structure 4DS6) involves mutation of an invariant residue (G359) belonging to the catalytic triad motif in D5 [32–36]. Since G359 is part of a helix, in which it forms a G•U wobble pair with the partner strand, adenosine was chosen to substitute guanosine and form an A-U pair. Considering that the atoms shaping the intron active site are primarily backbone oxygen atoms, the G359A mutation was expected to cause only minimal modification of the RNA structure . Indeed, in comparison to the wild-type intron, the structural perturbation in the mutant is very limited (overall RMSD = 1.2 Å). As expected, the mutation permits visualization of the 5′-splice junction. Constrained by the tight base pairing of the 5′-exon to EBS1, the junction adopts a sharp kink and forms an unusually small angle of approximately 50° between the two phosphate groups that flank the scissile phosphate . Surprisingly, however, the perturbation of the active site induced by the G359A mutation was sufficient to prevent binding of catalytic metals, which explains why activity is abolished almost completely . The cause of this loss of metal ion binding was explained by later studies, which elucidated the network of interactions that anchor metals in the core .
The second approach for trapping the precatalytic state (structure 4FAQ) involved the use of Ca2+, a structural but nonfunctional analogue of Mg2+. Ca2+ has long been known to act as an inhibitor of Mg2+-dependent enzymes  and it is also known to inhibit group II introns . Ca2+ possesses a bigger ionic radius relative to Mg2+, and it does not facilitate the formation of the trigonal bipyramidal transition state at phosphorus that is typical for enzymes that catalyze phosphodiesterase SN2 reactions [39–42]. Although its physical-chemical properties are different from those of Mg2+ – Ca2+-bound structures must be interpreted with caution – several informative structures of endonucleases were solved in their precatalytic state by replacing Mg2+ with Ca2+[42–44]. Under these conditions, the overall intron and the geometry of its active site are not significantly affected (overall RMSD = 0.84 Å between structure 4FAR obtained in the presence of Mg2+ and structure 4FAQ obtained with Ca2+). Therefore, the Ca2+-bound structures opened up the way for visualizing all reactants in place for catalysis, including the metal center, the splice junction, the catalytic triple helix and the nucleophilic water molecule (Figure 3).
Taken together, the structures of the precatalytic state establish how the intron mediates two essential attributes of splicing, namely efficiency and fidelity, using the EBSs and a four-metal heteronuclear center.
Splicing efficiency is tightly linked to the architectural organization of the metals in the active site. Four metals have been shown to be involved in catalysis . Two (M1-M2) are obligate divalent ions occupied by Mg2+in vivo, while the other two (K1-K2) are monovalent ions, likely occupied by K+in vivo. Furthermore, M1-M2-K1 are interconnected by single oxygen atoms and therefore they form a bona fide KMgO metal cluster . These ions are interconnected by three hexagonal rings of interatomic bonds, as in other organic clusters involving phosphorus (III) and phosphorus (V) oxides, but possessing 13 vertices (Figure 4, ). The formation of such a cluster results in a specific and highly constrained local architecture. The interconnection between the metals explains why the entire metal center is so easily disrupted when the active site residues adopt a conformation that shifts the position of the metal ion ligands, and which differs from the catalytic triple helix (vide infra). At the same time, the apparent rigidity of the properly assembled cluster mediates tight binding of the metals to the active site even in the absence of ligands (vide infra), a property that makes group II introns efficient mobile genetic elements.
By contrast, splicing fidelity is linked to the appropriate pairing of EBS-intron binding site (IBS) elements. Structure 4DS6 shows that the formation of the EBS1-IBS1 interaction is sufficient to place the 5′-splice junction correctly in the active site even if other elements, including the metal cluster, are not well positioned. Also the intron structures solved using OiD1-5 in a ligand-free state (vide infra) provide an illustrative example of how splicing fidelity is achieved. Specifically, OiD1-5 possesses a short poly-G sequence (GGG) at its 5′-end, and this fails to interact with the EBS1 site. This sequence is artificially inserted immediately downstream of the T7 promoter in order to enhance the yield of in vitro transcription by T7 RNA polymerase [46–48]. Because the GGG sequence is different from that of the native 5′-exon (UUAU) and therefore does not possess any complementarity to EBS1 (AUAA, Figure 2), the 5′-splice junction in those structures is flexible and completely excluded from the active site, even when the catalytic metal center is intact . Thus, EBS1 is highly specific in choosing its partner nucleotides at the 5′-splice site, as supported also by biochemical evidence .
Putative position of the branching nucleotide
No crystallographic data is available to describe the position of the 2′-OH group of the branching residue involved in splicing by transesterification (Figure 1A2). However, its position can be inferred based on the identification of the nucleophile in the structure that describes the hydrolytic reaction (4FAQ) . Certainly, predicting the correct position of this branching residue in the absence of experimental data is difficult because the nucleophilic adenosine and D6 form few interactions with the rest of the intron . It is known that the branching nucleotide needs to be an adenosine to achieve maximal splicing efficiency, but this residue does not control the fidelity of the reaction and other nucleotides are also compatible with branching albeit with low efficiency . Indeed, in the spliceosome, the splicing machinery corresponding to group II introns in eukaryotes, the branch site has been studied extensively and it has been shown that the precise location of the branch site is not always tightly fixed [52, 53]. Additionally, the branch site nucleophile is usually bulged or dynamic within the D6 stem but even this is not an absolutely conserved requirement [51, 54, 55]. However, despite these uncertainties, it is possible to model the position of D6 using the steric constraints imposed by the other active site elements and by the geometrical requirements of the SN2 reaction that is typical of group II intron splicing (Figure 5). These models show that a limited number of conformers are sterically allowed in which the trigonal bipyramidal geometry is maintained.
A conformational transition to the second splicing step
After the first splicing step, the intron active site becomes rearranged before performing the second transesterification reaction. Specifically, D5 is known to become rearranged, thanks to the flexibility of its two-nucleotide bulge motif [12, 56], while D6 toggles between an active state coordinated to the κ-coordination loop or the D1C helix, and a silent state forming the η-η' interaction with D2 [21, 28, 57]. However, biochemical experiments such as crosslinking studies  and all available crystal structures suggest that a group II intron possesses only one catalytic site for both the first and second splicing steps [12, 24, 58].
Given these observations, one might assume that the reactants for the second splicing step, which remains crystallographically uncharacterized, are already aligned correctly for catalysis in the precatalytic state. However, this is not the case, as long-range interactions involving the second splicing step reactants have been shown to form only between the first and the second splicing steps, or to affect selectively the second and not the first splicing step (that is, the γ-γ' interaction, the interaction between the first and the penultimate intron nucleotides, the IBS3-EBS3 interaction and the η-η' interaction [59–61]). Furthermore, in the structures, the nucleophile of the first splicing step is located near the EBS3 site, in the identical position that must be occupied by the 3′-splice junction during the second splicing step . Therefore, there is also a structural incompatibility that prohibits accommodation of all reactants in the same active site at once. Consequently, a rearrangement of the active site between the splicing steps is likely to happen.
In light of recent structures, more detailed hypotheses about such a rearrangement can be proposed. The structures suggest two types of conformational rearrangements, one involving a movement of the hydrolyzed scissile phosphate (Figure 1A1), the other a movement of the J2/3 junction and the two-nucleotide bulge (Figure 1B). The first conformational rearrangement, which directly follows 5′-exon cleavage, was visualized by crystallizing Oi5eD1-5 in the presence of the physiological, catalytically functional ions Mg2+ and K+ (structure 4FAR, reference  and Figures 1 and S1 therein). Upon hydrolysis, which occurs during the crystallization process, the 5′-exon maintains coordination to M1 through its 3′-OH group and is not displaced significantly from its binding site, as expected since the 5′-exon is the nucleophile of the second splicing step. Instead, hydrolysis induces relaxation of the kinked RNA backbone at the 5′-splice junction and the hydrolyzed scissile phosphate is released from the active site. Specifically, the free phosphate is displaced by about 4 Å, where it interacts directly with the K2 site, which evidently plays a direct role in organizing, and potentially liberating, splicing products. The second conformational rearrangement was visualized in a structure of Oi5eD1-5 solved in the presence of Li+/Mg2+ (4FAU) . In this structure, the 5′-exon has undergone hydrolysis and one observes an equilibrium between two conformations in the active site: the catalytic triple helix conformation and an inactive toggled conformation. The conformational change involves two residues in the J2/3 junction (G288-C289) and one residue in the two-nucleotide bulge (C377, D5), all known to be dynamic elements of group II introns [12, 58]. In the inactive toggled conformation, which is visualized most clearly when the intron is crystallized in a buffer of Na+/Mg2+ (structure 4FAX, see reference  and Figure 4 therein), G288 rotates by about 90° around an axis connecting its C5′ and C3′ backbone atoms, while the cytosine moiety of C377 rotates by about 70° around the glycosidic bond. Both residues in the inactive toggled conformation are stabilized by a new network of interactions. Among these, two involve the 2′-OH groups of both residues, which do not form any interactions in the triple helix conformation typical of the precatalytic state. These interactions are particularly interesting because the two hydroxyl groups had previously been demonstrated to be important in catalysis using biochemical methods, but their role was unclear until now [32, 34]. In addition to disrupting the triple helix, the conformational rearrangement also moves the RNA ligands that are essential for anchoring the M1-M2-K1-K2 metal center. This causes the interactions between catalytic ions and the 5′-splice junction to be broken and facilitates the release of the latter.
In summary, therefore, a concerted conformational rearrangement appears likely to promote the transition to the second splicing step. Considering the central role of the residues involved in the rearrangement, we cannot exclude the notion that the inactive toggled intron conformation might also occur at other points of the splicing cycle, and we would like to suggest two scenarios to support this hypothesis. First, the inactive toggled conformation may represent an intermediate conformation that occurs while the intron folds into its active, precatalytic state. This hypothesis is supported by the fact that a mutant designed to stabilize the inactive toggled conformation (C377G) shows a tenfold reduction in the rate of the first splicing step in addition to its pronounced defect in the second splicing step (see reference  and Figure S5 therein). Second, the opening of the triple helix and the consequent disruption of the active site metal cluster may be important for successfully terminating the splicing cycle, when the ligated exons must be released from the active site to form a free intron. The inactive toggled conformation would prevent ligated exons from being rehydrolyzed through SER, which is a prevalent in vitro side-reaction that represents a major problem for productive splicing in vivo.
Second splicing step
The second splicing step remains an important area for future structural studies, as it has not been fully elucidated by existing structures. Two sets of structures would be required to describe its mechanism at a molecular level, namely the structure of the state preceding cleavage of the 3′-splice junction and the structure of the postcatalytic state. While the latter can be represented by structure 3IGI (Figure 1C), which corresponds to the postcatalytic linear intron harboring products of the splicing reaction in its active site [26, 27]; the former structure is not yet available and can only be deduced from modeling exercises (Figure 1C).
Specifically, modeling the geometry of the 3′-splice junction before cleavage can be done on the basis of the following considerations. First, the position of the 3′-OH group of the 5′-exon, which acts as the nucleophile on the 3′-splice junction, can be derived from structures 4FAR and 4FAU (see above and ). These structures show that, after the first splicing step, the 5′-exon does not change its position within the active site and that it remains bound to the EBS1 site. Second, the position of the catalytic metal center can be deduced from the structures of the postcatalytic states of the intron (3IGI, 3EOG, 4E8K, 4E8T and 4FAW [26, 30, 31] and vide infra). These structures show that, after catalysis, the metals occupy identical positions as in the precatalytic state (see above). Therefore, it can be expected that in the second splicing step the metal center reassembles in the same conformation as in the first splicing step, after being transiently disrupted by the swinging and toggling mechanism described above . Third, the structure of three residues around the 3′-splice junction (penultimate and last intron nucleotides and first exon nucleotide) can be modeled de novo, based on the known positions of other intron residues with which they engage specific tertiary interactions previously identified by biochemical experiments [60–62]. The penultimate intron nucleotide engages in an interaction with G1 , whose position can be derived from structure 4FAR. The last intron nucleotide forms the γ-γ' interaction  with A287 (J2/3 junction), whose position is determined by structures 4DS6, 4FAQ, 4FAR, 4FAU, 4E8M, 4E8P, 4E8R, 4E8Q, 4E8N, 4E8V, 4E8V, 4FAX, 4FB0, 4E8K, 4E8T and 4FAW. Finally, the first exon nucleotide (IBS3 site) base-pairs with residue A223 (EBS3) , and the structure of this IBS3-EBS3 interaction can be derived from structures 4E8K and 4E8T. Finally, the model of the 3′-splice junction must also consider that the scissile phosphate prefers to adopt an Rp stereochemical configuration before nucleophilic attack, as determined by phosphorothioate substitutions . Based on these structural and biochemical constraints, we modeled the reactants of the second splicing step. Here we present two possible models, both compatible with available biochemical data and possessing favorable structural geometry. In the first case, which has already been proposed , the 3′-splice junction is modeled in a kinked conformation. In the other case, the junction adopts an extended conformation instead (Figure 6).
Postcatalytic ligand-free state
Upon completion of the splicing reaction, the ligated exons are released from the active site and the free intron is liberated in a linear or lariat form. While the structure of the lariat form is not yet available, many structures have been obtained for the linear form (4E8M, 4E8P, 4E8R, 4E8Q, 4E8N, 4E8V, 4FAX and 4FB0; see Figure 1D) .
To obtain the structures of the excised intron in a ligand-free state (that is, without any bound exons or ligated exons), it was necessary to prevent co-crystallization of exon-like fragments deriving from the splicing reaction and from self-degradation of the intron . To this end, we utilized construct OiD1-5, which folds spontaneously during transcription in vitro, and adopts a homogeneous, active conformation after purification, yielding a free, multiple-turnover ribozyme that is a good mimic for the postcatalytic state of the intron . The ligand-free intron structures are nearly identical to the available ligand-bound ones, which is a fairly typical case for protein enzymes and ribozymes that catalyze two-metal ion phosphodiester cleavage reactions . All residues are visible in the electron density and only the EBS1 site is slightly disordered, as expected given the absence of base-pairing with a corresponding IBS1 sequence. Despite their overall similarity to the ligand-bound states of the intron, the ligand-free intron structures show remarkable features, particularly in terms of the catalytic metal ions.
First, the ligand-free structures show that, even in the absence of K+, monovalent ions like Tl+, Rb+, Cs+, Na+ and NH4+, and divalent ions like Ba2+ can support the correct folding of the intron scaffold. Therefore, these structures unambiguously reveal the identity of numerous important metal binding sites. These observations demonstrate a remarkable adaptability of group II introns, and potentially other large RNA molecules, to different metal ions. This is important given that metal ions are very useful tools for studying large RNAs, not only crystallographically [30, 64], but also spectroscopically [65, 66] and biochemically .
Second, the ligand-free structures show that the catalytic metal center M1-M2-K1-K2 is correctly bound within the active site when the intron is crystallized in the presence of physiological ions (Mg2+/K+), or any other ions that support chemical catalysis. This observation is surprising considering that the metals – in particular M1 and M2 – are less tightly coordinated and more exposed to the solvent in the absence of the exons. Indeed, in the ligand-free structures M1-M2 are bridged by a water molecule that occupies the position of the scissile phosphate oxygen . This water molecule is therefore likely to represent an important element in the ligand-free active site, because it completes the KMgO cluster. The integrity of the active site in the ligand-free intron supports the observation that this ribozyme is a highly efficient retrotransposable element.
SER and retrotransposition
The structure of the empty, ligand-free intron sets the stage for understanding the mechanism of its retrotransposition into genomic DNA or into RNA (Figure 1E) . The first retrotransposition step (which is a reverse-splicing reaction) is thought to be approximated in vitro by the spliced-exon reopening reaction, where ligated exons are bound and then attacked by the free intron, because the chemistry of the two reactions is known to be identical [13–15]. Both the precatalytic and the postcatalytic states of the SER reaction have now been characterized crystallographically using RNA substrates (structures 3EOG, 4E8K, 4E8T and 4FAW [30, 31]).
The precatalytic state of SER was first visualized in 2008, when a self-spliced intron was co-crystallized with an oligonucleotide mimicking ligated exons (structure 3EOG) . In another approach to visualizing the precatalytic state of SER, construct OiD1-5 was co-crystallized in the presence of Ca2+ with an oligonucleotide that corresponds to the sequence of native ligated exons (structures 4E8K and 4E8T) . These latter structures revealed the presence of an intact active site, whose geometry is highly reminiscent of that of the precatalytic state preceding 5′-exon hydrolysis. The scissile phosphate of the substrate is located between the M1 and M2 sites, presenting the pro-S oxygen atom at approximately 2 Å from each of the two metals. The stereochemistry of the scissile phosphate in the structure is thus in perfect agreement with previous biochemical experiments that had predicted a preference for the pro-S configuration on the basis of phosphorothioate substitutions . Moreover, the 5′-exon portion of the oligonucleotide tightly binds to the EBS1 site, while the 3′-exon nucleotide shows a well-defined Watson–Crick base-pairing only for the uridine at the scissile position (IBS3) with the corresponding EBS3 adenosine. M1 coordinates to the leaving group (the 3′-OH of the nucleotide in 5′ to the scissile phosphate), while M2 coordinates to the scissile phosphate oxygen, in agreement with the two-metal ion mechanistic hypothesis . By contrast, the structure of the post-hydrolytic state of SER was obtained using the OiD1-5 construct, bound to the same oligonucleotide used for solving 4E8K and 4E8T, but co-crystallized in the presence of physiological ions Mg2+ and K+ (structure 4FAW) . This structure currently represents the structure of the intron at the highest resolution ever achieved (2.7 Å) and so far the highest resolution structure of a noncoding RNA longer than 200 nucleotides, except for the ribosomal subunits. In this structure, the 5′-exon portion of the oligonucleotide is visible in the electron density, as it forms base pairs with the EBS1 binding site in the same position as in the pre-hydrolytic state. By contrast, the 3′-end has been released and, as happens for ligand-free structures, the KMgO cluster is completed by a water molecule bound between M1 and M2.
The structures of the IBS-EBS interactions and of the metal center of the SER reaction are particularly significant, because they help in understanding the mechanism of the second splicing step, as discussed above. Furthermore, a solvent molecule coordinated by C358 in the catalytic triad and by M2 can also be identified in the precatalytic state (structures 4E8K and 4E8T) at about 3.2 Å from the scissile phosphate, in a direct line with the scissile P-O bond . This positioning, which is identical to that of the nucleophile for the first splicing step, suggests that this solvent molecule likely represents the reaction nucleophile of the SER reaction. Therefore, it represents the most likely location occupied by the nucleophile of the first reverse-splicing step, namely the 3′-OH group of the last intron nucleotide. These observations further corroborate the hypothesis of a single major active site for group II introns  and shed light on the molecular mechanism of the retrotransposition event. Certainly, to obtain a more complete visualization of the reverse-splicing reaction, it will be necessary to crystallize the intron in complex with DNA substrates.
Implications for the spliceosome
Besides revealing the molecular mechanism of different stages of the intron splicing cycle, the structures described so far also provide new evidence to support the idea that group II introns may be functionally and structurally related to the spliceosome [6–8]. Therefore, we will briefly discuss how the intron structures contribute to a deeper understanding of spliceosomal architecture and function.
Group II introns and the spliceosome have many strong analogies. Sequence conservation analyses revealed precise correspondence of active site motifs in the two systems . Specifically, the catalytic triad is well conserved within intron D5 and in the spliceosomal snRNA subunit U6 , the J2/3 junction (intron D2-3) corresponds to residues in the conserved spliceosomal ACAGAGA box (U6) , and the two-nucleotide bulge motif (intron D5) is likely to correspond to bulged residues either in the internal stem loop of U6 (U80, [71, 73]) or in U2-U6 helix I (A25, [30, 74]). Mutations at any of these conserved positions have similar effects in the two systems [14, 58, 75, 76]. Besides sequence similarities, the two macromolecules also share similar preferences for the stereochemical configuration of the scissile phosphate in the two splicing steps [15, 63, 77]. Moreover, the metal ion requirements are strikingly similar in both the intron and the spliceosome. Not only are both machineries selectively dependent on magnesium as a divalent ion [4, 78], but they are also both tightly controlled by monovalent ions, that is, potassium [50, 79]. Finally, both macromolecules are known to pause in transiently inactive states to regulate the transitions between the different splicing steps [30, 80].
In the light of these analogies, it seems plausible to believe that the mechanistic details learned from the new intron structures may be pertinent for spliceosomal splicing. In particular, the structural arrangement of the active site motifs and the reactants, the identity and the coordination of the metal ions in the catalytic heteronuclear center, and possibly the dynamics of the conformational toggling observed for the group II intron may have similar correspondence also in the spliceosome. Two specific hypotheses have been proposed, each agreeing with different sets of experimental data and differing in the choice of the toggling residues and in how the spliceosomal elements are positioned in the active site . Other scenarios are also possible and further studies on the spliceosome are required to obtain a more detailed representation of its active site.
Certainly, at present, it is very hard to picture with atomic precision a similarity between the approximately 150-kDa monomeric group II intron ribozyme and the approximately 12-MDa, heteromultimeric spliceosomal ribonucleoprotein. Recently, though, a significant milestone in this direction has been achieved with the determination of the crystal structure of Prp8, a spliceosomal component that interacts directly with all active site elements . Importantly, the Prp8 structure suggests that none of the protein motifs possess catalytic activity, thus reinforcing the current belief that spliceosomal chemistry is performed by the RNA subunits . Even more interestingly, the structure reveals that Prp8 folds around an overall positively charged cavity whose dimensions exactly correspond to the conserved RNA components within the group II intron active site . Evolution seems to have substituted the group II intron scaffold, which is provided by the noncatalytic intron domains (predominantly D1), with the protein scaffold of Prp8, presumably to achieve a finer regulation of splicing fidelity and a more elaborate coordination of the interaction network with other spliceosomal components and regulatory factors. Within this shell, catalytic elements similar to those of a group II intron (for example, U6) are still believed to reside in the core of the spliceosome, suggesting that an RNA element similar to the group II intron D5 is conserved from bacteria to humans.
Overall, the combination of all new structures of group II introns and spliceosomal components reinforces the hypothesis that the two systems may share a common catalytic core and a common mechanism for arranging their reactants and controlling the transitions between the chemical splicing steps.
The large collection of available group II intron structures has recently brought our understanding of splicing mechanism to a new level.
Future work is now likely to focus on the characterization of D6, and the structure of conformational states that participate in branching. Hopefully, these types of structures will reveal the position of the branching nucleotide involved in the mechanism of the first splicing step and will pave the way for visualizing the structures of the branched intron/3′-exon intermediate and of the ligand-free lariat intron. Furthermore, structures containing D6 will reveal the conformation of the 3′-splice junction in the precatalytic state, and in the state that immediately precedes the second splicing step.
Eventually, all these structural snapshots will enable the creation of a movie that depicts each stage of the splicing cycle at high resolution. These pieces of structural information will be valuable not only for understanding the reaction mechanism of group II introns, but for understanding pre-mRNA splicing in general, as group II introns share many structural and mechanistic features with their spliceosomal cousins.
MM and SS are currently postdoctoral associates at Yale University. AMP is the William Edward Gilbert Professor of Molecular, Cellular and Developmental Biology and Professor of Chemistry at Yale, and a Howard Hughes Medical Institute Investigator.
Exon binding site
Intron binding site
Protein Data Bank
Michel F, Jacquier A, Dujon B: Comparison of fungal mitochondrial introns reveals extensive homologies in RNA secondary structure. Biochimie. 1982, 64: 867-881. 10.1016/S0300-9084(82)80349-0.
Mattick JS: Introns: evolution and function. Curr Opin Genet Dev. 1994, 4: 823-831. 10.1016/0959-437X(94)90066-3.
Koonin EV: The origin of introns and their role in eukaryogenesis: a compromise solution to the introns-early versus introns-late debate?. Biol Direct. 2006, 1: 22-10.1186/1745-6150-1-22.
Pyle AM, Lambowitz AM: Group II introns: ribozymes that splice RNA and invade DNA. The RNA World. Edited by: Gesteland RF, Cech TR, Atkins JF. 2006, Cold Spring Harbor: Cold Spring Harbor Press, 469-505. 3
Chillon I, Martinez-Abarca F, Toro N: Splicing of the Sinorhizobium meliloti RmInt1 group II intron provides evidence of retroelement behavior. Nucleic Acids Res. 2011, 39: 1095-1104. 10.1093/nar/gkq847.
Cech TR: The generality of self-splicing RNA: relationship to nuclear mRNA splicing. Cell. 1986, 44: 207-210. 10.1016/0092-8674(86)90751-8.
Rest JS, Mindell DP: Retroids in archaea: phylogeny and lateral origins. Mol Biol Evol. 2003, 20: 1134-1142. 10.1093/molbev/msg135.
Sharp PA: Five easy pieces. Science. 1991, 254: 663-10.1126/science.1948046.
Garcia-Rodriguez FM, Barrientos-Duran A, Diaz-Prado V, Fernandez-Lopez M, Toro N: Use of RmInt1, a group IIB intron lacking the intron-encoded protein endonuclease domain, in gene targeting. Applied and Environmental Microbiology. 2011, 77: 854-861. 10.1128/AEM.02319-10.
Guo H, Karberg M, Long M, Jones JP, Sullenger B, Lambowitz AM: Group II introns designed to insert into therapeutically relevant DNA target sites in human cells. Science. 2000, 289: 452-457. 10.1126/science.289.5478.452.
Mastroianni M, Watanabe K, White TB, Zhuang F, Vernon J, Matsuura M, Wallingford J, Lambowitz AM: Group II intron-based gene targeting reactions in eukaryotes. PLoS One. 2008, 3: e3121-10.1371/journal.pone.0003121.
Dayie KT, Padgett RA: A glimpse into the active site of a group II intron and maybe the spliceosome, too. RNA. 2008, 14: 1697-1703. 10.1261/rna.1154408.
Jarrell KA, Peebles CL, Dietrich RC, Romiti SL, Perlman PS: Group II intron self-splicing. Alternative reaction conditions yield novel products. J Biol Chem. 1988, 263: 3432-3439.
Mikheeva S, Murray HL, Zhou H, Turczyk BM, Jarrell KA: Deletion of a conserved dinucleotide inhibits the second step of group II intron splicing. RNA. 2000, 6: 1509-1515. 10.1017/S1355838200000972.
Podar M, Perlman PS, Padgett RA: Stereochemical selectivity of group II intron splicing, reverse splicing, and hydrolysis reactions. Mol Cell Biol. 1995, 15: 4466-4478.
Simon DM, Clarke NA, McNeil BA, Johnson I, Pantuso D, Dai L, Chai D, Zimmerly S: Group II introns in eubacteria and archaea: ORF-less introns and new varieties. RNA. 2008, 14: 1704-1713. 10.1261/rna.1056108.
Zimmerly S, Hausner G, Wu X: Phylogenetic relationships among group II intron ORFs. Nucleic Acids Res. 2001, 29: 1238-1250. 10.1093/nar/29.5.1238.
Michel F, Umesono K, Ozeki H: Comparative and functional anatomy of group II catalytic introns – a review. Gene. 1989, 82: 5-30. 10.1016/0378-1119(89)90026-7.
Qin PZ, Pyle AM: The architectural organization and mechanistic function of group II intron structural elements. Curr Opin Struct Biol. 1998, 8: 301-308. 10.1016/S0959-440X(98)80062-6.
Toor N, Hausner G, Zimmerly S: Coevolution of group II intron RNA structures with their intron-encoded reverse transcriptases. RNA. 2001, 7: 1142-1152. 10.1017/S1355838201010251.
Pyle AM: The tertiary structure of group II introns: implications for biological function and evolution. Crit Rev Biochem Mol Biol. 2010, 45: 215-232. 10.3109/10409231003796523.
Costa M, Fontaine JM, Loiseaux-de Goer S, Michel F: A group II self-splicing intron from the brown alga Pylaiella littoralis is active at unusually low magnesium concentrations and forms populations of molecules with a uniform conformation. J Mol Biol. 1997, 274: 353-364. 10.1006/jmbi.1997.1416.
Dai L, Chai D, Gu SQ, Gabel J, Noskov SY, Blocker FJ, Lambowitz AM, Zimmerly S: A three-dimensional model of a group II intron RNA and its interaction with the intron-encoded reverse transcriptase. Mol Cell. 2008, 30: 472-485. 10.1016/j.molcel.2008.04.001.
de Lencastre A, Hamill S, Pyle AM: A single active-site region for a group II intron. Nat Struct Mol Biol. 2005, 12: 626-627. 10.1038/nsmb957.
Hamill S, Pyle AM: The receptor for branch-site docking within a group II intron active site. Mol Cell. 2006, 23: 831-840. 10.1016/j.molcel.2006.07.017.
Toor N, Keating KS, Taylor SD, Pyle AM: Crystal structure of a self-spliced group II intron. Science. 2008, 320: 77-82. 10.1126/science.1153803.
Toor N, Keating KS, Fedorova O, Rajashankar K, Wang J, Pyle AM: Tertiary architecture of the Oceanobacillus iheyensis group II intron. RNA. 2010, 16: 57-69. 10.1261/rna.1844010.
Li CF, Costa M, Michel F: Linking the branchpoint helix to a newly found receptor allows lariat formation by a group II intron. Embo J. 2011, 30: 3040-3051. 10.1038/emboj.2011.214.
Chan RT, Robart AR, Rajashankar KR, Pyle AM, Toor N: Crystal structure of a group II intron in the pre-catalytic state. Nat Struct Mol Biol. 2012, 19: 555-557. 10.1038/nsmb.2270.
Marcia M, Pyle AM: Visualizing group II intron catalysis through the stages of splicing. Cell. 2012, 151: 497-507. 10.1016/j.cell.2012.09.033.
Toor N, Rajashankar K, Keating KS, Pyle AM: Structural basis for exon recognition by a group II intron. Nat Struct Mol Biol. 2008, 15: 1221-1222. 10.1038/nsmb.1509.
Abramovitz DL, Friedman RA, Pyle AM: Catalytic role of 2′-hydroxyl groups within a group II intron active site. Science. 1996, 271: 1410-1413. 10.1126/science.271.5254.1410.
Boulanger SC, Belcher SM, Schmidt U, Dib-Hajj SD, Schmidt T, Perlman PS: Studies of point mutants define three essential paired nucleotides in the domain 5 substructure of a group II intron. Mol Cell Biol. 1995, 15: 4479-4488.
Chanfreau G, Jacquier A: Catalytic site components common to both splicing steps of a group II intron. Science. 1994, 266: 1383-1387. 10.1126/science.7973729.
Peebles CL, Zhang M, Perlman PS, Franzen JS: Catalytically critical nucleotide in domain 5 of a group II intron. Proc Natl Acad Sci USA. 1995, 92: 4422-4426. 10.1073/pnas.92.10.4422.
Schmidt U, Podar M, Stahl U, Perlman PS: Mutations of the two-nucleotide bulge of D5 of a group II intron block splicing in vitro and in vivo: phenotypes and suppressor mutations. RNA. 1996, 2: 1161-1172.
Vipond IB, Baldwin GS, Halford SE: Divalent metal ions at the active sites of the EcoRV and EcoRI restriction endonucleases. Biochemistry. 1995, 34: 697-704. 10.1021/bi00002a037.
Erat MC, Sigel RK: Divalent metal ions tune the self-splicing reaction of the yeast mitochondrial group II intron Sc.ai5γ. J Biol Inorg Chem. 2008, 13: 1025-1036. 10.1007/s00775-008-0390-7.
Horton JR, Cheng X: PvuII endonuclease contains two calcium ions in active sites. J Mol Biol. 2000, 300: 1049-1056. 10.1006/jmbi.2000.3938.
McConnell TS, Herschlag D, Cech TR: Effects of divalent metal ions on individual steps of the Tetrahymena ribozyme reaction. Biochemistry. 1997, 36: 8293-8303. 10.1021/bi9700678.
Syson K, Tomlinson C, Chapados BR, Sayers JR, Tainer JA, Williams NH, Grasby JA: Three metal ions participate in the reaction catalyzed by T5 flap endonuclease. J Biol Chem. 2008, 283: 28741-28746. 10.1074/jbc.M801264200.
Viadiu H, Aggarwal AK: The role of metals in catalysis by the restriction endonuclease BamHI. Nat Struct Biol. 1998, 5: 910-916. 10.1038/2352.
Martin AM, Horton NC, Lusetti S, Reich NO, Perona JJ: Divalent metal dependence of site-specific DNA binding by EcoRV endonuclease. Biochemistry. 1999, 38: 8430-8439. 10.1021/bi9905359.
Vipond IB, Halford SE: Specific DNA recognition by EcoRV restriction endonuclease induced by calcium ions. Biochemistry. 1995, 34: 1113-1119. 10.1021/bi00004a002.
Cotton AF, Wilkinson G: Symmetry and structure. Advanced Organic Chemistry:A comprehensive text. 1980, New York, USA: John Wiley and Sons, Inc, 58-59. 4
Helm M, Brule H, Giege R, Florentz C: More mistakes by T7 RNA polymerase at the 5′ ends of in vitro-transcribed RNAs. RNA. 1999, 5: 618-621. 10.1017/S1355838299982328.
Pleiss JA, Derrick ML, Uhlenbeck OC: T7 RNA polymerase produces 5′ end heterogeneity during in vitro transcription from certain templates. RNA. 1998, 4: 1313-1317. 10.1017/S135583829800106X.
Sampson JR, Uhlenbeck OC: Biochemical and physical characterization of an unmodified yeast phenylalanine transfer RNA transcribed in vitro. Proc Natl Acad Sci USA. 1988, 85: 1033-1037. 10.1073/pnas.85.4.1033.
Jacquier A, Michel F: Multiple exon-binding sites in class II self-splicing introns. Cell. 1987, 50: 17-29. 10.1016/0092-8674(87)90658-1.
Daniels DL, Michels WJ, Pyle AM: Two competing pathways for self-splicing by group II introns: a quantitative analysis of in vitro reaction rates and products. J Mol Biol. 1996, 256: 31-49. 10.1006/jmbi.1996.0066.
Chu VT, Adamidi C, Liu Q, Perlman PS, Pyle AM: Control of branch-site choice by a group II intron. Embo J. 2001, 20: 6866-6876. 10.1093/emboj/20.23.6866.
Query CC, Moore MJ, Sharp PA: Branch nucleophile selection in pre-mRNA splicing: evidence for the bulged duplex model. Genes Dev. 1994, 8: 587-597. 10.1101/gad.8.5.587.
Query CC, Strobel SA, Sharp PA: Three recognition events at the branch-site adenine. Embo J. 1996, 15: 1392-1402.
Schlatterer JC, Crayton SH, Greenbaum NL: Conformation of the Group II intron branch site in solution. J Am Chem Soc. 2006, 128: 3866-3867. 10.1021/ja0578754.
Chu VT, Liu Q, Podar M, Perlman PS, Pyle AM: More than one way to splice an RNA: branching without a bulge and splicing without branching in group II introns. RNA. 1998, 4: 1186-1202. 10.1017/S1355838298980724.
Eldho NV, Dayie KT: Internal bulge and tetraloop of the catalytic domain 5 of a group II intron ribozyme are flexible: implications for catalysis. J Mol Biol. 2007, 365: 930-944. 10.1016/j.jmb.2006.10.037.
Costa M, Deme E, Jacquier A, Michel F: Multiple tertiary interactions involving domain II of group II self-splicing introns. J Mol Biol. 1997, 267: 520-536. 10.1006/jmbi.1996.0882.
Ho Faix P: Conserved Nucleotides in the Joining Segment Between Domains 2 and 3 are Important for Group II Intron Splicing. 1998, : University of Pittsburgh
Chanfreau G, Jacquier A: An RNA conformational change between the two chemical steps of group II self-splicing. Embo J. 1996, 15: 3466-3476.
Costa M, Michel F, Westhof E: A three-dimensional perspective on exon binding by a group II self-splicing intron. Embo J. 2000, 19: 5007-5018. 10.1093/emboj/19.18.5007.
Jacquier A, Michel F: Base-pairing interactions involving the 5′ and 3′-terminal nucleotides of group-II self-splicing introns. J Mol Biol. 1990, 213: 437-447. 10.1016/S0022-2836(05)80206-2.
Chanfreau G, Jacquier A: Interaction of intronic boundaries is required for the 2nd splicing step efficiency of a group-II intron. Embo J. 1993, 12: 5173-5180.
Padgett RA, Podar M, Boulanger SC, Perlman PS: The stereochemical course of group II intron self-splicing. Science. 1994, 266: 1685-1688. 10.1126/science.7527587.
Stahley MR, Adams PL, Wang J, Strobel SA: Structural metals in the group I intron: a ribozyme with a multiple metal ion core. J Mol Biol. 2007, 372: 89-102. 10.1016/j.jmb.2007.06.026.
Ida R, Wu G: Solid-state 87Rb NMR signatures for rubidium cations bound to a G-quadruplex. 2005 Sep 14. 2005, 4294-4296. 10.1039/B505674H. Epub 2005 Aug 2, 34
Turner GL, Hinton JF, Millett FS: Thallium-205 nuclear magnetic resonance study of the thallium(I)-gramicidin A association in trifluoroethanol. Biochemistry. 1982, 21: 646-651. 10.1021/bi00533a008.
Feig AL, Uhlenbeck OC: The role of metal ions in RNA biochemistry. The RNA World. Edited by: Gesteland RF, Cech TR, Atkins JF. 1999, New York: Cold Spring Harbor Laboratory Press, 287-320. 2
Cousineau B, Lawrence S, Smith D, Belfort M: Retrotransposition of a bacterial group II intron. Nature. 2000, 404: 1018-1021. 10.1038/35010029.
Podar M, Perlman PS, Padgett RA: The two steps of group II intron self-splicing are mechanistically distinguishable. RNA. 1998, 4: 890-900. 10.1017/S1355838298971643.
Steitz TA, Steitz JA: A general two-metal-ion mechanism for catalytic RNA. Proc Natl Acad Sci USA. 1993, 90: 6498-6502. 10.1073/pnas.90.14.6498.
Keating KS, Toor N, Perlman PS, Pyle AM: A structural analysis of the group II intron active site and implications for the spliceosome. RNA. 2010, 16: 1-9. 10.1261/rna.1791310.
Madhani HD, Guthrie C: A novel base-pairing interaction between U2 and U6 snRNAs suggests a mechanism for the catalytic activation of the spliceosome. Cell. 1992, 71: 803-817. 10.1016/0092-8674(92)90556-R.
Yean SL, Wuenschell G, Termini J, Lin RJ: Metal-ion coordination by U6 small nuclear RNA contributes to catalysis in the spliceosome. Nature. 2000, 408: 881-884. 10.1038/35048617.
Mefford MA, Staley JP: Evidence that U2/U6 helix I promotes both catalytic steps of pre-mRNA splicing and rearranges in between these steps. RNA. 2009, 15: 1386-1397. 10.1261/rna.1582609.
Fabrizio P, Abelson J: Two domains of yeast U6 small nuclear RNA required for both steps of nuclear precursor messenger RNA splicing. Science. 1990, 250: 404-409. 10.1126/science.2145630.
Wolff T, Menssen R, Hammel J, Bindereif A: Splicing function of mammalian U6 small nuclear RNA: conserved positions in central domain and helix I are essential during the first and second step of pre-mRNA splicing. Proc Natl Acad Sci USA. 1994, 91: 903-907. 10.1073/pnas.91.3.903.
Yu YT, Maroney PA, Darzynkiwicz E, Nilsen TW: U6 snRNA function in nuclear pre-mRNA splicing: a phosphorothioate interference analysis of the U6 phosphate backbone. RNA. 1995, 1: 46-54.
Villa T, Pleiss JA, Guthrie C: Spliceosomal snRNAs: Mg2+-dependent chemistry at the catalytic core?. Cell. 2002, 109: 149-152. 10.1016/S0092-8674(02)00726-2.
Tseng CK, Cheng SC: Both catalytic steps of nuclear pre-mRNA splicing are reversible. Science. 2008, 320: 1782-1784. 10.1126/science.1158993.
Smith DJ, Konarska MM: Mechanistic insights from reversible splicing catalysis. RNA. 2008, 14: 1975-1978. 10.1261/rna.1289808.
Galej WP, Oubridge C, Newman AJ, Nagai K: Crystal structure of Prp8 reveals active site cavity of the spliceosome. Nature. 2013, advance online publication
We acknowledge the beamline scientists at 24-ID-C and E, NE-CAT, APS, Argonne (IL), USA, and Dr. Maximilian Bailor, Dr. Isabel Chillón Gázquez, Dr. Olga Fedorova, Nathan Pirakitikulr, and all members of the Pyle lab for constructive discussion and critical reading of the manuscript. This project was supported by the National Institute of Health (RO1GM50313).
The authors declare that they have no competing interests.
MM analyzed the structures, designed the model of the 3′-splice junction and wrote the manuscript incorporating contributions from all authors. SS designed the models of the branch site and the 3′-splice junction. AMP conceived and coordinated the study. All authors read and approved the final manuscript.