- Open Access
A proposed mechanism for IS607-family serine transposases
Mobile DNA volume 4, Article number: 24 (2013)
The transposases encoded by the IS607 family of mobile elements are unusual serine recombinases with an inverted domain order and minimal specificity for target DNA.
Structural genomics groups have determined three crystal structures of the catalytic domains of IS607 family transposases. The dimers formed by these catalytic domains are very different from those seen for other serine recombinases and include interactions that usually only occur upon formation of a synaptic tetramer.
Based on these structures, we propose a model for how IS607-family transposases could form a synaptic tetramer. The model suggests that, unlike other serine recombinases, these enzymes carry out sequence-specific DNA binding and catalysis in trans: the DNA binding and catalytic domains of each subunit are proposed to interact with different DNA duplexes. The model also suggests an explanation for the minimal target DNA specificity.
The IS607 family of insertion sequences (ISs) was first described in the human pathogens M. tuberculosis and H. pylori, and members of this family have now been identified in all three kingdoms of life and in eukaryotic viruses [1–6]. They usually encode two proteins: TnpA (sometimes called TnpA2), an unusual serine recombinase, and TnpB, a protein of completely unknown structure (Figure 1). Similar TnpB proteins are also encoded by IS200/IS605-type elements, but these encode a different TnpA that belongs to the Y1 (one tyrosine) family of transposases [7, 8]. Although TnpB is conserved in both these IS families, it is not required for IS607 transposition, and it inhibits transposition of the IS200/IS605-family member ISDra2 [2, 9]. TnpA is therefore the transposase of the IS607 family elements. Analysis of IS607 insertions in an E. coli system and genomic analysis of other family members showed that they insert with very little target sequence specificity [2, 5]. This is very unusual for reactions catalysed by serine recombinases, which usually display extensive specificity for all recombining partners [10, 11].
The serine transposases encoded by IS607-type insertion sequences represent a poorly understood branch of the serine recombinase family. Serine recombinases all share a common catalytic domain that includes the eponymous serine that is the active site nucleophile. The most intensively studied branch of this family, and the only one for which extensive structural information is available, comprises the “canonical” resolvase/invertase group ; these catalyse resolution of transposition cointegrates and replicon dimers, or inversion of DNA segments. A second branch that has also been biochemically characterized comprises the large serine recombinases, which include a number of bacteriophage integrases and some transposases. For both of the characterized groups, the catalytic domain is always found at the N-terminus of the protein and is followed by a sequence-specific DNA binding domain: a simple helix-turn-helix for the resolvase/invertase group, or a much larger bipartite domain in the large serine recombinases (hence their name) . However, the IS607-family serine transposases carry a predicted helix-turn-helix DNA binding domain at the N-terminus, with the catalytic domain at the C-terminus (Figure 1c) [2, 5, 10].
Recombination by the characterized serine recombinases proceeds within a tetrameric complex that synapses the two DNA partners (Figure 2; reviewed in ). DNA is cleaved by attack of the conserved serine on a particular phosphodiester bond in the DNA, displacing a 3′ hydroxyl and creating a covalent protein-DNA intermediate. Once the DNA is cleaved, two subunits are thought to rotate 180° relative to the other two, carrying the broken DNA ends with them [13–19]. Religation then occurs by the chemical reversal of the cleavage reaction: the 3′ hydroxyls attack the phosphoserine linkages of their new partners. The recombination reaction is chemically reversible and these systems rely on additional features to determine when and in what direction it occurs. The tetramer can be subdivided into dimers in two different ways: “cutting dimers” in which both subunits bind (and cut/religate) the same duplex and “rotating dimers” in which the two subunits bind different duplexes but rotate together during the strand exchange process (Figure 2).
For the well-characterized serine recombinases, activation entails a pair of inactive cutting dimers undergoing large conformational changes as they come together to form a catalytically active tetramer. In the resolvase and invertase systems, activation is triggered by formation of a large synaptic complex that has a defined topology and includes additional copies of the recombinase and/or other DNA bending proteins [20, 21]. This requirement can be bypassed by mutations that tip the conformational balance from the inactive dimer normally favoured by the isolated wild type (WT) protein to the catalytically active tetramer [10, 22, 23]. Integrases of the large serine recombinase family can form synaptic complexes without accessory factors, but usually only if they involve certain pairs of DNA sites (e.g., attP, and attB, the attachment sites found in the phage and bacterial DNA, respectively) .
Structural studies have revealed similar cutting dimers formed by the WT catalytic domains of enzymes from both the canonical and the large serine recombinases, in which the active site is not fully assembled (e.g., Figure 3a) [24–26] and unpublished structures with protein database identifications (PDBids) 3g13, 1guv]. The one partial exception to this rule is the catalytic domain of TP901-1 integrase, which crystallized as a tetramer . However, further biophysical data showed that it is dimeric in solution . Tetrameric structures have also been determined for activated mutants of 3 canonical serine recombinases (Figure 3b) [19, 28–30]. These show that the conformational changes that accompany activation create a remarkably flat central interface about which subunit rotation could occur. The conformational changes include disruption of the cutting dimer interface contacts, rotation and repositioning of the “core” of the catalytic domain relative to the last helix (“E”), and repacking of the 4 copies of helix E in the centre of the tetramer. Rotating dimer interactions are formed primarily by the antiparallel packing of the E helices of subunits bound to different duplexes.
It has been unclear how to apply the lessons learned from the well-characterized serine recombinases to the IS607-family serine transposases. For instance, in the canonical serine recombinases, the catalytic and DNA binding domains of each protomer interact with the same DNA half-site [26, 31], but this is difficult to model for enzymes such as IS607 transposase where the DNA binding domain is N-terminal to the catalytic one, and attached by a rather short linker (~6aa for ISC1904). This geometry places the DNA binding domain (DBD) on the opposite side of the catalytic domain from the active site. Additionally, the current paradigm in which each of the four subunits binds one copy of a specific sequence motif (a “half site”) is hard to reconcile with the lack of target specificity shown by IS607-family transposases. Recently, structures of the catalytic domains of three different archaeal IS607-family serine transposases, including that from ISC1904 , have been determined by the Midwest Center for Structural Genomics (PDBids 3ilx, 3lhk and 3lhf). These reveal a very different dimer architecture than previously observed for other serine recombinases and suggest a different pathway for formation of an active tetramer.
The IS607-family dimer is a “rotating” rather than a “cutting” pair
The structure of the catalytic domain of ISC1904 transposase (from Sulfolobus Solfataricus P2; PDBid 3ilx) is shown in Figure 4 and is nearly identical to the dimers seen in the other two serine transposase catalytic domain structures. The packing of the E-helices in the serine transposase dimers more closely resembles that seen in activated tetramers rather than inactive dimers of canonical serine recombinases [19, 25, 26, 28–30, 33]. In contrast to other (inactive) serine recombinase dimer structures, the E helices pack against one another in an antiparallel manner and make sparse contacts with the rest of the catalytic domains. The C-terminal portions of the transposase E helices fold back on themselves at a point that is flexible in other structures [25, 26]. The antiparallel packing of the E helices closely resembles that seen previously in activated tetramers between subunits that form rotating dimers (Figure 4c and d). Thus, we propose that the two subunits forming the dimer in the IS607-family transposases are those that will become a rotating dimer within an active tetramer. In contrast, for other serine recombinases the two subunits of the inactive dimer become a cutting dimer within the tetramer.
If the resolvase tetramer were to be split into two rotating dimers, a large hydrophobic surface would be exposed (Figure 3). In the serine transposase dimer, the equivalent surface is covered by the C-terminal portions of the E helices that fold back against it (Figure 4a). These observations suggested a model for the transposase in which a full tetramer is assembled on a single DNA crossover site. As described below, such a tetramer can be assembled by maintaining the packing between E helices within each dimer and by rotating about two flexible points within each subunit.
Modelling an active IS607-family transposase tetramer
We anticipate that all IS607-family elements use a 'standard’ serine recombinase strand exchange mechanism (Figure 2; ) and transpose via a circular intermediate, similar to the circular forms of bacteriophages that use a large serine recombinase for integration/excision. In the circular form, the two ends of the mobile element would be joined to form a new crossover site through a specific 'overlap’ dinucleotide ('GG’ for IS607 ). Evidence for a circular form of IS607 was obtained by PCR in an E. coli-based transposition assay (NDF Grindley, personal communication). Recombination between the crossover site in the circular intermediate and a matching dinucleotide in the target DNA would insert the element into a new genomic location. Here, we propose a pathway for integration that can easily be extrapolated to the excision step.
To find a good model for the DBD, which was not included in any of the deposited serine transposase structures, we used the PROF routine of PredictProtein to predict its secondary structure . This was consistent with a winged helix-turn-helix, with two short helices followed by a β-hairpin wing and a third short helix. The SoxR repressor begins with just such a DNA-binding motif and is also the top hit found by the Phyre2 threading server [35, 36]. We therefore used a truncated version of the SoxR structure, with the DNA it was co-crystallized with, to model the DBD of the full-length ISC1904 transposase-DNA complex. The third helix of the transposase’s DBD is predicted to end at residue 47; this implies that there is a 6-residue linker before the catalytic domain, which becomes ordered in the crystal at residue 54.
The model for tetramer assembly (Figures 5 and 6), begins with the ISC1904 transposase dimer structure, with two SoxR-based DBDs attached by short flexible linkers to the N-termini. Straightening helix E of one subunit to more closely resemble known resolvase structures brings it into close proximity to the DBD of the opposite subunit. Since the C-terminal segment of helix E interacts with the minor groove of DNA in γδ and Sin resolvase-DNA structures, we predict that these structural motifs collaborate in binding a half-site (Figure 5b). The DNA was aligned with that in the γδ resolvase tetramer structure, although the central 2 bp, which are unpaired in the resolvase structure, were omitted from our model. The position of the DBD relative to the centre of the complex was based on the fact that the ISC1904 left and right ends both have a short sequence motif (TTG) that might comprise a specific binding site 2–4 bp from each 5′ end (Figure 1b; similar short repeats and conserved motifs can be found near the ends of many, but not all IS607-family elements listed in the ISFINDER database [2, 5, 6, 37]). When the catalytic domain cores of each subunit are rotated about the β4-helix E hinge to match the position of those in the γδ resolvase tetramer, an interesting prediction appears: unlike well-characterized serine recombinases, this one will cleave the DNA in trans. That is, the DNA half-site that interacts with the green subunit’s DBD will become covalently linked to the blue subunit’s catalytic domain (Figure 5b).
Straightening the E helices of the dimer exposes a hydrophobic surface, which we propose interacts with that of a similar dimer (Figure 5b, third panel), triggering tetramer assembly. The order of the conformational changes proposed in Figure 5 is arbitrary. However, it is plausible that sequence-specific major groove binding by one subunit’s DBD would cause the C-terminal part of its partner’s E-helix to flip into the adjacent minor groove. Synergistic binding of a second dimer would result in a full tetramer bound to one DNA segment. Tetramer formation would force the other pair of E-helices (green in Figure 5b) into the extended conformation, ready to bind target DNA. Since the second set of DNA-binding moieties is thus pre-assembled, the tetramer’s affinity for target DNA of nearly-random sequence would be greatly increased over that of a single inactive transposase dimer. Note that all DNA binding proteins have some affinity for random-sequence DNA, although it can be orders of magnitude weaker than that for specific sequences. Our model implies that the serine transposase’s affinity for specific vs. non-specific DNA is tuned such that the affinity of a single DBD for non-specific DNA is too weak to be physiologically significant, but, due to cooperativity, a pre-assembled array of two DBDs plus two E helices binds non-specific target DNA tightly enough to be functionally relevant.
Figure 6 shows a ribbon drawing of the final model and a superposition of it onto the γδ resolvase tetramer structure. The catalytic domains and E-helices overlap quite well, with the only major difference lying in the placement of the DBDs. Note that the E-helix interactions of the initial dimer were maintained throughout the modelling. There is precedent for the type, if not the scale, of the inter-domain motions needed to construct the model. In several other structures, the E-helix bends and/or becomes disordered at the position where it folds back on itself in the IS607-family dimers [24–26] and PDBid 3 g13. Rotation of the catalytic domain core relative to the E-helix occurs in the transition from inactive dimer to active tetramer for both γδ and Sin resolvases and triggers assembly of the active site [19, 28, 29].
In the serine transposase case, an extra level of regulation may keep the dimer inactive until the proper complex is assembled: the active sites within the catalytic domain cores are physically occluded by the E-helices. The inhibitory interaction between the cores and E-helices is stabilized by two negatively charged side chains that interact with the conserved arginines of the active site (Figure 4a). This pair of negatively charged residues is highly conserved within the serine transposases but not within the larger serine recombinase family.
Another question is whether or not a tetramer assembled on one crossover site would repeatedly cleave and religate that site even in the absence of target DNA. Kinetic experiments suggested that Sin resolvase tetramers were catalytically active even when only bound to one DNA segment. However, those experiments bypassed the natural assembly pathway for Sin . The serine transposases may have evolved an additional regulatory mechanism to avoid making double strand breaks until both DNA partners are present, which would be an interesting question to address experimentally. Preassembly of an active complex that captures a target site (of varying specificity) has good precedent in otherwise unrelated recombination systems, e.g., phage lambda integrase and the DDE family of transposases and retroviral integrases [39, 40].
The assembly pathway cartooned in Figure 5b regards the integration reaction, where one crossover site contains specific recombinase binding sites derived from the left and right ends of the IS element, and the other (“target DNA”) is non-specifically captured by a pre-assembled tetramer. How could this model accommodate the excision reaction, where only one arm of each crossover site contains any specific DNA sequences? Perhaps a tetramer could be nucleated by a single dimer binding to the sequence-specific half of each crossover site duplex. Unfolding of that dimer’s E-helices as they dock into the minor groove would expose a hydrophobic surface that would favour addition of the second dimer even though its DNA contacts would be non-specific. However, it is unclear how this assembly pathway would lead to proper synapsis of the two element ends and how recombination of one end with a random target would be avoided. Alternatively, the specific half-site at each IS end may bind one subunit of a rotating dimer and synapsis of the two ends may be mediated by two such dimers docking together (Figure 7).
Our modelling exercise demonstrates that while the assembly pathway may be very different, the final activated tetramer formed by IS607-family serine transposases may be very similar to that formed by canonical serine recombinases. However, it would differ in that each subunit would act in trans – that is, the catalytic and DNA binding domains would interact with different DNA duplexes.
Modelling was carried out by manipulating the relevant structures manually in Pymol (The PyMOL Molecular Graphics System, Version 1.3 Schrödinger, LLC).
DNA binding domain
Protein database identification
Blount ZD, Grogan DW: New insertion sequences of sulfolobus: functional properties and implications for genome evolution in hyperthermophilic archaea. Mol Microbiol 2005, 55: 312-325.
Kersulyte D, Mukhopadhyay AK, Shirai M, Nakazawa T, Berg DE: Functional organization and insertion specificity of IS607, a chimeric element of Helicobacter pylori . J Bacteriol 2000, 182: 5300-5308. 10.1128/JB.182.19.5300-5308.2000
Rolland T, Neuveglise C, Sacerdot C, Dujon B: Insertion of horizontally transferred genes within conserved syntenic regions of yeast genomes. PLoS One 2009, 4: e6515. 10.1371/journal.pone.0006515
Gilbert C, Cordaux R: Horizontal transfer and evolution of prokaryote transposable elements in eukaryotes. Genome Biol Evol 2013,5(5):822-832. 10.1093/gbe/evt057
Kersulyte D, Kalia A, Zhang M, Lee HK, Subramaniam D, Kiuduliene L, Chalkauskas H, Berg DE: Sequence organization and insertion specificity of the novel chimeric ISHp609 transposable element of Helicobacter pylori . J Bacteriol 2004, 186: 7521-7528. 10.1128/JB.186.22.7521-7528.2004
Gordon SV, Heym B, Parkhill J, Barrell B, Cole ST: New insertion sequences and a novel repeated sequence in the genome of Mycobacterium tuberculosis H37Rv. Microbiology 1999,145(Pt 4):881-892.
Ronning DR, Guynet C, Ton-Hoang B, Perez ZN, Ghirlando R, Chandler M, Dyda F: Active site sharing and subterminal hairpin recognition in a new class of DNA transposases. Mol Cell 2005, 20: 143-154. 10.1016/j.molcel.2005.07.026
Bao W, Jurka J: Homologues of bacterial TnpB_IS605 are widespread in diverse eukaryotic transposable elements. Mob DNA 2013, 4: 12. 10.1186/1759-8753-4-12
Pasternak C, Dulermo R, Ton-Hoang B, Debuchy R, Siguier P, Coste G, Chandler M, Sommer S: ISDra2 transposition in Deinococcus radiodurans is downregulated by TnpB. Mol Microbiol 2013, 88: 443-455. 10.1111/mmi.12194
Grindley ND, Whiteson KL, Rice PA: Mechanisms of site-specific recombination. Annu Rev Biochem 2006, 75: 567-605. 10.1146/annurev.biochem.73.011303.073908
Smith MC, Brown WR, McEwan AR, Rowley PA: Site-specific recombination by phiC31 integrase and other large serine recombinases. Biochem Soc Trans 2010, 38: 388-394. 10.1042/BST0380388
Rutherford K, Yuan P, Perry K, Sharp R, Van Duyne GD: Attachment site recognition and regulation of directionality by the serine integrases. Nucleic acids research 2013, 41: 8341-8356. 10.1093/nar/gkt580
Cozzarelli NR, Krasnow MA, Gerrard SP, White JH: A topological treatment of recombination and topoisomerases. Cold Spring Harb Symp Quant Biol 1984, 49: 383-400. 10.1101/SQB.1984.049.01.045
Heichman KA, Moskowitz IP, Johnson RC: Configuration of DNA strands and mechanism of strand exchange in the Hin invertasome as revealed by analysis of recombinant knots. Genes Dev 1991, 5: 1622-1634. 10.1101/gad.5.9.1622
McIlwraith MJ, Boocock MR, Stark WM: Site-specific recombination by Tn3 resolvase, photocrosslinked to its supercoiled DNA substrate. J Molec Biol 1996, 260: 299-303. 10.1006/jmbi.1996.0400
Sherratt D, Dyson P, Boocock M, Brown L, Summers D, Stewart G, Chan P: Site-specific recombination in transposition and plasmid stability. Cold Spring Harb Symp Quant Biol 1984, 49: 227-233. 10.1101/SQB.1984.049.01.026
Stark WM, Sherratt DJ, Boocock MR: Site-specific recombination by Tn3 resolvase: topological changes in the forward and reverse reactions. Cell 1989, 58: 779-790. 10.1016/0092-8674(89)90111-6
Wasserman SA, Dungan JM, Cozzarelli NR: Discovery of a predicted DNA knot substantiates a model for site-specific recombination. Science 1985, 229: 171-174. 10.1126/science.2990045
Li W, Kamtekar S, Xiong Y, Sarkis GJ, Grindley ND, Steitz TA: Structure of a synaptic gammadelta resolvase tetramer covalently linked to two cleaved DNAs. Science 2005, 309: 1210-1215. 10.1126/science.1112064
Stark WM, Boocock MR, Olorunniji FJ, Rowland SJ: Intermediates in serine recombinase-mediated site-specific recombination. Biochem Soc Trans 2011, 39: 617-622. 10.1042/BST0390617
Dhar G, Heiss JK, Johnson RC: Mechanical constraints on Hin subunit rotation imposed by the Fis/enhancer system and DNA supercoiling during site-specific recombination. Mol Cell 2009, 34: 746-759. 10.1016/j.molcel.2009.05.020
Dhar G, McLean MM, Heiss JK, Johnson RC: The Hin recombinase assembles a tetrameric protein swivel that exchanges DNA strands. Nucleic Acids Res 2009, 37: 4743-4756. 10.1093/nar/gkp466
Rowland SJ, Boocock MR, McPherson AL, Mouw KW, Rice PA, Stark WM: Regulatory mutations in Sin recombinase support a structure-based model of the synaptosome. Mol Microbiol 2009, 74: 282-298. 10.1111/j.1365-2958.2009.06756.x
Sanderson MR, Freemont PS, Rice PA, Goldman A, Hatfull GF, Grindley ND, Steitz TA: The crystal structure of the catalytic domain of the site-specific recombination enzyme gamma delta resolvase at 2.7 a resolution. Cell 1990, 63: 1323-1329. 10.1016/0092-8674(90)90427-G
Mouw KW, Rowland SJ, Gajjar MM, Boocock MR, Stark WM, Rice PA: Architecture of a serine recombinase-DNA regulatory complex. Mol Cell 2008, 30: 145-155. 10.1016/j.molcel.2008.02.023
Yang W, Steitz TA: Crystal structure of the site-specific recombinase gamma delta resolvase complexed with a 34 bp cleavage site. Cell 1995, 82: 193-207. 10.1016/0092-8674(95)90307-0
Yuan P, Gupta K, Van Duyne GD: Tetrameric structure of a serine integrase catalytic domain. Structure 2008, 16: 1275-1286. 10.1016/j.str.2008.04.018
Kamtekar S, Ho RS, Cocco MJ, Li W, Wenwieser SV, Boocock MR, Grindley ND, Steitz TA: Implications of structures of synaptic tetramers of gamma delta resolvase for the mechanism of recombination. Proc Natl Acad Sci USA 2006, 103: 10642-10647. 10.1073/pnas.0604062103
Keenholtz RA, Rowland SJ, Boocock MR, Stark WM, Rice PA: Structural basis for catalytic activation of a serine recombinase. Structure 2011, 19: 799-809. 10.1016/j.str.2011.03.017
Ritacco CJ, Kamtekar S, Wang J, Steitz TA: Crystal structure of an intermediate of rotating dimers within the synaptic tetramer of the G-segment invertase. Nucleic Acids Res 2013, 41: 2673-2682. 10.1093/nar/gks1303
Boocock MR, Zhu X, Grindley ND: Catalytic residues of gamma delta resolvase act in cis. EMBO J 1995, 14: 5129-5140.
Filee J, Siguier P, Chandler M: Insertion sequence diversity in archaea. Microbiol Mol Biol Rev 2007, 71: 121-157. 10.1128/MMBR.00031-06
Rice PA, Steitz TA: Refinement of gamma delta resolvase reveals a strikingly flexible molecule. Structure 1994, 2: 371-384. 10.1016/S0969-2126(00)00039-3
Rost B, Yachdav G, Liu J: The PredictProtein server. Nucleic Acids Res 2004, 32: W321-W326. 10.1093/nar/gkh377
Watanabe S, Kita A, Kobayashi K, Miki K: Crystal structure of the [2Fe-2S] oxidative-stress sensor SoxR bound to DNA. Proc Natl Acad Sci USA 2008, 105: 4121-4126. 10.1073/pnas.0709188105
Kelley LA, Sternberg MJ: Protein structure prediction on the web: a case study using the phyre server. Nat Protoc 2009, 4: 363-371.
Siguier P, Perochon J, Lestrade L, Mahillon J, Chandler M: ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res 2006, 34: D32-D36. 10.1093/nar/gkj014
Mouw KW, Steiner AM, Ghirlando R, Li NS, Rowland SJ, Boocock MR, Stark WM, Piccirilli JA, Rice PA: Sin resolvase catalytic activity and oligomerization state are tightly coupled. J Mol Biol 2010, 404: 16-33. 10.1016/j.jmb.2010.08.057
Dyda F, Chandler M, Hickman AB: The emerging diversity of transpososome architectures. Q Rev Biophys 2012, 45: 493-521. 10.1017/S0033583512000145
Landy A: Dynamic, structural, and regulatory aspects of lambda site-specific recombination. Annu Rev Biochem 1989, 58: 913-949. 10.1146/annurev.bi.58.070189.004405
This work was funded by a Functional Supplement to NIH R01 GM086826 (to PAR) and by the Wellcome Trust (MRB). We thank S-J Rowland and James Fuller for comments on the manuscript, and NDF Grindley for communicating unpublished data.
The authors have no competing interests.
MRB noticed that IS607-family transposase catalytic domain structures had appeared in the protein data bank and discussed the significance of their unusual dimer architecture with PAR. PAR constructed the model and wrote the manuscript, which MRB edited and approved. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Boocock, M.R., Rice, P.A. A proposed mechanism for IS607-family serine transposases. Mobile DNA 4, 24 (2013) doi:10.1186/1759-8753-4-24
- Serine recombinase
- Site-specific recombinase
- Transposition mechanisms