CRISPR-Cas immunity and mobile DNA: a new superfamily of DNA transposons encoding a Cas1 endonuclease
© Hickman and Dyda; licensee BioMed Central Ltd. 2014
Received: 18 July 2014
Accepted: 11 August 2014
Published: 26 August 2014
Mobile genetic elements such as DNA transposons are a feature of most genomes. The existence of novel DNA transposons can be inferred when whole genome sequencing reveals the presence of hallmarks of mobile elements such as terminal inverted repeats (TIRs) flanked by target site duplications (TSDs). A recent report describes a new superfamily of DNA transposons in the genomes of a few bacteria and archaea that possess TIRs and TSDs, and encode several conserved genes including a cas1 endonuclease gene, previously associated only with CRISPR-Cas adaptive immune systems. The data strongly suggests that these elements, designated ‘casposons’, are likely to be bona fide DNA transposons and that their Cas1 nucleases act as transposases and are possibly still active.
KeywordsDNA transposition V(D)J recombination CRISPR-Cas Adaptive immunity Casposon Transposable elements Cas1 nuclease protein
Mobile genetic elements can modify the genomes of the organisms that harbor them, and their mobility is believed to be an important factor in evolution (reviewed in [1–5]). Mobile elements can affect their host by disrupting genes, modifying control regions, and by introducing new proteins or protein domains into novel genomic locations. One of the best known examples is the RAG1 protein of jawed vertebrates which is a key protein required for the functioning of the adaptive immune system , and whose catalytic domain originated from the transposase associated with Transib transposons .
One of the most exciting recent advances in microbiology has been the discovery that an adaptive immune system also exists in many bacteria and archaea (reviewed in [8–11]). CRISPR-Cas systems provide a mechanism for prokaryotes to incorporate short stretches of foreign DNA (‘spacers’) into their genomes to archive sequence information on ‘non-self’ DNA they have encountered, such as that of viruses or plasmids. This is called the adaptation stage of the immune process. Once integrated, these spacers serve as templates for the synthesis of RNA which then directs Cas nucleases to specific foreign nucleic acids in order to degrade them. Several different types of CRISPR systems have been identified, and each is associated with a distinct set of Cas proteins. Only two proteins, Cas1 and Cas2, appear to be strictly conserved among the various CRISPR systems, and they are both metal-dependent nucleases. The structure of the Cas1-Cas2 complex from E. coli strain MG1655 has been determined .
A recent report by Krupovic et al. presents data suggesting that Cas1 proteins of CRISPR systems originated from a newly identified superfamily of DNA transposons that the authors call ‘casposons’. If true, an elegant symmetry emerges in the evolutionary history of the establishment of adaptive immune systems in higher eukaryotes and in bacteria and archaea. Furthermore, the discovery of a novel family of DNA transposases would be a significant addition to the known repertoire of mechanisms by which mobile elements are moved .
The parallels between the proposed mechanism of the adaptation step of the CRISPR immune system (reviewed in ) and DNA transposition are striking. Cas proteins are responsible for excising a short spacer segment from foreign DNA (typically 32 to 38 bp , preceded by a 2 to 5 bp ‘protospacer adjacent motif’, or PAM) and site-specifically integrating it into a particular genomic location at the leader end of a CRISPR locus. Spacer integration is accompanied by the generation of direct repeats on either side of the spacer that can vary in size from 23 to 55 bp . Thus, if the Cas1 nucleases associated with casposons are involved in catalyzing transposition, they presumably can sequence-specifically recognize their TIRs which for most DNA transposons are longer than 10 bp [2, 15]. They also appear to exhibit relaxed target DNA recognition properties relative to CRISPR-Cas systems: whereas spacer integration mediated by Cas proteins is site-specific, the genomic locations of casposons suggests that their integration sites are not highly conserved (in line with the integration properties of most RNase H-like DNA transposons with a few notable exceptions, such as the bacterial Tn7 transposon ).
One of the main ways that transposon superfamilies are grouped is by the conservation of TIR sequences located at their transposon ends. At first glance, the 19 putative casposon TIR sequences identified and analyzed by Krupovic et al. appear disconcertingly variable both in length and in sequence. However, we find that it is possible to align the TIRs of the sequences corresponding to casposon family 2 members (the most populous casposon family defined in Krupovic et al.) such that a pattern of conserved base pairs emerges within the terminal approximately 20 bp (Figure 1B). This suggests that transposon-specific end recognition by a casposon-encoded protein is reasonable. (Casposon families 1 and 3 TIRs can also be aligned to reveal conserved TIR motifs but have fewer representatives than family 2.)
The alignment in Figure 1 also suggests a resolution of a second unusual feature of the sequences presented by Krupovic et al., which is that the TSDs are reported to vary in size from 1 to 27 nucleotides (nt). TSD size is typically highly conserved in Insertion Sequences and DNA transposon superfamilies, rarely varying by more than one or two nt [15, 2]. This is because TSD size is a direct consequence of the spacing of the staggered cuts generated by a transpososome assembled on target DNA, and it reflects properties of the distinct architecture - in particular the distance between and the orientation of two catalytic sites - of these multimeric protein-DNA complexes. When the TIRs of casposon family 2 are aligned as in Figure 1B, the TSD size (as they are usually defined which does not include any overlap with the TIRs) now converges on 14 bp. This is relatively large when compared to TSDs of most characterized transposons, but is substantially less than the range of 23 to 55 nt for the repeat size of CRISPR systems. The thus-aligned TSD sequences also hint at yet another feature of many characterized DNA transposons which is a preferred palindromic target site motif .
Finally, it should be noted that all of the casposon-associated Cas1 proteins identified by Krupovic et al. possess the four conserved catalytic residues expected for an active Cas1 nuclease (Supplemental Figure 1 in their report).
The evidence is compelling that casposons possess some of the expected properties of active DNA transposons. However, as we are only beginning to understand how the multiple Cas proteins in different CRISPR systems mediate immunity, the evolutionary link between the CRISPR-associated Cas1 proteins and the casposon-associated Cas1 proteins provides only limited insight into the possible mechanism of casposon mobility. Many intriguing questions have been raised by the report of Krupovic et al. Since two types of nuclease are often associated with casposons, the Cas1 proteins and usually an HNH nuclease, does the latter have a role? If so, do these nucleases work together and interdependently to catalyze excision and integration? How might Cas1 and a B family polymerase collaborate to generate the proposed intermediate of the reaction, an excised transposon flanked by double-strand breaks? How is this related to the transposition mechanism of the superfamily of self-synthesizing Polinton/Mavericks found in eukaryotes [20, 21], to which casposons are proposed to be mechanistically related albeit not evolutionarily ? Do the recurrent HTH domains identified within casposons (for example, all the Cas1 proteins of casposon family 2 have a conserved HTH appended to their C-termini) play a role in the recognition of transposon ends or a target site? Clearly, experimental biochemistry is needed to answer these questions.
Clustered Regularly Interspaced Short Palindromic Repeats
protospacer adjacent motif
terminal inverted repeat
target site duplication.
This work was supported by the Intramural Research Program of the NIH, The National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK).
- Deininger PL, Moran JV, Batzer MA, Kazazian HH Jr: Mobile elements and mammalian genome evolution. Curr Opin Genet Dev 2003, 13: 651-658.View ArticlePubMedGoogle Scholar
- Feschotte C, Pritham EJ: DNA transposons and the evolution of eukaryotic genomes. Annu Rev Genet 2007, 41: 331-368.PubMed CentralView ArticlePubMedGoogle Scholar
- Feschotte C: Transposable elements and the evolution of regulatory networks. Nature Rev Genet 2008, 9: 397-405.PubMed CentralView ArticlePubMedGoogle Scholar
- Biémont C: A brief history of the status of transposable elements: from junk DNA to major players in evolution. Genetics 2010, 186: 1085-1093.PubMed CentralView ArticlePubMedGoogle Scholar
- Janicki M, Rooke R, Yang G: Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes. Chromosome Res 2011, 19: 787-808.View ArticlePubMedGoogle Scholar
- Schatz DG, Swanson PC: V(D)J recombination: mechanisms of initiation. Annu Rev Genet 2011, 45: 167-202.View ArticlePubMedGoogle Scholar
- Kapitonov VV, Jurka J: RAG1 core and V(D)J recombination signal sequences were derived from Transib transposons. PLoS Biol 2005, 3: e181.PubMed CentralView ArticlePubMedGoogle Scholar
- Westra ER, Swarts DC, Staals RHJ, Jore MM, Brouns SJJ, van der Oost J: The CRISPRs, they are a-changin’: how prokaryotes generate adaptive immunity. Annu Rev Genet 2012, 46: 311-339.View ArticlePubMedGoogle Scholar
- Makarova KS, Wolf YI, Koonin EV: The basic building blocks and evolution of CRISPR-Cas systems. Biochem Soc Trans 2013, 41: 1392-1400.View ArticlePubMedGoogle Scholar
- Sorek R, Lawrence CM, Wiedenheft B: CRISPR-mediated adaptive immune systems in bacteria and archaea. Annu Rev Biochem 2013, 82: 237-266.View ArticlePubMedGoogle Scholar
- Barrangou R, Marraffini LA: CRISPR-Cas systems: prokaryotes upgrade to adaptive immunity. Mol Cell 2014, 54: 234-244.PubMed CentralView ArticlePubMedGoogle Scholar
- Nuñez JK, Kranzusch PJ, Noeske J, Wright AV, Davies CW, Doudna JA: Cas1-Cas2 complex formation mediates spacer acquisition during CRISPR-Cas adaptive immunity. Nature Struct Mol Biol 2014, 21: 528-534.View ArticleGoogle Scholar
- Krupovic M, Makarova KS, Forterre P, Prangishvili D, Koonin EV: Casposons: a new superfamily of self-synthesizing DNA transposons at the origin of prokaryotic CRISPR-Cas immunity. BMC Biol 2014, 12: 36.PubMed CentralView ArticlePubMedGoogle Scholar
- Curcio MJ, Derbyshire KM: The outs and ins of transposition: from Mu to kangaroo. Nature Rev Mol Cell Biol 2003, 4: 865-877.View ArticleGoogle Scholar
- Chandler M, Mahillon J: Insertion sequences revisited. In Mobile DNA, Volume 2. Edited by: Craig NL, Craigie R, Gellert M, Lambowitz A. Washington DC: ASM Press; 2002:305-366.View ArticleGoogle Scholar
- Yuan YW, Wessler SR: The catalytic domain of all eukaryotic cut-and-paste transposase superfamilies. Proc Natl Acad Sci U S A 2011, 108: 7884-7889.PubMed CentralView ArticlePubMedGoogle Scholar
- Fineran PC, Charpentier E: Memory of viral infections by CRISPR-Cas adaptive immune systems: acquisition of new information. Virol 2012, 434: 202-209.View ArticleGoogle Scholar
- Peters JE, Craig NL: Tn7: smarter than we thought. Nature Rev Mol Cell Biol 2001, 2: 806-814.View ArticleGoogle Scholar
- Linheiro RS, Bergman CM: Testing the palindromic target site model for DNA transposon insertion using the Drosophila melanogaster P-element. Nucl Acids Res 2008, 36: 6199-6208.PubMed CentralView ArticlePubMedGoogle Scholar
- Kapitonov VV, Jurka J: Self-synthesizing DNA transposons in eukaryotes. Proc Natl Acad Sci U S A 2006, 103: 4540-4545.PubMed CentralView ArticlePubMedGoogle Scholar
- Pritham EJ, Putliwala T, Feschotte C: Mavericks , a novel class of giant transposable elements widespread in eukaryotes and related to DNA viruses. Gene 2007, 390: 3-17.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.