- Open Access
Tos17 rice element: incomplete but effective
Mobile DNAvolume 5, Article number: 10 (2014)
Tos17 was the first LTR retrotransposon (Copia) described as active in cultivated rice, and is present in two copies in the genome of the sequenced Nipponbare variety. Only the chromosome 7 copy is active and able to retrotranspose, at least during in vitro culture, and this ability was widely used in insertional mutagenesis assays.
Here the structure of the active Tos17 was thoroughly annotated using a set of bioinformatic analyses.
Unexpectedly, Tos17 appears to be a non-autonomous LTR retrotransposon, lacking the gag sequence and thus unable to transpose by itself.
The long terminal repeats (LTR) retrotransposon life cycle involves a cytosolic reverse-transcription step within a multiproteic core called virus-like particle (VLP), formed by the polymerization of the Group-specific antigen (GAG) proteins, normally encoded in the element itself; for a recent review, see . This GAG protein classically harbors three domains, from external to internal:
the matrix domain (MA), for membrane targeting and capsid assembly;
the capsid hydrophobic region (CA) and the most conserved part of GAG, in charge of polymerization, and the
nucleocapsid (NC), targeting the specific mRNA through the PSI region .
In addition, a CCHC zinc-finger motif is located at the C-terminus of the protein, single or twice repeated (or even thrice), and is in charge of the protein-nucleic acid interactions . This protein is theoretically specific of its own RNA, and is an essential and mandatory component of the retrotransposition of LTR retrotransposons. A second open reading frame (ORF), pol, encodes the reverse transcriptase-RNaseH (RT-RNaseH), which drives the synthesis of a double-stranded cDNA from two RNA matrices and the integrase (INT) which allows the insertion of the new cDNA copy. However, in some cases, some non-autonomous elements have been shown capable of hijacking the GAG from other elements .
In cultivated Asian rice (Oryza sativa L.), LTR retrotransposons compose at least 20% of the genome (MSUv7.0 reference genome , http://rice.plantbiology.msu.edu/index.shtml). The Copia Tos17 element (for Transposon of Oryza sativa 17) was the first identified as active  and able to transpose in this genome. Moreover, Tos17 seems to be the most transpositionally competent one in regenerated plants .
Two almost identical genomic copies of Tos17 reside in the reference genome (on chromosomes 7 and 10; Figure 1). Only the chromosome-7 copy is transpositionally active (during in vitro culture at least), whereas the other, located on chromosome 10, is inactive, heavily methylated and contains several stop codons and indels in its predicted coding region . This last copy can, however, be reactivated (transcriptionally) in methylation-defective mutants . In the whole Oryza genus, the copy number as well as the location of active copies (if there are any) may differ .
The Tos17 activation during in vitro culture was widely used in mutagenesis assays, which allowed reverse genetics analyses through the generation of insertional mutants without transformation [8–10]. In the present study, a detailed functional analysis of Tos17 was performed, showing that both genomic Tos17 copies lack a gag ORF, making Tos17 a non-autonomous element requiring an active one in order to ensure its transposition.
Results and discussion
The two Tos17 genomic copies were extracted from their respective location in the rice MSUv7.0 genome, and manually annotated using a series of basic local alignment search tool (BLAST), ProSite and Protein families (Pfam) analyses. A predicted long ORF (from position 659 to 3835, Figure 2A; annotated as the gag-pol ORF ) of 1,058 residues can be detected on the active copy (chromosome 7), whereas no apparent ORFs (that is, more than 100 residues starting with Met) exist on the inactive copy. On this long ORF, INT (gag_pre-integrase and rve) and RT (RVT_2) Pfam-A motifs can be easily identified (see Table 1), which suggests that this ORF is the polyprotein (POL) one. However, none of the truly GAG-related motifs, such as CCHC zinc-finger (18 residues) or the UBN2 group (100 to 150 residues), could be identified, and the first confidently identified motif related to the INT (and thus to the pol ORF) in the Pfam database starts at residue 79 (Figure 2A; gag_pre-integrase motif) of this ORF (base 757 of the internal sequence).
The Pfam analysis was performed on the largest Tos17 ORF.
This ORF was then compared to ORFs from those of the active Copia elements, RIRE1 from Oryza. australiensis[11, 12] [BAA22288; EMBL/GB] (Figure 2B), and Houba from O. sativa (known to be one of the most recently retrotransposed Copia in rice; ). As shown on Figure 3, the ORFs aligned on the whole POL part the elements that are compared two by two; the Tos17 ORF, however, lacked the GAG region, while the ORFs from RIRE1 and Houba are also aligned on the GAG part. No GAG-related region can be detected on the whole Tos17 genomic sequences in BLASTx against nr and protein databases (data not shown). Various tBLASTn (protein query versus nucleic database) analyses against the rice EST databases from NCBI were performed, and no ESTs resembling a larger ORF than the ones known were detected. Finally, no other Tos17 gag-like sequence can be amplified in PCR on the NipponBare genomic DNA (data not shown).
RT-sequence phylogenetic analysis showed that only RN304 and Lullaby are closely related to Tos17. Interestingly, RN304, the closest element to Tos17, is itself also a non-autonomous element also lacking the gag sequence, similar to Tos17, but no information about its transpositional activity is available. The closest complete element to Tos17 (that is, one that harbors a complete gag-pol ORF) is the Lullaby element, recently shown as transitionally active in only some of the regenerated lines in which its expression was detected . Lullaby is a 5′142-long element, and has two copies in the Nipponbare genome, on chromosomes 6 and 9, with only the chromosome-6 copy active . The DNA similarity between Tos17 and Lullaby is 57% at the DNA level (whole element sequence), and 64% at the protein level (gag-pol region sequence). At the DNA level, the similarity is limited to the internal sequence, whereas at the protein level the two POL sequences aligned well. Moreover, the primer binding site (PBS) region, located immediately after the 5′ LTR, and involved in RNA-GAG recognition , is almost identical between the two elements (5′-TGGTATCAGAGC(a/t)A(t/-)GGT-3′), starting at positions 126 and 139 for Lullaby and Tos17 respectively. However, no common INT signal (at the 3′-end of the 3′ LTR ) is shared between Lullaby and Tos17, highlighting the use of Lullaby GAG by Tos17 only.
Tos17, the most active LTR retrotransposon in cultivated rice, and the most commonly used element as an insertional mutation tool , is thus a non-autonomous element, because no gag sequence exists in the Oryza sativa genome, even if Tos17 is able to retrotranspose in this species. The simplest explanation is that Tos17 is coupled with an active LTR retrotransposon for its mobility, and that the former is able to use the gag (and VLP) from the latter. Such hitchhiking implies a structural (same GAG-recognition signals) as well as translational (same time of expression) relationship between Tos17 and its autonomous partner. This association is probably a long-term association, as the structural annotation of the Tos17 elements (Figure 2A) reveals a complete removal of the gag region, without any identifiable remnants, but without damaging any other structural features of the element (LTR, PBS or polypurine tract (PPT)). Indeed, such clean elimination might have occurred during Tos17 evolution, with only elements within this correct deletion selected (able to be correctly expressed and mobilized by its partner), as no other Tos17-like element with gag remnants has been detected.
The use of Tos17 as an insertional tool for reverse genetics is not affected by this non-autonomous state, as long as requested functional and complementation analyses are performed to validate or invalidate the insertion as the real cause of the observed phenotype. The fact that Tos17 is not able to retrotranspose by itself may help to explain the high rate (almost 90%) of morpho-physiological variations untagged by Tos17 (or the transferred T-DNA) observed among regenerated lines (; M Lorieux, unpublished data; B Hsingh, personal communication), which is probably also due to transposition of other elements, as shown previously .
Analyses, such as the one described here, highlight the need for a better knowledge of transposable elements (TEs), in order to ensure a better understanding of their effects upon the host genome. In particular, it may be of interest to further study the details of the relationships between the non-autonomous elements and their autonomous counterparts, because existing data suggest that the former are more active than the latter, as shown for BARE2 and Tos17.
The nucleotidic sequences from genomic copies of each element were launched in Artemis , and the ORFs longer than 100 residues were automatically extracted from the element sequences. The ORFs were then scanned online using a combination of Pfam, ProSite and BLASTp analyses  with standard parameters. The results were then reported on Artemis, in order to manually reconstruct the complete structure of each element. The LTRs were identified using Dotter , and the PBS and PPT were manually determined. The comparison between putative GAG-POL sequences was performed using the Align2Sequence graphical tool from the NCBI, through a BLASTp analysis, for a better presentation. The identity/similarity levels were calculated using the Stretcher program from the EMBOSS suite.
basic local alignment search tool
long terminal repeats
open reading frame
primer binding site
transposon of Oryza sativa
Sabot F, Schulman AH: Parasitism and the retrotransposon life cycle in plants: a hitchhiker’s guide to the genome. Heredity 2006, 97: 381-388.
Tanskanen JA, Sabot F, Vicient C, Schulman AH: Life without GAG: the BARE -2 retrotransposon as a parasite’s parasite. Gene 2007, 390: 166-174. 10.1016/j.gene.2006.09.009
Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, McCombie WR, Ouyang S, Schwartz DC, Tanaka T, Wu J, Zhou S, Childs KL, Davidson RM, Lin H, Quesada-Ocampo L, Vaillancourt B, Sakai H, Lee SS, Kim J, Numa H, Itoh T, Buell CR, Matsumoto T: Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice 2013, 6: 4. 10.1186/1939-8433-6-4
Hirochika H, Sugimoto K, Otsuki Y, Tsugawa H, Kanda M: Retrotransposons of rice involved in mutations induced by tissue culture. Proc Natl Acad Sci USA 1996, 93: 7783-7788. 10.1073/pnas.93.15.7783
Sabot F, Picault N, ElBaidouri M, Llauro C, Chaparro C, Piegu B, Roulin A, Guiderdoni E, Delabastide M, McCOMBIE R, Panaud O: Transpositional landscape of the rice genome revealed by paired-end mapping of high-throughput re-sequencing data. Plant J 2011, 66: 241-246. 10.1111/j.1365-313X.2011.04492.x
Hirochika H: Contribution of the Tos17 retrotransposon to rice functional genomics. Curr Opin Plant Biol 2001, 4: 118-122. 10.1016/S1369-5266(00)00146-1
Petit J, Bourgeois E, Stenger W, Bès M, Droc G, Meynard D, Courtois B, Ghesquière A, Sabot F, Panaud O, Guiderdoni E: Diversity of the Ty -1 copia retrotransposon Tos17 in rice ( Oryza sativa L.) and the AA genome of the Oryza genus. Mol Genet Genom 2009, 282: 633-652. 10.1007/s00438-009-0493-z
Hirochika H, Guiderdoni E, An G, Hsing Y-I, Eun MY, Han C-D, Upadhyaya N, Ramachandran S, Zhang Q, Pereira A, Sundaresan V, Leung H: Rice mutant resources for gene discovery. Plant Mol Biol 2004, 54: 325-334.
Miyao A, Iwasaki Y, Kitano H, Itoh J-I, Maekawa M, Murata K, Yatou O, Nagato Y, Hirochika H: A large-scale collection of phenotypic data describing an insertional mutant population to facilitate functional analysis of rice genes. Plant Mol Biol 2007, 63: 625-635. 10.1007/s11103-006-9118-7
Piffanelli P, Droc G, Mieulet D, Lanau N, Bès M, Bourgeois E, Rouvière C, Gavory F, Cruaud C, Ghesquière A, Guiderdoni E: Large-scale characterization of Tos17 insertion sites in a rice T-DNA mutant library. Plant Mol Biol 2007, 65: 587-601. 10.1007/s11103-007-9222-3
Noma K, Nakajima R, Ohtsubo H, Ohtsubo E: RIRE1 , a retrotransposon from wild rice Oryza australiensis . Genes Genet Syst 1997, 72: 131-140. 10.1266/ggs.72.131
Piegu B, Guyot R, Picault N, Roulin A, Sanyal A, Saniyal A, Kim H, Collura K, Brar DS, Jackson S, Wing RA, Panaud O: Doubling genome size without polyploidization: dynamics of retrotransposition-driven genomic expansions in Oryza australiensis , a wild relative of rice. Genome Res 2006, 16: 1262-1269. 10.1101/gr.5290206
Vitte C, Ishii T, Lamy F, Brar D, Panaud O: Genomic paleontology provides evidence for two distinct origins of Asian rice ( Oryza sativa L.). Mol Genet Genom 2004, 272: 504-511. 10.1007/s00438-004-1069-6
Picault N, Chaparro C, Piegu B, Stenger W, Formey D, Llauro C, Descombin J, Sabot F, Lasserre E, Meynard D, Guiderdoni E, Panaud O: Identification of an active LTR retrotransposon in rice. Plant J 2009, 58: 754-765. 10.1111/j.1365-313X.2009.03813.x
Rutherford K, Parkhill J, Crook J, Horsnell T, Barrell B, Rice P: Artemis : sequence visualization and annotation. Bioinformatics 2000, 16: 944-945. 10.1093/bioinformatics/16.10.944
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215: 403-410.
Sonnhammer ELL, Durbin R: A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Gene 1996, 167: 1-10.
The author thanks Cristian Chaparro and Benoit Piegu for their comments on the analyses, and Dr Timothy Tranberger for his help with English corrections.
The author declares having no competing interests.