- Open Access
Blood from ‘junk’: the LTR chimeric transcript Pu.2 promotes erythropoiesis
Mobile DNA volume 5, Article number: 15 (2014)
Transposable elements (TEs) are a prominent feature of most eukaryotic genomes. Despite rapidly accumulating evidence for the role of TE-driven insertional mutagenesis and structural variation in genome evolution, few clear examples of individual TEs impacting biology via perturbed gene regulation are available. A recent report describes the discovery of an alternative promoter for the murine erythroid transcription factor Pu.1. This promoter is located in an ORR1A0 long terminal repeat (LTR) retrotransposon intronic to Pu.1 and is regulated by the Krüppel-like factors KLF1 and KLF3. Expression of the resultant chimeric transcript, called Pu.2, spontaneously induces erythroid differentiation in vitro. These experiments illustrate how transcription factor binding sites spread by retrotransposition have the potential to impact networks encoding key biological processes in the host genome.
Transposable elements (TEs) have in the past been maligned as ‘junk’ , ‘selfish’, and ‘parasitic’ [2, 3]. These descriptions are likely apt for many if not most sequences derived from TEs. However, a more complete view is that the majority of DNA generated by TE activity evolves neutrally under selection, while a small yet important minority of TE-derived sequences continues to drive genome evolution and innovation. The potential importance of TEs to the host is suggested by their near ubiquitous presence in eukaryotes, often accounting for half or more of genome sequence content [4–6] and, more convincingly, their provision of regulatory or otherwise functional genetic elements . In primates, for example, most order-specific regulatory sequences are derived from TEs . Many of these regions are dynamically regulated during development [9, 10] and incorporate internal binding sites for suppressor and activator complexes .
As a TE proliferates in its host genome, the number of loci subject to regulation by DNA binding proteins specific to that TE, such as transcription factors, also increases. During evolution, TEs have often contributed transcription factor binding sites to promoter, enhancer, and boundary elements [9–14]. In some cases, entire developmental pathways have been rewired as a result of TE mobilization [15–17]. Novel TE insertions can result in alternative splicing , exonization [19, 20], altered mRNA translational efficiency [21, 22], as well as the provision of distal enhancers . Thus, an abundance of TE-derived sequences, including ready-made promoters, enhancers, and other regulatory units, points to a major role for TEs in shaping the regulatory landscape of the eukaryotic genome (see  for a recent review).
Discovery and characterization of Pu.2, an LTR-driven chimeric mRNA
Despite extensive evidence for gene regulatory and structural innovation produced by TEs, examples of phenotypic change due to this variation are comparatively limited in mammals. TE-derived alternative promoters, which generate a chimeric mRNA with an adjacent gene, are arguably one of the more straightforward scenarios to link a TE with a functional product, particularly when that gene encodes a protein of known function. In recent work, Mak et al. report the discovery and functional characterization of a long terminal repeat (LTR) promoted chimeric mRNA of Pu.1, regulated in turn by Krüppel-like factors 1 (KLF1) and 3 (KLF3). PU.1, KLF1, and KLF3 are transcription factors active during hematopoiesis, where PU.1 favors myeloid differentiation and KLF1 and KLF3 drive erythroid maturation [26, 27]. In wild-type mice, KLF1 and KLF3 recognize similar sequence motifs in an antagonistic manner. KLF1 generally acts as a transcriptional activator , while KLF3 recruits a repressive complex including epigenetic modifiers .
Analyzing a microarray screen of Klf3-/- knockout mice, Mak et al. first identified major de-repression of Pu.1 that, oddly, excluded the initial two exons of the gene. To resolve this discrepancy, the authors performed 5′ RACE primed from the third exon of Pu.1. The results of this approach indicated an alternative promoter in an ORR1A0 LTR  located in the second intron of Pu.1 (Figure 1). Subsequent qRT-PCR assays confirmed that the ORR1A0-Pu.1 chimeric transcript, named Pu.2, was upregulated in Klf3-/- fetal liver tissue, while KLF1 and KLF3 were found to bind sequence motifs contained in ORR1A0, based on electrophoretic mobility shift assay (EMSA) and luciferase reporter experiments. Taken together, these data suggest opposing roles for KLF1 and KLF3 in regulating Pu.2 expression.
Intriguingly, Pu.2 was predicted to encode an N-terminal truncated isoform of PU.1 that retained a DNA binding ETS domain (Figure 1). By EMSA, Mak et al. demonstrated that PU.2 could bind to a predicted PU.1 target site. Moreover, PU.2 was found to counter the activity of PU.1 in a dose-dependent manner, and promoted spontaneous erythroid differentiation in human K562 cells. These assays clearly demonstrated, in vitro, an erythropoietic function for Pu.2. Extending their results to an in vivo setting, the authors detected Pu.2 mRNA and protein in wild-type fetal liver, with overexpression observed in Klf3-/- animals, and also found that in vitro KLF1 overexpression activated PU.2. These experiments provide evidence for PU.2 function in murine erythroid differentiation, albeit heavily repressed by KLF3 in vivo.
Finally, via RNA-seq and additional qRT-PCR, Mak et al. identified several other ORR1A0 alternative promoters differentially regulated by KLF3 and producing chimeric transcripts with adjacent protein-coding genes. This important, though preliminary, observation suggested that ORR1A0 may play a broader role in regulating erythroid differentiation beyond the highlighted example of Pu.2, and provides insight into the co-evolution of TE subfamilies, transcription factors, and core biological processes, as discussed elsewhere . One reasonable conclusion from this work is that the amplification of the ORR1A0 LTR family in rodents generated a ready-made network of genetic material subject to control by KLF1 and KLF3, and capable of changing how erythroid maturation was regulated during development. Notably, the ORR1A0 LTR family is rodent-specific and almost certainly incapable of further mobilization . As such, Pu.2 is not found in human cells, though it remains to be determined whether other TEs present in the human Pu.1 locus generate mRNAs functionally analogous to mouse Pu.2. Future experiments involving genome-wide chromatin immunoprecipitation sequencing (ChIP-seq) to elucidate KLF bound sites in vivo may reveal human TE families dynamically regulated hematopoiesis, in the Pu.1 locus and elsewhere.
Among a host of alternative promoters derived from mammalian TEs and driving protein-coding and non-coding gene expression [9, 31–33], we consider three reports as landmark examples of TE insertions having a clear functional impact upon biology: an epigenetically regulated LTR upstream of the Agouti gene in rodents , an LTR alternative promoter for the colony stimulating factor 1 receptor (CSF1R) proto-oncogene in lymphoma , and the convergent evolution of multiple LTRs to act as promoters for the neuronal apoptosis inhibitory protein (NAIP) gene . In each case, an LTR produces a chimeric mRNA with the adjacent protein-coding gene. The discovery by Mak et al. that the LTR-initiated Pu.2 transcript can promote erythroid maturation in the absence of KLF3 is a valuable addition to this literature, and will likely increase future attention to the role of TEs in regulating various developmental processes, including hematopoiesis.
Breast cancer 2, early onset
Chromatin immunoprecipitation sequencing
Colony stimulating factor 1 receptor
Electrophoretic mobility shift assay
Long terminal repeat
Neuronal apoptosis inhibitory protein
quantitative real time PCR
Rapid amplification of cDNA ends
Ohno S: So much “junk” DNA in our genome. Brookhaven Symp Biol 1972, 23: 366-370.
Doolittle WF, Sapienza C: Selfish genes, the phenotype paradigm and genome evolution. Nature 1980, 284: 601-603. 10.1038/284601a0
Orgel LE, Crick FH: Selfish DNA: the ultimate parasite. Nature 1980, 284: 604-607. 10.1038/284604a0
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, et al.: Initial sequencing and analysis of the human genome. Nature 2001, 409: 860-921. 10.1038/35057062
Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, Antonarakis SE, Attwood J, Baertsch R, Bailey J, Barlow K, Beck S, Berry E, Birren B, Bloom T, Bork P, Botcherby M, Bray N, Brent MR, Brown DG, Brown SD, Bult C, Burton J, Butler J, Campbell RD, Mouse Genome Sequencing Consortium, et al.: Initial sequencing and comparative analysis of the mouse genome. Nature 2002, 420: 520-562. 10.1038/nature01262
Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang J, Fulton L, Graves TA, Minx P, Reilly AD, Courtney L, Kruchowski SS, Tomlinson C, Strong C, Delehaunty K, Fronick C, Courtney B, Rock SM, Belter E, Du F, Kim K, Abbott RM, Cotton M, Levy A, Marchetto P, Ochoa K, Jackson SM, Gillam B, et al.: The B73 maize genome: complexity, diversity, and dynamics. Science 2009, 326: 1112-1115. 10.1126/science.1178534
Britten RJ, Davidson EH: Gene regulation for higher cells: a theory. Science 1969, 165: 349-357. 10.1126/science.165.3891.349
Jacques PE, Jeyakani J, Bourque G: The majority of primate-specific regulatory sequences are derived from transposable elements. PLoS Genet 2013, 9: e1003504. 10.1371/journal.pgen.1003504
Faulkner GJ, Kimura Y, Daub CO, Wani S, Plessy C, Irvine KM, Schroder K, Cloonan N, Steptoe AL, Lassmann T, Waki K, Hornig N, Arakawa T, Takahashi H, Kawai J, Forrest AR, Suzuki H, Hayashizaki Y, Hume DA, Orlando V, Grimmond SM, Carninci P: The regulated retrotransposon transcriptome of mammalian cells. Nat Genet 2009, 41: 563-571. 10.1038/ng.368
Lu X, Sachs F, Ramsay L, Jacques PE, Goke J, Bourque G, Ng HH: The retrovirus HERVH is a long noncoding RNA required for human embryonic stem cell identity. Nat Struct Mol Biol 2014, 21: 423-425. 10.1038/nsmb.2799
Bourque G, Leong B, Vega VB, Chen X, Lee YL, Srinivasan KG, Chew JL, Ruan Y, Wei CL, Ng HH, Liu ET: Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res 2008, 18: 1752-1762. 10.1101/gr.080663.108
Lunyak VV, Prefontaine GG, Nunez E, Cramer T, Ju BG, Ohgi KA, Hutt K, Roy R, Garcia-Diaz A, Zhu X, Yung Y, Montoliu L, Glass CK, Rosenfeld MG: Developmentally regulated activation of a SINE B2 repeat as a domain boundary in organogenesis. Science 2007, 317: 248-251. 10.1126/science.1140871
Schmidt D, Schwalie PC, Wilson MD, Ballester B, Goncalves A, Kutter C, Brown GD, Marshall A, Flicek P, Odom DT: Waves of retrotransposon expansion remodel genome organization and CTCF binding in multiple mammalian lineages. Cell 2012, 148: 335-348. 10.1016/j.cell.2011.11.058
Wang T, Zeng J, Lowe CB, Sellers RG, Salama SR, Yang M, Burgess SM, Brachmann RK, Haussler D: Species-specific endogenous retroviruses shape the transcriptional network of the human tumor suppressor protein p53. Proc Natl Acad Sci U S A 2007, 104: 18613-18618. 10.1073/pnas.0703637104
Kunarso G, Chia NY, Jeyakani J, Hwang C, Lu X, Chan YS, Ng HH, Bourque G: Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat Genet 2010, 42: 631-634. 10.1038/ng.600
Lynch VJ, Leclerc RD, May G, Wagner GP: Transposon-mediated rewiring of gene regulatory networks contributed to the evolution of pregnancy in mammals. Nat Genet 2011, 43: 1154-1159. 10.1038/ng.917
Chuong EB, Rumi MA, Soares MJ, Baker JC: Endogenous retroviruses function as species-specific enhancer elements in the placenta. Nat Genet 2013, 45: 325-329. 10.1038/ng.2553
Ni JZ, Grate L, Donohue JP, Preston C, Nobida N, O’Brien G, Shiue L, Clark TA, Blume JE, Ares M Jr: Ultraconserved elements are associated with homeostatic control of splicing regulators by alternative splicing and nonsense-mediated decay. Genes Dev 2007, 21: 708-718. 10.1101/gad.1525507
Sorek R, Ast G, Graur D: Alu-containing exons are alternatively spliced. Genome Res 2002, 12: 1060-1067. 10.1101/gr.229302
Nekrutenko A, Li WH: Transposable elements are found in a large number of human protein-coding genes. Trends Genet 2001, 17: 619-621. 10.1016/S0168-9525(01)02445-3
Carrieri C, Cimatti L, Biagioli M, Beugnet A, Zucchelli S, Fedele S, Pesce E, Ferrer I, Collavin L, Santoro C, Forrest AR, Carninci P, Biffo S, Stupka E, Gustincich S: Long non-coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat. Nature 2012, 491: 454-457. 10.1038/nature11508
Smalheiser NR, Torvik VI: Alu elements within human mRNAs are probable microRNA targets. Trends Genet 2006, 22: 532-536. 10.1016/j.tig.2006.08.007
Sasaki T, Nishihara H, Hirakawa M, Fujimura K, Tanaka M, Kokubo N, Kimura-Yoshida C, Matsuo I, Sumiyama K, Saitou N, Shimogori T, Okada N: Possible involvement of SINEs in mammalian-specific brain formation. Proc Natl Acad Sci U S A 2008, 105: 4220-4225. 10.1073/pnas.0709398105
Rebollo R, Romanish MT, Mager DL: Transposable elements: an abundant and natural source of regulatory sequences for host genes. Annu Rev Genet 2012, 46: 21-42. 10.1146/annurev-genet-110711-155621
Mak KS, Burdach J, Norton LJ, Pearson RCM, Crossley M, Funnell APW: Repression of chimeric transcripts emanating from endogenous retrotransposons by a sequence-specific transcription factor. Genome Biol 2014. In Press
Pearson R, Fleetwood J, Eaton S, Crossley M, Bao S: Kruppel-like transcription factors: a functional family. Int J Biochem Cell Biol 2008, 40: 1996-2001. 10.1016/j.biocel.2007.07.018
Tallack MR, Whitington T, Yuen WS, Wainwright EN, Keys JR, Gardiner BB, Nourbakhsh E, Cloonan N, Grimmond SM, Bailey TL, Perkins AC: A global role for KLF1 in erythropoiesis revealed by ChIP-seq in primary erythroid cells. Genome Res 2010, 20: 1052-1063. 10.1101/gr.106575.110
Turner J, Crossley M: Cloning and characterization of mCtBP2, a co-repressor that associates with basic Kruppel-like factor and other mammalian transcriptional regulators. EMBO J 1998, 17: 5129-5140. 10.1093/emboj/17.17.5129
Smit AF: Identification of a new, abundant superfamily of mammalian LTR-transposons. Nucleic Acids Res 1993, 21: 1863-1872. 10.1093/nar/21.8.1863
Thomas JH, Schneider S: Coevolution of retroelements and tandem zinc finger genes. Genome Res 2011, 21: 1800-1812. 10.1101/gr.121749.111
Huda A, Bowen NJ, Conley AB, Jordan IK: Epigenetic regulation of transposable element derived human gene promoters. Gene 2011, 475: 39-48. 10.1016/j.gene.2010.12.010
Kapusta A, Kronenberg Z, Lynch VJ, Zhuo X, Ramsay L, Bourque G, Yandell M, Feschotte C: Transposable elements are major contributors to the origin, diversification, and regulation of vertebrate long noncoding RNAs. PLoS Genet 2013, 9: e1003470. 10.1371/journal.pgen.1003470
Speek M: Antisense promoter of human L1 retrotransposon drives transcription of adjacent cellular genes. Mol Cell Biol 2001, 21: 1973-1985. 10.1128/MCB.21.6.1973-1985.2001
Morgan HD, Sutherland HG, Martin DI, Whitelaw E: Epigenetic inheritance at the agouti locus in the mouse. Nat Genet 1999, 23: 314-318. 10.1038/15490
Lamprecht B, Walter K, Kreher S, Kumar R, Hummel M, Lenze D, Kochert K, Bouhlel MA, Richter J, Soler E, Stadhouders R, Johrens K, Wurster KD, Callen DF, Harte MF, Giefing M, Barlow R, Stein H, Anagnostopoulos I, Janz M, Cockerill PN, Siebert R, Dorken B, Bonifer C, Mathas S: Derepression of an endogenous long terminal repeat activates the CSF1R proto-oncogene in human lymphoma. Nat Med 2010, 16: 571-579. 1p following 579 10.1038/nm.2129
Romanish MT, Lock WM, van de Lagemaat LN, Dunn CA, Mager DL: Repeated recruitment of LTR retrotransposons as promoters by the anti-apoptotic locus NAIP during mammalian evolution. PLoS Genet 2007, 3: e10. 10.1371/journal.pgen.0030010
We thank Sandra Richardson and Adam Ewing for helpful editorial comments. GJF acknowledges the support of an Australian NHMRC Career Development Fellowship (GNT1045237), NHMRC Project Grants GNT1042449 and GNT1068789 and the European Union’s Seventh Framework Programme (FP7/2007-2013) under grant agreement No. 259743 underpinning the MODHEP consortium. KRU and GJF are additionally supported by NHMRC Project Grants GNT1045991 and GNT1067983. Funding bodies played no role in the preparation of, or decision to submit, the manuscript.
The authors declare that they have no competing interests.
GJF wrote the manuscript. KRU provided edits and prepared Figure 1. Both authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Upton, K.R., Faulkner, G.J. Blood from ‘junk’: the LTR chimeric transcript Pu.2 promotes erythropoiesis. Mobile DNA 5, 15 (2014). https://doi.org/10.1186/1759-8753-5-15
- LTR retrotransposon
- Transposable element