Back

Lifestyles of Gypsy-family transposons shape their regulatory mechanisms

Papameletiou, A.-M.; Czech Nicholson, B.; Bornelöv, S.; Hannon, G. J.

2026-05-21 genomics
10.64898/2026.05.19.726053 bioRxiv
Show abstract

Transposable elements are a highly diverse group of selfish genomic elements, prevalent across the tree of life, whose uncontrolled propagation poses a threat to genome stability. Recent studies have explored the evolution of Drosophila melanogaster transposable elements, their co-evolution with the host genome, and mechanisms that regulate their activity. However, little is known about their cross-species evolutionary patterns. Long terminal repeat (LTR) retrotransposons are the most active group of transposable elements in Drosophila. They are broadly separated into retroelements, which are active in the germline, and insect endogenous retroviruses that are active in the soma. Somatic elements are hypothesised to infect the germline through their acquisition of virus-derived proteins such as Envelope and sORF2, thus multiplying through successive generations. In this study, we curated the sequences of LTR retrotransposons in 249 drosophilid genomes, allowing us to study their evolution across these species and highlight their varying degrees of conservation. Furthermore, we reveal multiple instances of Envelope protein loss or inactivation that suggest shifts in the expression pattern of these transposons, likely accompanied by adopting different transcriptional control mechanisms. We contrast this with the evolutionary history of sORF2, which we found to be much more stable. Lastly, we examined variations in transposon LTR regions responsible for transcriptional regulation and use predictive modelling to suggest six transcription factors likely involved in their tissue-specific expression. Altogether, we reveal complex, interspecies evolutionary patterns of Gypsy-family LTR retrotransposons and highlight examples of their co-evolution with their host genome.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Molecular Biology and Evolution
488 papers in training set
Top 0.3%
12.3%
2
Genome Biology and Evolution
280 papers in training set
Top 0.1%
10.1%
3
PLOS Genetics
756 papers in training set
Top 2%
6.4%
4
Nucleic Acids Research
1128 papers in training set
Top 4%
4.8%
5
Mobile DNA
27 papers in training set
Top 0.1%
4.3%
6
Nature Communications
4913 papers in training set
Top 36%
4.3%
7
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 1.0%
4.2%
8
Nature Ecology & Evolution
113 papers in training set
Top 1%
4.0%
50% of probability mass above
9
Peer Community Journal
254 papers in training set
Top 0.8%
3.6%
10
Frontiers in Genetics
197 papers in training set
Top 2%
3.6%
11
BMC Biology
248 papers in training set
Top 0.5%
2.6%
12
eLife
5422 papers in training set
Top 32%
2.6%
13
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 3%
2.1%
14
PLOS Biology
408 papers in training set
Top 7%
2.1%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
16
Cell Genomics
162 papers in training set
Top 3%
1.7%
17
Genome Research
409 papers in training set
Top 2%
1.7%
18
Genome Biology
555 papers in training set
Top 5%
1.5%
19
PLOS Pathogens
721 papers in training set
Top 6%
1.5%
20
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
21
Cell Reports
1338 papers in training set
Top 28%
1.2%
22
Science
429 papers in training set
Top 17%
1.1%
23
BMC Genomics
328 papers in training set
Top 4%
0.9%
24
Cell
370 papers in training set
Top 15%
0.9%
25
Genes
126 papers in training set
Top 2%
0.9%
26
Genetics
225 papers in training set
Top 4%
0.8%
27
Current Biology
596 papers in training set
Top 13%
0.8%
28
iScience
1063 papers in training set
Top 32%
0.7%
29
Molecular Ecology
304 papers in training set
Top 4%
0.7%
30
PLOS Computational Biology
1633 papers in training set
Top 26%
0.7%