Back

Transposable element-host genome evolutionary arms race revealed by multi-modal epigenomic profiling in a telomere-to-telomere human genome reference

Nikitin, D.

2026-03-23 genomics
10.64898/2026.03.19.712972 bioRxiv
Show abstract

For a quarter of a century transposable elements have been recognized as a major component of the human genome, comprising 46.1% according to recent estimates, and as key drivers of regulatory innovation as well as participants in an ongoing evolutionary arms race with host defense systems. Using the newly released T2T ENCODE dataset, we quantified the epigenetic impact of 3.7 million transposable elements across evolutionary time by analyzing seven epigenomic modalities in twelve human cell lines, spanning six transposon classes, 44 families, and 1,122 subfamilies. We show that SVA elements exhibit the strongest signatures of the arms race, characterized by progressive escape from H3K9me3-mediated heterochromatinization accompanied by increased acquisition of CTCF binding and enhancer-associated chromatin marks. Among Alu elements, the AluYb8 and AluYb9 subfamilies display age-dependent accumulation of CTCF binding, while seven LTR subfamilies (HERV16-int, MER11C, LTR43-int, HERVE-int, LTR22C, LTR5_Hs, HERVIP10FH-int) demonstrate dynamic evolutionary behavior within active chromatin, H3K9me3 chromatin and CTCF contexts. We further evaluated the relative contribution of distinct epigenomic modalities to the host-transposable element conflict and found that transposon-driven evolution is dominated by evasion of host-imposed heterochromatinization primarily at H3K9me3 and secondarily at H3K27me3, together with progressive invasion into CTCF-rich regions. In contrast, enhancer, promoter, and H3K36me3 marks appear to play more limited roles. Collectively, these findings deepen our insight into the coevolutionary epigenomic dynamics between human genome and transposable elements and the associated processes driving regulatory innovation.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Cell Genomics
162 papers in training set
Top 0.1%
18.0%
2
Science
429 papers in training set
Top 3%
9.8%
3
Nature Communications
4913 papers in training set
Top 20%
9.8%
4
Nature Genetics
240 papers in training set
Top 0.9%
8.1%
5
Cell
370 papers in training set
Top 3%
6.1%
50% of probability mass above
6
Nature
575 papers in training set
Top 6%
4.2%
7
Nucleic Acids Research
1128 papers in training set
Top 5%
3.8%
8
Genome Biology
555 papers in training set
Top 3%
3.5%
9
Nature Ecology & Evolution
113 papers in training set
Top 2%
3.5%
10
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 24%
2.8%
11
Molecular Cell
308 papers in training set
Top 5%
2.5%
12
Nature Biotechnology
147 papers in training set
Top 4%
2.3%
13
Cell Reports
1338 papers in training set
Top 20%
2.3%
14
Science Advances
1098 papers in training set
Top 14%
2.0%
15
Developmental Cell
168 papers in training set
Top 9%
1.6%
16
eLife
5422 papers in training set
Top 44%
1.6%
17
Nature Cell Biology
99 papers in training set
Top 3%
1.4%
18
Genome Research
409 papers in training set
Top 3%
1.3%
19
Nature Structural & Molecular Biology
218 papers in training set
Top 4%
1.3%
20
Advanced Science
249 papers in training set
Top 17%
0.9%
21
Nature Plants
84 papers in training set
Top 2%
0.9%
22
Molecular Plant
36 papers in training set
Top 1%
0.8%
23
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 6%
0.7%
24
Genome Medicine
154 papers in training set
Top 8%
0.7%
25
Scientific Reports
3102 papers in training set
Top 77%
0.7%
26
Cell Systems
167 papers in training set
Top 14%
0.6%