Back

TEDEdb: a large-scale resource and multi-cohort analysis of transposable element differential expression in cancer

Calendo, G.; Chaunzwa, M.; Dehzangi, I.; Madzo, J.; Issa, J.-P. J.

2026-04-29 cancer biology
10.64898/2026.04.27.721042 bioRxiv
Show abstract

The human genome consists of nearly 50% repetitive DNA, referred to for decades as "junk DNA". These repetitive sequences, usually under the strict control of epigenetic silencing, have been observed to be aberrantly expressed in cancer. Some of these expressed sequences, e.g., transposable elements (TEs), can induce innate immune responses when de-repressed following treatment with epigenetic therapies. As a result, epigenetic therapy has been suggested to augment cancer therapies. TEs are traditionally ignored in most RNA-seq studies and their expression is often excluded from publicly available data sources. Thus, the vast amount of publicly available RNA-seq data is an untapped resource for exploring the role of TE expression in cancer and cancer treatment. Here, we present a uniform re-analysis of over 7,000 RNA-seq samples, encompassing more than 2,000 differential expression experiments across 220 cancer cell lines and 700 drug treatments. We observed that TE expression is more prone to batch effects than gene expression alone, necessitating the use of meta-analysis techniques to probe the dataset for global trends. We confirm that DNMTi and HDACis are powerful inducers of TEs. We also show that non-epigenetic compounds such as CDK and topoisomerase inhibitors can also induce robust up-regulation of transposable elements and confirm that this TE induction is consistent with viral mimicry response. We make all of the reprocessed data, web application, and database publicly available at: https://dataexplorer.coriell.org/TEDEdb/

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Mobile DNA
27 papers in training set
Top 0.1%
14.3%
2
Nucleic Acids Research
1128 papers in training set
Top 2%
8.2%
3
Genome Biology
555 papers in training set
Top 1.0%
6.6%
4
Genome Research
409 papers in training set
Top 0.5%
6.2%
5
Nature Communications
4913 papers in training set
Top 31%
6.2%
6
Genome Medicine
154 papers in training set
Top 1%
6.1%
7
Nature
575 papers in training set
Top 6%
4.7%
50% of probability mass above
8
Nature Genetics
240 papers in training set
Top 2%
4.7%
9
Cell Genomics
162 papers in training set
Top 2%
3.5%
10
Nature Biotechnology
147 papers in training set
Top 3%
3.5%
11
iScience
1063 papers in training set
Top 9%
2.3%
12
Cell Systems
167 papers in training set
Top 7%
1.8%
13
PLOS ONE
4510 papers in training set
Top 51%
1.8%
14
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 34%
1.6%
15
Bioinformatics Advances
184 papers in training set
Top 3%
1.6%
16
Science
429 papers in training set
Top 16%
1.4%
17
Cell Reports
1338 papers in training set
Top 27%
1.4%
18
Scientific Reports
3102 papers in training set
Top 62%
1.4%
19
NAR Cancer
36 papers in training set
Top 0.1%
1.3%
20
Cell
370 papers in training set
Top 14%
1.2%
21
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
22
PLOS Genetics
756 papers in training set
Top 11%
1.2%
23
eLife
5422 papers in training set
Top 51%
1.1%
24
Cancer Research
116 papers in training set
Top 3%
0.9%
25
Genome Biology and Evolution
280 papers in training set
Top 2%
0.8%
26
Science Advances
1098 papers in training set
Top 29%
0.8%
27
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 6%
0.7%
28
Bioinformatics
1061 papers in training set
Top 10%
0.7%
29
Nature Methods
336 papers in training set
Top 6%
0.7%
30
Molecular Cancer
14 papers in training set
Top 1%
0.7%