Back

An improved epigenetic age estimation with TFMethyl Clock reveals DNA methylation changes during aging in transcription factor binding sites

Patel, T.; Schwarz, R.; Riege, K.; Varshavsky, M.; Kestler, H. A.; Kaplan, T.; Hoffman, S.; Bommel, A. v.

2025-10-07 bioinformatics
10.1101/2025.10.07.680024 bioRxiv
Show abstract

Methylation-based epigenetic clocks are among the most accurate tools for predicting chronological age. Although DNA methylation (DNAm) at genomic CpG sites is linked to various regulatory mechanisms, the biological interpretability of epigenetic clocks remains surprisingly limited. One primary mechanism by which DNAm is thought to influence gene regulation is by modulating transcription factor binding activity. In this study, we examine established epigenetic clocks to assess the regulatory potential of their predictive CpGs during the aging process. Our analysis reveals that generally most CpG sites used by epigenetic clocks do not overlap known transcription factor binding sites (TFBSs), indicating that changes in TFBS dynamics may not account for prediction accuracy of these models. On the other hand, by identifying age-associated CpGs that overlap TFBSs, we identified transcription factors that may be involved in the aging process. Specifically, the TFBSs of ZBED1, NFE2, CEBPB, FOXP1, EGR1, SP1, PAX5, and MAZ were particularly enriched for age-associated CpGs, while RBPJ, NFIC, RELA, IKZF1, STAT3, and USF2 were significantly protected against methylation changes. By focusing on TFBS-associated CpGs, combined with additional feature selection and engineering steps, we developed an alternative, TFMethyl Clock model, outperforming several existing approaches. Target genes of model-selected, age-predictive CpGs are enriched in the interleukin-1b production and long- chain fatty acid metabolism pathways. In contrast, these CpGs themselves are enriched mainly at binding sites of NR2C2 TF. Furthermore, approximately three-fourths of the target genes downstream of age- predictive CpGs exhibit significant age-related changes, suggesting that our approach captures deeper insights into possible methylation-driven biological aging processes. Our findings demonstrate that incorporating regulatory loci into the design of epigenetic predictors may provide mechanistic insights into the aging process while maintaining or even improving the predictive power.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Aging
69 papers in training set
Top 0.1%
35.4%
2
Aging Cell
144 papers in training set
Top 0.7%
8.7%
3
npj Aging
15 papers in training set
Top 0.1%
5.0%
4
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.7%
5.0%
50% of probability mass above
5
GeroScience
97 papers in training set
Top 0.4%
4.3%
6
Scientific Reports
3102 papers in training set
Top 34%
3.7%
7
Neurobiology of Aging
95 papers in training set
Top 0.9%
2.8%
8
Epigenetics
43 papers in training set
Top 0.3%
2.1%
9
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.1%
10
Clinical Epigenetics
53 papers in training set
Top 0.4%
1.9%
11
Frontiers in Genetics
197 papers in training set
Top 4%
1.8%
12
The Journals of Gerontology: Series A
25 papers in training set
Top 0.5%
1.7%
13
eLife
5422 papers in training set
Top 45%
1.5%
14
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
15
Frontiers in Aging Neuroscience
67 papers in training set
Top 2%
1.3%
16
Advanced Science
249 papers in training set
Top 14%
1.3%
17
International Journal of Molecular Sciences
453 papers in training set
Top 11%
1.0%
18
Nature Communications
4913 papers in training set
Top 60%
0.9%
19
PLOS ONE
4510 papers in training set
Top 65%
0.8%
20
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 9%
0.8%
21
Bioinformatics
1061 papers in training set
Top 9%
0.8%
22
Genetic Epidemiology
46 papers in training set
Top 0.8%
0.8%
23
iScience
1063 papers in training set
Top 30%
0.8%
24
Science Advances
1098 papers in training set
Top 30%
0.7%
25
Epigenetics & Chromatin
42 papers in training set
Top 0.3%
0.7%
26
Chaos, Solitons & Fractals
32 papers in training set
Top 2%
0.7%
27
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.7%