Back

Testing the mutation accumulation hypothesis in aging with AlphaGenome

Fischbach, A.

2026-05-15 bioinformatics
10.64898/2026.05.10.724136 bioRxiv
Show abstract

The mutation accumulation (MA) hypothesis posits that somatic mutations progressively escape selection and degrade tissue function during aging. Direct tests of this idea have been limited by the difficulty of predicting, at scale, the molecular consequences of individual somatic variants. Here I use AlphaGenome, a sequence-to-function deep learning model, to systematically score the predicted transcriptional impact of somatic mutations under a nested series of designs spanning individual variants, co-occurring variant bundles, and real mutation catalogues. First, I characterize the genome-wide effect-size baseline by scoring 4,000 random single-nucleotide variants (SNVs) in colon tissue, together with 1-Mb-window combined-effect tests. Second, I extend this baseline to gene-body resolution with a 60-cell x 4,000-SNV simulation and pseudobulk RNA-seq aggregation. Third, I analyze the real somatic mutation catalogue of Cagan et al. (Nature, 2022), scoring 54,158 substitutions and 9,799 indels from 54 mouse colonic crypts plus three human samples, together with region- and gene-level enrichment tests against GENCODE. Across all analyses, both random and real somatic variants, including single-nucleotide variants and indels, produce predicted expression changes whose distributions lie three to four orders of magnitude below the tissues endogenous aging transcriptional program. These results argue against a simple, direct mutation-accumulation explanation for the age-associated transcriptional signature of colonic epithelium and redirect attention to epigenetic and regulatory mechanisms.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature
575 papers in training set
Top 1%
18.6%
2
Nature Communications
4913 papers in training set
Top 13%
12.7%
3
Nature Aging
51 papers in training set
Top 0.1%
10.4%
4
Cell Systems
167 papers in training set
Top 2%
6.8%
5
Science
429 papers in training set
Top 4%
6.8%
50% of probability mass above
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 11%
6.4%
7
Nature Metabolism
56 papers in training set
Top 0.6%
3.6%
8
Science Advances
1098 papers in training set
Top 8%
3.1%
9
Nature Medicine
117 papers in training set
Top 1%
2.9%
10
Nature Genetics
240 papers in training set
Top 3%
2.1%
11
Advanced Science
249 papers in training set
Top 11%
1.7%
12
Nature Biotechnology
147 papers in training set
Top 5%
1.7%
13
Cell
370 papers in training set
Top 13%
1.5%
14
Cell Genomics
162 papers in training set
Top 4%
1.3%
15
Aging Cell
144 papers in training set
Top 2%
1.2%
16
Cell Reports
1338 papers in training set
Top 28%
1.2%
17
Nature Cell Biology
99 papers in training set
Top 4%
0.9%
18
Nature Structural & Molecular Biology
218 papers in training set
Top 5%
0.8%
19
Nature Machine Intelligence
61 papers in training set
Top 3%
0.8%
20
eLife
5422 papers in training set
Top 58%
0.7%
21
Cell Host & Microbe
113 papers in training set
Top 5%
0.7%
22
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.7%
23
Communications Biology
886 papers in training set
Top 29%
0.6%
24
Nature Chemical Biology
104 papers in training set
Top 4%
0.6%
25
Genome Biology
555 papers in training set
Top 9%
0.6%
26
Molecular Cell
308 papers in training set
Top 11%
0.6%