Back

Mapping the mammalian dark metabolome by in vivo isotope tracing

MacArthur, M. R.; Raeber, J.; Lu, W.; Qiang, H.; Schueppert, A. V.; Ayres, L. B.; Cordova, R. A.; Neinast, M. D.; Leiva, E.; Pham, V. N.; AbuSalim, J. E.; Jankowski, C. S. R.; Samarah, L. Z.; Roichman, A.; Peace, C. G.; Ivanov, D. G.; Renzo, G. L.; Oschmann, A. M.; Ayroles, J. F.; Mitchell, S. J.; Xing, X.; Olszewski, K.; Kim, H.; Rabinowitz, J.; Skinnider, M.

2026-04-02 biochemistry
10.64898/2026.03.31.713900 bioRxiv
Show abstract

Despite decades of biochemical study, a comprehensive map of the mammalian metabolome remains elusive. Mass spectrometry-based metabolomics detects thousands of small molecule-associated signals in mammalian tissues, but it is currently unclear how many of these reflect products of endogenous metabolism. Here, we leverage systematic in vivo isotope tracing to infer the biosynthetic origins of unidentified metabolites. We administered 26 different isotopically labelled nutrients to mice, measured circulating and tissue metabolite labelling by mass spectrometry, and developed a statistical framework to infer the number of carbon atoms incorporated from each of these precursors into more than 4,000 putative metabolites. We show this information can be harnessed for biosynthesis-aware structure elucidation using a multimodal AI model that co-embeds isotopic labelling patterns with chemical structures. This approach revealed several previously unrecognized families of mammalian metabolites, including cysteine-derived alkylthiazolidines, dithioacetal mercapturic acid derivatives, short-chain N-acyltaurines, acylglycyltaurines, and N-oxidized taurines. It further uncovered a family of mevalonate-derived isoprenoid metabolites that includes 2,3-dihydrofarnesoic acid, which is markedly depleted in both mouse and human aging. Age-related depletion of these isoprenoids is driven by impaired coenzyme A synthesis. Our work establishes the biosynthetic precursors for thousands of unidentified metabolites and reveals multiple previously unrecognized branches of mammalian metabolism.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 10%
14.5%
2
Cell Metabolism
49 papers in training set
Top 0.1%
14.1%
3
Nature Chemical Biology
104 papers in training set
Top 0.2%
6.7%
4
Journal of the American Chemical Society
199 papers in training set
Top 1%
6.2%
5
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 14%
4.8%
6
Nature
575 papers in training set
Top 5%
4.8%
50% of probability mass above
7
eLife
5422 papers in training set
Top 18%
4.8%
8
Cell Reports
1338 papers in training set
Top 13%
3.9%
9
Science
429 papers in training set
Top 8%
3.6%
10
Cell
370 papers in training set
Top 6%
3.5%
11
Molecular Cell
308 papers in training set
Top 4%
3.5%
12
Science Advances
1098 papers in training set
Top 8%
3.0%
13
Nucleic Acids Research
1128 papers in training set
Top 8%
2.6%
14
Nature Genetics
240 papers in training set
Top 3%
2.3%
15
Cell Systems
167 papers in training set
Top 6%
1.9%
16
Nature Metabolism
56 papers in training set
Top 1%
1.7%
17
Aging Cell
144 papers in training set
Top 2%
1.5%
18
Nature Biotechnology
147 papers in training set
Top 6%
1.2%
19
Cell Chemical Biology
81 papers in training set
Top 2%
1.2%
20
Nature Methods
336 papers in training set
Top 5%
0.9%
21
Cell Genomics
162 papers in training set
Top 5%
0.9%
22
Advanced Science
249 papers in training set
Top 18%
0.8%
23
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
24
Nature Cancer
35 papers in training set
Top 2%
0.6%