Back

Comprehensive metabolite ratio QTL mapping reveals disease relevant enzyme biology

Rizi, S.; Goss, N.; Kutalik, Z.; van der Graaf, A.

2025-12-05 epidemiology
10.64898/2025.12.04.25341616 medRxiv
Show abstract

Metabolite ratios are valuable proxies for enzyme- and pathway activity, which are otherwise hard to capture. The genetic basis of metabolite ratios has not been systematically studied as it implies expanding the search space quadratically, leading to increased computational burden. Here, we present efficient statistical methodology to identify ratio quantitative trait loci (rQTLs), requiring only association summary statistics of metabolite measurements. In validations the methodology shows strong correlation with classically estimated ratios (median R2 = 0.94). Across all pairwise metabolite comparisons, 5,095 metabolite pairs contain one or more significant rQTL that exhibit far stronger associations than their constituents. The genes to which these rQTLs map are strongly enriched for enzymes (Odds ratio ranges between 4.3 and 20, depending on gene mapping strategy). Furthermore, metabolites whose ratios have rQTLs have shorter reaction distance (median=4) compared to random pairs of metabolites (median=6) (P = 8.0{middle dot}10-13). We identified many otherwise missed loci: 1,249 rQTLs across 53 independent loci were novel, meaning that the individual metabolites did not pass significance in the source study, highlighting that our methodology increases the number of QTLs by 21%. rQTLs often reveal enzyme activity: capturing long-chain polyunsaturated fatty acid desaturation at the FADS2 locus (177 metabolites across 1,258 rQTLs), as well as elongation through ELOVL2 and ELOVL5 (23 metabolites across 27 rQTLs). Importantly, 72% of genes mapped to rQTLs were not available in pQTL studies. Furthermore, tissue-specific eQTLs confirmed that some blood rQTL associations (e.g. ones mapped to ETFDH in muscle tissue and SCD in adipose tissue) can capture processes taking place in other tissues. We further identified metabolite ratios that are likely causal biomarkers for malignant bladder neoplasms and ischaemic heart disease. Finally, we identified a novel rQTL for the cAMP-to-PFOS ratio, mapping to a mis-sense variant in ABCG2, suggesting that ABCG2 is involved in the excretion of PFOS in humans. In summary, our method is able to systematically map rQTLs which can serve as key disease biomarkers, proxy for unmeasured proteins and identify novel biology. We offer an interactive browser to explore the rQTL and metabolite ratios identified in this study: metabolite-ratio-app.athirtyone.com/

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Metabolites
50 papers in training set
Top 0.1%
26.9%
2
Nature Communications
4913 papers in training set
Top 16%
10.9%
3
eLife
5422 papers in training set
Top 10%
7.1%
4
PLOS Computational Biology
1633 papers in training set
Top 6%
5.0%
5
Scientific Reports
3102 papers in training set
Top 21%
5.0%
50% of probability mass above
6
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.5%
4.1%
7
Bioinformatics
1061 papers in training set
Top 5%
4.1%
8
Cell Reports Methods
141 papers in training set
Top 0.7%
3.7%
9
Genome Medicine
154 papers in training set
Top 3%
3.0%
10
EMBO Molecular Medicine
85 papers in training set
Top 0.8%
2.8%
11
eBioMedicine
130 papers in training set
Top 0.7%
2.2%
12
The American Journal of Human Genetics
206 papers in training set
Top 2%
2.0%
13
Molecular Systems Biology
142 papers in training set
Top 0.5%
1.9%
14
Bioinformatics Advances
184 papers in training set
Top 3%
1.5%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
16
iScience
1063 papers in training set
Top 21%
1.3%
17
PLOS Genetics
756 papers in training set
Top 12%
1.0%
18
Communications Biology
886 papers in training set
Top 18%
0.9%
19
Genome Biology
555 papers in training set
Top 6%
0.9%
20
International Journal of Epidemiology
74 papers in training set
Top 2%
0.9%
21
Genetic Epidemiology
46 papers in training set
Top 0.7%
0.8%
22
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
23
Nature
575 papers in training set
Top 17%
0.7%
24
PLOS ONE
4510 papers in training set
Top 72%
0.5%
25
BMC Genomics
328 papers in training set
Top 7%
0.5%
26
Nature Biotechnology
147 papers in training set
Top 9%
0.5%
27
Genome Research
409 papers in training set
Top 5%
0.5%