Back

Metatranscriptomics-Derived Disease Risk Scores as a Preventive, Diagnostic, and Treatment Support Tool

Hu, L.; Bass, M.; Patridge, E.; Molusky, M.; Antoine, G.; Vuyisich, M.; Banavar, G.

2026-06-06 genetic and genomic medicine
10.64898/2026.05.29.26354333 medRxiv
Show abstract

Background: Chronic diseases and symptom syndromes often develop after prolonged biological changes that may precede formal diagnosis. RNA-based metatranscriptomics captures active microbial and human gene expression and may provide a functional layer for disease risk evaluation. To address this translational gap, we developed and validated a Disease Risk Score (DRS) framework that integrates metatranscriptome-derived pathway activity scores from stool, saliva, and blood samples, and evaluated its potential clinical utility as an adjunct risk-evaluation tool. Methods: DRS uses disease-specific sets of pathway activity scores derived from stool and saliva microbial functions, stool and saliva microbial taxa, and blood human gene expression. For each disease, 'not optimal' pathway scores are aggregated into a normalized cumulative odds ratio, or cOR, using score-level odds ratios, statistical significance, and literature-supported biological relevance derived from a Development Cohort of 22,369 individuals. A cOR [≥] 5 is defined as high risk. Performance is evaluated in an independent Validation Cohort of 15,908 individuals using self-reported diseases as the reference. Disease support requires both significant cOR separation between self-reported and not-reported (Cohen's d [≥] 0.2) and risk ratio enrichment of self-reported disease among individuals classified as high risk (95% CI of Risk Ratio > 1). Results: Of 20 initially evaluated diseases, 15 meet the prespecified validation criteria on the independent validation cohort: ADHD, anxiety, chronic fatigue syndrome, depression, GERD, hypertension, inflammatory bowel disease, IBS-C, IBS-D, insomnia, MASLD, obesity, obstructive sleep apnea, Sjogren's syndrome, and type 2 diabetes. Five selected clinical scenarios illustrate how DRS can support clinician-mediated decision making, including IBS subtype reclassification, improved diagnostic acceptance in IBS-D, personalized lifestyle counseling in MASLD and early type 2 diabetes, and diagnostic uncertainty in atypical GERD. Conclusions: DRS is a metatranscriptomics-based risk-stratification framework that aggregates active microbial and human pathway signals into interpretable disease-specific risk estimates across a wide range of disease conditions. Validation against self-reported disease labels in an independent cohort shows significant risk enrichment for each of 15 diseases. DRS is intended as an adjunct to clinical evaluation: a decision support tool in situations where routine care encounters uncertainty, delay, or low patient engagement. Future prospective studies using clinically adjudicated endpoints are needed to assess calibration and clinical outcomes.

Matching journals

The top 11 journals account for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.4%
12.9%
2
Genome Medicine
154 papers in training set
Top 0.7%
7.4%
3
Genetics in Medicine
69 papers in training set
Top 0.3%
6.6%
4
Translational Psychiatry
219 papers in training set
Top 1%
5.0%
5
Scientific Reports
3102 papers in training set
Top 21%
5.0%
6
British Journal of Anaesthesia
14 papers in training set
Top 0.2%
3.7%
7
Med
38 papers in training set
Top 0.1%
2.1%
8
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
2.1%
9
Journal of Biomedical Informatics
45 papers in training set
Top 0.7%
1.9%
10
BMC Medicine
163 papers in training set
Top 3%
1.9%
11
eBioMedicine
130 papers in training set
Top 0.9%
1.9%
50% of probability mass above
12
PLOS ONE
4510 papers in training set
Top 49%
1.9%
13
Biological Psychiatry
119 papers in training set
Top 2%
1.7%
14
Sleep
26 papers in training set
Top 0.3%
1.7%
15
PLOS Biology
408 papers in training set
Top 11%
1.5%
16
Nature Communications
4913 papers in training set
Top 54%
1.5%
17
Nature Human Behaviour
85 papers in training set
Top 3%
1.4%
18
Journal of Translational Medicine
46 papers in training set
Top 1%
1.4%
19
Bioinformatics Advances
184 papers in training set
Top 4%
1.3%
20
Psychological Medicine
74 papers in training set
Top 1%
1.1%
21
The Lancet Digital Health
25 papers in training set
Top 0.7%
1.0%
22
Nature Medicine
117 papers in training set
Top 4%
0.9%
23
Journal of Clinical Medicine
91 papers in training set
Top 5%
0.9%
24
Journal of Allergy and Clinical Immunology
25 papers in training set
Top 0.6%
0.9%
25
Genetic Epidemiology
46 papers in training set
Top 0.7%
0.8%
26
Brain, Behavior, and Immunity
105 papers in training set
Top 2%
0.8%
27
Bioinformatics
1061 papers in training set
Top 9%
0.8%
28
Cell Genomics
162 papers in training set
Top 6%
0.8%
29
JAMA Psychiatry
13 papers in training set
Top 0.5%
0.8%
30
Molecular Psychiatry
242 papers in training set
Top 3%
0.8%