Back

Machine Learning to Identify Molecular Markers for Metabolic Disease Development Using Mouse Models

Yang, G.; Liu, R.; Rezaei, S.; Liu, X.; Wan, Y.-J. Y.

2023-03-12 bioinformatics
10.1101/2023.03.11.532149 bioRxiv
Show abstract

BackgroundAging, Western diet (WD) intake, and bile acid (BA) receptor farnesoid X receptor (FXR) inactivation are risk factors for metabolic disease development including nonalcoholic fatty liver disease (NAFLD) and chronic inflammation-related health issues such as dementia. The progression of the metabolic disease can be escalated when those risks are combined. Inactivation of FXR is cancer prone in both humans and mice. The current study used omics data generated within the gut-liver axis to classify those risks using bioinformatics and machine learning approaches. MethodsDifferent ages (5, 10, and 15 months) of wild-type (WT) and FXR knockout (KO) male mice were fed with either a healthy control diet (CD) or a WD since weaning. Hepatic transcripts, liver, serum, and urine metabolites, hepatic bile acids (BAs), as well as gut microbiota were used for risk prediction. A linear support vector machine with K-fold cross-validation was used for classification and feature selection. ResultsIncreased urine sucrose alone achieved 91% accuracy in predicting WD intake. Hepatic lithocholic acid (LCA) and serum pyruvate had 100% and 95% accuracy, respectively to classify age. Association analyses showed hepatic LCA was positively associated with serum concentrations of acetone, a ketone body, and 1,3-dihydroxyacetone (DHA), but negatively correlated with serum pyruvate. Urine metabolites (decreased creatinine and taurine as well as increased succinate) or gut microbiota (increased Dorea, Dehalobacterium, and Oscillospira) could predict FXR functional status with greater than 90% accuracy. Integrated pathway analyses revealed that the predictors for diet and FXR expression were implicated in the central carbon metabolism in cancer. To assess the translational relevance, mouse hepatic transcripts were crosschecked with human NAFLD and hepatocellular carcinoma (HCC) datasets. WD-affected hepatic Cyp39a1 and Gramd1b expression were associated with human HCC and NAFLD, respectively. The metabolites and diseases interaction analyses uncovered that the identified features are implicated in human metabolic diseases, mental disorders, and cancer. ConclusionThe risk prediction using mouse models contributes to the identification of noninvasive biomarkers for early diagnosis of metabolic disease development.

Matching journals

The top 12 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 17%
10.4%
2
Metabolomics
11 papers in training set
Top 0.1%
10.4%
3
The Journal of Nutrition
21 papers in training set
Top 0.1%
6.5%
4
Metabolites
50 papers in training set
Top 0.1%
5.0%
5
Scientific Reports
3102 papers in training set
Top 26%
4.5%
6
PeerJ
261 papers in training set
Top 4%
2.7%
7
BMC Bioinformatics
383 papers in training set
Top 4%
2.2%
8
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.9%
2.1%
9
Bioinformatics
1061 papers in training set
Top 7%
2.0%
10
Frontiers in Genetics
197 papers in training set
Top 4%
1.8%
11
Aging
69 papers in training set
Top 1%
1.7%
12
Current Developments in Nutrition
15 papers in training set
Top 0.6%
1.4%
50% of probability mass above
13
Obesity
19 papers in training set
Top 0.3%
1.4%
14
Life Sciences
25 papers in training set
Top 0.8%
1.3%
15
The Journal of Clinical Endocrinology & Metabolism
35 papers in training set
Top 0.9%
1.0%
16
BMC Microbiology
35 papers in training set
Top 1%
1.0%
17
BioMed Research International
25 papers in training set
Top 2%
1.0%
18
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.0%
19
Frontiers in Medicine
113 papers in training set
Top 5%
0.9%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
0.9%
21
Journal of Proteome Research
215 papers in training set
Top 2%
0.9%
22
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
23
International Journal of Obesity
25 papers in training set
Top 0.5%
0.8%
24
Frontiers in Nutrition
23 papers in training set
Top 1%
0.8%
25
Hepatology Communications
21 papers in training set
Top 0.3%
0.8%
26
BMC Medicine
163 papers in training set
Top 6%
0.8%
27
iScience
1063 papers in training set
Top 28%
0.8%
28
Frontiers in Neuroscience
223 papers in training set
Top 7%
0.8%
29
Biomedicines
66 papers in training set
Top 3%
0.8%
30
International Journal of Molecular Sciences
453 papers in training set
Top 15%
0.8%