Back

Identification of Shared and Unique Key Biomarkers of Alcohol Liver Cirrhosis and Non-Alcoholic Steatohepatitis Through Machine Learning Network-Based Algorithms

Hajihosseini, M.; Talarico, F.; Zhao, C.; Campbell, S.; Udenze, D.; Hajizadeh Bastani, N.; Ahmed, M.; Ghasemi, E.; Tonoyan, L.; Guirguis, M.; Mayo, P.; Campanella, C.

2024-10-18 gastroenterology
10.1101/2024.10.17.24315623 medRxiv
Show abstract

IntroductionLiver fibrosis can progress to cirrhosis, liver failure, or hepatocellular carcinoma, which often requires transplantation and burdens healthcare systems around the world. Advances in single-cell RNA sequencing and machine learning have enhanced the understanding of immune responses in many liver diseases particularly alcohol liver cirrhosis (ALC) and non-alcoholic steatohepatitis (NASH). This study aims to identify key biomarkers involved in these conditions and assess their potential as non-invasive diagnostic tools. MethodsTwo gene expression profiles GSE136103 and GSE115469 were used to conduct differential gene expression (DEG) analysis. Using the results from DEG analysis, we then applied two machine learning network-based algorithms, master regulator analysis (MRA) and weighted key driver analysis (wKDA), to identify potential biomarker genes for NASH and ALC. ResultsA total of 1,435 and 5,074 DEGs were identified for ALC and NASH compared to healthy controls, including 1,077 shared DEGs between the two diseases. The MRA showed HLA-DPA1, HLA-DRB1, IFI44L, ISG15, and CD74 as the potential master regulators of ALC and HLA-DPB1, HLA-DQB1, HLA-DRB5, PFN1, and TMSB4X as the potential master regulators of NASH. In addition, wKDA analysis indicated CD300A, FCGR2A, RGS1, HLA-DMB, and C1QA as the key drivers of ALC and INPP5D, NCKAP1L, RAC2, PTPRC, and TYROBP as key drivers of NASH. ConclusionThis study presented a comprehensive framework for analyzing single-cell RNA-seq data, demonstrating the potential of combining advanced network-based machine-learning techniques with conventional DEG analysis to uncover actionable prognostic markers for ALC and NASH with potential use as target biomarkers in drug development.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Biomedicines
66 papers in training set
Top 0.1%
10.6%
2
Hepatology
18 papers in training set
Top 0.1%
6.9%
3
Frontiers in Pharmacology
100 papers in training set
Top 0.4%
6.5%
4
Journal of Translational Medicine
46 papers in training set
Top 0.1%
6.5%
5
PLOS ONE
4510 papers in training set
Top 27%
6.4%
6
Hepatology Communications
21 papers in training set
Top 0.1%
4.0%
7
Heliyon
146 papers in training set
Top 0.4%
3.6%
8
Metabolomics
11 papers in training set
Top 0.1%
3.6%
9
Scientific Reports
3102 papers in training set
Top 35%
3.6%
50% of probability mass above
10
Frontiers in Medicine
113 papers in training set
Top 2%
2.6%
11
BMC Medicine
163 papers in training set
Top 2%
2.4%
12
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 3%
2.4%
13
Genes
126 papers in training set
Top 0.8%
1.8%
14
Biomolecules
95 papers in training set
Top 0.3%
1.8%
15
PeerJ
261 papers in training set
Top 7%
1.7%
16
Journal of Clinical Medicine
91 papers in training set
Top 3%
1.7%
17
Frontiers in Oncology
95 papers in training set
Top 2%
1.7%
18
Frontiers in Physiology
93 papers in training set
Top 3%
1.5%
19
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.3%
20
Journal of Personalized Medicine
28 papers in training set
Top 0.5%
1.3%
21
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 4%
1.3%
22
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.3%
23
Frontiers in Genetics
197 papers in training set
Top 7%
1.1%
24
Gastroenterology
40 papers in training set
Top 1%
1.0%
25
Metabolites
50 papers in training set
Top 0.8%
1.0%
26
Biochemistry and Biophysics Reports
28 papers in training set
Top 1%
0.9%
27
PLOS Neglected Tropical Diseases
378 papers in training set
Top 4%
0.9%
28
Communications Medicine
85 papers in training set
Top 0.7%
0.9%
29
Diabetes, Obesity and Metabolism
17 papers in training set
Top 0.5%
0.8%
30
Medicine
30 papers in training set
Top 2%
0.8%