Back

A Deep Learning Framework for Predicting Gut Microbe-Host Receptor Interactions

li, H.; Zhao, R.; Zhu, C.; Jiang, R.; Chen, T.; li, X.; Yang, Y.

2026-03-23 microbiology
10.64898/2026.03.19.713066 bioRxiv
Show abstract

MotivationGut microbiota regulates host health through complex protein-protein interactions. However, deciphering this specific interactions between microbiota and human receptors remains a significant challenge due to the lack of specialized computational tools. ResultsLeveraging the hypothesis of cell communication and relevant data, HMI-Pred initially builds an ensemble classifier to screen for potential ligand sequences within microbial genomes. It then jointly evaluates sequence semantics and molecular docking to predict potential microbe-host receptor interactions.HMI-Pred achieved robust performance with F1-scores of 0.901 for microbial ligand identification and 0.883 for interaction prediction. Application to 332,381 microbial proteins revealed distinct interaction patterns: histone deacetylases (HDACs) served as broad-spectrum targets (mean score > 0.80), while G protein-coupled receptors (GPCRs) exhibited high specificity (scores 0.42-0.61). Furthermore, literature mining validated over 47% of the functional predictions, and specific immunomodulatory interactions were confirmed in Akkermansia muciniphila.HMI-Pred provides a valuable computational tool for decoding host-microbe signaling networks and facilitating the discovery of microbiome-based therapeutic targets. AvailabilityThe source code and documentation are available at https://github.com/YangLab-BUPT/HMI-Pred. Contactlihm@bupt.edu.cn

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Science China Life Sciences
26 papers in training set
Top 0.1%
14.4%
2
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.5%
10.4%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.3%
7.2%
4
PLOS Computational Biology
1633 papers in training set
Top 6%
6.4%
5
Cell Discovery
54 papers in training set
Top 0.9%
4.9%
6
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.9%
7
Gut Microbes
70 papers in training set
Top 0.2%
4.2%
50% of probability mass above
8
Bioinformatics
1061 papers in training set
Top 5%
3.6%
9
Advanced Science
249 papers in training set
Top 5%
3.6%
10
mSystems
361 papers in training set
Top 3%
3.1%
11
Microbiome
139 papers in training set
Top 1%
2.9%
12
Nature Communications
4913 papers in training set
Top 44%
2.6%
13
Genome Medicine
154 papers in training set
Top 4%
1.9%
14
GigaScience
172 papers in training set
Top 1%
1.7%
15
Scientific Reports
3102 papers in training set
Top 60%
1.7%
16
Frontiers in Microbiology
375 papers in training set
Top 6%
1.5%
17
Communications Biology
886 papers in training set
Top 12%
1.3%
18
Journal of Genetics and Genomics
36 papers in training set
Top 1%
1.2%
19
Bioinformatics Advances
184 papers in training set
Top 4%
0.9%
20
mBio
750 papers in training set
Top 10%
0.9%
21
iScience
1063 papers in training set
Top 27%
0.9%
22
Nucleic Acids Research
1128 papers in training set
Top 16%
0.8%
23
Genome Biology
555 papers in training set
Top 7%
0.8%
24
ISME Communications
103 papers in training set
Top 2%
0.8%
25
National Science Review
22 papers in training set
Top 2%
0.8%
26
Cell Systems
167 papers in training set
Top 12%
0.7%
27
eLife
5422 papers in training set
Top 58%
0.7%
28
Cell Reports Methods
141 papers in training set
Top 6%
0.7%
29
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 6%
0.7%
30
Patterns
70 papers in training set
Top 3%
0.6%