Back

De novo multi-mechanism antimicrobial peptide design via multimodal deep learning

Wang, Y.; Gong, H.; Li, X.; Li, L.; Zhao, Y.; Bao, P.; Kong, Q.; Wan, B.; Zhang, Y.; Zhang, J.; Ni, J.; Han, Z.; Nan, X.; Ju, K.; Sun, L.; Chang, H.; Zheng, M.; Yu, Y.; Yang, X.; Zuo, X.; Li, Y.

2024-01-02 bioengineering
10.1101/2024.01.02.573846 bioRxiv
Show abstract

Artificial intelligence (AI)-driven discovery of antimicrobial peptides (AMPs) is yet to fully utilise their three-dimensional (3D) structural characteristics, microbial specie-specific antimicrobial activities and mechanisms. Here, we constructed a QLAPD database comprising the sequence, structures and antimicrobial properties of 12,914 AMPs. QLAPD underlies a multimodal, multitask, multilabel, and conditionally controlled AMP discovery (M3-CAD) pipeline, which is proposed for the de novo design of multi-mechanism AMPs to combat multidrug-resistant organisms (MDROs). This pipeline integrates the generation, regression, and classification modules, using a innovative 3D voxel coloring method to capture the nuanced physicochemical context of amino acids, significantly enhancing structural characterizations. QL-AMP-1, discovered by M3-CAD, which possesses four antimicrobial mechanisms, exhibited low toxicity and significant activity against MDROs. The skin wound infection model demonstrates its considerable antimicrobial effects and negligible toxicity. Altogether, integrating 3D features, specie-specific antimicrobial activities and mechanisms enhanced AI-driven AMP discovery, making the M3-CAD pipeline a viable tool for de novo AMP design.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
Advanced Science
249 papers in training set
Top 0.1%
51.6%
50% of probability mass above
2
Nature Communications
4913 papers in training set
Top 29%
6.3%
3
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.9%
6.3%
4
Angewandte Chemie International Edition
81 papers in training set
Top 0.8%
3.9%
5
Chemical Science
71 papers in training set
Top 0.7%
2.1%
6
Nucleic Acids Research
1128 papers in training set
Top 9%
1.9%
7
Nature Machine Intelligence
61 papers in training set
Top 2%
1.7%
8
ACS Nano
99 papers in training set
Top 3%
1.5%
9
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.2%
10
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
11
Cell Systems
167 papers in training set
Top 9%
1.2%
12
Communications Biology
886 papers in training set
Top 16%
1.1%
13
iScience
1063 papers in training set
Top 23%
1.1%
14
Journal of the American Chemical Society
199 papers in training set
Top 4%
0.9%
15
eLife
5422 papers in training set
Top 54%
0.9%
16
Scientific Reports
3102 papers in training set
Top 73%
0.8%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 8%
0.8%
18
Nature Biomedical Engineering
42 papers in training set
Top 2%
0.7%
19
Cell Reports Physical Science
18 papers in training set
Top 0.8%
0.7%
20
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 46%
0.7%
21
Cell Reports
1338 papers in training set
Top 35%
0.6%
22
PLOS ONE
4510 papers in training set
Top 71%
0.6%