Back

A PLUM Job: Peptide modeLs for Understanding and engineering antiMicrobial therapeutics

Banerjee, P.; Friedberg, I.; Rued, B. E.; Eulenstein, O.

2026-02-23 bioinformatics
10.64898/2026.02.21.707214 bioRxiv
Show abstract

MotivationAntibiotic-resistant infections in humans and animals are rising, creating an urgent need for new antimicrobial strategies. This challenge extends beyond clinical settings to food production systems; the Centers for Disease Control and Prevention estimates that foodborne pathogens cause over 48 million illnesses annually in the U.S. alone. Antimicrobial peptides (AMPs) are a promising alternative, with broad activity and lower risk of resistance. However, rational design remains challenging, especially when simultaneously controlling sequence, function, and peptide length. ResultsWe introduce Peptide modeLs for Understanding and engineering antiMicrobial therapeutics (PLUM), a structured conditional Variational Autoencoder for controlled AMP generation. PLUM disentangles sequence, function, and length in its latent space, enabling de novo and prototype-conditioned generation of peptides 5-35 amino acids long, allowing capture of larger functional domains. Across 45,000 generated peptides, PLUM achieved the highest AMP yield (0.885, 7% higher than HydrAMP) and increased AMP diversity (14% higher than HydrAMP), while maintaining the highest non-AMP sequence yield 0.895 (19% higher than HydrAMP). For prototype-conditioned generation, PLUM produced 37% more AMPs than HydrAMP, generating sequences that closely matched real peptide compositions with low predicted toxicity. Integrated AMP classifiers enabled robust evaluation of identity and potency across diverse bacteria. These results establish PLUM as a scalable, versatile platform for designing AMPs and next-generation therapeutics. Availabilityhttps://github.com/priyamayur/PLUM Contactpb11@iastate.edu, idoerg@iastate.edu, brued@iastate.edu, oeulen@iastate.edu

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Machine Intelligence
61 papers in training set
Top 0.1%
14.5%
2
Advanced Science
249 papers in training set
Top 1.0%
10.6%
3
Nature Communications
4913 papers in training set
Top 16%
10.6%
4
Bioinformatics
1061 papers in training set
Top 4%
4.9%
5
Nature Biotechnology
147 papers in training set
Top 2%
4.4%
6
Briefings in Bioinformatics
326 papers in training set
Top 2%
4.0%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 19%
3.7%
50% of probability mass above
8
Cell Systems
167 papers in training set
Top 4%
3.1%
9
Bioinformatics Advances
184 papers in training set
Top 2%
2.8%
10
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
2.6%
11
Nature Biomedical Engineering
42 papers in training set
Top 0.5%
2.1%
12
ACS Synthetic Biology
256 papers in training set
Top 1%
2.1%
13
Nature Methods
336 papers in training set
Top 4%
1.9%
14
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
16
Cell Reports Methods
141 papers in training set
Top 2%
1.7%
17
Nucleic Acids Research
1128 papers in training set
Top 10%
1.7%
18
Scientific Reports
3102 papers in training set
Top 62%
1.5%
19
PLOS ONE
4510 papers in training set
Top 58%
1.3%
20
Cell Reports Medicine
140 papers in training set
Top 5%
1.2%
21
Communications Biology
886 papers in training set
Top 21%
0.8%
22
Chemical Science
71 papers in training set
Top 2%
0.8%
23
mAbs
28 papers in training set
Top 0.3%
0.8%
24
Genome Medicine
154 papers in training set
Top 8%
0.7%
25
BMC Bioinformatics
383 papers in training set
Top 8%
0.7%
26
Patterns
70 papers in training set
Top 3%
0.7%
27
iScience
1063 papers in training set
Top 37%
0.7%
28
Cell Genomics
162 papers in training set
Top 8%
0.5%