Back

PepHammer - a lightweight web-based tool for bioactive peptide matching and identification

Gronning, A. G. B.; Scheele, C.

2026-04-15 bioinformatics
10.64898/2026.04.13.718252 bioRxiv
Show abstract

Peptides are gaining increasing attention as therapeutic agents. Already, peptide-based therapeutics play a key role in the treatment of diverse diseases, including diabetes, obesity, and other complex disorders, and their clinical relevance is expected to expand further in the coming years. Technological and computational advances have substantially enriched peptidomics, massively increasing the scale and depth of peptide identification. As a result, increasingly large and information-rich datasets are now available for downstream analysis and experimental validation. However, the rapid expansion of peptidomics datasets also leads to a corresponding increase in search space, complicating the efficient identification of peptides relevant to specific biological or clinical questions. To address this challenge, we present PepHammer, a lightweight web-based tool for bioactive peptide matching and identification. PepHammer allows users to input up to 10000 peptides (2-150 amino acids in length) and compare them against extensive databases of peptides with predicted or experimentally validated bioactivities and tissue associations using Hamming distance, Grantham distance, as well as partial or exact matching strategies. Via an example study of human milk peptidomics, we demonstrate that PepHammer rapidly provides an overview of the bioactivity and tissue-relational landscape, serving as a starting point for downstream analyses. PepHammer thus enables efficient exploration of large-scale peptidomics datasets and facilitates the identification of biologically relevant peptides.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Briefings in Bioinformatics
326 papers in training set
Top 0.3%
10.4%
2
Bioinformatics
1061 papers in training set
Top 3%
10.0%
3
Journal of Proteome Research
215 papers in training set
Top 0.3%
10.0%
4
Bioinformatics Advances
184 papers in training set
Top 0.2%
8.3%
5
Nucleic Acids Research
1128 papers in training set
Top 2%
7.1%
6
Cell Reports Methods
141 papers in training set
Top 0.5%
4.8%
50% of probability mass above
7
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
4.8%
8
Advanced Science
249 papers in training set
Top 6%
3.6%
9
Nature Communications
4913 papers in training set
Top 40%
3.6%
10
Molecular & Cellular Proteomics
158 papers in training set
Top 0.8%
2.7%
11
Nature Machine Intelligence
61 papers in training set
Top 1%
2.6%
12
Nature Methods
336 papers in training set
Top 3%
2.4%
13
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.1%
14
Scientific Reports
3102 papers in training set
Top 54%
1.9%
15
Nature Biotechnology
147 papers in training set
Top 4%
1.9%
16
Metabolites
50 papers in training set
Top 0.5%
1.7%
17
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
18
BMC Bioinformatics
383 papers in training set
Top 5%
1.7%
19
PLOS ONE
4510 papers in training set
Top 57%
1.5%
20
Genome Biology
555 papers in training set
Top 6%
0.9%
21
Genome Medicine
154 papers in training set
Top 7%
0.9%
22
Communications Chemistry
39 papers in training set
Top 1.0%
0.8%
23
iScience
1063 papers in training set
Top 29%
0.8%
24
Cell Systems
167 papers in training set
Top 11%
0.8%
25
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.7%