Back

A framework for peptide identification on commercial nanopore sequencing platforms

Beslic, D.; Kucklick, M.; Graap, E.; Sedaghatjoo, S.; Renard, B. Y.; Fuchs, S.; Engelmann, S.; Koerber, N.

2026-05-21 bioinformatics
10.64898/2026.05.19.726067 bioRxiv
Show abstract

Direct single-molecule peptide analysis could in principle enable rapid and sensitive identification of pathogen-derived or disease-associated biomarkers without reliance on mass spectrometry. However, existing nanopore peptide sensing methods are typically constrained by limited throughput and lack of accessibility beyond specialized setups. Here, we present an integrated experimental-computational framework for DNA-linked peptide translocation on a commercially available, high-throughput nanopore sequencing platform, the MinION. Synthetic peptides were covalently bound to oligonucleotides at both termini. The resulting peptide-DNA constructs were then translocated through the CsgG-CsgF pores using a DNA motor protein. Current traces were segmented using the known DNA sequences to extract peptide-associated signal regions. From these segments, we extracted signal features and trained feature-based and deep-learning classifiers to distinguish peptides, balancing interpretability and classification performance. We establish a framework for peptide identification using standard nanopore sequencing hardware. Across a diverse panel of synthetic peptides, our approach resolves single-amino-acid substitutions, maintains performance across independent sequencing runs, and correctly identifies peptides in blind mixtures. Interpretable model analyses connect classifier decisions and common errors to specific signal motifs. By combining commercially available instrumentation with a reproducible experimental and computational workflow, this framework lowers the barrier to nanopore-based proteomics and enables broader adoption across laboratories. It provides a foundation for future developments in amino acid modification detection and sequence analysis.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 15%
12.1%
2
Analytical Chemistry
205 papers in training set
Top 0.3%
9.9%
3
Journal of Proteome Research
215 papers in training set
Top 0.4%
9.9%
4
Molecular & Cellular Proteomics
158 papers in training set
Top 0.4%
6.7%
5
ACS Nano
99 papers in training set
Top 0.6%
6.3%
6
Nature Biotechnology
147 papers in training set
Top 3%
3.5%
7
Advanced Science
249 papers in training set
Top 6%
3.5%
50% of probability mass above
8
Nano Letters
63 papers in training set
Top 0.9%
3.5%
9
Cell Systems
167 papers in training set
Top 4%
3.2%
10
Nature Methods
336 papers in training set
Top 3%
2.7%
11
PROTEOMICS
35 papers in training set
Top 0.2%
2.6%
12
Nature Machine Intelligence
61 papers in training set
Top 2%
1.7%
13
PLOS ONE
4510 papers in training set
Top 55%
1.6%
14
JACS Au
35 papers in training set
Top 0.5%
1.6%
15
Cancer Research Communications
46 papers in training set
Top 0.6%
1.5%
16
Bioinformatics
1061 papers in training set
Top 8%
1.3%
17
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.3%
1.2%
18
Cell Reports Methods
141 papers in training set
Top 3%
1.2%
19
Nature Chemical Biology
104 papers in training set
Top 3%
1.1%
20
Genome Biology
555 papers in training set
Top 6%
0.9%
21
ACS Synthetic Biology
256 papers in training set
Top 3%
0.9%
22
Communications Chemistry
39 papers in training set
Top 0.8%
0.9%
23
Small Methods
26 papers in training set
Top 0.8%
0.9%
24
Nucleic Acids Research
1128 papers in training set
Top 17%
0.8%
25
Scientific Reports
3102 papers in training set
Top 73%
0.8%
26
Communications Biology
886 papers in training set
Top 22%
0.8%
27
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 45%
0.7%
28
Journal of Molecular and Cellular Cardiology
39 papers in training set
Top 0.8%
0.7%
29
Analytica Chimica Acta
17 papers in training set
Top 0.6%
0.7%
30
Science Advances
1098 papers in training set
Top 34%
0.6%