Back

NAP: an open-source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data

Jones, L. B.; Bagby, S.

2026-05-26 bioinformatics
10.64898/2026.05.22.727110 bioRxiv
Show abstract

BackgroundNanopore sequencing offers a cost-effective and portable platform for microbiome analysis, but amplicon-based approaches remain limited by higher sequencing error rates and a lack of workflows tailored to mixed domain ribosomal RNA profiling. While short-read technologies dominate microbial community analysis, their portability and flexibility are constrained. There is therefore a need for robust pipelines designed specifically for cross-domain Nanopore amplicon data. ResultsWe introduce the Nanopore sequencing-based Amplicon Pipeline (NAP; https://github.com/Luke-B-Jones/NAP), an open-source workflow optimised for flexible mixed domain primer sets such as 515Y/926R. NAP performs adaptive quality filtering, chimera removal, centroid generation, BLAST-based taxonomic classification, hierarchical consensus correction, and domain-aware post-processing, outputting decontaminated abundance tables suitable for downstream analysis. Initial validation against two complementary commercial mock communities showed that NAP achieved strong genus-level performance across both low complexity logarithmic and more compositionally complex gut mock communities. Detection was most reliable above ca. 1% relative abundance, and replicate outputs showed strong agreement with expected composition under Bray-Curtis, Jaccard, agreement-plot, and Bland-Altman analyses. Benchmarking of NAPs internal filtering modes showed that the default adaptive setting provided the most robust balance of read quality, retained depth, and downstream taxonomic fidelity across heterogeneous inputs. Direct comparison against QIIME2 and Kraken2/Bracken further showed that NAP most accurately preserved expected community structure, with markedly fewer false positive assignments at genus level and substantially stronger species-level behaviour under the tested conditions. Species-level assignments were informative for some taxa, but remained less robust than genus-level outputs with the default V4-V5 amplicon. ConclusionsNAP provides a robust and flexible workflow for cross-domain Nanopore amplicon profiling, with strongest performance at genus level and competitive species-level behaviour for well resolved taxa. Although analysis of field-derived data was not assessed here, NAP compatibility with portable Nanopore sequencing supports accurate mixed domain microbiome profiling under the tested conditions.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Microbiome
139 papers in training set
Top 0.3%
8.5%
2
Bioinformatics
1061 papers in training set
Top 4%
6.9%
3
Microbial Genomics
204 papers in training set
Top 0.3%
6.9%
4
Nature Communications
4913 papers in training set
Top 26%
6.9%
5
mSystems
361 papers in training set
Top 1%
6.4%
6
Methods in Ecology and Evolution
160 papers in training set
Top 0.6%
6.4%
7
BMC Bioinformatics
383 papers in training set
Top 2%
6.4%
8
mSphere
281 papers in training set
Top 0.8%
4.9%
50% of probability mass above
9
PLOS ONE
4510 papers in training set
Top 35%
4.0%
10
Scientific Reports
3102 papers in training set
Top 40%
3.1%
11
Microbiology Spectrum
435 papers in training set
Top 1%
2.9%
12
Nucleic Acids Research
1128 papers in training set
Top 9%
2.1%
13
GigaScience
172 papers in training set
Top 1%
1.9%
14
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.8%
15
Cell Reports Methods
141 papers in training set
Top 2%
1.7%
16
PLOS Computational Biology
1633 papers in training set
Top 18%
1.5%
17
BMC Microbiology
35 papers in training set
Top 0.8%
1.3%
18
Genome Biology
555 papers in training set
Top 5%
1.3%
19
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.0%
20
Microbiology Resource Announcements
22 papers in training set
Top 0.6%
0.9%
21
Nature Biotechnology
147 papers in training set
Top 7%
0.8%
22
Molecular Ecology Resources
161 papers in training set
Top 1.0%
0.8%
23
PeerJ
261 papers in training set
Top 13%
0.8%
24
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
25
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
26
Frontiers in Microbiology
375 papers in training set
Top 8%
0.8%
27
Scientific Data
174 papers in training set
Top 2%
0.8%
28
Microorganisms
101 papers in training set
Top 2%
0.7%
29
Frontiers in Bioinformatics
45 papers in training set
Top 1%
0.7%
30
Wellcome Open Research
57 papers in training set
Top 3%
0.7%