Back

Nanopore sequencing reaches amplicon sequence variant (ASV) resolution

Riisgaard-Jensen, M.; Villanelo, S. A. R.; Andersen, K. S.; Kirkegaard, R.; Hansen, S. H.; Jiang, C.; Stefansen, A. V.; Thomsen, J. H. D.; Nielsen, P. H.; Dueholm, M. K. D.

2026-02-28 bioinformatics
10.64898/2026.02.26.708165 bioRxiv
Show abstract

Sequencing of ribosomal marker genes remains a cornerstone for profiling complex microbial communities. In recent years, there has been a shift from Illumina to long-read technologies, including PacBio and Oxford Nanopore Technologies (ONT). ONT is attractive due to its low startup cost and portability; however, historically high error rates have prevented direct amplicon sequencing variant (ASV) generation from raw nanopore reads. This has forced most workflows to rely on mapping raw reads against reference databases constraining analyses to taxa covered by these. With recent improvements in ONT sequencing accuracy, we sought to challenge this view by sequencing samples of increasing complexity using primer sets targeting amplicons of different lengths, and by sequencing the exact same PCR libraries on both PacBio and ONT. We demonstrate that error-free ASVs can now be generated directly from raw nanopore reads using standard denoising algorithms originally developed for Illumina data. Current ONT read quality enables reliable reconstruction of amplicons spanning [~]250 bp to [~]4,200 bp and allows resolution of intragenomic rRNA gene variants. These results extend beyond simple mock communities to complex fecal, anaerobic digester, activated sludge, and soil samples. When sequencing depth is sufficient, ONT accurately recovers all or nearly all intra-genomic 16S rRNA gene copy variants, showing perfect sequence identity to curated reference sequences in mock communities and to ASVs inferred from PacBio data in complex communities. Across the primer sets, ONT required higher sequencing depth than PacBio to fully resolve the communities, with this requirement increasing with amplicon length. For complex samples, ONT required approximately 2-3x more reads for V4 ([~]250 bp) and V1-V3 ([~]500 bp), 4.1-5.6x more reads for V1-V8 ([~]1400 bp), and 25-42x more reads for rRNA operon (OPR) amplicons ([~]4200 bp). Consequently, sequencing complex communities with OPR primers on ONT is currently not feasible due to the unrealistically high read depth required. This study provides evidence that ONT amplicon sequencing has matured to the point where true ASV-resolved profiling is practically and economically feasible, moving ONT amplicon analysis beyond reliance on OTU clustering or reference alignment to enable application in environments lacking comprehensive reference databases. Key FindingsO_LIIt is now straightforward to generate ASVs on ONT platforms (250-4200 bp) C_LIO_LIONT can resolve intragenomic 16S rRNA gene variants C_LIO_LIASV recovery is successful in both simple and complex communities C_LI

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 12%
14.8%
2
mSphere
281 papers in training set
Top 0.1%
12.6%
3
mSystems
361 papers in training set
Top 0.9%
8.5%
4
Microbial Genomics
204 papers in training set
Top 0.2%
8.5%
5
Molecular Ecology Resources
161 papers in training set
Top 0.2%
4.4%
6
Microbiome
139 papers in training set
Top 0.7%
4.3%
50% of probability mass above
7
Microbiology Spectrum
435 papers in training set
Top 0.8%
3.6%
8
Scientific Reports
3102 papers in training set
Top 47%
2.4%
9
Nature Communications
4913 papers in training set
Top 46%
2.1%
10
BMC Bioinformatics
383 papers in training set
Top 4%
2.1%
11
Methods in Ecology and Evolution
160 papers in training set
Top 1%
1.7%
12
Environmental DNA
49 papers in training set
Top 0.2%
1.7%
13
PeerJ
261 papers in training set
Top 7%
1.7%
14
Limnology and Oceanography: Methods
11 papers in training set
Top 0.2%
1.7%
15
mBio
750 papers in training set
Top 8%
1.5%
16
Frontiers in Microbiology
375 papers in training set
Top 6%
1.5%
17
Cell Reports Methods
141 papers in training set
Top 3%
1.5%
18
PLOS Computational Biology
1633 papers in training set
Top 18%
1.3%
19
Water Research
74 papers in training set
Top 1%
1.2%
20
Environmental Microbiome
26 papers in training set
Top 0.3%
1.2%
21
Genome Biology
555 papers in training set
Top 6%
0.9%
22
BMC Genomics
328 papers in training set
Top 5%
0.8%
23
Environmental Science & Technology Letters
22 papers in training set
Top 0.3%
0.8%
24
Metabarcoding and Metagenomics
12 papers in training set
Top 0.1%
0.8%
25
GigaScience
172 papers in training set
Top 3%
0.8%
26
Frontiers in Bioinformatics
45 papers in training set
Top 0.9%
0.8%
27
Bioinformatics
1061 papers in training set
Top 10%
0.7%
28
Gigabyte
60 papers in training set
Top 2%
0.6%
29
Microbiology Resource Announcements
22 papers in training set
Top 1%
0.6%
30
Journal of Clinical Microbiology
120 papers in training set
Top 2%
0.6%