Back

NANOTAXI: A Shiny-Based GUI for Real-Time Classification and Analysis of 16S rRNA Nanopore Reads

Mahar, N. S.; Chouhan, K.; Gupta, I.

2026-05-20 bioinformatics
10.64898/2026.05.17.725747 bioRxiv
Show abstract

Real-time taxonomic classification of nanopore amplicon sequencing data enables rapid insights into microbial communities, with applications in clinical diagnostics, environmental monitoring, and outbreak surveillance. However, bridging the gap between long-read data and interpretable results often requires specialised bioinformatics expertise. There remains a need for integrated, user-friendly software that combines live data acquisition with downstream microbiome analysis. Here we present NANOTAXI, a fully automated Shiny-based GUI for the classification of barcoded 16S rRNA gene sequences generated by Oxford Nanopore sequencing. The platform supports four taxonomic classifiers, integrated with five reference databases, enabling flexible selection of classification strategies based on user requirements and available computational resources. In addition to real-time monitoring, NANOTAXI performs cohort-level analyses, including alpha and beta diversity, ordination, differential abundance testing, and functional inference using PICRUSt2. Validation using barcoded synthetic communities comprising pooled genomic DNA from clinically relevant bacterial species and the ZymoBIOMICS mock community demonstrated that NANOTAXI generated biologically coherent taxonomic and functional profiles. Benchmarking revealed clear trade-offs between computational performance and taxonomic specificity. Emu provided the lowest observed species-level false-positive rate, whereas Kraken2 offered the fastest classification and enabled continuous near-real-time monitoring across all tested databases. NANOTAXI is open source and freely available at https://github.com/Nirmal2310/NANOTAXI under the GPL version 3 license.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Biotechnology
147 papers in training set
Top 0.2%
18.7%
2
Nature Communications
4913 papers in training set
Top 11%
14.4%
3
Microbiome
139 papers in training set
Top 0.3%
9.2%
4
Nucleic Acids Research
1128 papers in training set
Top 3%
6.8%
5
Bioinformatics
1061 papers in training set
Top 4%
4.9%
50% of probability mass above
6
mSystems
361 papers in training set
Top 2%
4.2%
7
Genome Biology
555 papers in training set
Top 2%
4.0%
8
Nature Methods
336 papers in training set
Top 3%
2.7%
9
Cell Reports Methods
141 papers in training set
Top 1%
2.4%
10
GigaScience
172 papers in training set
Top 0.9%
2.1%
11
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
12
BMC Bioinformatics
383 papers in training set
Top 4%
1.7%
13
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
14
Scientific Reports
3102 papers in training set
Top 59%
1.7%
15
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.7%
16
PLOS ONE
4510 papers in training set
Top 58%
1.3%
17
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.3%
18
Genome Medicine
154 papers in training set
Top 6%
1.2%
19
Microbial Genomics
204 papers in training set
Top 2%
1.0%
20
Nature Microbiology
133 papers in training set
Top 4%
0.9%
21
Communications Biology
886 papers in training set
Top 19%
0.9%
22
Computational and Structural Biotechnology Journal
216 papers in training set
Top 8%
0.9%
23
mSphere
281 papers in training set
Top 5%
0.9%
24
Nature Protocols
30 papers in training set
Top 0.2%
0.7%
25
Advanced Science
249 papers in training set
Top 19%
0.7%
26
Cell Systems
167 papers in training set
Top 12%
0.7%
27
Patterns
70 papers in training set
Top 2%
0.7%
28
Methods in Ecology and Evolution
160 papers in training set
Top 2%
0.7%
29
Frontiers in Bioinformatics
45 papers in training set
Top 1%
0.7%
30
Nature Computational Science
50 papers in training set
Top 2%
0.7%