Back

RAPID: an interactive R/Shiny platform for end-to-end 16S rRNA and ITS amplicon sequence analysis using DADA2

Kapoor, B.; Cregger, M. A.; Ranjan, P.

2026-05-08 bioinformatics
10.64898/2026.05.05.723040 bioRxiv
Show abstract

MotivationAmplicon sequencing of 16S rRNA and internal transcribed spacer (ITS) gene regions is the most widely used approach for characterizing bacterial and fungal communities, respectively. The DADA2 pipeline has become a standard for inferring amplicon sequence variants (ASVs), offering single-nucleotide resolution over traditional OTU clustering. However, executing the full DADA2 workflow requires proficiency in R programming and manual coordination of multiple sequential steps, presenting a substantial barrier for researchers in clinical, environmental, and agricultural sciences who lack computational training. ResultsWe present RAPID (R-based Amplicon Pipeline for Interactive DADA2), a pair of R/Shiny applications providing complete graphical user interfaces for 16S rRNA and ITS amplicon sequence analysis. The 16S application implements a 10-step guided workflow from raw paired-end FASTQ files through quality filtering, error learning, dereplication, paired-read merging, chimera removal, taxonomy assignment (SILVA), phyloseq construction with data transformation (rarefaction, relative abundance, or CLR), interactive visualization (rarefaction curves, alpha diversity, NMDS, PCoA, taxonomic abundance), PERMANOVA, and ANCOM-BC2 differential abundance analysis. The ITS application extends this to an 11-step workflow, adding an automated primer removal step using cutadapt with support for multiple primers and length-variable amplicons, and uses the UNITE database for fungal taxonomy. Both applications feature asynchronous background processing, session persistence, real-time progress monitoring, publication-ready figure export, and comprehensive result downloads. AvailabilityRAPID is freely available at https://github.com/beantkapoor786/RAPID. Both applications can be installed locally on any system with R (version 4.0 or higher) and run as local web applications accessible through a standard browser.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.6%
32.8%
2
BMC Bioinformatics
383 papers in training set
Top 0.3%
18.5%
50% of probability mass above
3
GigaScience
172 papers in training set
Top 0.3%
4.3%
4
Microbial Genomics
204 papers in training set
Top 0.5%
4.1%
5
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
6
Cell Reports Methods
141 papers in training set
Top 0.9%
3.6%
7
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.7%
3.6%
8
Nature Communications
4913 papers in training set
Top 41%
3.6%
9
Microbiome
139 papers in training set
Top 1%
2.1%
10
Genome Biology
555 papers in training set
Top 4%
2.1%
11
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
12
PLOS ONE
4510 papers in training set
Top 51%
1.8%
13
Methods in Ecology and Evolution
160 papers in training set
Top 1%
1.7%
14
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
15
Scientific Reports
3102 papers in training set
Top 64%
1.3%
16
PLOS Computational Biology
1633 papers in training set
Top 20%
1.1%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
0.9%
18
Journal of Open Source Software
22 papers in training set
Top 0.2%
0.9%
19
mSystems
361 papers in training set
Top 7%
0.8%
20
Microbiology Spectrum
435 papers in training set
Top 5%
0.7%
21
Nature Protocols
30 papers in training set
Top 0.2%
0.7%
22
mSphere
281 papers in training set
Top 6%
0.7%
23
Molecular Ecology Resources
161 papers in training set
Top 1%
0.7%
24
PeerJ
261 papers in training set
Top 17%
0.6%
25
Frontiers in Microbiology
375 papers in training set
Top 10%
0.6%