Back

OTUs clustering should be avoided for defining oral microbiome

Regueira-Iglesias, A.; Vazquez-Gonzalez, L.; Balsa-Castro, C.; Blanco-Pintos, T.; Arce, V. M.; Carreira, M. J.; Tomas, I.

2021-08-09 bioinformatics
10.1101/2021.08.09.455616 bioRxiv
Show abstract

This in silico investigation aimed to: 1) evaluate a set of primer pairs with high coverage, including those most commonly used in the literature, to find the different oral species with 16S rRNA gene amplicon similarity/identity (ASI) values [≥]97%; and 2) identify oral species that may be erroneously clustered in the same operational taxonomic unit (OTU) and ascertain whether they belong to distinct genera or other higher taxonomic ranks. Thirty-nine primer pairs were employed to obtain amplicon sequence variants (ASVs) from the complete genomes of 186 bacterial and 135 archaeal species. For each primer, ASVs without mismatches were aligned using BLASTN and their similarity values were obtained. Finally, we selected ASVs from different species with an ASI value [≥]97% that were covered 100% by the query sequences. For each primer, the percentage of species-level coverage with no ASI[≥]97% (SC-NASI[≥]97%) was calculated. Based on the SC-NASI[≥]97% values, the best primer pairs were OP_F053-KP_R020 for bacteria (65.05%), KP_F018-KP_R002 for archaea (51.11%), and OP_F114-KP_R031 for bacteria and archaea together (52.02%). Eighty percent of the oral-bacteria and oralarchaea species shared an ASI[≥]97% with at least one other taxa, including Campylobacter, Rothia, Streptococcus, and Tannerella, which played conflicting roles in the oral microbiota. Moreover, around a quarter and a third of these two-by-two similarity relationships were between species from different bacteria and archaea genera, respectively. Furthermore, even taxa from distinct families, orders, and classes could be grouped in the same cluster. Consequently, irrespective of the primer pair used, OTUs constructed with a 97% similarity provide an inaccurate description of oral-bacterial and oral-archaeal species, greatly affecting microbial diversity parameters. As a result, clustering by OTUs impacts the credibility of the associations between some oral species and certain health and disease conditions. This limits significantly the comparability of the microbial diversity findings reported in oral microbiome literature.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Frontiers in Microbiology
375 papers in training set
Top 0.2%
12.7%
2
Scientific Reports
3102 papers in training set
Top 3%
12.7%
3
PLOS ONE
4510 papers in training set
Top 18%
10.3%
4
BMC Microbiology
35 papers in training set
Top 0.1%
7.3%
5
PeerJ
261 papers in training set
Top 1%
4.4%
6
mSystems
361 papers in training set
Top 3%
3.7%
50% of probability mass above
7
mSphere
281 papers in training set
Top 1%
3.7%
8
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 1%
3.7%
9
Microbiology Spectrum
435 papers in training set
Top 1%
2.6%
10
Microorganisms
101 papers in training set
Top 0.3%
2.5%
11
Microbiome
139 papers in training set
Top 1%
2.4%
12
npj Biofilms and Microbiomes
56 papers in training set
Top 0.8%
2.1%
13
Antibiotics
32 papers in training set
Top 0.7%
1.7%
14
Environmental Microbiome
26 papers in training set
Top 0.3%
1.2%
15
International Journal of Molecular Sciences
453 papers in training set
Top 12%
1.0%
16
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
0.9%
17
Microbial Genomics
204 papers in training set
Top 2%
0.8%
18
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
19
Science of The Total Environment
179 papers in training set
Top 4%
0.8%
20
Water Research
74 papers in training set
Top 1%
0.8%
21
Scientific Data
174 papers in training set
Top 2%
0.8%
22
BioTechniques
24 papers in training set
Top 0.3%
0.7%
23
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
24
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
25
Applied Sciences
24 papers in training set
Top 1%
0.7%
26
Communications Biology
886 papers in training set
Top 28%
0.7%
27
Microbiology Resource Announcements
22 papers in training set
Top 1%
0.7%
28
Journal of Medical Virology
137 papers in training set
Top 5%
0.7%
29
Nature Communications
4913 papers in training set
Top 67%
0.5%
30
International Journal of Environmental Research and Public Health
124 papers in training set
Top 8%
0.5%