Back

The elusive resistome: a global comparison reveals large discrepancies among detection pipelines

Inda-Diaz, J. S.; Adegoke, F.; Löber, U.; Jarquin-Diaz, V. H.; Duan, Y.; Bengtsson-Palme, J.; Ugarcina Perovic, S.; Coelho, L. P.

2026-05-12 bioinformatics
10.64898/2026.05.11.724158 bioRxiv
Show abstract

Identifying antibiotic resistance genes (ARGs) from metagenomic data is critical for studying antimicrobial resistance across microbial communities and pathogens. However, there is no standardized methodology for ARG annotation. Here, we compare ten commonly used ARG detection pipelines by analysing over 270 million prokaryotic genes from the Global Microbial Gene Catalogue across 13 distinct habitats. We observed up to a 45-fold difference in the number of reported ARGs, with a mean Jaccard index of only 16% between pipelines. Pipeline selection profoundly impacted downstream biological interpretations, with drastic changes to estimates of ARG relative abundance and richness, to the characterization of pan- and core-resistomes, and to the class-level composition of the inferred resistome. ARG detection pipelines make different, defensible trade-offs, and no single approach should be treated as authoritative. Therefore, users should justify and communicate choices carefully, as our analyses show that, taken uncritically, the same data can support conflicting biological and ecological interpretations.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 11%
14.2%
2
Microbiome
139 papers in training set
Top 0.1%
13.9%
3
Nature Microbiology
133 papers in training set
Top 0.1%
13.9%
4
Nucleic Acids Research
1128 papers in training set
Top 3%
6.6%
5
mSystems
361 papers in training set
Top 2%
4.2%
50% of probability mass above
6
Nature Biotechnology
147 papers in training set
Top 3%
3.5%
7
eLife
5422 papers in training set
Top 28%
3.5%
8
Cell Systems
167 papers in training set
Top 5%
3.0%
9
mBio
750 papers in training set
Top 6%
2.6%
10
The Lancet Microbe
43 papers in training set
Top 0.5%
1.8%
11
Cell Host & Microbe
113 papers in training set
Top 3%
1.8%
12
Genome Biology
555 papers in training set
Top 5%
1.6%
13
Cell
370 papers in training set
Top 12%
1.6%
14
Genome Medicine
154 papers in training set
Top 6%
1.3%
15
mSphere
281 papers in training set
Top 4%
1.3%
16
Molecular Biology and Evolution
488 papers in training set
Top 3%
1.3%
17
Scientific Reports
3102 papers in training set
Top 67%
1.2%
18
Frontiers in Microbiology
375 papers in training set
Top 7%
0.9%
19
PLOS Computational Biology
1633 papers in training set
Top 21%
0.9%
20
Communications Biology
886 papers in training set
Top 18%
0.9%
21
Microbial Genomics
204 papers in training set
Top 2%
0.9%
22
Cell Reports
1338 papers in training set
Top 33%
0.8%
23
Gut Microbes
70 papers in training set
Top 1%
0.8%
24
PLOS Biology
408 papers in training set
Top 19%
0.8%
25
GigaScience
172 papers in training set
Top 3%
0.7%
26
ISME Communications
103 papers in training set
Top 2%
0.7%
27
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 46%
0.7%
28
Genome Research
409 papers in training set
Top 4%
0.7%
29
Nature
575 papers in training set
Top 16%
0.7%
30
The ISME Journal
194 papers in training set
Top 3%
0.7%