Back

ARGus: A Co-assembly workflow for MAG generation, ARG detection, and virulence analysis

Kelley, S. T.; Subramanian, N. P.; Krutkin, D. D.

2026-05-26 bioinformatics
10.64898/2026.05.22.727233 bioRxiv
Show abstract

The emergence of antibiotic resistance among pathogenic bacteria is a significant global health challenge with multidrug resistance becoming increasingly common. Moreover, since antibiotic resistance genes (ARGs) can be transferred horizontally more bacteria are rapidly evolving resistance. In addition, emerging bacterial pathogens continue to arise from a combination of urbanization, animal agriculture, global movements of people, and inadequate sewage infrastructure. Researchers have begun applying deep sequencing and shotgun metagenomics to detect known and unknown pathogenic organisms and ARGs directly from environmental samples. Here, we describe a bioinformatics workflow that uses a co-assembly approach to assemble contigs across metagenomes and bin them into high coverage metagenomic assembled genomes (MAGs), while segregating out unbinned contigs that includes mobile elements (e.g., plasmids). The workflow includes annotation of coding sequences and differential determination of ARGs and virulence factors (VF) within the sets of both MAG genome bins and unbinned contigs and allows quantification of MAG, ARG and VF abundances for ecological (alpha and beta diversity) and network analyses. Workflow analysis of metagenomic samples collected from the heavily polluted Tijuana River identified hundreds of MAGs, including many high-quality bins and many novel potential pathogens, and found the vast majority of ARG sequence matches in the unbinned contigs. A combined network analysis found strong correlations (r > 0.90) between ARGs and specific MAGs, indicating which bacterial species is likely to contain the ARG. This workflow provides a powerful approach for public health metagenomics studies of emerging pathogens and ARGs.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Microbiome
139 papers in training set
Top 0.1%
22.2%
2
mSystems
361 papers in training set
Top 0.4%
14.1%
3
PLOS ONE
4510 papers in training set
Top 29%
6.2%
4
Microbial Genomics
204 papers in training set
Top 0.6%
3.8%
5
Genome Biology
555 papers in training set
Top 2%
3.5%
6
mSphere
281 papers in training set
Top 2%
3.5%
50% of probability mass above
7
Water Research
74 papers in training set
Top 0.7%
2.7%
8
Microbiology Spectrum
435 papers in training set
Top 1%
2.7%
9
BMC Bioinformatics
383 papers in training set
Top 3%
2.6%
10
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.1%
11
Bioinformatics
1061 papers in training set
Top 7%
2.1%
12
Frontiers in Microbiology
375 papers in training set
Top 5%
1.9%
13
Scientific Reports
3102 papers in training set
Top 54%
1.9%
14
GigaScience
172 papers in training set
Top 1%
1.9%
15
Nature Communications
4913 papers in training set
Top 51%
1.8%
16
Nucleic Acids Research
1128 papers in training set
Top 10%
1.8%
17
Bioinformatics Advances
184 papers in training set
Top 3%
1.5%
18
PLOS Computational Biology
1633 papers in training set
Top 18%
1.5%
19
Cell Reports Methods
141 papers in training set
Top 3%
1.3%
20
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.2%
21
Microbiology Resource Announcements
22 papers in training set
Top 0.5%
1.2%
22
iScience
1063 papers in training set
Top 23%
1.1%
23
mBio
750 papers in training set
Top 10%
0.9%
24
Genome Medicine
154 papers in training set
Top 7%
0.9%
25
Environmental Science & Technology Letters
22 papers in training set
Top 0.4%
0.8%
26
PeerJ
261 papers in training set
Top 14%
0.8%
27
Frontiers in Bioinformatics
45 papers in training set
Top 1.0%
0.7%
28
Viruses
318 papers in training set
Top 5%
0.7%
29
Microorganisms
101 papers in training set
Top 2%
0.7%
30
FEMS Microbiology Ecology
47 papers in training set
Top 0.6%
0.7%