Back

Deep untargeted wastewater metagenomic sequencing from sewersheds across the United States

Justen, L. J.; Rushford, C.; Hershey, O. S.; Floyd-O'Sullivan, R.; Grimm, S. L.; Bradshaw, W. J.; Bhasin, H.; Rice, D. P.; Stansifer, K.; Faraguna, J. D.; McLaren, M. R.; Zulli, A.; Tovar-Mendez, A.; Copen, E.; Shelton, K. K.; Amirali, A.; Kannoly, S.; Pesantez, S.; Stanciu, A.; Quiroga, I. C.; Silvera, L.; Greenwood, N.; Bongiovi, B.; Walkins, A.; Love, R.; Lening, S.; Patterson, K.; Johnston, T.; Hernandez, S.; Benitez, A.; McCarley, B. J.; Engelage, S.; Pillay, S.; Calender, C.; Herring, B.; Robinson, C.; Monett Wastewater Treatment Plant, ; Columbia Missouri Wastewater Treatment Plant, ;

2026-03-06 public and global health
10.64898/2026.03.05.26345726 medRxiv
Show abstract

Wastewater monitoring enables non-invasive, population-scale tracking of community infections independent of healthcare-seeking behavior and clinical diagnosis. Metagenomic sequencing extends this capability by enabling broad, pathogen-agnostic detection, genomic characterization, and identification of novel or unexpected threats. Here, we present data from CASPER (the Coalition for Agnostic Sequencing of Pathogens from Environmental Reservoirs), a U.S.-based wastewater metagenomic sequencing network designed for deep, untargeted pathogen monitoring at national scale. This release includes 1,206 samples collected between December 2023 and December 2025 from 27 sites across nine states, covering 13 million people. Deep sequencing ([~]1 billion read pairs per sample) generated 1.2 trillion read pairs (357 terabases), enabling detection of even rare taxa, with CASPER representing 67% of all untargeted wastewater sequencing data currently available on the NCBI Sequence Read Archive. Virus abundance trends correlate with nationwide wastewater PCR and clinical data for SARS-CoV-2, influenza A, and respiratory syncytial virus, while the pathogen-agnostic approach captures emerging threats, including avian influenza H5N1 during initial dairy cattle outbreaks, West Nile virus, and measles, among hundreds of viral taxa. As the largest publicly available untargeted wastewater sequencing dataset to date, CASPER provides a shared and growing resource for pathogen surveillance and microbial ecology.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Environmental Science & Technology
64 papers in training set
Top 0.2%
12.8%
2
Nature Communications
4913 papers in training set
Top 14%
12.4%
3
Environmental Science & Technology Letters
22 papers in training set
Top 0.1%
10.5%
4
Water Research
74 papers in training set
Top 0.3%
8.5%
5
mSystems
361 papers in training set
Top 2%
4.9%
6
Microbiology Resource Announcements
22 papers in training set
Top 0.1%
4.3%
50% of probability mass above
7
Science of The Total Environment
179 papers in training set
Top 2%
4.0%
8
PLOS ONE
4510 papers in training set
Top 41%
3.3%
9
Environmental Health Perspectives
17 papers in training set
Top 0.2%
2.1%
10
Nature Microbiology
133 papers in training set
Top 2%
1.9%
11
The Journal of Infectious Diseases
182 papers in training set
Top 2%
1.7%
12
Emerging Infectious Diseases
103 papers in training set
Top 1%
1.7%
13
Med
38 papers in training set
Top 0.3%
1.7%
14
mBio
750 papers in training set
Top 8%
1.5%
15
Nature Medicine
117 papers in training set
Top 3%
1.5%
16
Cell
370 papers in training set
Top 14%
1.2%
17
Microbiome
139 papers in training set
Top 2%
1.2%
18
Cell Reports Medicine
140 papers in training set
Top 6%
1.0%
19
Scientific Reports
3102 papers in training set
Top 69%
1.0%
20
Genome Medicine
154 papers in training set
Top 7%
0.9%
21
Microbiology Spectrum
435 papers in training set
Top 4%
0.9%
22
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 41%
0.9%
23
FEMS Microbes
14 papers in training set
Top 0.3%
0.9%
24
Clinical Infectious Diseases
231 papers in training set
Top 4%
0.8%
25
PLOS Water
11 papers in training set
Top 0.3%
0.8%
26
Environmental Science: Water Research & Technology
13 papers in training set
Top 0.3%
0.8%
27
The Lancet Microbe
43 papers in training set
Top 1%
0.8%
28
Applied and Environmental Microbiology
301 papers in training set
Top 3%
0.7%
29
Genome Biology
555 papers in training set
Top 8%
0.6%
30
Nature
575 papers in training set
Top 17%
0.6%