Back

Predictive and Seasonal Dynamics of the Human Wastewater Virome

Vahdat, Z.; Grimm, S. L.; Gandhi, T.; Tisza, M.; Javornik-Cregeen, S.; Bel Rhali, S.; Clark, J.; Prakash, H.; Petrosino, J. F.; Ayvaz, T.; Ross, M. C.; Deegan, J.; Bauer, C.; Boerwinkle, E.; Coarfa, C.; Maresso, A. W.

2026-03-23 epidemiology
10.64898/2026.03.19.26348845 medRxiv
Show abstract

Wastewater-based epidemiology provides a scalable, noninvasive framework for population-level infectious disease monitoring, but traditional assays limit detection breadth and genomic insight. To address these constraints, we conducted targeted hybrid capture virome sequencing across 15 Texas cities over three years, from 2023 to 2025, generating [~]3 billion viral reads and identifying more than 900 strains across 374 species. Comprehensive temporal and spatial analysis revealed that the wastewater virome exhibits strong, predictable seasonal patterns, which grouped into three dominant seasonal clusters encompassing human, animal, and plant pathogens. Correlation network analysis revealed numerous positive co-occurrence patterns, including seasonal viral pairings, suggesting that the virome functions as a structured and interconnected ecological system. Leveraging this structure, we developed machine learning models using site-specific historical data to forecast individual viral species one month in advance. Of the 159 species modeled, approximately half achieved prediction performance of Pearsons Correlation Coefficient R{superscript 2} [≥] 0.50, and many exceeded R{superscript 2} [≥] 0.75. Classification models accurately inferred the month and season of sample collection (AUROC > 0.85 and > 0.95, respectively). Predictive features frequently included other viruses and temporal indicators, highlighting networked, seasonal virome dynamics. Sentinel pathogens (e.g., Norovirus, SARS-CoV-2) could be forecast accurately even with limited historical data. Together, these findings demonstrate that the wastewater virome is highly seasonal, interconnected, and forecastable, providing a foundation for proactive, metagenomics-based monitoring and early outbreak detection.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Water Research
74 papers in training set
Top 0.1%
22.2%
2
Science of The Total Environment
179 papers in training set
Top 0.7%
10.3%
3
mSystems
361 papers in training set
Top 0.8%
10.0%
4
Environmental Science & Technology
64 papers in training set
Top 0.3%
8.3%
50% of probability mass above
5
Nature Communications
4913 papers in training set
Top 36%
4.1%
6
Environmental Health Perspectives
17 papers in training set
Top 0.2%
3.5%
7
Epidemics
104 papers in training set
Top 0.7%
2.3%
8
GeoHealth
10 papers in training set
Top 0.2%
2.0%
9
ACS ES&T Water
18 papers in training set
Top 0.2%
1.9%
10
Cell
370 papers in training set
Top 12%
1.6%
11
PLOS Computational Biology
1633 papers in training set
Top 18%
1.5%
12
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 36%
1.3%
13
Science Advances
1098 papers in training set
Top 23%
1.2%
14
PLOS ONE
4510 papers in training set
Top 61%
1.2%
15
Scientific Reports
3102 papers in training set
Top 67%
1.2%
16
Science China Life Sciences
26 papers in training set
Top 2%
0.9%
17
mBio
750 papers in training set
Top 10%
0.9%
18
Advanced Science
249 papers in training set
Top 17%
0.9%
19
Environmental Microbiology
119 papers in training set
Top 3%
0.9%
20
eLife
5422 papers in training set
Top 54%
0.9%
21
Nature Biotechnology
147 papers in training set
Top 7%
0.8%
22
Environment International
42 papers in training set
Top 1%
0.8%
23
Environmental Research
46 papers in training set
Top 1%
0.8%
24
Cell Reports Medicine
140 papers in training set
Top 8%
0.7%
25
FEMS Microbes
14 papers in training set
Top 0.4%
0.7%
26
Microbiology Spectrum
435 papers in training set
Top 6%
0.7%
27
Med
38 papers in training set
Top 0.9%
0.7%
28
PNAS Nexus
147 papers in training set
Top 2%
0.7%
29
The Journal of Infectious Diseases
182 papers in training set
Top 5%
0.7%
30
Applied and Environmental Microbiology
301 papers in training set
Top 3%
0.7%