Back

Inferring the sensitivity of wastewater metagenomic sequencing for pathogen early detection

Grimm, S. L.; Kaufman, J. T.; Rice, D. P.; Whittaker, C.; Bradshaw, W. J.; McLaren, M. M.

2023-12-26 epidemiology
10.1101/2023.12.22.23300450 medRxiv
Show abstract

BackgroundMetagenomic sequencing of wastewater (W-MGS) can in principle detect any known or novel pathogen in a population. We quantify the sensitivity and cost of W-MGS for viral pathogen detection by jointly analysing W-MGS and epidemiological data for a range of human-infecting viruses. MethodsSequencing data from four studies were analysed to estimate the relative abundance (RA) of 11 human-infecting viruses. Corresponding prevalence and incidence estimates were obtained or calculated from academic and public-health reports. These estimates were combined using a hierarchical Bayesian model to predict RA at set prevalence or incidence values, allowing comparison across studies and viruses. These predictions were then used to estimate the sequencing depth and concomitant cost required for pathogen detection using W-MGS with or without use of a hybridization-capture enrichment panel. FindingsAfter controlling for variation in local infection rates, relative abundance varied by orders of magnitude across studies for a given virus. For instance, a local SARS-CoV-2 weekly incidence of 1% corresponds to predicted SARS-CoV-2 relative abundance ranging from 3.8 x 10-10 to 2.4 x 10-7 across studies, translating to orders-of-magnitude variation in the cost of operating a system able to detect a SARS-CoV-2-like pathogen at a given sensitivity. Use of a respiratory virus enrichment panel in two studies dramatically increased predicted relative abundance of SARS-CoV-2, lowering yearly costs by 24-to 29-fold for a system able to detect a SARS-CoV-2-like pathogen before reaching 0.01% cumulative incidence. InterpretationThe large variation in viral relative abundance after controlling for epidemiological factors indicates that other sources of inter-study variation, such as differences in sewershed hydrology and lab protocols, have a substantial impact on the sensitivity and cost of W-MGS. Well-chosen hybridization capture panels can dramatically increase sensitivity and reduce cost for viruses in the panel, but may reduce sensitivity to unknown or unexpected pathogens. FundingWellcome Trust; Open Philanthropy; Musk Foundation Research In ContextO_ST_ABSEvidence before this studyC_ST_ABSNumerous other studies have performed wastewater metagenomic sequencing (W-MGS), with a range of objectives. However, few have explicitly examined the performance of W-MGS as a monitoring tool. We searched PubMed between database inception and September 2024, using the search terms "MGS OR Metagenomic sequencing OR Metagenomics OR Shotgun sequencing" AND "Performance OR Precision OR Sensitivity OR Cost-effectiveness OR Effectiveness" AND "Virus OR Viral OR Virome" AND "Wastewater OR Sewage". Among the 88 resulting studies, 17 focused specifically on viruses in wastewater. A 2023 UK study by Child and colleagues assessed hybridization-capture and untargeted sequencing of wastewater for genomic epidemiology, concluding that the former but not the latter provided sufficient coverage for effective variant tracking. However, they did find untargeted sequencing sufficient for presence/absence calls of human pathogens in wastewater, a finding supported by numerous other W-MGS studies. While several studies examined the effect of different W-MGS protocols on viral abundance and composition, none accounted for epidemiological or study effects, and none explicitly quantified the sensitivity and cost of W-MGS for viral detection. Added value of this studyTo our knowledge, this study provides the first quantitative assessment of the sensitivity and cost of untargeted and hybridization-capture W-MGS for pathogen surveillance. Linking a large corpus of public wastewater metagenomic sequencing with epidemiological data in a Bayesian model, we predict pathogen relative abundance in W-MGS data at set infection prevalence or incidence, and estimate concomitant read-depth and cost requirements for effective detection across different studies and viruses. Our flexible modelling framework provides a valuable tool for evaluation of sequencing-based surveillance in other contexts. Implications of all the available evidenceThe sensitivity of untargeted W-MGS varies greatly with pathogen and study design, and large gaps in our understanding remain for pathogens not present in our data. As untargeted W-MGS protocols undergo further improvements, our Bayesian modelling framework is an effective tool for evaluating the sensitivity of new protocols under different epidemiological conditions. While less pathogen-agnostic, hybridization capture can dramatically increase the sensitivity of W-MGS-based pathogen monitoring, and our findings support piloting it as a tool for biosurveillance of known viruses.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
ACS ES&T Water
18 papers in training set
Top 0.1%
15.1%
2
Science of The Total Environment
179 papers in training set
Top 0.6%
10.7%
3
Environmental Health Perspectives
17 papers in training set
Top 0.1%
8.6%
4
Environmental Science: Water Research & Technology
13 papers in training set
Top 0.1%
7.4%
5
Environmental Science & Technology
64 papers in training set
Top 0.4%
7.0%
6
Water Research
74 papers in training set
Top 0.4%
5.0%
50% of probability mass above
7
mSystems
361 papers in training set
Top 2%
5.0%
8
FEMS Microbes
14 papers in training set
Top 0.1%
4.5%
9
PLOS Water
11 papers in training set
Top 0.1%
3.7%
10
Environment International
42 papers in training set
Top 0.5%
3.0%
11
PLOS ONE
4510 papers in training set
Top 43%
3.0%
12
Infection Control & Hospital Epidemiology
17 papers in training set
Top 0.1%
2.4%
13
GeoHealth
10 papers in training set
Top 0.2%
1.8%
14
Nature Communications
4913 papers in training set
Top 54%
1.4%
15
Environmental Science & Technology Letters
22 papers in training set
Top 0.2%
1.3%
16
Scientific Reports
3102 papers in training set
Top 68%
1.1%
17
International Journal of Environmental Research and Public Health
124 papers in training set
Top 5%
1.0%
18
Emerging Infectious Diseases
103 papers in training set
Top 2%
1.0%
19
Epidemics
104 papers in training set
Top 1%
0.9%
20
Environmental Research
46 papers in training set
Top 1%
0.9%
21
One Health
29 papers in training set
Top 1.0%
0.9%
22
The Lancet Microbe
43 papers in training set
Top 1%
0.8%
23
Microbial Genomics
204 papers in training set
Top 2%
0.8%
24
Applied and Environmental Microbiology
301 papers in training set
Top 3%
0.8%
25
PeerJ
261 papers in training set
Top 17%
0.7%
26
Chemosphere
15 papers in training set
Top 0.6%
0.7%
27
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.7%
28
BMC Public Health
147 papers in training set
Top 7%
0.5%
29
The Journal of Infectious Diseases
182 papers in training set
Top 6%
0.5%
30
Environmental Pollution
35 papers in training set
Top 3%
0.5%