Back

Inferring the sensitivity of wastewater metagenomic sequencing for pathogen early detection

Grimm, S. L.; Kaufman, J. T.; Rice, D. P.; Whittaker, C.; Bradshaw, W. J.; McLaren, M. M.

2023-12-26 epidemiology
10.1101/2023.12.22.23300450
Show abstract

BackgroundMetagenomic sequencing of wastewater (W-MGS) can in principle detect any known or novel pathogen in a population. We quantify the sensitivity and cost of W-MGS for viral pathogen detection by jointly analysing W-MGS and epidemiological data for a range of human-infecting viruses. MethodsSequencing data from four studies were analysed to estimate the relative abundance (RA) of 11 human-infecting viruses. Corresponding prevalence and incidence estimates were obtained or calculated from academic and public-health reports. These estimates were combined using a hierarchical Bayesian model to predict RA at set prevalence or incidence values, allowing comparison across studies and viruses. These predictions were then used to estimate the sequencing depth and concomitant cost required for pathogen detection using W-MGS with or without use of a hybridization-capture enrichment panel. FindingsAfter controlling for variation in local infection rates, relative abundance varied by orders of magnitude across studies for a given virus. For instance, a local SARS-CoV-2 weekly incidence of 1% corresponds to predicted SARS-CoV-2 relative abundance ranging from 3.8 x 10-10 to 2.4 x 10-7 across studies, translating to orders-of-magnitude variation in the cost of operating a system able to detect a SARS-CoV-2-like pathogen at a given sensitivity. Use of a respiratory virus enrichment panel in two studies dramatically increased predicted relative abundance of SARS-CoV-2, lowering yearly costs by 24-to 29-fold for a system able to detect a SARS-CoV-2-like pathogen before reaching 0.01% cumulative incidence. InterpretationThe large variation in viral relative abundance after controlling for epidemiological factors indicates that other sources of inter-study variation, such as differences in sewershed hydrology and lab protocols, have a substantial impact on the sensitivity and cost of W-MGS. Well-chosen hybridization capture panels can dramatically increase sensitivity and reduce cost for viruses in the panel, but may reduce sensitivity to unknown or unexpected pathogens. FundingWellcome Trust; Open Philanthropy; Musk Foundation Research In ContextO_ST_ABSEvidence before this studyC_ST_ABSNumerous other studies have performed wastewater metagenomic sequencing (W-MGS), with a range of objectives. However, few have explicitly examined the performance of W-MGS as a monitoring tool. We searched PubMed between database inception and September 2024, using the search terms "MGS OR Metagenomic sequencing OR Metagenomics OR Shotgun sequencing" AND "Performance OR Precision OR Sensitivity OR Cost-effectiveness OR Effectiveness" AND "Virus OR Viral OR Virome" AND "Wastewater OR Sewage". Among the 88 resulting studies, 17 focused specifically on viruses in wastewater. A 2023 UK study by Child and colleagues assessed hybridization-capture and untargeted sequencing of wastewater for genomic epidemiology, concluding that the former but not the latter provided sufficient coverage for effective variant tracking. However, they did find untargeted sequencing sufficient for presence/absence calls of human pathogens in wastewater, a finding supported by numerous other W-MGS studies. While several studies examined the effect of different W-MGS protocols on viral abundance and composition, none accounted for epidemiological or study effects, and none explicitly quantified the sensitivity and cost of W-MGS for viral detection. Added value of this studyTo our knowledge, this study provides the first quantitative assessment of the sensitivity and cost of untargeted and hybridization-capture W-MGS for pathogen surveillance. Linking a large corpus of public wastewater metagenomic sequencing with epidemiological data in a Bayesian model, we predict pathogen relative abundance in W-MGS data at set infection prevalence or incidence, and estimate concomitant read-depth and cost requirements for effective detection across different studies and viruses. Our flexible modelling framework provides a valuable tool for evaluation of sequencing-based surveillance in other contexts. Implications of all the available evidenceThe sensitivity of untargeted W-MGS varies greatly with pathogen and study design, and large gaps in our understanding remain for pathogens not present in our data. As untargeted W-MGS protocols undergo further improvements, our Bayesian modelling framework is an effective tool for evaluating the sensitivity of new protocols under different epidemiological conditions. While less pathogen-agnostic, hybridization capture can dramatically increase the sensitivity of W-MGS-based pathogen monitoring, and our findings support piloting it as a tool for biosurveillance of known viruses.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Science of The Total Environment
based on 80 papers
Top 0.7%
13.0%
2
Environmental Science & Technology
based on 16 papers
Top 0.2%
7.5%
3
ACS ES&T Water
based on 11 papers
Top 0.1%
7.5%
4
mSystems
based on 16 papers
Top 0.2%
5.3%
5
Water Research
based on 26 papers
Top 0.4%
5.3%
6
Environmental Research
based on 36 papers
Top 1%
4.5%
7
Scientific Reports
based on 701 papers
Top 52%
2.9%
8
American Journal of Epidemiology
based on 54 papers
Top 2%
2.8%
9
Nature Communications
based on 483 papers
Top 22%
2.8%
50% of probability mass above
10
Epidemics
based on 96 papers
Top 4%
2.4%
11
mSphere
based on 27 papers
Top 0.4%
2.4%
12
Environmental Health Perspectives
based on 11 papers
Top 0.5%
2.4%
13
Environmental Science & Technology Letters
based on 16 papers
Top 0.6%
2.4%
14
PLOS ONE
based on 1737 papers
Top 85%
2.3%
15
Proceedings of the National Academy of Sciences
based on 100 papers
Top 7%
1.8%
16
Environment International
based on 22 papers
Top 1%
1.6%
17
One Health
based on 15 papers
Top 0.7%
1.3%
18
International Journal of Environmental Research and Public Health
based on 116 papers
Top 17%
1.3%
19
BMC Public Health
based on 148 papers
Top 19%
1.2%
20
Nature
based on 58 papers
Top 9%
0.8%
21
The Lancet Microbe
based on 33 papers
Top 4%
0.8%
22
PNAS Nexus
based on 22 papers
Top 0.8%
0.8%
23
Microbial Genomics
based on 24 papers
Top 2%
0.7%
24
PLOS Global Public Health
based on 287 papers
Top 23%
0.7%
25
BMC Infectious Diseases
based on 110 papers
Top 21%
0.7%
26
Cell Reports Medicine
based on 49 papers
Top 6%
0.7%
27
BMC Medicine
based on 155 papers
Top 26%
0.7%
28
International Journal of Epidemiology
based on 65 papers
Top 10%
0.7%