Back

Library preparation strategy critically impacts RNA virus sensitivity in clinical metagenomics

Stepniak, D.; Constantinides, B.; Weaver, M.; Treagus, S.; Wilkinson, S. A.; Quarton, S.; Behruznia, M.; Cumley, N.; Tyson, J.; McNally, A.; Loman, N. J.; Pullan, S.; Quick, J.

2026-05-21 genetic and genomic medicine
10.64898/2026.05.18.26353500 medRxiv
Show abstract

Clinical metagenomics uses sequencing for culture-independent identification of pathogens directly from clinical specimens. While a number of protocols claim to be pathogen agnostic, sensitivity for RNA viruses is likely lower than for bacteria or fungi, as it requires additional processing steps including conversion to cDNA. Sequence-independent, single-primer amplification (SISPA) was first described in 1991, yet how it preferentially enriches viral molecules has never been described. Here we propose that single-primer amplification exploits the PCR suppression effect, which selectively amplifies longer viral molecules over shorter host-derived cDNA fragments on the basis of size. This model predicts that any upstream processing step that disrupts fragment length will prevent this enrichment occurring. To test this, we systematically compared two adapter introduction strategies - during cDNA synthesis and via tagmentation - followed by single primer amplification, using the ZeptoMetrix Respiratory Panel 2.1 containing 16 RNA and 3 DNA virus strains. SISPA-based approaches recovered all of the viral genomes in the control, whereas using tagmentation to amplify cDNA recovered none. We then spiked the controls into extracted clinical samples and found that SISPA-based methods performed best in all background settings, however in high-background settings no viral genomes were recovered by any approach. Finally, using a modified SMART-9N protocol, we demonstrated that single-primer PCR is critical to overall performance, indicating that direct tagmentation of cDNA and dual-primer PCR should be avoided in protocols for clinical metagenomics where high sensitivity for RNA viruses is critical. These findings demonstrate that library preparation strategy fundamentally determines RNA virus sensitivity and offer mechanistic insights for protocol optimisation with direct relevance to clinical metagenomics.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 19%
10.0%
2
Nature Microbiology
133 papers in training set
Top 0.2%
8.3%
3
Genome Biology
555 papers in training set
Top 0.8%
7.1%
4
Microbiome
139 papers in training set
Top 0.5%
6.2%
5
BMC Genomics
328 papers in training set
Top 0.4%
4.8%
6
Scientific Reports
3102 papers in training set
Top 31%
3.9%
7
iScience
1063 papers in training set
Top 5%
3.5%
8
PLOS Biology
408 papers in training set
Top 3%
3.5%
9
PLOS Computational Biology
1633 papers in training set
Top 10%
3.5%
50% of probability mass above
10
Frontiers in Microbiology
375 papers in training set
Top 3%
3.2%
11
Nucleic Acids Research
1128 papers in training set
Top 7%
2.8%
12
PLOS ONE
4510 papers in training set
Top 44%
2.7%
13
Genome Medicine
154 papers in training set
Top 3%
2.6%
14
eLife
5422 papers in training set
Top 34%
2.3%
15
Cell Reports Methods
141 papers in training set
Top 2%
1.9%
16
Open Research Europe
14 papers in training set
Top 0.1%
1.7%
17
Cancer Research Communications
46 papers in training set
Top 0.5%
1.7%
18
Microbiology Spectrum
435 papers in training set
Top 3%
1.7%
19
mSystems
361 papers in training set
Top 5%
1.6%
20
Pathogens
53 papers in training set
Top 0.7%
1.5%
21
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.3%
22
Frontiers in Bioinformatics
45 papers in training set
Top 0.7%
0.9%
23
Environmental Science & Technology
64 papers in training set
Top 2%
0.9%
24
Med
38 papers in training set
Top 0.6%
0.9%
25
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 43%
0.8%
26
GigaScience
172 papers in training set
Top 3%
0.8%
27
Genome Research
409 papers in training set
Top 4%
0.7%
28
Cell Systems
167 papers in training set
Top 12%
0.7%
29
BMC Biology
248 papers in training set
Top 5%
0.7%
30
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%