Back

Hunting for Helminths: short- and long-read shotgun metagenomics for parasite detection in faecal samples

O'Brien, K.; Elamaran, A.; Dayi, M.; Keeling, G.; Nevin, W. D.; Liu, Y.; Viney, M.; Reynolds, K.; Bishop, C.; Sripa, B.; Woubshete, M.; Sachs Nique, P.; Wright, R.; Younger, J.; Hunt, V. L.

2026-03-10 molecular biology
10.64898/2026.03.09.710549 bioRxiv
Show abstract

Soil-transmitted helminths (STHs) pose significant challenges to public health in endemic areas, necessitating reliable methods for their detection. Shotgun metagenomics enables simultaneous detection of STHs and microbes in a sample without prior knowledge of what is present. However, validation of shotgun metagenomics with known infection intensity or across different sequencing platforms has not been carried out for eukaryote parasites including STHs, and false positives remain a pervasive issue. We validated shotgun metagenomics as a method of STH detection in faecal samples. Using the Strongyloides ratti laboratory model of a STH infection we investigated how analytical methods (nucleotide-nucleotide matching, nucleotide-protein matching, marker gene detection, mitochondrial mapping), infection intensity and sequencing technology (short-read vs. long-read) affects sensitivity and specificity of detection. S. ratti was accurately detected at a standard laboratory dose, but low intensity infections were more difficult to detect. Only mitochondrial sequence mapping was 100% accurate at identifying S. ratti with no false positives. Overall, short-read outperformed long-read sequencing methods. We applied the same analytical methods to human faecal samples with confirmed infections for at least one of four STHs. Mitochondrial sequence mapping was also the most effective method for detecting STHs in human faecal samples, detecting 100% of Necator americanus and 92% of Ascaris spp. infections, but could not reliably detect STHs where DNA levels are expected to be low or variable. In conclusion, mitochondrial mapping was the most effective method of detection for sensitivity and specificity in both the laboratory system and human faecal samples. Our findings indicate that shotgun metagenomics should be approached cautiously using validated methods, particularly when infection intensity or DNA levels are expected to be low. Author SummarySoil-transmitted helminths (STH) such as the parasite Strongyloides, are important gastrointestinal parasites of humans and livestock. Accurate methods of detection for diagnostics and monitoring are important to implement suitable control and treatment strategies. Here we validate a shotgun metagenomics approach, where all DNA in a sample is sequenced, for detecting STH in faecal samples using a Strongyloides laboratory model for infection. Strongyloides was reliability detected in faecal samples at higher infection levels, but mitochondrial genome mapping of the sequences was the only analytical method that reliably detected Strongyloides at lower infections levels. These results were reflected in stool samples from humans infected with STH, where mitochondrial mapping was also the most reliable method. However, species that were associated with low levels of parasite material or DNA in the faeces including Strongyloides stercoralis, were more difficult to detect. We compared two sequencing methods: short-read Illumina and long-read Oxford Nanopore Technologies, but short-read outperformed long-read shotgun metagenomics. Contamination of bacteria sequences in parasite genome assemblies was problematic for analysis and contributed to false positive results. Future work should focus on specific targeting of eukaryote DNA either at the laboratory or bioinformatic stage to improve STH detection further.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
PLOS Neglected Tropical Diseases
378 papers in training set
Top 0.6%
18.7%
2
PLOS ONE
4510 papers in training set
Top 24%
7.2%
3
Parasites & Vectors
57 papers in training set
Top 0.2%
6.9%
4
International Journal for Parasitology
21 papers in training set
Top 0.1%
6.4%
5
Pathogens
53 papers in training set
Top 0.1%
4.4%
6
Scientific Reports
3102 papers in training set
Top 27%
4.3%
7
PeerJ
261 papers in training set
Top 2%
4.0%
50% of probability mass above
8
Malaria Journal
48 papers in training set
Top 0.5%
4.0%
9
Molecular Ecology Resources
161 papers in training set
Top 0.3%
3.6%
10
Wellcome Open Research
57 papers in training set
Top 0.3%
3.6%
11
mSphere
281 papers in training set
Top 1%
3.6%
12
Microorganisms
101 papers in training set
Top 0.3%
2.6%
13
Journal of Microbiological Methods
11 papers in training set
Top 0.2%
1.9%
14
Ecology and Evolution
232 papers in training set
Top 2%
1.8%
15
One Health
29 papers in training set
Top 0.5%
1.8%
16
Microbiology Spectrum
435 papers in training set
Top 3%
1.5%
17
Applied and Environmental Microbiology
301 papers in training set
Top 2%
1.3%
18
Acta Tropica
13 papers in training set
Top 0.4%
1.3%
19
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 5%
0.8%
20
Metabarcoding and Metagenomics
12 papers in training set
Top 0.1%
0.8%
21
The Journal of Infectious Diseases
182 papers in training set
Top 5%
0.8%
22
Peer Community Journal
254 papers in training set
Top 4%
0.7%
23
BMC Genomics
328 papers in training set
Top 6%
0.7%
24
Frontiers in Veterinary Science
30 papers in training set
Top 1.0%
0.7%
25
Frontiers in Microbiology
375 papers in training set
Top 10%
0.6%
26
Environmental DNA
49 papers in training set
Top 0.4%
0.6%
27
Nature Communications
4913 papers in training set
Top 67%
0.5%
28
Journal of Clinical Microbiology
120 papers in training set
Top 2%
0.5%
29
Microbial Genomics
204 papers in training set
Top 3%
0.5%
30
BMC Biology
248 papers in training set
Top 7%
0.5%