Back

Resolving eukaryotic river biofilm communities using long-read sequencing for biomonitoring

Anderson, M. A. J.; Read, D. S.; Thorpe, A. C.; Bhanu Busi, S.; Warren, J.; Walsh, K.

2026-02-20 molecular biology
10.64898/2026.02.20.706759 bioRxiv
Show abstract

Freshwater biofilms host diverse microbial eukaryotic communities that are central to ecosystem functioning and serve as key indicators of water quality. Molecular biomonitoring approaches based on environmental DNA (eDNA) sequencing are increasingly used to characterise these communities, offering scalable alternatives to traditional microscopy-based assessments. Understanding how DNA sequencing methods influence the observed community composition and diversity is essential for ensuring accurate ecological interpretation. Here, we compared short-read Illumina and long-read Pacific Biosciences sequencing of the 18S rRNA gene, alongside a trimmed long-read dataset (restricted to the Illumina-primed region), to evaluate how read length and sequencing platform affect community profiling in river biofilms from seven English rivers sampled across three timepoints. Distinct community patterns were observed between the sequencing approaches, with PERMANOVA revealing significant differences in beta diversity (p = 0.001) and modest effect sizes (R2 = 3.8-8.3%). While the long and trimmed datasets produced nearly identical community structures, both diverged strongly from the short-read data, suggesting that short-read sequencing captures a systematically different subset of taxa than long-read sequencing. Long-read sequencing significantly improved taxonomic resolution of the 18S rRNA gene, particularly at the genus and species levels, enabling detection of lineages that were unresolvable in short-read data. However, comparisons of paired long- and trimmed-read ASVs indicated that trimming can increase taxonomic mismatches at finer ranks, likely due to reduced sequence length rather than sequencing platform bias. Collectively, our results demonstrate that sequencing strategy significantly influences inferred community composition and taxonomic precision. Long-read sequencing provides a more robust representation of community diversity, whereas trimmed analyses reveal how shorter amplicons may contribute to misidentification. These findings emphasise the importance of considering read length when interpreting eDNA-based assessments using the 18S rRNA gene and support the adoption of long-read sequencing for high-resolution biomonitoring applications.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Water Research
74 papers in training set
Top 0.1%
22.2%
2
Applied and Environmental Microbiology
301 papers in training set
Top 0.2%
10.0%
3
Environmental Science: Water Research & Technology
13 papers in training set
Top 0.1%
6.7%
4
PLOS ONE
4510 papers in training set
Top 29%
6.3%
5
Environmental DNA
49 papers in training set
Top 0.1%
6.3%
50% of probability mass above
6
Molecular Ecology Resources
161 papers in training set
Top 0.2%
6.2%
7
Molecular Ecology
304 papers in training set
Top 2%
3.5%
8
Scientific Reports
3102 papers in training set
Top 44%
2.7%
9
mSystems
361 papers in training set
Top 4%
2.3%
10
Environmental Microbiology
119 papers in training set
Top 2%
1.9%
11
Frontiers in Marine Science
55 papers in training set
Top 0.7%
1.7%
12
Frontiers in Microbiology
375 papers in training set
Top 5%
1.7%
13
Microbiology Spectrum
435 papers in training set
Top 3%
1.7%
14
Environmental Microbiology Reports
27 papers in training set
Top 0.4%
1.5%
15
Communications Earth & Environment
14 papers in training set
Top 0.6%
1.2%
16
PeerJ
261 papers in training set
Top 10%
1.2%
17
mSphere
281 papers in training set
Top 5%
1.1%
18
Evolutionary Applications
91 papers in training set
Top 0.9%
0.9%
19
Science of The Total Environment
179 papers in training set
Top 4%
0.9%
20
mBio
750 papers in training set
Top 10%
0.9%
21
Freshwater Biology
11 papers in training set
Top 0.1%
0.9%
22
Environmental Science & Technology
64 papers in training set
Top 2%
0.9%
23
Environmental Microbiome
26 papers in training set
Top 0.4%
0.9%
24
eLife
5422 papers in training set
Top 56%
0.8%
25
Metabarcoding and Metagenomics
12 papers in training set
Top 0.1%
0.8%
26
ISME Communications
103 papers in training set
Top 2%
0.7%
27
The ISME Journal
194 papers in training set
Top 3%
0.7%
28
BMC Genomics
328 papers in training set
Top 6%
0.7%