Back

How many phage species remain undiscovered? Species sampling approaches to inform phage discovery

Cavallaro, M.; Kinsella, A.; Megremis, S.; Morozov, A.; Millard, A. D.; Freund, F.

2026-02-17 genomics
10.64898/2026.02.15.704868 bioRxiv
Show abstract

The emergence of antimicrobial resistant bacteria has been identified as one of the most serious public health and development threats for the near future. The use of bacteriophages (phages) is a promising solution for the sustainable control of these pathogens. Phages are natural viral predators of bacterial pathogens. However, due to the variability and adaptability of bacteria, developing effective and sustainable phage treatments requires drawing from a wide variety of different phage species. This study applies specialised mathematical and computational estimation approaches to the problem of sampling and discovering species of phages in microbiological communities. We show that classical non-parametric estimator techniques lead to robust results and outperformed, for existing data settings in phages, model-based approaches. We then show how efficient the continuation of current phage collection and isolation effort is expected to be for discovering new phage species in various relevant bacterial host genera, a prerequisite for phage applications to provide sustainable control of pathogens in human and environmental settings. Our results have the potential to inform and optimise the hunt for and isolation of novel of phages from the the natural environment.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 0.2%
33.4%
2
Journal of Theoretical Biology
144 papers in training set
Top 0.1%
10.2%
3
PLOS ONE
4510 papers in training set
Top 27%
6.4%
4
Royal Society Open Science
193 papers in training set
Top 0.5%
4.0%
50% of probability mass above
5
mSystems
361 papers in training set
Top 3%
2.8%
6
Physical Review E
95 papers in training set
Top 0.4%
2.8%
7
Scientific Reports
3102 papers in training set
Top 45%
2.6%
8
Journal of The Royal Society Interface
189 papers in training set
Top 2%
2.1%
9
Microbial Genomics
204 papers in training set
Top 0.9%
2.1%
10
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
11
Evolutionary Applications
91 papers in training set
Top 0.6%
1.7%
12
PeerJ
261 papers in training set
Top 7%
1.7%
13
Antibiotics
32 papers in training set
Top 0.8%
1.5%
14
BMC Bioinformatics
383 papers in training set
Top 5%
1.3%
15
Viruses
318 papers in training set
Top 4%
1.1%
16
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 4%
1.0%
17
Frontiers in Microbiology
375 papers in training set
Top 8%
0.8%
18
Ecology and Evolution
232 papers in training set
Top 4%
0.8%
19
Microbiology
57 papers in training set
Top 1%
0.8%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 10%
0.7%
21
BMC Genomics
328 papers in training set
Top 6%
0.7%
22
F1000Research
79 papers in training set
Top 6%
0.7%
23
G3 Genes|Genomes|Genetics
351 papers in training set
Top 3%
0.5%
24
Environmental Science & Technology
64 papers in training set
Top 3%
0.5%
25
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 1%
0.5%
26
PLOS Global Public Health
293 papers in training set
Top 7%
0.5%
27
Bioinformatics
1061 papers in training set
Top 11%
0.5%