Back

Large-scale insights into the biosynthetic potential of the Bacillus cereus group

Blom, J.; Wambui, J.; Gourle, H.; Larralde, M.; Ramnath, V.; Henriksson, J.; Carroll, L. M.

2025-10-16 microbiology
10.1101/2025.10.16.682773 bioRxiv
Show abstract

Bacterial secondary metabolites (SMs) are a critical source of natural product-derived drugs. However, SM discovery efforts have focused overwhelmingly on Actinomycetes, potentially overlooking other key producers. Here, we explore the biosynthetic potential of the Bacillus cereus group, an underexplored complex of SM producers. Using a combined rule- and machine learning-based approach, we mine an unprecedented number of B. cereus group genomes (n = 9,744) for SM-producing biosynthetic gene clusters (BGCs; n = 200,196). Notably, 158,678 B. cereus group BGCs (78.2%) did not cluster with previously described BGCs, suggesting new chemical scaffolds to be explored. B. pseudomycoides was particularly prolific in terms of its SM production potential (30.8 BGC families/genome, Kruskal-Wallis p < 0.0001), and we identify a previously uncharacterized, B. pseudomycoides-unique peptide. Overall, our study represents the largest survey of B. cereus group biosynthetic potential to date and posits the complex as an under-queried SM resource.

Matching journals

The top 11 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 25%
7.2%
2
Journal of Natural Products
11 papers in training set
Top 0.1%
6.9%
3
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 9%
6.9%
4
eLife
5422 papers in training set
Top 13%
6.4%
5
Cell Chemical Biology
81 papers in training set
Top 0.6%
4.0%
6
Scientific Reports
3102 papers in training set
Top 32%
3.9%
7
Journal of the American Chemical Society
199 papers in training set
Top 2%
3.6%
8
Chemical Science
71 papers in training set
Top 0.3%
3.6%
9
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
3.6%
10
Advanced Science
249 papers in training set
Top 7%
2.6%
11
Science
429 papers in training set
Top 12%
2.1%
50% of probability mass above
12
Angewandte Chemie International Edition
81 papers in training set
Top 2%
1.9%
13
Cell Discovery
54 papers in training set
Top 3%
1.7%
14
Metabolites
50 papers in training set
Top 0.5%
1.7%
15
Cell Host & Microbe
113 papers in training set
Top 3%
1.7%
16
iScience
1063 papers in training set
Top 14%
1.7%
17
Nature
575 papers in training set
Top 10%
1.7%
18
Communications Biology
886 papers in training set
Top 9%
1.7%
19
ACS Chemical Biology
150 papers in training set
Top 1%
1.5%
20
PLOS ONE
4510 papers in training set
Top 57%
1.5%
21
Nature Chemistry
34 papers in training set
Top 0.5%
1.3%
22
Cell Reports Medicine
140 papers in training set
Top 5%
1.3%
23
Cell
370 papers in training set
Top 14%
1.2%
24
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
1.0%
25
mSystems
361 papers in training set
Top 6%
1.0%
26
Communications Chemistry
39 papers in training set
Top 0.9%
0.8%
27
ACS Synthetic Biology
256 papers in training set
Top 3%
0.8%
28
RSC Advances
18 papers in training set
Top 1%
0.8%
29
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
30
mBio
750 papers in training set
Top 11%
0.8%