Back

Genomic instability and biofilm determinants in Streptococcus mutans: insights from a sequence-defined arrayed transposon library

Solano Morales, A. K.; Cazano, E.; Pirani, C.; Jones, G.; Goode, A.; Riveros Walker, A.; Sperduto, A.; Dwivedi, B.; Bantha, P.; Peter, S.; McLellan, L. K.; Alam, M. A.; Shields, R. C.

2026-03-26 microbiology
10.64898/2026.03.25.714184 bioRxiv
Show abstract

Streptococcus mutans is a primary architect of dental caries, utilizing complex genetic networks to build resilient, acid-producing biofilms. While pooled screens (Tn-seq) have identified important fitness factors, they often fail to capture extracellular or moderate-effect determinants due to community-level masking. Therefore, to study biofilm phenotypes, we constructed a comprehensive arrayed library of 9,216 mutants and used Cartesian Pooling-Coordinate Sequencing (CP-CSeq) to establish a sequence-defined resource covering 51% of non-essential genes. By screening the entire collection in isolation, we identified several novel biofilm determinants, including the putative metal transporter SMU_635 and the glycosylation-associated protein SMU_2160. However, systematic whole-genome sequencing (WGS) of our hits revealed an interesting level of genomic instability: 25% of biofilm-defective mutants had undergone spontaneous recombination at the gtfBC locus, while 7% had lost the TnSmu1 element, an excision rate 1,000-fold higher than previously reported. While targeted mutagenesis confirmed that TnSmu1 loss does not impact biofilm integrity, the gtfBC deletions directly accounted for the most severe phenotypes, highlighting a systemic risk of misattributing gene functions to primary transposon insertions. Our findings provide a powerful new genetic resource for the S. mutans community while establishing a critical new standard: an arrayed library is only as defined as its underlying genome, making systematic genomic verification an essential prerequisite for accurate functional genomics. ImportanceStreptococcus mutans is a major human pathogen responsible for dental caries, a global public health challenge driven in part by the organisms ability to form resilient, acidogenic biofilms. While traditional pooled genetic screens have identified many fitness factors, they often fail to capture extracellular or moderate-effect determinants because neighboring healthy bacteria can mask these defects. This work provides the scientific community with a sequence-defined arrayed mutant library, an essential resource for dissecting the individual contributions of genes to biofilm integrity in isolation. Beyond identifying well-known machinery, this study uncovers novel determinants, including the putative metal transporter SMU_635 and the putative glycosylation-associated protein SMU_2160. Crucially, the discovery of pervasive genomic instability within the library, specifically at the gtfBC and TnSmu1 loci, reveals a systemic risk in functional genomics: the potential to misattribute phenotypes to primary mutations when the underlying background has undergone large-scale rearrangements. By establishing systematic whole-genome verification as a necessary standard, this research ensures that the identification of future therapeutic targets is built upon a verified genetic foundation.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 20%
9.8%
2
PLOS Genetics
756 papers in training set
Top 1%
8.8%
3
PLOS Pathogens
721 papers in training set
Top 2%
8.8%
4
mBio
750 papers in training set
Top 2%
8.1%
5
Cell
370 papers in training set
Top 3%
6.2%
6
Cell Host & Microbe
113 papers in training set
Top 0.9%
6.1%
7
PLOS Biology
408 papers in training set
Top 1%
6.1%
50% of probability mass above
8
eLife
5422 papers in training set
Top 19%
4.7%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 17%
4.2%
10
mSystems
361 papers in training set
Top 2%
4.0%
11
npj Biofilms and Microbiomes
56 papers in training set
Top 0.6%
2.8%
12
Nature Microbiology
133 papers in training set
Top 2%
2.4%
13
Nucleic Acids Research
1128 papers in training set
Top 8%
2.3%
14
Cell Reports
1338 papers in training set
Top 21%
2.0%
15
The ISME Journal
194 papers in training set
Top 1%
2.0%
16
Microbiome
139 papers in training set
Top 2%
1.3%
17
Genome Medicine
154 papers in training set
Top 6%
1.3%
18
Cell Systems
167 papers in training set
Top 9%
1.2%
19
Cell Genomics
162 papers in training set
Top 5%
1.2%
20
Science Advances
1098 papers in training set
Top 27%
0.9%
21
Nature Ecology & Evolution
113 papers in training set
Top 4%
0.8%
22
Nature
575 papers in training set
Top 15%
0.8%
23
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 6%
0.7%
24
Microbial Genomics
204 papers in training set
Top 3%
0.6%
25
mSphere
281 papers in training set
Top 7%
0.6%