Back

Distribution of prophage-encoded Pas sRNAs across pathogenic Escherichia coli

Zhu, D. X.; Shabalina, S. A.; Storz, G.

2026-05-14 evolutionary biology
10.64898/2026.05.13.724894 bioRxiv
Show abstract

Numerous base pairing small RNAs (sRNAs), which are an integral part of regulatory networks in bacteria, are encoded on mobile genetic elements (MGEs) in pathogenic strains of Escherichia coli. These sRNAs help coordinate the expression of MGE-encoded virulence factors with core genome-encoded cellular pathways. To investigate the evolution of MGE-encoded sRNAs, we queried public databases to characterize the distribution of PasA, PasB, PasC, PasD1, and PasD2, five prophage-encoded sRNAs discovered in enteropathogenic E. coli. We find that while the Pas sRNAs are largely restricted to pathogenic lineages of Escherichia and Shigella, they exhibit diversity in sequence, genomic presence, and copy number across strains. Based on phylogenetic analysis, the Pas sRNAs originate from multiple ancestral lineages and associate with specific E. coli pathovars, consistent with horizontal acquisition followed by retention. Syntenic analysis suggests a phage origin for the Pas sRNAs, likely from Shiga-toxin encoding phages, but the sRNAs appear to have diverged substantially following their integration into bacterial chromosomes. Comparative and structural analyses further suggest that the PasA and PasC sRNAs share a common ancestor as is the case for PasD and STnc100, another prophage-encoded sRNA. These findings add to our understanding of how accessory genome-encoded sRNAs emerge and evolve.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 4%
12.2%
2
eLife
5422 papers in training set
Top 12%
6.6%
3
Molecular Biology and Evolution
488 papers in training set
Top 0.8%
6.2%
4
Nature Communications
4913 papers in training set
Top 34%
4.7%
5
PLOS Pathogens
721 papers in training set
Top 3%
4.7%
6
Science
429 papers in training set
Top 8%
4.1%
7
mBio
750 papers in training set
Top 4%
3.9%
8
Nucleic Acids Research
1128 papers in training set
Top 5%
3.9%
9
Cell Host & Microbe
113 papers in training set
Top 1%
3.9%
50% of probability mass above
10
PLOS Biology
408 papers in training set
Top 3%
3.8%
11
Current Biology
596 papers in training set
Top 6%
3.5%
12
Genetics
225 papers in training set
Top 1%
3.5%
13
PLOS Genetics
756 papers in training set
Top 5%
3.5%
14
Nature
575 papers in training set
Top 8%
2.7%
15
Science Advances
1098 papers in training set
Top 12%
2.3%
16
Cell
370 papers in training set
Top 10%
2.0%
17
Nature Ecology & Evolution
113 papers in training set
Top 2%
2.0%
18
Cell Reports
1338 papers in training set
Top 21%
2.0%
19
Molecular Cell
308 papers in training set
Top 7%
1.7%
20
Genome Biology
555 papers in training set
Top 5%
1.6%
21
mSystems
361 papers in training set
Top 5%
1.6%
22
Mobile DNA
27 papers in training set
Top 0.1%
1.6%
23
Microbiome
139 papers in training set
Top 2%
1.6%
24
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 4%
1.4%
25
PLOS Computational Biology
1633 papers in training set
Top 22%
0.9%
26
Genome Biology and Evolution
280 papers in training set
Top 2%
0.8%
27
Developmental Cell
168 papers in training set
Top 12%
0.7%
28
Journal of Virology
456 papers in training set
Top 4%
0.7%
29
PLOS ONE
4510 papers in training set
Top 70%
0.7%