Back

Categorization of prophage genes in Bacillus subtilis 168 and assessing their relative importance through RNA-seq gene expression analysis

Ng, W.

2021-10-28 bioinformatics
10.1101/2021.10.26.466030 bioRxiv
Show abstract

Bacteriophage evolves to control the population of fast-growing bacterial cells, without which explosion in bacterial population may induce unimaginable harm to diverse ecosystems. But, bacteriophage also "hide" in bacterial genomes when nutritional and environmental circumstances are unfavourable. This involves the integration of phage genome into the host genome at appropriate genomic loci in a process known as lysogeny. This work sought to delineate the prophages present in the annotated genome of Bacillus subtilis 168, and assess their relative importance through RNA-seq expression analysis. Firstly, examination of the annotated genome of the model Gram-positive bacterium revealed five distinct prophage regions: SPBeta, prophage 6, PBSX, prophage 3 region, and prophage 1 region. All prophage regions contain host genes, which suggests that host transposase activity have swapped in host genes for phage genes in the prophage genome. Given the significant number of phage genes that have been swapped into each of the prophage genome, all prophage regions are deemed to be defective. BLAST analysis further highlighted that many of the prophages in B. subtilis are extinct given that they do not have ancestral or daughter brethren. However, RNA-seq transcriptome analysis of B. subtilis turned out an interesting paradox indicative of the important role that host transposase have in swapping in host promoters for prophage genes. Specifically, a significant number of prophage genes are highly expressed, which is implausible given that phage genes should be transcriptionally silent. The result and phenomenon further suggests the relative facile nature in which host promoters could be swapped in for phage genes, which is indicative of presence of genomic motifs in prophage genome recognizable by host transposase. Existence of such sequence motifs is thus indicative of possible co-evolution of transposase and phages where transposases were originally a part of the phage genome, which latter "jumped out" into the host genome to aid the swapping in of host genes into the prophage genome for augmenting prophage genetic repertoire in the face of changing environmental conditions. Overall, it is not uncommon for bacterial species to harbour multiple prophages. But, lysogeny may not be a viable option for long-term preservation of prophage genetic repertoire given that host transposase would inevitable swap in host genes at random locations in the prophage genome.

Matching journals

The top 11 journals account for 50% of the predicted probability mass.

1
PLOS Genetics
756 papers in training set
Top 2%
6.9%
2
Scientific Reports
3102 papers in training set
Top 17%
6.4%
3
PLOS ONE
4510 papers in training set
Top 27%
6.4%
4
PeerJ
261 papers in training set
Top 1%
4.9%
5
Frontiers in Genetics
197 papers in training set
Top 1%
4.9%
6
International Journal of Molecular Sciences
453 papers in training set
Top 2%
4.3%
7
Microbial Genomics
204 papers in training set
Top 0.5%
4.3%
8
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.1%
4.0%
9
F1000Research
79 papers in training set
Top 0.4%
3.9%
10
mSystems
361 papers in training set
Top 3%
3.6%
11
BMC Genomics
328 papers in training set
Top 0.8%
3.6%
50% of probability mass above
12
Frontiers in Microbiology
375 papers in training set
Top 3%
3.6%
13
BMC Microbiology
35 papers in training set
Top 0.3%
2.5%
14
Genes
126 papers in training set
Top 0.6%
2.1%
15
Biology
43 papers in training set
Top 0.5%
1.9%
16
Genomics
60 papers in training set
Top 0.9%
1.8%
17
Environmental Microbiology
119 papers in training set
Top 2%
1.7%
18
Microorganisms
101 papers in training set
Top 0.7%
1.7%
19
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 3%
1.5%
20
Viruses
318 papers in training set
Top 3%
1.5%
21
PLOS Pathogens
721 papers in training set
Top 7%
1.3%
22
Journal of Virology
456 papers in training set
Top 3%
1.0%
23
Biosystems
18 papers in training set
Top 0.3%
0.9%
24
Access Microbiology
22 papers in training set
Top 0.5%
0.9%
25
Archives of Clinical and Biomedical Research
28 papers in training set
Top 2%
0.8%
26
Biology Direct
10 papers in training set
Top 0.1%
0.8%
27
Genome Biology and Evolution
280 papers in training set
Top 2%
0.8%
28
DNA Research
23 papers in training set
Top 0.5%
0.7%
29
mSphere
281 papers in training set
Top 6%
0.7%
30
Life Science Alliance
263 papers in training set
Top 2%
0.6%