Back

A comprehensive computational analysis investigating the relationships between phage codon usage, infection style, and number of tRNA genes

Ross, N. D.; Doore, S. M.

2026-03-20 microbiology
10.64898/2026.03.19.712862 bioRxiv
Show abstract

It has been known for decades that bacteriophages encode tRNA genes, but their function and the factors contributing to their acquisition and retention are unclear. Although tRNAs are found in a variety of phages infecting a variety of bacteria, many large-scale computational studies investigating tRNA acquisition and retention in phages are specific to Mycobacterium phages; however, these findings may not be representative of other phages or bacteria. This work uses a broader sampling of phages and hosts to investigate the relationships between codon usage bias, infection cycle, and tRNA gene numbers in phage genomes. We analyzed 154 phages infecting 7 host genera, including Gram-negative (Escherichia, Shigella, Salmonella) and Gram-positive (Bacillus, Lactobacillus, Staphylococcus, Mycobacterium) bacteria. Phages included temperate and virulent representatives, plus a range of tRNA numbers and morphologies. All phages and hosts were analyzed using four metrics: GC content, Effective Number of Codons, Relative Synonymous Codon Usage, and tRNA Adaptation Index. On a global scale, virulent phages with many tRNA genes show greater differences in codon usage and codon adaptation compared to their respective hosts. Gram-negative bacteria and their phages generally exhibit greater differences in codon usage compared to Gram-positive bacteria and their phages. Phages infecting Gram-negative hosts also tend to encode more tRNA genes. In nearly all genus-level comparisons, Mycobacterium phages were different from any other host and from global patterns. This suggests previous computational studies performed in Mycobacterium phages are likely not applicable on a global scale or to phages infecting other host genera. AUTHOR SUMMARYBacteriophages, or phages, are viruses infecting bacteria. They are abundant in all environments, yet how they interact with their bacterial hosts is still not well-understood. Like other viruses, phages must rely on the host translational components to replicate and form new phage particles; and similarly to other parasites, phages have genomes that differ significantly from their hosts in terms of composition. In this work, we explore the relationship between phage lifestyle, number of tRNA genes encoded, and genome differences from the host using a variety of phages and their associated hosts. Phages can be either virulent (do not integrate into the host genome) or temperate (capable of integrating into the host genome), with differences from the host genome more pronounced in virulent phages. There are many phages that also carry tRNA genes, and having higher numbers of tRNAs is associated with larger differences from the host genome. The findings here indicate that virulent phages carrying large numbers of tRNAs diverge the most from host genome composition.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Viruses
318 papers in training set
Top 0.2%
14.2%
2
mSphere
281 papers in training set
Top 0.5%
6.7%
3
Microbiology
57 papers in training set
Top 0.1%
4.8%
4
mSystems
361 papers in training set
Top 2%
4.5%
5
mBio
750 papers in training set
Top 4%
4.3%
6
Virus Evolution
140 papers in training set
Top 0.3%
4.3%
7
PeerJ
261 papers in training set
Top 1%
4.3%
8
Access Microbiology
22 papers in training set
Top 0.1%
3.9%
9
F1000Research
79 papers in training set
Top 0.4%
3.9%
50% of probability mass above
10
Microbial Genomics
204 papers in training set
Top 0.6%
3.6%
11
Journal of Virology
456 papers in training set
Top 2%
3.2%
12
Genome Biology and Evolution
280 papers in training set
Top 0.6%
3.0%
13
PLOS Computational Biology
1633 papers in training set
Top 11%
3.0%
14
PLOS ONE
4510 papers in training set
Top 46%
2.4%
15
Scientific Reports
3102 papers in training set
Top 51%
2.1%
16
Virology
56 papers in training set
Top 0.3%
1.7%
17
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 5%
1.3%
18
Microorganisms
101 papers in training set
Top 1%
1.2%
19
Frontiers in Microbiology
375 papers in training set
Top 7%
1.1%
20
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 5%
0.9%
21
Journal of General Virology
46 papers in training set
Top 0.6%
0.9%
22
GigaScience
172 papers in training set
Top 2%
0.9%
23
BMC Genomics
328 papers in training set
Top 5%
0.9%
24
Microbial Pathogenesis
13 papers in training set
Top 0.5%
0.9%
25
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
0.9%
26
PLOS Genetics
756 papers in training set
Top 13%
0.9%
27
PLOS Biology
408 papers in training set
Top 18%
0.8%
28
Infection, Genetics and Evolution
43 papers in training set
Top 1.0%
0.7%
29
Environmental Microbiology
119 papers in training set
Top 3%
0.7%
30
Journal of The Royal Society Interface
189 papers in training set
Top 5%
0.7%