Back

Extreme GC3 Codon Bias in a Novel Brown Seaweed Virus Results in Pseudoambigrammatic Characteristics

Dekker, R. J.; de Leeuw, W. C.; Rauwerda, H.; van Olst, M.; Ensink, W. A.; van Leeuwen, S.; Cohen, J.; Timmermans, K. R.; Breit, T. M.; Jonker, M. J.

2025-03-17 genomics
10.1101/2025.03.14.643323 bioRxiv
Show abstract

Although viruses are often studied in relation to disease, they can also offer valuable insights into RNA functionality. Due to their vast range of hosts and high mutation rates, viruses can quickly adapt to changing environments. It is therefore expected that viruses exploit all possible RNA functionalities to develop strategies for host infection and propagation. Our aim is to discover RNA viruses with survival strategies that reveal unknown RNA characteristics and functionalities. We have chosen seaweed as host organism because relatively little is known about RNA viruses in seaweed. Additionally, seaweeds are evolutionary distant from terrestrial plants and inhabit environments with entirely different conditions, increasing the likelihood of discovering new RNA viruses with unknown survival strategies. Here we report a new, unclassified negative-stranded single-strand RNA virus from seaweed Saccharina latissima that showed a unique genome characteristic. The novel virus, named SalaUV-NL1, contains a large segment of 7.065 nucleotides coding for a 2.296 amino acids protein and a small segment of 2.120 nucleotides coding for a 328 plus a potential 315+ amino acids protein. The large segment also appeared to have an additional large open reading frame (ORF) on the negative strand, suggesting it could be an ambigrammatic virus. However, the virus displayed an extreme GC3 codon bias (98%) in the (+)ORF, which differs from the typical avoidance of just the reverse-complement stop codons seen in ambigrammatic viruses. Consequently, the (-)ORF appears to occur accidental and is unlikely to encode a functional protein. Therefore, we have termed our virus to be pseudoambigrammatic. Several other viruses with the same pseudogrammatic high GC3 codon bias were identified in Genbank, most without a (-)ORF. The SalaUV-NL1 virus codon bias surpasses the high GC3 codon bias of the S. latissima host genes, however other species that host pseudoambigrammatic viruses do not display a high GC3 codon bias. Comparison of several variants of SalaUV-NL1 showed that the relative nucleotide change rate, particularly for silent changes from A/T to G/C, was extremely high (up to 81%). This novel virus demonstrates a new survival strategy of RNA viruses that is not yet understood.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Viruses
318 papers in training set
Top 0.1%
18.4%
2
Virus Research
36 papers in training set
Top 0.1%
17.3%
3
Archives of Virology
14 papers in training set
Top 0.1%
6.3%
4
Frontiers in Microbiology
375 papers in training set
Top 2%
3.9%
5
Scientific Reports
3102 papers in training set
Top 38%
3.5%
6
PLOS ONE
4510 papers in training set
Top 43%
3.0%
50% of probability mass above
7
Journal of General Virology
46 papers in training set
Top 0.2%
3.0%
8
Frontiers in Genetics
197 papers in training set
Top 3%
2.8%
9
Virus Evolution
140 papers in training set
Top 0.5%
2.7%
10
Journal of Virology
456 papers in training set
Top 2%
2.3%
11
Peer Community Journal
254 papers in training set
Top 1%
2.1%
12
Microbiology Resource Announcements
22 papers in training set
Top 0.2%
2.1%
13
BMC Genomics
328 papers in training set
Top 2%
1.9%
14
PeerJ
261 papers in training set
Top 6%
1.8%
15
Gene Reports
13 papers in training set
Top 0.3%
1.7%
16
Virology Journal
25 papers in training set
Top 0.1%
1.6%
17
Heliyon
146 papers in training set
Top 5%
0.9%
18
Microbiology Spectrum
435 papers in training set
Top 5%
0.9%
19
Genes
126 papers in training set
Top 2%
0.9%
20
Virology
56 papers in training set
Top 0.6%
0.9%
21
Antiviral Research
49 papers in training set
Top 0.4%
0.8%
22
Pathogens
53 papers in training set
Top 1%
0.8%
23
Journal of Medical Virology
137 papers in training set
Top 4%
0.7%
24
F1000Research
79 papers in training set
Top 5%
0.7%
25
Frontiers in Public Health
140 papers in training set
Top 9%
0.7%
26
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 7%
0.6%
27
mSystems
361 papers in training set
Top 8%
0.6%