Back

Understanding evolutionary and functional relationships of RNA polymerases in plant and fungal viruses through structural modeling and divergence date estimations

Gracy, J.; Ghafari, M.; Labesse, G.; Fargette, D.; Hebrard, E.

2024-11-15 evolutionary biology
10.1101/2024.11.13.623411 bioRxiv
Show abstract

RNA-dependent RNA polymerases (RdRps) are crucial for replication of RNA viruses and serve as key marker genes for defining deep taxonomic ranks and for understanding viral evolutionary history. Despite shared functions and conserved amino acid motifs, the high genetic diversity of RdRps complicates precise sequence comparisons across viral families, hindering accurate taxonomic classification of new species - especially important at the age of metagenomics. When available, three-dimensional (3D) RdRp structures can help address these challenges through structure-based alignments. However, such structures are scarce for understudied viruses infecting fungi and plants, preventing the investigation of their ecological and evolutionary links. In this study, we focused on the highy divergent order Sobelivirales. Using deep{-}learning structural modeling, we generated highly reliable 3D models on 20 representative viral species. Multiple structural alignment enabled the reconstruction of a robust phylogeny with improved quality and length. Based on this phylogeny, we proposed revisions of existing viral families and reclassified genera. Clade divergence dates were then estimated using the Prisoner of War model, which has previously revealed the ancient origin of the genus Sobemovirus. We provided here the first divergence time estimation between these plant and fungal viruses, dating back to 26.6 million years before present - significantly more recent than the divergence between their respective hosts. Our amino acid conservation analysis, validated on 99 other viral species, also identified molecular signatures of sobeliviral families and genera, which could help in future taxonomic assignment and diagnostic tools development. This interdisciplinary approach integrating structure modeling and date estimations offers new insights into the evolutionary divergence between fungus and plant viruses, with potential applications to other viral orders and families. Author summaryRNA-dependent RNA polymerases (RdRps) are essential for RNA virus replication and serve as important markers for classifying viruses and understanding their evolution. However, with the rising popularity of metagenomics and discovery of viruses with high genetic diversity in RdRps, it is difficult to compare viral families and accurately classify new species. When available, 3D structures of RdRps can help overcome this challenge through structural alignments. In this study, we focused on the highly divergent order of the Sobelivirales, using deep{-}learning models to generate reliable 3D structures for 20 representative viral species. These structural alignments allowed us to build a more accurate viral phylogeny. Based on this finding, we proposed updates to existing viral families and genera within the Sobelivirales order. We also estimated divergence dates using a model that previously uncovered the ancient origins of Sobemovirus. Notably, we provided the first estimate of when these plant and fungal viruses diverged - around 26.6 million years ago, which is much more recent than the separation of their hosts. We also identified molecular signatures that are useful for future virus classification and diagnosis, with potential applications to other viral groups.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Virus Evolution
140 papers in training set
Top 0.1%
41.9%
2
PLOS Biology
408 papers in training set
Top 0.8%
6.7%
3
Viruses
318 papers in training set
Top 0.9%
5.1%
50% of probability mass above
4
PLOS Computational Biology
1633 papers in training set
Top 9%
3.8%
5
PLOS Pathogens
721 papers in training set
Top 4%
3.2%
6
Journal of Virology
456 papers in training set
Top 2%
3.0%
7
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.5%
8
Scientific Reports
3102 papers in training set
Top 53%
1.9%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 34%
1.6%
10
mBio
750 papers in training set
Top 8%
1.6%
11
PLOS ONE
4510 papers in training set
Top 57%
1.4%
12
iScience
1063 papers in training set
Top 19%
1.4%
13
Nature Communications
4913 papers in training set
Top 58%
0.9%
14
mSphere
281 papers in training set
Top 5%
0.9%
15
Bioinformatics
1061 papers in training set
Top 8%
0.9%
16
Communications Biology
886 papers in training set
Top 17%
0.9%
17
BMC Ecology and Evolution
49 papers in training set
Top 2%
0.9%
18
Frontiers in Microbiology
375 papers in training set
Top 7%
0.9%
19
Molecular Biology and Evolution
488 papers in training set
Top 4%
0.8%
20
mSystems
361 papers in training set
Top 7%
0.8%
21
Frontiers in Genetics
197 papers in training set
Top 9%
0.8%
22
BMC Biology
248 papers in training set
Top 4%
0.8%
23
Genome Biology and Evolution
280 papers in training set
Top 2%
0.8%
24
Microbiome
139 papers in training set
Top 3%
0.8%
25
eLife
5422 papers in training set
Top 60%
0.7%
26
Virology Journal
25 papers in training set
Top 0.5%
0.5%
27
Computers in Biology and Medicine
120 papers in training set
Top 6%
0.5%
28
Microbiology Spectrum
435 papers in training set
Top 7%
0.5%
29
Patterns
70 papers in training set
Top 3%
0.5%