Back

Insights into goatpox virus and sheeppox virus genomes from pangenome graphs

Downing, T.

2026-03-31 genomics
10.64898/2026.03.28.714820 bioRxiv
Show abstract

The Capripoxviruses (CaPV) comprise three species: goatpox virus (GTPV), sheeppox virus (SPPV) and lumpy skin disease virus (LSDV). They are large double-stranded DNA viruses with highly conserved core genomes and variable terminal regions. Previous studies have described variation in CaPV gene content, their broader population structure and the contribution of non-coding and structural variation remains opaque. This study investigated the genomic diversity and evolutionary history of GTPV and SPPV using an integrative framework combining phylogenetics, pangenome variation graphs (PVGs), and gene-specific analyses. We found marked differences in population structure between the two viruses. GTPV comprised three deeply divergent and genetically stable lineages with limited evidence of recent gene flow, whereas SPPV had weaker clade separation consistent with an ancestral bottleneck followed by recent population expansion. PVG-based analyses indicated that GTPV has a comparatively closed pangenome, while SPPV remains open, particularly at the genome termini. Structural and haplotypic variation was concentrated at the inverted terminal repeats (ITRs), which moderate host immunity and specificity. In several lineages, extended putative ORFs spanning adjacent terminal genes were observed, indicating recurrent structural plasticity at the genome ends. Patterns of gene-specific conservation and divergence highlighted loci under strong constraint and lineage-specific structural changes that may contribute to host specificity. Together, these results demonstrate how graph-based genome models complement gene-based analyses in resolving poxvirus genome evolution and provide a resource for improved comparative and population genomic studies of large DNA viruses. SignificanceCapripoxviruses are economically important livestock pathogens, yet the genomic mechanisms underlying their diversification and host specificity remain poorly resolved. By applying pangenome variation graphs alongside phylogenetic and gene-level analyses, this study reveals fundamental differences in how goatpox and sheeppox viruses have evolved. Goatpox virus had a deeper, more stable lineage structure, whereas sheeppox virus was more recent and diverse. Importantly, structural variation at the inverted terminal repeats emerged as a major driver of genomic diversity, including lineage-specific haplotypes and variable gene structures. These findings demonstrated the value of graph-based genome representations for resolving complex variation in large DNA viruses and provides a framework for improving genomic surveillance, comparative analyses, and future investigations into host range, virulence and tropism.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of Virology
456 papers in training set
Top 0.1%
28.5%
2
Virus Evolution
140 papers in training set
Top 0.1%
19.2%
3
PLOS Pathogens
721 papers in training set
Top 2%
7.0%
50% of probability mass above
4
Viruses
318 papers in training set
Top 1.0%
5.0%
5
Microbial Genomics
204 papers in training set
Top 0.8%
2.5%
6
Frontiers in Genetics
197 papers in training set
Top 3%
2.4%
7
BMC Genomics
328 papers in training set
Top 1%
2.4%
8
PLOS Computational Biology
1633 papers in training set
Top 13%
2.1%
9
mBio
750 papers in training set
Top 6%
2.1%
10
PLOS Genetics
756 papers in training set
Top 8%
1.7%
11
Molecular Ecology
304 papers in training set
Top 3%
1.7%
12
mSystems
361 papers in training set
Top 5%
1.7%
13
Scientific Reports
3102 papers in training set
Top 57%
1.7%
14
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
15
Nature Communications
4913 papers in training set
Top 57%
1.1%
16
PLOS Biology
408 papers in training set
Top 15%
1.1%
17
Molecular Biology and Evolution
488 papers in training set
Top 4%
0.9%
18
Genome Biology
555 papers in training set
Top 7%
0.7%
19
BMC Biology
248 papers in training set
Top 4%
0.7%
20
PLOS ONE
4510 papers in training set
Top 71%
0.7%
21
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
22
Virus Research
36 papers in training set
Top 1%
0.7%
23
Virology
56 papers in training set
Top 0.9%
0.5%
24
Genome Biology and Evolution
280 papers in training set
Top 2%
0.5%
25
Frontiers in Microbiology
375 papers in training set
Top 11%
0.5%