Back

Unraveling viral identity: Avoiding the trap of endogenous sequences for viral surveillance of small ruminant oncogenic retroviruses

Riocreux-Verney, B.; Verneret, M.; Dolmazon, C.; Ashraf, S.; Atim, S.; Navratil, V.; Leroux, C.; Turpin, J.

2026-03-05 microbiology
10.64898/2026.03.05.709768 bioRxiv
Show abstract

Small ruminants (sheep and goats) are one of the few mammals in which an exogenous retrovirus (XRV) and closely related endogenous retroviral elements (ERV) coexist within the same host genome. The betaretroviruses Jaagsiekte sheep retrovirus (JSRV) and enzootic nasal tumor virus (ENTV) cause pulmonary and nasal adenocarcinomas, respectively, and share extensive sequence similarity with their endogenous counterparts. Consequently, molecular surveillance must rely on assays that can unequivocally distinguish true exogenous infection from ERV-derived templates; failure to do so compromises diagnosis, phylogenetic inference, and epidemiological conclusions. We retrieved all complete JSRV, ENTV-1/2, and related ERV genomes deposited in public repositories and performed a comprehensive alignment. Only a limited number of genomic segments were capable of distinguishing exogenous from endogenous sequences. We refer to these as discriminating regions (DRs). Phylogenies built using DRs revealed that several entries annotated as XRV are, in fact, ERV-derived or chimeric artefacts generated by short-amplicon reconstruction. A systematic literature review of over 100 articles identified 286 distinct primers and probes used for the XRV amplification. In-silico mapping of each oligonucleotide onto the full alignment showed that only 28 % reliably differentiate XRV from ERV. We experimentally validated the predictive power of this approach for 17 primer/probe sets, confirming that non-discriminating assays produce false-positive signals from endogenous templates. The misannotation of ERV sequences as exogenous viruses has resulting in the population of databases with dubious entries, fostering erroneous hypotheses such as vector-borne transmission of JSRV and ENTV. To address this issue, we propose a concise set of criteria for assay design, validation, and database annotation emphasizing DR targeting, specificity testing against endogenous templates, and transparent reporting. Although this framework was developed for small ruminants, it is readily applicable to any host-virus system in which exogenous viruses coexist with endogenous viral elements. This will strengthen viral surveillance, phylogenetics, and the One Health initiatives.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
PLOS Biology
408 papers in training set
Top 0.2%
10.0%
2
Journal of Virology
456 papers in training set
Top 0.6%
10.0%
3
PLOS Pathogens
721 papers in training set
Top 2%
9.1%
4
Viruses
318 papers in training set
Top 0.7%
6.8%
5
Virus Evolution
140 papers in training set
Top 0.2%
6.3%
6
Journal of General Virology
46 papers in training set
Top 0.1%
4.1%
7
PLOS ONE
4510 papers in training set
Top 36%
3.9%
50% of probability mass above
8
Journal of Medical Virology
137 papers in training set
Top 1%
3.2%
9
Frontiers in Microbiology
375 papers in training set
Top 4%
2.1%
10
Genome Medicine
154 papers in training set
Top 3%
2.1%
11
Scientific Reports
3102 papers in training set
Top 54%
1.9%
12
Nucleic Acids Research
1128 papers in training set
Top 10%
1.8%
13
Virology Journal
25 papers in training set
Top 0.1%
1.7%
14
Microbiology Spectrum
435 papers in training set
Top 3%
1.7%
15
Nature Communications
4913 papers in training set
Top 52%
1.6%
16
Emerging Microbes & Infections
74 papers in training set
Top 0.9%
1.6%
17
Frontiers in Cellular and Infection Microbiology
98 papers in training set
Top 3%
1.5%
18
eLife
5422 papers in training set
Top 47%
1.3%
19
Archives of Virology
14 papers in training set
Top 0.4%
1.2%
20
Virus Research
36 papers in training set
Top 0.8%
1.2%
21
Mobile DNA
27 papers in training set
Top 0.2%
1.2%
22
Antiviral Research
49 papers in training set
Top 0.3%
0.9%
23
Journal of Clinical Microbiology
120 papers in training set
Top 1%
0.9%
24
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
0.9%
25
Peer Community Journal
254 papers in training set
Top 3%
0.8%
26
Journal of Virological Methods
36 papers in training set
Top 0.6%
0.8%
27
Pathogens
53 papers in training set
Top 2%
0.7%
28
Frontiers in Immunology
586 papers in training set
Top 8%
0.7%
29
Emerging Infectious Diseases
103 papers in training set
Top 3%
0.7%
30
mBio
750 papers in training set
Top 13%
0.6%