Back

Population-scale discovery and analysis of non-reference endogenous retrovirus insertions in wild house mice

Yano, T.; Takada, T.; Fujiwara, K.; Watabe, D.; Hirose, S.; Masuya, H.; Endo, T.; Osada, N.

2026-02-20 evolutionary biology
10.1101/2025.09.23.678169 bioRxiv
Show abstract

Endogenous retroviruses (ERVs) represent a major source of structural variation in mammalian genomes, yet their diversity in wild populations remains poorly understood. Here, we conduct a comprehensive genome-wide survey of non-reference ERV insertions in wild house mice (Mus musculus) to characterize their distribution and evolutionary dynamics. Using a newly developed bioinformatics pipeline, we detected and annotated over 100,000 non-reference ERV insertions from short-read sequencing data across 163 wild mouse genomes. Our analyses revealed marked differences in ERV insertion patterns among subspecies and populations, including variation in genomic localization and population-specific polymorphisms. These heterogeneous patterns suggest distinct evolutionary histories and host-retrovirus interactions across populations. For instance, we describe the distribution of the ERV-derived Fv4 locus, which shows subspecies-restricted occurrence and confers resistance to murine leukemia viruses (MLVs). Several lines of evidence showed that the spread of Fv4 insertions in Korean population has been driven by adaptive introgression from neighboring populations. Our study provides the first large-scale population genomic scan of ERV diversity in wild house mice. By cataloguing extensive polymorphism in non-reference ERV insertions, our results highlight the role of ERVs as dynamic genomic elements that contribute to structural variation and adaptive evolution. Article SummaryEndogenous retroviruses (ERVs) are viral sequences embedded in animal genomes that can create structural genetic variation. In this study, we conducted a genome-wide survey of non-reference ERV insertions in 163 wild house mice using short-read sequencing data and a newly developed computational pipeline. We identified more than 100,000 polymorphic ERV insertions and found substantial differences among subspecies and geographic populations. One example, the ERV-derived Fv4 locus, illustrates how ERV variation can influence the genetic pattern of polymorphisms in the species. These results demonstrate that ERVs are dynamic genomic elements that contribute to population divergence and adaptive evolution.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of Virology
456 papers in training set
Top 0.3%
18.4%
2
Mobile DNA
27 papers in training set
Top 0.1%
12.2%
3
Virus Evolution
140 papers in training set
Top 0.1%
10.0%
4
Molecular Biology and Evolution
488 papers in training set
Top 0.8%
6.3%
5
Frontiers in Genetics
197 papers in training set
Top 1%
4.3%
50% of probability mass above
6
BMC Genomics
328 papers in training set
Top 0.6%
4.1%
7
Viruses
318 papers in training set
Top 1%
4.1%
8
PLOS Pathogens
721 papers in training set
Top 3%
3.9%
9
Genome Biology and Evolution
280 papers in training set
Top 0.8%
2.1%
10
eLife
5422 papers in training set
Top 36%
2.1%
11
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
1.9%
12
PLOS ONE
4510 papers in training set
Top 52%
1.8%
13
Scientific Reports
3102 papers in training set
Top 59%
1.7%
14
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 35%
1.5%
15
PLOS Genetics
756 papers in training set
Top 10%
1.5%
16
Genes
126 papers in training set
Top 2%
1.3%
17
mBio
750 papers in training set
Top 9%
1.2%
18
mSystems
361 papers in training set
Top 6%
1.1%
19
Journal of Genetics and Genomics
36 papers in training set
Top 1%
1.1%
20
PLOS Biology
408 papers in training set
Top 16%
0.9%
21
Genome Biology
555 papers in training set
Top 7%
0.8%
22
iScience
1063 papers in training set
Top 30%
0.8%
23
Nucleic Acids Research
1128 papers in training set
Top 18%
0.7%
24
Science China Life Sciences
26 papers in training set
Top 2%
0.7%
25
Microbiology Spectrum
435 papers in training set
Top 6%
0.6%