Back

Ancient persistence and recurrent emergence of structural variants across divergent Atlantic salmon lineages

Diblasi, C.; Kwak, J. S.; Manousi, D.; Arnyasi, M.; de Leon, A. V.-P.; Barson, N. J.; Saitou, M.

2026-05-28 genomics
10.64898/2026.05.25.727466 bioRxiv
Show abstract

Structural variants (SVs) are a major source of genomic diversity, yet the evolutionary origins of SVs shared across divergent populations remain difficult to resolve. Shared SVs may reflect ancient polymorphism, recurrent mutation, introgression, or subsequent lineage-specific frequency change, but the relative contribution of these processes often remains difficult to distinguish. Here, we investigated SV evolution across four Atlantic salmon (Salmo salar) lineages differing in geography, Europe versus North America, and domestication status, wild versus farmed. Using sensitive SV discovery, stringent genotyping, local PCA, haplotype-distance analyses, and forward simulations, we tested whether broadly shared SVs behave as a single class of variation or separate into distinct evolutionary categories. We generated a high-confidence SV map and found that SVs were enriched in repetitive regions, particularly segmental duplications and LTR retrotransposons, consistent with genome architecture shaping SV formation. Nearly half of high-confidence SVs were shared across all four lineages despite deep continental divergence, and simulations showed that this broad sharing is more consistent with ancient persistence than recurrent mutation alone. In contrast, a small subset of large SVs exhibited complex PCA clustering and multimodal haplotype-distance distributions, consistent with recurrent formation at structurally unstable loci. Large SVs also showed contrasting frequency trajectories between continents, and one immune gene-rich copy-number variable region showed a marked frequency increase in domesticated European salmon. Together, these results show that shared SVs comprise distinct evolutionary categories shaped by ancient persistence, recurrent mutation, and lineage-specific frequency change.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Cell Genomics
162 papers in training set
Top 0.2%
10.2%
2
Molecular Biology and Evolution
488 papers in training set
Top 0.5%
8.3%
3
eLife
5422 papers in training set
Top 9%
8.3%
4
Genome Biology
555 papers in training set
Top 0.8%
7.0%
5
Cell
370 papers in training set
Top 3%
6.3%
6
Nature Ecology & Evolution
113 papers in training set
Top 0.8%
6.3%
7
Science
429 papers in training set
Top 5%
6.2%
50% of probability mass above
8
Nature Communications
4913 papers in training set
Top 34%
4.8%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 17%
4.2%
10
Genome Research
409 papers in training set
Top 0.7%
4.2%
11
Cell Reports
1338 papers in training set
Top 12%
4.1%
12
Nucleic Acids Research
1128 papers in training set
Top 7%
2.8%
13
Genome Biology and Evolution
280 papers in training set
Top 0.6%
2.7%
14
Science Advances
1098 papers in training set
Top 11%
2.6%
15
Nature Biotechnology
147 papers in training set
Top 4%
2.0%
16
PLOS Genetics
756 papers in training set
Top 8%
1.9%
17
Molecular Cell
308 papers in training set
Top 7%
1.8%
18
Genetics
225 papers in training set
Top 2%
1.7%
19
The American Journal of Human Genetics
206 papers in training set
Top 3%
0.9%
20
Genome Medicine
154 papers in training set
Top 7%
0.9%
21
Nature
575 papers in training set
Top 15%
0.8%
22
Developmental Cell
168 papers in training set
Top 12%
0.7%
23
Frontiers in Genetics
197 papers in training set
Top 10%
0.7%
24
Nature Genetics
240 papers in training set
Top 8%
0.7%
25
Molecular Ecology
304 papers in training set
Top 5%
0.6%
26
BMC Biology
248 papers in training set
Top 6%
0.6%