Back

Haplotype-based models improve sweep detection in ancient populations with complex demography

Sequeira, A. N.; Szpiech, Z. A.; Huber, C. D.

2026-05-11 genetics
10.64898/2026.05.08.723766 bioRxiv
Show abstract

Identifying signatures of positive selection in humans is complicated by demographic processes such as bottlenecks, migration and admixture, all of which can distort or obscure the genomic patterns produced by selective sweeps. Ancient DNA offers a direct window into past allele and haplotype frequencies, yet most sweep scans in ancient populations rely on allele-frequency or site frequency spectrum (SFS) summaries, with limited use of haplotype-based approaches. Here, we evaluate the performance of haplotype and SFS-based methods for detecting selective sweeps under demographic scenarios that reflect the complex history of ancient and modern Europeans. We extend the haplotype-based likelihood framework saltiLASSI to accommodate pseudohaploid ancient genomes, enabling the use of truncated haplotype frequency spectra and their spatial decay to detect sweeps without requiring phased data. Using forward-in-time simulations, we examine sweeps of varying ages, two pulses of admixture with different source proportions, and cases where selection continues or ceases after admixture. We compare saltiLASSI to a widely used SFS-based approach (SweepFinder2). Our results show that haplotype-based likelihood models retain higher power than SFS methods in admixed populations, particularly when sweep haplotypes are introduced through migration or when selection has not had sufficient time to regenerate a clear SFS signature after admixture. These findings highlight the promise of haplotype-based inference for ancient DNA and demonstrate how model-based approaches can improve the detection of historical selective sweeps in populations with complex demographic histories.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genetics
225 papers in training set
Top 0.2%
17.0%
2
PLOS Genetics
756 papers in training set
Top 0.7%
13.9%
3
Molecular Biology and Evolution
488 papers in training set
Top 0.4%
9.8%
4
The American Journal of Human Genetics
206 papers in training set
Top 0.7%
6.9%
5
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 15%
4.7%
50% of probability mass above
6
eLife
5422 papers in training set
Top 23%
3.8%
7
Nature Communications
4913 papers in training set
Top 42%
3.5%
8
Genome Biology
555 papers in training set
Top 3%
3.1%
9
PLOS Computational Biology
1633 papers in training set
Top 11%
3.0%
10
Bioinformatics
1061 papers in training set
Top 6%
3.0%
11
Molecular Ecology Resources
161 papers in training set
Top 0.6%
1.8%
12
Frontiers in Genetics
197 papers in training set
Top 5%
1.6%
13
GENETICS
189 papers in training set
Top 0.7%
1.6%
14
Nucleic Acids Research
1128 papers in training set
Top 11%
1.6%
15
Nature Genetics
240 papers in training set
Top 5%
1.6%
16
Science
429 papers in training set
Top 16%
1.3%
17
Genome Research
409 papers in training set
Top 3%
1.3%
18
Human Genetics and Genomics Advances
70 papers in training set
Top 0.5%
1.1%
19
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
0.9%
20
Scientific Reports
3102 papers in training set
Top 71%
0.9%
21
Cell Genomics
162 papers in training set
Top 6%
0.9%
22
PLOS ONE
4510 papers in training set
Top 67%
0.8%
23
Cell Reports
1338 papers in training set
Top 34%
0.7%
24
iScience
1063 papers in training set
Top 34%
0.7%
25
Genome Biology and Evolution
280 papers in training set
Top 2%
0.7%
26
BMC Genomics
328 papers in training set
Top 6%
0.7%
27
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
28
European Journal of Human Genetics
49 papers in training set
Top 1%
0.7%
29
American Journal of Biological Anthropology
11 papers in training set
Top 0.3%
0.7%
30
Science Advances
1098 papers in training set
Top 34%
0.6%