Back

Inference of population demographic history captures differing evolutionary signals based on the number of individuals in the dataset

Mah, J. C.; Lohmueller, K. E.

2026-04-08 evolutionary biology
10.64898/2026.04.07.716740 bioRxiv
Show abstract

Accurate estimation of population demographic history is central to population genetics yet remains challenging due to the sensitivity of inference methods to the number of individuals and the demographic scenario assumed in inference. The site-frequency spectrum (SFS) of neutral variants, a widely used summary statistic of genetic variation, is particularly sensitive to demographic processes, but studies have shown that qualitative results from demographic inference, i.e., population expansion vs. contraction, can depend strongly on the number of individuals in the dataset. Here, we analyzed two simulated datasets and one empirical dataset characterized by an ancient population bottleneck followed by a recent population expansion. Fitting a two-epoch demographic model across a range of sample sizes, we found that inference shifted from signals of ancient population contraction at small sample sizes to signals of recent population expansion at large sample sizes. Other summary statistics, including Tajimas D and the proportion of singletons, also changed with sample size. We found that these changes of inferred evolutionary signals under a two-epoch model can be explained by the epoch which contributes the highest mean proportion of coalescent branch lengths. Our results highlight that demographic inference depends critically on the number of individuals analyzed and suggest that analyzing datasets at multiple sample sizes can reveal complementary aspects of population history.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Molecular Biology and Evolution
488 papers in training set
Top 0.2%
14.5%
2
Genetics
225 papers in training set
Top 0.3%
12.3%
3
PLOS Genetics
756 papers in training set
Top 1%
10.3%
4
Genome Biology and Evolution
280 papers in training set
Top 0.1%
10.0%
5
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 10%
6.7%
50% of probability mass above
6
Molecular Ecology
304 papers in training set
Top 1%
6.2%
7
PLOS Computational Biology
1633 papers in training set
Top 9%
3.9%
8
Molecular Ecology Resources
161 papers in training set
Top 0.4%
3.0%
9
Evolution
199 papers in training set
Top 1%
2.3%
10
GENETICS
189 papers in training set
Top 0.5%
2.1%
11
eLife
5422 papers in training set
Top 36%
2.1%
12
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 3%
1.5%
13
BMC Ecology and Evolution
49 papers in training set
Top 1%
1.3%
14
Nature Communications
4913 papers in training set
Top 55%
1.3%
15
Evolution Letters
71 papers in training set
Top 1%
1.3%
16
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
1.2%
17
Journal of Evolutionary Biology
98 papers in training set
Top 0.7%
1.2%
18
Virus Evolution
140 papers in training set
Top 1.0%
1.2%
19
New Phytologist
309 papers in training set
Top 4%
0.9%
20
PLOS Biology
408 papers in training set
Top 17%
0.9%
21
The American Journal of Human Genetics
206 papers in training set
Top 3%
0.9%
22
Systematic Biology
121 papers in training set
Top 0.4%
0.9%
23
Peer Community Journal
254 papers in training set
Top 3%
0.9%
24
Scientific Reports
3102 papers in training set
Top 73%
0.8%
25
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
26
Methods in Ecology and Evolution
160 papers in training set
Top 3%
0.6%
27
Frontiers in Ecology and Evolution
60 papers in training set
Top 4%
0.6%