Back

Identifying adaptive variation in spatially structured populations using low-coverage whole-genome sequencing data

Goel, N.; Bossu, C. M.; Yi, S.; Robertson, E. C. N.; Brown, T. M.; Bolton, P. E.; Vernasco, B. J.; Zavaleta, E.; Ruegg, K. C.; Hooten, M. B.

2026-01-21 genomics
10.64898/2026.01.19.700474 bioRxiv
Show abstract

Successful implementation of evolutionary management programs to rescue climatically threatened species requires identification of adaptive genetic variation. Although current genotype-environment association methods have been successful in identifying adaptive variation, they can be improved in two important aspects. First, most existing methods do not account for genotype uncertainty in widely available low-coverage whole-genome sequencing data. Researchers often restrict analysis to loci for which genotypes can be inferred reliably or call the most probable genotype, allowing the use of traditional genotype-based methods, such as BayeScEnv and Bayenv. However, discarding data and false genotype calls increases the uncertainty in estimates of genetic variation and introduces systematic biases. Second, most methods use phenomenological approaches, such as logistic regression, to partition estimated genetic variation into adaptive and non-adaptive components. Consequently, current approaches may inadvertently fail to account for evolutionary processes, such as migration-selection balance. Structured migration between climatically disparate locations can produce deviations from a smooth S-shape response curve, which can be difficult to accommodate using generalized linear regression models. To overcome these challenges, we developed a method that accounts for genotype uncertainty in sequencing data and propagates this uncertainty to inform the parameters of a model of evolution. A key feature of this evolutionary model is that it mechanistically describes how genetic variation arises from joint interactions between local adaptation, structured migration, mutation, and drift. We apply our approach to analyze multiple synthetic datasets and a real dataset of North American rosy-finches (3.7 million SNPs), a high-alpine, climatically threatened clade of bird species.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Methods in Ecology and Evolution
160 papers in training set
Top 0.3%
11.9%
2
Molecular Biology and Evolution
488 papers in training set
Top 0.4%
9.8%
3
The American Journal of Human Genetics
206 papers in training set
Top 0.7%
6.6%
4
PLOS Genetics
756 papers in training set
Top 3%
6.2%
5
Molecular Ecology
304 papers in training set
Top 1%
6.2%
6
Molecular Ecology Resources
161 papers in training set
Top 0.2%
6.1%
7
GENETICS
189 papers in training set
Top 0.2%
4.2%
50% of probability mass above
8
eLife
5422 papers in training set
Top 28%
3.5%
9
PLOS Computational Biology
1633 papers in training set
Top 11%
3.5%
10
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 2%
2.8%
11
Bioinformatics
1061 papers in training set
Top 6%
2.6%
12
Nature Communications
4913 papers in training set
Top 45%
2.4%
13
Genetics
225 papers in training set
Top 2%
2.3%
14
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.3%
2.3%
15
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
2.0%
16
Conservation Biology
14 papers in training set
Top 0.1%
2.0%
17
Global Change Biology
69 papers in training set
Top 0.8%
1.8%
18
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
19
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
20
New Phytologist
309 papers in training set
Top 4%
1.2%
21
Genome Research
409 papers in training set
Top 3%
1.1%
22
Science
429 papers in training set
Top 18%
0.9%
23
Evolutionary Applications
91 papers in training set
Top 1%
0.9%
24
Genome Biology and Evolution
280 papers in training set
Top 2%
0.7%
25
PLOS Biology
408 papers in training set
Top 21%
0.7%
26
Scientific Reports
3102 papers in training set
Top 76%
0.7%
27
Journal of Heredity
35 papers in training set
Top 0.2%
0.7%
28
Communications Biology
886 papers in training set
Top 27%
0.7%
29
PLOS ONE
4510 papers in training set
Top 70%
0.7%
30
Ecology and Evolution
232 papers in training set
Top 4%
0.7%