Back

Evaluating genotyping strategies for a small managed population with simulation

Martin, A. A. A.; Schoenebeck, J.; Clements, D. N.; Lewis, T.; Wiener, P.; Gorjanc, G.

2025-01-25 genomics
10.1101/2025.01.23.634495 bioRxiv
Show abstract

BackgroundCollecting genomic information is crucial to advance breeding for complex traits such as health, welfare, and behaviour in domesticated populations. For that purpose, different data collection scenarios can be envisioned based on the number of individuals, the number of markers, and the genotyping technology. This study developed a simulation framework, based on a service dog population, aiming to identify an optimal and cost-effective genotyping strategy that would support the implementation of genomic selection, investigation of the genetic architecture of traits of interest, and track loci of interest. MethodsWe simulated a population based on the existing pedigree, using the gene drop method in AlphaSimR. The existing pedigree was extended with additional progeny generations to evaluate the outcomes of different genotyping strategies in the future. We generated genotype data based on existing high-coverage whole-genome sequences (WGS) for the current breeding dogs and evaluated different scenarios for genotyping the progeny. The genotyping options considered SNP arrays of various densities and WGS callsets produced from different sequencing depths. We then phased and imputed the genotype data to high-coverage WGS using AlphaPeel. ResultsAll scenarios were compared based on individual imputation accuracy against the simulated true whole-genome genotype. Averaged over five generations of simulated progeny, low-pass sequencing (0.5 to 2X depth) achieved accuracies of 0.998 to 0.999. The accuracy of SNP array genotyping (25K to 710K markers) was lower, with means of 0.911 to 0.938. ConclusionsOur simulation was tailored to identify the most cost-effective and efficient strategy for downstream use in genomic selection and genetic research into traits and loci of interest. Low-pass sequencing outperformed SNP array genotyping in imputation accuracy of whole-genome genotypes as expected. Additionally, low-pass sequencing technology was the most affordable genotyping approach currently available for dogs. Thus, it appears to be the optimal choice for balancing the goals of regimented breeding programmes such as those that produce service dogs. This simulation framework could also be adapted to address other objectives for breeding organisations working with small populations.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genetics Selection Evolution
33 papers in training set
Top 0.1%
37.2%
2
Gigabyte
60 papers in training set
Top 0.1%
12.2%
3
PLOS ONE
4510 papers in training set
Top 20%
10.0%
50% of probability mass above
4
BMC Genomics
328 papers in training set
Top 0.3%
6.2%
5
Scientific Reports
3102 papers in training set
Top 38%
3.5%
6
Aquaculture
29 papers in training set
Top 0.2%
2.7%
7
Methods in Ecology and Evolution
160 papers in training set
Top 1%
2.0%
8
Peer Community Journal
254 papers in training set
Top 2%
1.9%
9
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
10
GigaScience
172 papers in training set
Top 2%
1.6%
11
Journal of Dairy Science
11 papers in training set
Top 0.1%
1.3%
12
PeerJ
261 papers in training set
Top 9%
1.3%
13
Genes
126 papers in training set
Top 2%
1.2%
14
Global Ecology and Conservation
25 papers in training set
Top 0.9%
1.1%
15
Molecular Ecology Resources
161 papers in training set
Top 0.9%
0.9%
16
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.7%
0.9%
17
Journal of Heredity
35 papers in training set
Top 0.2%
0.8%
18
Conservation Genetics
15 papers in training set
Top 0.2%
0.8%
19
Frontiers in Veterinary Science
30 papers in training set
Top 0.8%
0.8%
20
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
21
Ecology and Evolution
232 papers in training set
Top 4%
0.6%
22
Evolutionary Applications
91 papers in training set
Top 2%
0.6%
23
G3 Genes|Genomes|Genetics
351 papers in training set
Top 3%
0.6%