Back

Importance of genetic architecture in marker selection decisions for genomic prediction

Della Coletta, R.; Fernandes, S.; Monnahan, P.; Mikel, M.; Bohn, M. O.; Lipka, A. E.; Hirsch, C.

2023-03-01 plant biology
10.1101/2023.02.28.530521 bioRxiv
Show abstract

Breeders commonly use genetic markers to predict the performance of untested individuals as a way to improve the efficiency of breeding programs. These genomic prediction models have almost exclusively used single nucleotide polymorphisms (SNPs) as their source of genetic information, even though other types of markers exist, such as structural variants (SVs). Given that SVs are associated with environmental adaptation and not all of them are in linkage disequilibrium to SNPs, SVs have the potential to bring additional information to multi-environment prediction models that are not captured by SNPs alone. Here, we evaluated different marker types (SNPs and/or SVs) on prediction accuracy across a range of genetic architectures for simulated traits across multiple environments. Our results show that SVs can improve prediction accuracy by up to 19%, but it is highly dependent on the genetic architecture of the trait. Differences in prediction accuracy across marker types were more pronounced for traits with high heritability, high number of QTLs, and SVs as causative variants. In these scenarios, using SV markers resulted in better prediction accuracies than SNP markers, especially when predicting untested genotypes across environments, likely due to more predictors being in linkage disequilibrium with causative variants. The simulations revealed little impact of different effect sizes between SNPs and SVs as causative variants on prediction accuracy. This study demonstrates the importance of knowing the genetic architecture of a trait in deciding what markers and marker types to use in large scale genomic prediction modeling in a breeding program. Key messageWe demonstrate potential for improved multi-environment genomic prediction accuracy using structural variant markers. However, the degree of observed improvement is highly dependent on the genetic architecture of the trait.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
The Plant Genome
53 papers in training set
Top 0.1%
18.8%
2
Crop Science
18 papers in training set
Top 0.1%
18.7%
3
BMC Genomics
328 papers in training set
Top 0.1%
10.2%
4
Theoretical and Applied Genetics
46 papers in training set
Top 0.1%
6.9%
50% of probability mass above
5
PLOS ONE
4510 papers in training set
Top 27%
6.4%
6
Frontiers in Plant Science
240 papers in training set
Top 2%
6.4%
7
Frontiers in Genetics
197 papers in training set
Top 1%
4.2%
8
The Plant Phenome Journal
14 papers in training set
Top 0.1%
2.6%
9
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.3%
2.1%
10
Scientific Reports
3102 papers in training set
Top 53%
1.9%
11
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
1.5%
12
Journal of Heredity
35 papers in training set
Top 0.1%
1.1%
13
Peer Community Journal
254 papers in training set
Top 4%
0.8%
14
Evolutionary Applications
91 papers in training set
Top 1%
0.8%
15
AoB PLANTS
11 papers in training set
Top 0.2%
0.8%
16
Biology Methods and Protocols
53 papers in training set
Top 2%
0.8%
17
New Phytologist
309 papers in training set
Top 5%
0.7%
18
in silico Plants
24 papers in training set
Top 0.4%
0.6%
19
BMC Plant Biology
47 papers in training set
Top 1%
0.6%
20
PLANTS, PEOPLE, PLANET
21 papers in training set
Top 0.9%
0.6%
21
Journal of Dairy Science
11 papers in training set
Top 0.1%
0.6%
22
Aquaculture
29 papers in training set
Top 0.9%
0.5%
23
PeerJ
261 papers in training set
Top 19%
0.5%