Back

Efficient genomic prediction at reduced training size and moderate marker density in an expanded aus-NAM population of rice

Kitony, J. K.; Reyes, V. P.; Sunohara, H.; Tasaki, M.; Yamasaki, M.; Mori, J.-i.; Shimazu, A.; Nishiuchi, S.; Michael, T. P.; Doi, K.

2026-05-01 plant biology
10.64898/2026.04.28.721500 bioRxiv
Show abstract

Genomic selection (GS) can accelerate genetic gain in crops, but its effectiveness depends on training population design and marker density. Nested association mapping (NAM) populations provide a structured framework that captures broad allelic diversity within a controlled genetic background. Here, we evaluated genomic prediction (GP) and genome-wide association study (GWAS) performance in an expanded aus-NAM population of rice comprising 1,818 recombinant inbred lines across 14 families and 11 agronomic traits, using genotyping-by-sequencing (GBS) markers and projected whole-genome sequence variants. Prediction accuracy plateaued at moderate marker densities ([~]20k SNPs) and with training populations of [~]500 lines ([~]40-60% of the available pool), with trait heritability emerging as the strongest determinant of predictive performance rather than model choice or marker density. In contrast, GWAS resolution continued to improve with increasing marker density, enabling detection of additional loci, including a chromosome 12 locus associated with heading date, while consistently recovering well-characterized genes such as EARLY HEADING DATE 1 (Ehd1) and SEMIDWARF 1 (SD1). These contrasting patterns indicate that GP reaches near-optimal performance once genome-wide variation is adequately represented, whereas GWAS benefits from higher marker density through improved locus resolution. The present study establishes a benchmark for implementing breeding programs involving japonica/indica crosses using GP in a single environment.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
The Plant Genome
53 papers in training set
Top 0.1%
26.3%
2
Theoretical and Applied Genetics
46 papers in training set
Top 0.1%
10.6%
3
The Plant Journal
197 papers in training set
Top 0.6%
6.9%
4
Frontiers in Plant Science
240 papers in training set
Top 1%
6.4%
50% of probability mass above
5
New Phytologist
309 papers in training set
Top 1%
4.9%
6
Plant Biotechnology Journal
56 papers in training set
Top 0.2%
4.9%
7
The Plant Phenome Journal
14 papers in training set
Top 0.1%
4.4%
8
Horticulture Research
43 papers in training set
Top 0.5%
3.6%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 23%
3.1%
10
PLOS ONE
4510 papers in training set
Top 47%
2.1%
11
Scientific Reports
3102 papers in training set
Top 49%
2.1%
12
Plant Communications
35 papers in training set
Top 0.7%
1.9%
13
Crop Science
18 papers in training set
Top 0.2%
1.9%
14
BMC Plant Biology
47 papers in training set
Top 0.3%
1.8%
15
Nature Communications
4913 papers in training set
Top 50%
1.8%
16
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
17
Journal of Experimental Botany
195 papers in training set
Top 2%
1.7%
18
BMC Genomics
328 papers in training set
Top 3%
1.5%
19
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
1.0%
20
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.8%
0.8%
21
Genome Biology
555 papers in training set
Top 7%
0.8%
22
in silico Plants
24 papers in training set
Top 0.3%
0.8%
23
Plant Direct
81 papers in training set
Top 2%
0.8%
24
Plant Phenomics
17 papers in training set
Top 0.3%
0.8%
25
Genetics
225 papers in training set
Top 4%
0.7%
26
Plant Physiology
217 papers in training set
Top 3%
0.7%
27
PLANTS, PEOPLE, PLANET
21 papers in training set
Top 1.0%
0.5%