Back

Identifying genomic regions and candidate genes selected during the breeding of rice in Vietnam

Higgins, J.; Santos, B.; Khanh, T. D.; Trung, K. H.; Duong, T. D.; Doai, N. T. P.; Hall, A.; Dyer, S.; Ham, L. H.; Caccamo, M.; De Vega, J. J.

2021-08-05 plant biology
10.1101/2021.08.04.455072 bioRxiv
Show abstract

Background and aimsVietnam harnesses a rich diversity of rice landraces adapted to a broad range of conditions, which constitute a largely untapped source of genetic diversity for the continuous improvement of rice cultivars. We previously identified a strong population structure in Vietnamese rice, which is captured in five Indica and four Japonica subpopulations, including an outlying Indica-5 group. Here, we leveraged on that strong differentiation, and the 672 rice genomes generated, to identify genes within genomic regions putatively selected during domestication and breeding of rice in Vietnam. MethodologyWe identified significant distorted patterns in allele frequency (XP-CLR method) and population differentiation scores (FST), resulting from differential selective pressures between native subpopulations, and compared them with QTLs previously identified by GWAS in the same panel. We particularly focused on the outlying Indica-5 subpopulation because of its likely novelty and differential evolution. ResultsWe identified selection signatures in each of the Vietnamese subpopulations and carried out a comprehensive annotation of the 52 regions selected in Indica-5, which represented 8.1% of the rice genome. We annotated the 4,576 genes in these regions, verified the overlap with QTLs identified in the same diversity panel and the comparison with a FST analysis between subpopulations, to select sixty-five candidate genes as promising breeding targets, several of which harboured alleles with non-synonymous substitutions. ConclusionsOur results highlight genomic differences between traditional Vietnamese landraces, which are likely the product of adaption to multiple environmental conditions and regional culinary preferences in a very diverse country. We also verified the applicability of this genome scanning approach to identify potential regions harbouring novel loci and alleles to breed a new generation of sustainable and resilient rice. Key MessageWe localised regions in the rice genome selected during breeding by comparing allele frequency patterns among Vietnamese rice subpopulations. We characterised candidate genes in the Indica-5 subpopulation with breeding potential.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
The Plant Genome
53 papers in training set
Top 0.1%
14.8%
2
PLOS ONE
4510 papers in training set
Top 17%
10.5%
3
Theoretical and Applied Genetics
46 papers in training set
Top 0.1%
8.5%
4
Frontiers in Plant Science
240 papers in training set
Top 1%
7.2%
5
Scientific Reports
3102 papers in training set
Top 14%
6.9%
6
BMC Plant Biology
47 papers in training set
Top 0.1%
4.0%
50% of probability mass above
7
PLANTS, PEOPLE, PLANET
21 papers in training set
Top 0.1%
3.7%
8
GigaScience
172 papers in training set
Top 0.6%
3.1%
9
Crop Science
18 papers in training set
Top 0.1%
2.8%
10
The Plant Journal
197 papers in training set
Top 2%
2.4%
11
The Plant Phenome Journal
14 papers in training set
Top 0.1%
2.1%
12
BMC Genomics
328 papers in training set
Top 2%
1.9%
13
New Phytologist
309 papers in training set
Top 3%
1.8%
14
Plant Biotechnology Journal
56 papers in training set
Top 0.7%
1.7%
15
Plant Direct
81 papers in training set
Top 1%
1.5%
16
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.5%
1.3%
17
Horticulture Research
43 papers in training set
Top 1%
1.3%
18
Frontiers in Genetics
197 papers in training set
Top 6%
1.3%
19
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
1.2%
20
Gigabyte
60 papers in training set
Top 0.9%
1.2%
21
Plant Physiology
217 papers in training set
Top 2%
1.2%
22
Evolutionary Applications
91 papers in training set
Top 0.8%
1.2%
23
Agronomy
18 papers in training set
Top 0.6%
1.0%
24
Plant Science
25 papers in training set
Top 0.9%
0.9%
25
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 41%
0.9%
26
G3
33 papers in training set
Top 0.5%
0.8%
27
Nature Communications
4913 papers in training set
Top 65%
0.6%
28
PeerJ
261 papers in training set
Top 17%
0.6%
29
Plant Phenomics
17 papers in training set
Top 0.4%
0.6%
30
Gene
41 papers in training set
Top 3%
0.6%