Back

Deciphering the Genetic Architecture of Sorghum Grain Oil Content via Lipidome-Integrated Genome-Wide Association Analysis

Jiao, Y.; Nigam, D.; Metwally, S.; Chen, F.

2026-03-16 bioinformatics
10.64898/2026.03.12.711187 bioRxiv
Show abstract

Grain oil content and composition are complex quantitative traits that shape cereal grain quality and nutritional value. Sorghum (Sorghum bicolor), a heat- and drought-adapted C crop essential for global food and feed security, remains insufficiently characterized with respect to grain lipidome diversity and its genetic architecture. Here, we integrated population-scale whole-grain lipidomics with genome-wide association studies (GWAS) in 266 sorghum accessions. Lipidome profiling revealed extensive natural variation in triacylglycerols (TAGs), accompanied by coordinated shifts in phosphatidylcholines (PCs) and phosphatidylethanolamines (PEs), explaining 87% of population-level differences in total grain oil. Lipidome-wide GWAS identified approximately 1.6 million significant variant-trait associations and resolved 55 loci linked to plastidial fatty acid synthesis, TAG assembly, lipid transport, and membrane remodeling. These loci, many undetected in previous GWAS of bulk oil content, demonstrated the increased mapping resolution achieved through lipidomics. Integration with metabolic gene clusters revealed significant enrichment of lipid-associated variants within terpene and saccharide-terpene biosynthetic clusters, indicating coordinated genetic regulation between central lipid metabolism and specialized metabolic pathways. Variants within these clusters explained more than 50% of the variance in measured grain oil content and exhibited additive effects of favorable alleles. Haplotype analyses further identified 27 elite sorghum accessions and 12 linked markers for marker-assisted improvement of sorghum grain oil. These findings elucidate the multilayered genetic architecture of sorghum grain lipid diversity and showcase the value of large-scale lipidomics integrated with GWAS for accelerating C crop grain quality improvement.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 0.3%
39.7%
2
Molecular Plant
36 papers in training set
Top 0.1%
10.2%
3
Advanced Science
249 papers in training set
Top 4%
4.3%
50% of probability mass above
4
Science Advances
1098 papers in training set
Top 4%
3.9%
5
Plant Biotechnology Journal
56 papers in training set
Top 0.3%
3.6%
6
Nature Metabolism
56 papers in training set
Top 0.7%
3.3%
7
Nature Plants
84 papers in training set
Top 0.8%
2.4%
8
Communications Biology
886 papers in training set
Top 6%
1.9%
9
Cell Genomics
162 papers in training set
Top 3%
1.9%
10
Plant Communications
35 papers in training set
Top 0.7%
1.8%
11
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 32%
1.7%
12
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
1.7%
13
Genome Biology
555 papers in training set
Top 4%
1.7%
14
Nature Genetics
240 papers in training set
Top 5%
1.5%
15
Science
429 papers in training set
Top 16%
1.3%
16
Horticulture Research
43 papers in training set
Top 1%
1.2%
17
Nature
575 papers in training set
Top 13%
1.1%
18
Cell Reports
1338 papers in training set
Top 29%
1.0%
19
Nature Chemical Biology
104 papers in training set
Top 3%
1.0%
20
Cell Systems
167 papers in training set
Top 10%
0.9%
21
eLife
5422 papers in training set
Top 53%
0.9%
22
Nucleic Acids Research
1128 papers in training set
Top 19%
0.6%
23
New Phytologist
309 papers in training set
Top 5%
0.6%
24
Cell
370 papers in training set
Top 18%
0.6%
25
Nature Microbiology
133 papers in training set
Top 6%
0.5%
26
Plant Physiology
217 papers in training set
Top 3%
0.5%
27
The American Journal of Human Genetics
206 papers in training set
Top 5%
0.5%