Back

Machine learning-based prediction of dynamic height heterosis with pathway biomarkers in rice

Dan, Z.; Chen, Y.; Huang, W.

2025-04-14 systems biology
10.1101/2024.11.09.622823 bioRxiv
Show abstract

The development of robust biomarkers enables accurate prediction of complex phenotypes. However, the dynamic nature of biomarkers is often underestimated since their quantitative changes during development are directly connected to phenotypic transformations, influencing both crop agronomic traits and human diseases. Here, we performed network analysis of untargeted metabolite profiles to investigate height heterosis in rice, which is dynamic that varies during development and is a key determinant of yield heterosis. We found that the levels of pyruvaldehyde were predictive of height heterosis specific at the seedling stage, while 4-hydroxycinnamic acid positively correlated with height heterosis across four developmental stages. We identified metabolic pathways associated with height heterosis and found that metabolomic changes during the elongation stage had a greater impact than those in other stages. Finally, 11 heterosis-associated pathways were developed into metabolomic biomarkers through random forest analysis, successfully predicting height heterosis in an independent population under different growth conditions. This study elucidates the metabolomic landscape of dynamic height heterosis in rice and develops pathway biomarkers for complex phenotypes, demonstrating robustness across diverse populations, environments, and developmental stages.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Plant Communications
35 papers in training set
Top 0.1%
22.2%
2
Nature Communications
4913 papers in training set
Top 14%
12.2%
3
Scientific Reports
3102 papers in training set
Top 19%
6.3%
4
Advanced Science
249 papers in training set
Top 4%
4.8%
5
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
3.9%
6
Frontiers in Plant Science
240 papers in training set
Top 2%
3.5%
50% of probability mass above
7
Cell Reports
1338 papers in training set
Top 19%
2.6%
8
Horticulture Research
43 papers in training set
Top 0.8%
2.0%
9
Communications Biology
886 papers in training set
Top 5%
2.0%
10
Journal of Agricultural and Food Chemistry
14 papers in training set
Top 0.5%
1.9%
11
iScience
1063 papers in training set
Top 12%
1.9%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.9%
13
Synthetic and Systems Biotechnology
10 papers in training set
Top 0.2%
1.8%
14
eLife
5422 papers in training set
Top 40%
1.8%
15
npj Systems Biology and Applications
99 papers in training set
Top 1%
1.7%
16
Metabolic Engineering
68 papers in training set
Top 0.4%
1.7%
17
BMC Plant Biology
47 papers in training set
Top 0.4%
1.7%
18
Journal of Genetics and Genomics
36 papers in training set
Top 1.0%
1.7%
19
Plant Biotechnology Journal
56 papers in training set
Top 0.7%
1.6%
20
PLOS ONE
4510 papers in training set
Top 57%
1.5%
21
Plant Physiology
217 papers in training set
Top 2%
1.3%
22
Science China Life Sciences
26 papers in training set
Top 1%
1.2%
23
The Plant Journal
197 papers in training set
Top 3%
1.2%
24
Science Advances
1098 papers in training set
Top 25%
0.9%
25
Journal of Experimental Botany
195 papers in training set
Top 3%
0.9%
26
Genome Biology
555 papers in training set
Top 7%
0.8%
27
in silico Plants
24 papers in training set
Top 0.3%
0.7%
28
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 46%
0.7%
29
New Phytologist
309 papers in training set
Top 5%
0.6%
30
Frontiers in Genetics
197 papers in training set
Top 12%
0.6%