Back

Multi-Substrate Specificity of Isoflavone hydroxylases (GmIFH) Drive Isoflavonoid Diversification in Soybean

Khatri, P.; McDowell, T.; Marsolais, F.; Renaud, J.; Dhaubhadel, S.

2026-05-08 biochemistry
10.64898/2026.05.05.722824 bioRxiv
Show abstract

Isoflavone hydroxylases (IFHs, CYP81E) convert isoflavone aglycones into their respective hydroxylated intermediates, which direct legume isoflavones into specialized defense pathways. In soybean, their functions have been studied mostly in the context of the daidzein-derived glyceollin biosynthesis. Here we combine metabolomics-guided feature mining, phylogenetic analysis, heterologous enzymology, structural elucidation, and in planta metabolite validation to determine the functional landscape of the soybean IFH family. Analysis of a soybean isoflavonoid-enriched metabolomic dataset revealed unidentified hydroxyisoflavone features that co-accumulated with glyceollins, indicating branch chemistry that is not well-recognized. The systematic characterization of the repertoire of soybean CYP81E has demonstrated that 9 out of 11 GmIFHs are catalytically active and collectively span both 2'- and 3'- hydroxylation of the major soybean isoflavone aglycones. Among them, GmIFH9A showed broad substrate scope and regioselectivity, yielding canonical and previously unknown hydroxylated isoflavone products. NMR and LC-MS/MS were used to identify and validate the hydroxylated isoflavone products as 2'-hydroxyglycitein and 2'-hydroxyformononetin, whose presence was also confirmed in soybean roots, thus confirming two of the hidden soybean isoflavonoid network metabolites. Kinetic studies also indicated that, although the majority of GmIFHs prefer daidzein and genistein as substrates, a few isoforms are active towards methoxylated isoflavones as well, indicating functional divergence in this expanded family. Our findings collectively redefine soybean IFHs as a multi-functional enzyme module that expands the hydroxyisoflavone chemical space and reveals new biosynthetic entry points beyond canonical glyceollin pathway.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 5%
19.2%
2
Cell Reports
1338 papers in training set
Top 4%
8.6%
3
Cell Chemical Biology
81 papers in training set
Top 0.3%
7.4%
4
Horticulture Research
43 papers in training set
Top 0.3%
7.4%
5
Plant Communications
35 papers in training set
Top 0.1%
7.0%
6
Nature Chemical Biology
104 papers in training set
Top 0.3%
6.5%
50% of probability mass above
7
Journal of the American Chemical Society
199 papers in training set
Top 1%
4.4%
8
eLife
5422 papers in training set
Top 20%
4.3%
9
Molecular Plant
36 papers in training set
Top 0.5%
3.1%
10
Advanced Science
249 papers in training set
Top 8%
2.1%
11
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 31%
1.8%
12
Acta Pharmaceutica Sinica B
11 papers in training set
Top 0.3%
1.8%
13
Angewandte Chemie International Edition
81 papers in training set
Top 2%
1.7%
14
New Phytologist
309 papers in training set
Top 4%
1.4%
15
Science Advances
1098 papers in training set
Top 26%
0.9%
16
Communications Biology
886 papers in training set
Top 18%
0.9%
17
Science
429 papers in training set
Top 18%
0.9%
18
ACS Chemical Biology
150 papers in training set
Top 2%
0.8%
19
ACS Catalysis
16 papers in training set
Top 0.2%
0.8%
20
Plant Physiology
217 papers in training set
Top 3%
0.8%
21
Cell Discovery
54 papers in training set
Top 5%
0.8%
22
Redox Biology
64 papers in training set
Top 1%
0.7%
23
Nature Chemistry
34 papers in training set
Top 0.9%
0.7%
24
Nucleic Acids Research
1128 papers in training set
Top 19%
0.7%
25
Scientific Reports
3102 papers in training set
Top 77%
0.7%
26
Journal of Natural Products
11 papers in training set
Top 0.4%
0.7%
27
iScience
1063 papers in training set
Top 36%
0.7%
28
Cell
370 papers in training set
Top 19%
0.5%