Back

PlantP450Dock: an Automated Molecular Docking Pipeline of Plant Cytochrome P450s

Feng, L.; Niu, C.; Qing, X.; Zhang, C.; Li, C.

2026-05-15 bioinformatics
10.64898/2026.05.12.724510 bioRxiv
Show abstract

Cytochrome P450 enzymes (CYPs) are the primary drivers of chemical diversification in plant secondary metabolism; however, fewer than 10% of the superfamily members have been functionally characterized. Computational docking provides a scalable strategy to prioritize candidates for experimental validation, yet prevailing workflows are poorly adapted to plant P450s because AlphaFold-predicted structures lack the essential heme cofactor and conventional flexible-residue selection relies on subjective geometric cutoffs. To address these limitations, we developed an automated pipeline--PlantP450Dock--that unifies heme cofactor implantation, molecular dynamics-based conformational sampling, data-driven flexible residue selection, and semi-flexible docking within a single integrated workflow. The heme is transferred from a crystallographic template to the AlphaFold model via a local coordinate transformation algorithm, achieving a positional deviation of less than 0.2 [A] relative to the experimentally determined CYP73A33 structure (PDB: 6VBY). Subsequent 100 ns molecular dynamics simulations confirmed faithful preservation of the Fe-S coordination geometry (2.61 {+/-} 0.08 [A]) across all trajectory frames. A singular value decomposition-based heme-plane filtering strategy objectively identified distal active-site residues for flexible treatment, eliminating user-dependent subjectivity. Cross-family validation against four phylogenetically distinct P450s (CYP73, CYP711, CYP706, and CYP701) generated catalytically competent binding poses with substrate-to-heme-iron distances of 2.8-4.4 [A] without enzyme-specific parameterization. Released as an open-source tool, this pipeline furnishes the plant science community with a standardized, reproducible computational framework to accelerate functional annotation of the largely unexplored plant P450 families.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.9%
2
Nature Communications
4913 papers in training set
Top 13%
12.8%
3
PLOS Computational Biology
1633 papers in training set
Top 5%
6.9%
4
Plant Communications
35 papers in training set
Top 0.2%
6.4%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
4.0%
6
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
4.0%
7
eLife
5422 papers in training set
Top 31%
2.8%
50% of probability mass above
8
Communications Biology
886 papers in training set
Top 4%
2.5%
9
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
2.4%
10
BMC Bioinformatics
383 papers in training set
Top 4%
2.1%
11
Bioinformatics Advances
184 papers in training set
Top 2%
2.1%
12
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
1.9%
13
Plant Physiology
217 papers in training set
Top 2%
1.7%
14
Molecular Plant
36 papers in training set
Top 0.9%
1.5%
15
PLOS ONE
4510 papers in training set
Top 58%
1.3%
16
Advanced Science
249 papers in training set
Top 13%
1.3%
17
Genome Medicine
154 papers in training set
Top 5%
1.3%
18
Scientific Reports
3102 papers in training set
Top 65%
1.2%
19
New Phytologist
309 papers in training set
Top 4%
1.2%
20
Cell Systems
167 papers in training set
Top 9%
1.2%
21
The Plant Journal
197 papers in training set
Top 3%
1.1%
22
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.0%
23
Journal of Molecular Biology
217 papers in training set
Top 3%
0.9%
24
Metabolites
50 papers in training set
Top 0.9%
0.9%
25
Journal of Cheminformatics
25 papers in training set
Top 0.5%
0.8%
26
Protein Science
221 papers in training set
Top 2%
0.8%
27
Nucleic Acids Research
1128 papers in training set
Top 17%
0.8%
28
Frontiers in Molecular Biosciences
100 papers in training set
Top 5%
0.7%
29
Scientific Data
174 papers in training set
Top 3%
0.7%
30
Horticulture Research
43 papers in training set
Top 2%
0.5%