Back

Whole-Genome sequencing of Indigenous Withania somnifera accession and comparative cytochrome P450 phylogenomics

gupta, S.; Misra, P.; Singh, R.; Dhar, M. K.

2026-05-01 genomics
10.64898/2026.04.28.721529 bioRxiv
Show abstract

Cytochrome P450 monooxygenases (CYP450s) are key oxidative enzymes that diversify plant specialized metabolites and play a central role in the biosynthesis of bioactive withanolides in Withania somnifera (L.) Dunal. Despite their importance, genome-wide information on CYP450s in W. somnifera has remained elusive. Herein, the first high-quality genome assembly (2.2 Gb, scaffold N50: 47.4 kb) of an Indian W. somnifera cultivar was generated using a hybrid Oxford Nanopore-Illumina sequencing strategy. Comparative analysis with the NCBI reference genome revealed moderate SNP and indel variations, reflecting intraspecific genetic diversity. A comprehensive CYP450 catalog was established and analyzed phylogenomically across nine plant genomes, encompassing both withanolide-producing and non-producing Solanaceae and non-Solanaceae species. Unique CYP families (CYP450A, CYP1194, and CYP705A) were detected exclusively in W. somnifera, suggesting lineage-specific metabolic innovations, while Solanaceae-restricted (CYP82E/M) and absent (CYP81B, CYP6) lineages highlight taxonomic divergence. Across all analyzed genomes, 36 conserved CYP450 subfamilies, including triterpenoid-associated members, were identified, suggesting a shared oxidative framework adaptable to specialized metabolism. Moreover, potential candidate genes in the triterpenoid pathway, including CYP72A692_1, CYP72A560_4, CYP716A48, CYP724B2, and CYP51G1, were identified through phylogenetic integration with functionally validated triterpenoid-modifying enzymes from other plant species. Gene family evolution analysis further revealed contraction of monoterpenoid-related subfamilies (CYP76A), implying a metabolic shift toward triterpenoid specialization. The comprehensive genome assembly and CYPome of W. somnifera offer a valuable resource for functional characterization, evolutionary analysis, and the identification of genes underlying its specialized metabolism. Furthermore, the study advances our understanding of CYP450 diversity and evolution, revealing lineage-specific innovations, conserved subfamilies, and key candidate genes involved in triterpenoid biosynthesis. Together, these findings lay a foundation for future functional studies and pathway engineering aimed at optimizing the metabolic potential of this important medicinal plant.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Frontiers in Plant Science
240 papers in training set
Top 0.9%
10.1%
2
Scientific Reports
3102 papers in training set
Top 6%
10.1%
3
The Plant Journal
197 papers in training set
Top 0.5%
8.4%
4
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.9%
6.8%
5
Horticulture Research
43 papers in training set
Top 0.4%
4.9%
6
Plant Communications
35 papers in training set
Top 0.2%
4.9%
7
New Phytologist
309 papers in training set
Top 1%
4.9%
50% of probability mass above
8
Nature Communications
4913 papers in training set
Top 33%
4.9%
9
Scientific Data
174 papers in training set
Top 0.4%
4.0%
10
PLOS ONE
4510 papers in training set
Top 39%
3.6%
11
Plant Biotechnology Journal
56 papers in training set
Top 0.5%
2.5%
12
Communications Biology
886 papers in training set
Top 4%
2.5%
13
Plant Direct
81 papers in training set
Top 0.8%
2.4%
14
Genomics
60 papers in training set
Top 1%
1.7%
15
BMC Genomics
328 papers in training set
Top 2%
1.7%
16
Journal of Agricultural and Food Chemistry
14 papers in training set
Top 0.6%
1.7%
17
eLife
5422 papers in training set
Top 43%
1.7%
18
DNA Research
23 papers in training set
Top 0.3%
1.2%
19
International Journal of Biological Macromolecules
65 papers in training set
Top 2%
1.2%
20
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
21
Frontiers in Microbiology
375 papers in training set
Top 8%
0.9%
22
Molecular Plant
36 papers in training set
Top 1%
0.8%
23
Plant Molecular Biology
18 papers in training set
Top 0.3%
0.7%
24
Free Radical Biology and Medicine
33 papers in training set
Top 0.5%
0.7%
25
International Journal of Molecular Sciences
453 papers in training set
Top 16%
0.7%
26
Journal of Genetics and Genomics
36 papers in training set
Top 2%
0.7%
27
BMC Plant Biology
47 papers in training set
Top 1%
0.7%
28
Journal of Hazardous Materials
19 papers in training set
Top 0.9%
0.7%
29
Genes
126 papers in training set
Top 4%
0.6%
30
Metabolites
50 papers in training set
Top 1%
0.6%