Back

The Chromosome-Scale Genome of Phyllanthus niruri Reveals Candidate Genes and a Putative Biosynthetic Framework for Phyllanthin Formation.

Khushi, K.; Ganesh, A.; Sharma, A.; Ravindran, F.; Srinivasan, S.; Choudhary, B.

2026-05-16 genomics
10.64898/2026.05.15.725390 bioRxiv
Show abstract

Phyllanthus niruri (Phyllanthaceae) is a medicinally important herb known for producing phyllanthin, a bioactive dibenzylbutane lignan with reported hepatoprotective and antioxidant properties. However, the biosynthetic basis of phyllanthin production remains unresolved, largely due to the absence of a reference genome for the species. We report a Chromosome-Scale Assembly of P. niruri generated by integrating PacBio HiFi long reads and Illumina short reads, followed by reference-guided scaffolding against Phyllanthus cochinchinensis. The assembly has an L50 of 7 and 97.6% BUSCO completeness. Annotation predicted 19,254 protein-coding genes (91.1% functionally annotated), with phenylpropanoid biosynthesis emerging as the most enriched specialized-metabolism pathway in the genome. Using pathway-guided genome mining, structural similarity analysis, and comparative metabolic reconstruction, we propose a putative biosynthetic pathway for phyllanthin originating from the phenylpropanoid-lignan branch through secoisolariciresinol-like intermediates, followed by terminal O-methylation reactions. A total of 305 unique candidate genes associated with the proposed pathway were identified, including expanded families of dirigent proteins, peroxidases, secoisolariciresinol dehydrogenases, and O-methyltransferases. Comparative transcriptomic analyses across related Phyllanthus species further supported the proposed pathway through coordinated expression of lignan-associated genes and tissue-specific enrichment of O-methyltransferases. This work provides the first reference genome for P. niruri and a prioritized candidate gene set for functional characterization of phyllanthin biosynthesis.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 1%
29.3%
2
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.9%
6.8%
3
Communications Biology
886 papers in training set
Top 1.0%
4.2%
4
Horticulture Research
43 papers in training set
Top 0.7%
2.9%
5
Scientific Reports
3102 papers in training set
Top 42%
2.9%
6
Plant Communications
35 papers in training set
Top 0.6%
2.2%
7
Cell Reports
1338 papers in training set
Top 20%
2.2%
50% of probability mass above
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 27%
2.2%
9
Molecular Plant
36 papers in training set
Top 0.6%
2.2%
10
Science
429 papers in training set
Top 12%
2.2%
11
Advanced Science
249 papers in training set
Top 10%
1.9%
12
The Plant Journal
197 papers in training set
Top 2%
1.8%
13
New Phytologist
309 papers in training set
Top 3%
1.8%
14
Science Advances
1098 papers in training set
Top 16%
1.8%
15
Journal of the American Chemical Society
199 papers in training set
Top 3%
1.6%
16
Cell Discovery
54 papers in training set
Top 3%
1.6%
17
International Journal of Biological Macromolecules
65 papers in training set
Top 2%
1.4%
18
Cell Chemical Biology
81 papers in training set
Top 2%
1.3%
19
Plant Biotechnology Journal
56 papers in training set
Top 0.8%
1.3%
20
Cell
370 papers in training set
Top 15%
1.0%
21
eLife
5422 papers in training set
Top 51%
1.0%
22
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
1.0%
23
Nature Plants
84 papers in training set
Top 2%
0.8%
24
Scientific Data
174 papers in training set
Top 2%
0.8%
25
Nature Genetics
240 papers in training set
Top 7%
0.8%
26
Protein & Cell
25 papers in training set
Top 2%
0.8%
27
PLOS ONE
4510 papers in training set
Top 65%
0.8%
28
Nature
575 papers in training set
Top 15%
0.8%
29
BMC Genomics
328 papers in training set
Top 5%
0.8%
30
Frontiers in Genetics
197 papers in training set
Top 10%
0.7%