Back

EpiReasoner: An Integrated Artificial Intelligence Framework for Phenotype-to-Genotype Reasoning in Plant Epidermal Development

Zhang, H.; Feng, X.

2026-05-18 plant biology
10.64898/2026.05.13.724792 bioRxiv
Show abstract

Achieving high-throughput and precise phenotypic quantification and imaging modalities of stomatal and epidermal cells across diverse species remains a primary bottleneck in elucidating the mechanisms of stomatal dynamics, epidermal patterning, and environmental adaptation of plants. Here, we developed EpiReasoner, an artificial intelligence framework comprising a vision module, EpiVision, and a knowledge-based reasoning module, EpiBrain, for the quantitative phenotypic analysis and domain-specific knowledge reasoning of stomatal complexes and pavement cells in plants. Operating across bright-field, scanning electron microscopy, and differential interference contrast modalities, EpiVision achieves precise instance segmentation in various monocotyledonous, dicotyledonous, and fern species. Its performance significantly surpasses current state-of-the-art models. Moreover, we defined 23 quantitative indices describing stomatal cell morphology and spatial distribution. For domain-specific tasks such as phenotype prediction, genotype deduction, and molecular mechanism reasoning, EpiBrain demonstrates a human preference rate significantly higher than that of general-purpose large language models, including GPT-5 and Claude Sonnet 4. The application of EpiReasoner to phenotypic data of stomatal density derived from a tomato natural population of 170 accessions successfully identified a major quantitative trait locus on chromosome 8. The candidate gene, SKP1-interaction partner 19L (SKIP19L), encoding an F-box family protein, exhibited severe allele frequency drift during tomato domestication, which is highly consistent with the adaptive trend of reduced stomatal density under artificial selection. EpiReasoner provides a novel paradigm that unifies visual phenomics and knowledge-driven reasoning for the biology of stomata and pavement cells, thereby significantly accelerating scientific discovery in plant science.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 14%
12.3%
2
Plant Communications
35 papers in training set
Top 0.1%
10.0%
3
Horticulture Research
43 papers in training set
Top 0.3%
6.7%
4
Molecular Plant
36 papers in training set
Top 0.2%
6.7%
5
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
6.3%
6
Advanced Science
249 papers in training set
Top 3%
6.3%
7
Science China Life Sciences
26 papers in training set
Top 0.2%
4.8%
50% of probability mass above
8
Plant Physiology
217 papers in training set
Top 1%
3.5%
9
New Phytologist
309 papers in training set
Top 2%
3.5%
10
Journal of Genetics and Genomics
36 papers in training set
Top 0.4%
3.5%
11
The Plant Journal
197 papers in training set
Top 2%
2.7%
12
Nature Plants
84 papers in training set
Top 0.9%
2.1%
13
Plant Phenomics
17 papers in training set
Top 0.1%
2.1%
14
Genome Biology
555 papers in training set
Top 4%
1.9%
15
PLOS ONE
4510 papers in training set
Top 50%
1.9%
16
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 33%
1.7%
17
Cell Systems
167 papers in training set
Top 8%
1.6%
18
Scientific Reports
3102 papers in training set
Top 64%
1.3%
19
Communications Biology
886 papers in training set
Top 13%
1.3%
20
GigaScience
172 papers in training set
Top 2%
1.2%
21
PLOS Computational Biology
1633 papers in training set
Top 21%
0.9%
22
Development
440 papers in training set
Top 3%
0.9%
23
Plant Biotechnology Journal
56 papers in training set
Top 1%
0.9%
24
eLife
5422 papers in training set
Top 56%
0.8%
25
Genome Research
409 papers in training set
Top 4%
0.7%
26
iScience
1063 papers in training set
Top 33%
0.7%
27
Frontiers in Plant Science
240 papers in training set
Top 6%
0.6%