Back

What Large Language Models Know About Plant Molecular Biology

Fernandez Burda, M.; Ferrero, L.; Gaggion, N.; Fonouni-Farde, C.; Iglesias, M. J.; Fragkostefanakis, S.; Tonelli, M. L.; Zanetti, M. E.; Krapp, A.; Mencia, R.; Romani, F.; Muschietti, J. P.; Mansilla, N.; Casal, J.; Pagnussat, L. A.; Ballare, C. L.; Mammarella, M. F.; Blanco, F. A.; Roy, S.; Maroniche, G. A.; Rivarola, M.; Fiol, D. F.; Cubas, P.; Dezar, C.; Casati, P.; Ibanez, F.; Fernanda, d. C.-N.; Staiger, D.; Fusari, C. M.; Auge, G.; Arana, M. V.; Parmar, R.; Zhang, W.; Mathur, S.; Verslues, P. E. V.; Manavella, P. A.; Mateos, J. L.; Bouche, N.; Lucero, L. E.; Drincovich, M. F.; Traubenik,

2025-09-04 plant biology
10.1101/2025.08.31.672925 bioRxiv
Show abstract

Large language models (LLMs) are rapidly permeating scientific research, yet their capabilities in plant molecular biology remain largely uncharacterized. Here, we present MO_SCPLOWOC_SCPLOWBO_SCPLOWIC_SCPLOWPO_SCPLOWLANTC_SCPLOW, the first comprehensive benchmark for evaluating LLMs in this domain, developed by a consortium of 112 plant scientists across 19 countries. MO_SCPLOWOC_SCPLOWBO_SCPLOWIC_SCPLOWPO_SCPLOWLANTC_SCPLOW comprises 565 expert-curated multiple-choice questions and 1,075 synthetically generated questions, spanning core topics from gene regulation to plant-environment interactions. We benchmarked seven leading chat-based LLMs using both automated scoring and human evaluation of open-ended answers. Models performed well on multiple-choice tasks (exceeding 75% accuracy), although most of them exhibited a consistent bias towards option A. In contrast, expert reviews exposed persistent limitations, including factual misalignment, hallucinations, and low self-awareness. Critically, we found that model performance strongly correlated with the citation frequency of source literature, suggesting that LLMs do not simply encode plant biology knowledge uniformly, but are instead shaped by the visibility and frequency of information in their training corpora. This understanding is key to guiding both the development of next-generation models and the informed use of current tools in the everyday work of plant researchers. MO_SCPLOWOC_SCPLOWBO_SCPLOWIC_SCPLOWPO_SCPLOWLANTC_SCPLOW is publicly available online in this link.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Plants
84 papers in training set
Top 0.1%
22.1%
2
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 8%
8.3%
3
Science
429 papers in training set
Top 6%
4.8%
4
Nature
575 papers in training set
Top 5%
4.8%
5
Development
440 papers in training set
Top 0.6%
4.2%
6
Genome Biology
555 papers in training set
Top 2%
4.1%
7
Applications in Plant Sciences
21 papers in training set
Top 0.1%
3.0%
50% of probability mass above
8
Bioinformatics
1061 papers in training set
Top 6%
3.0%
9
GigaScience
172 papers in training set
Top 0.7%
2.7%
10
Cell Systems
167 papers in training set
Top 6%
2.1%
11
Plant Physiology
217 papers in training set
Top 2%
2.0%
12
Nature Communications
4913 papers in training set
Top 49%
1.9%
13
The Plant Journal
197 papers in training set
Top 2%
1.9%
14
Database
51 papers in training set
Top 0.3%
1.9%
15
PLOS Biology
408 papers in training set
Top 10%
1.7%
16
PLOS ONE
4510 papers in training set
Top 56%
1.6%
17
PLOS Computational Biology
1633 papers in training set
Top 17%
1.6%
18
Nature Methods
336 papers in training set
Top 5%
1.5%
19
Plant Communications
35 papers in training set
Top 0.9%
1.5%
20
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.3%
21
The Plant Cell
141 papers in training set
Top 2%
1.2%
22
Molecular Systems Biology
142 papers in training set
Top 1.0%
1.2%
23
eLife
5422 papers in training set
Top 51%
1.1%
24
Cell
370 papers in training set
Top 15%
0.9%
25
BMC Biology
248 papers in training set
Top 3%
0.9%
26
in silico Plants
24 papers in training set
Top 0.3%
0.9%
27
Molecular Plant
36 papers in training set
Top 1%
0.8%
28
New Phytologist
309 papers in training set
Top 4%
0.8%
29
Nature Biotechnology
147 papers in training set
Top 8%
0.7%
30
Nature Genetics
240 papers in training set
Top 8%
0.7%