Back

Evolution of research topics and paradigms in plant sciences

Shiu, S.-H.; Lehti-Shiu, M. D.

2023-10-03 plant biology
10.1101/2023.10.02.560457 bioRxiv
Show abstract

Scientific advances due to conceptual or technological innovations can be revealed by examining how research topics have evolved. But such topical evolution is difficult to uncover and quantify because of the large body of literature and the needs of expert knowledge from a wide range of areas in any field. Here we used machine learning and language models to classify plant science citations into topics representing interconnected, evolving subfields. The changes in prevalence of topical records over the last 50 years reflect major research paradigm shifts and recent radiation of new topics, as well as turnovers of model species and vastly different plant science research trajectories among countries. Our approaches readily summarize the topical diversity and evolution of a scientific field with hundreds of thousands of relevant papers, and they can be applied broadly to other fields. Significance statementChanges in scientific paradigms are foundational for the advancement of science, but such changes are difficult to summarize, quantify, and illustrate. These challenges are exacerbated by the rapid, exponential growth of literature. Applying a combination of machine learning and language modeling to hundreds of thousands of published abstracts, we demonstrate that a scientific field (i.e., plant science) can be summarized as interconnected subfields evolving from one another. We also reveal insights into major research trends and the rise and decline in the use of model organisms in different countries. Our study demonstrates how artificial intelligence and language models can be broadly applied to understand scientific advances that inform science policy and funding decisions.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 0.1%
51.5%
50% of probability mass above
2
Nature Plants
84 papers in training set
Top 0.1%
8.3%
3
eLife
5422 papers in training set
Top 20%
4.3%
4
Science
429 papers in training set
Top 8%
3.9%
5
New Phytologist
309 papers in training set
Top 2%
2.7%
6
PLOS Biology
408 papers in training set
Top 5%
2.6%
7
Genome Biology
555 papers in training set
Top 3%
2.1%
8
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
9
Applications in Plant Sciences
21 papers in training set
Top 0.2%
1.7%
10
Cell Systems
167 papers in training set
Top 8%
1.5%
11
Nature Communications
4913 papers in training set
Top 54%
1.5%
12
Science Advances
1098 papers in training set
Top 25%
0.9%
13
Molecular Systems Biology
142 papers in training set
Top 1%
0.8%
14
Molecular Biology and Evolution
488 papers in training set
Top 4%
0.7%
15
BMC Biology
248 papers in training set
Top 4%
0.7%
16
Global Change Biology
69 papers in training set
Top 2%
0.7%
17
PLOS ONE
4510 papers in training set
Top 68%
0.7%
18
BMC Medicine
163 papers in training set
Top 8%
0.7%
19
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
20
Nature Methods
336 papers in training set
Top 6%
0.7%
21
Patterns
70 papers in training set
Top 3%
0.7%
22
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
23
PLANTS, PEOPLE, PLANET
21 papers in training set
Top 0.8%
0.7%
24
mSystems
361 papers in training set
Top 8%
0.7%