Back

Science should be machine-readable

Booeshaghi, A. S.; Luebbert, L.; Pachter, L.

2026-02-02 scientific communication and education
10.64898/2026.01.30.702911 bioRxiv
Show abstract

We develop a machine-automated approach for extracting results from papers, which we assess via a comprehensive review of the entire eLife corpus. Our method facilitates a direct comparison of machine and peer review, and sheds light on key challenges that must be overcome in order to facilitate AI-assisted science. In particular, the results point the way towards a machine-readable framework for disseminating scientific information. We therefore argue that publication systems should optimize separately for the dissemination of data and results versus the conveying of novel ideas, and the former should be machine-readable.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 10%
18.3%
2
eLife
5422 papers in training set
Top 6%
10.1%
3
PLOS Biology
408 papers in training set
Top 0.2%
10.1%
4
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 0.4%
7.2%
5
PLOS Computational Biology
1633 papers in training set
Top 6%
6.4%
50% of probability mass above
6
Nature Neuroscience
216 papers in training set
Top 2%
4.8%
7
Journal of Cell Biology
333 papers in training set
Top 0.8%
4.3%
8
Nature Human Behaviour
85 papers in training set
Top 0.6%
4.3%
9
Acta Crystallographica Section D Structural Biology
54 papers in training set
Top 0.1%
4.0%
10
Nature Biotechnology
147 papers in training set
Top 2%
3.6%
11
Scientific Reports
3102 papers in training set
Top 50%
2.1%
12
eneuro
389 papers in training set
Top 6%
1.5%
13
Nature
575 papers in training set
Top 12%
1.5%
14
Nature Genetics
240 papers in training set
Top 6%
1.2%
15
Science
429 papers in training set
Top 16%
1.2%
16
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.9%
17
Nature Methods
336 papers in training set
Top 6%
0.8%
18
Patterns
70 papers in training set
Top 2%
0.8%
19
Molecular Systems Biology
142 papers in training set
Top 2%
0.7%
20
FASEB BioAdvances
15 papers in training set
Top 0.3%
0.7%
21
Communications Biology
886 papers in training set
Top 24%
0.7%
22
Royal Society Open Science
193 papers in training set
Top 5%
0.7%
23
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
24
BMC Medicine
163 papers in training set
Top 7%
0.7%
25
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
26
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
27
FEBS Letters
42 papers in training set
Top 0.5%
0.6%
28
Entropy
20 papers in training set
Top 0.5%
0.6%
29
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.6%