Back

AI predictions and the expansion of scientific frontiers: Evidence from structural biology

Sun, M.; Choi, S.; Yin, Y.

2026-04-07 bioinformatics
10.64898/2026.04.06.716821 bioRxiv
Show abstract

Artificial intelligence holds the potential to expand the frontier of scientific research1, yet recent work has raised concern that it may instead narrow scientific attention to well-established areas2-4. Here, leveraging the 2021 release of AlphaFold25 as a quasi-experimental opportunity, we provide field-level evidence that AI can redirect collective attention toward more novel research targets. Tracking 245,396 experimental structures in the Protein Data Bank6, we show that a long-running decline in the study of novel proteins halted after AlphaFold2s release, with the shift concentrated among studies citing AlphaFold2 and targets with high-confidence predictions. This pattern extends to 248,191 downstream papers that consume structural knowledge, where engagement with genes lacking experimental structures and with understudied human genes increased since 2021. Amid rising concern that AI may reinforce scientific canons7-10, our findings offer an early field-level case where AI predictions expand scientific frontiers, consistent with the idea that the real-world consequences of AI on science depend on where their informational gains are greatest. These results suggest AI can complement human knowledge and redirect collective attention in science, with broad implications for emerging AI for science models.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
Nature
575 papers in training set
Top 0.1%
50.8%
50% of probability mass above
2
Nature Communications
4913 papers in training set
Top 34%
4.8%
3
Cell Genomics
162 papers in training set
Top 2%
3.5%
4
Nature Biotechnology
147 papers in training set
Top 3%
3.5%
5
Nature Methods
336 papers in training set
Top 3%
2.7%
6
Science
429 papers in training set
Top 11%
2.5%
7
Nature Medicine
117 papers in training set
Top 2%
2.0%
8
Nature Genetics
240 papers in training set
Top 4%
2.0%
9
Nature Plants
84 papers in training set
Top 0.9%
1.8%
10
Nature Machine Intelligence
61 papers in training set
Top 2%
1.7%
11
Science Advances
1098 papers in training set
Top 18%
1.7%
12
Nature Metabolism
56 papers in training set
Top 1%
1.7%
13
Cell Systems
167 papers in training set
Top 7%
1.7%
14
Advanced Science
249 papers in training set
Top 12%
1.6%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 35%
1.5%
16
Nature Microbiology
133 papers in training set
Top 3%
1.3%
17
Nature Chemical Biology
104 papers in training set
Top 3%
1.2%
18
Cell Research
49 papers in training set
Top 2%
1.2%
19
Cell
370 papers in training set
Top 16%
0.9%
20
Molecular Cell
308 papers in training set
Top 10%
0.7%
21
Nature Structural & Molecular Biology
218 papers in training set
Top 5%
0.7%
22
Communications Biology
886 papers in training set
Top 30%
0.6%
23
Nature Cell Biology
99 papers in training set
Top 5%
0.6%