Back

The Limits of Sequence-Based Biosecurity Screening Tools in the Age of AI-Assisted Protein Design

Wittmann, B. J.; Wheeler, N. E.; Murphey, S. T.; Mitchell, T.; Magalis, B.; Gemler, B.; Flyangolts, K.; Diggans, J.; Clore, A.; Beal, J.; Bartling, C.; Alexanian, T.; Horvitz, E.

2026-03-05 synthetic biology
10.64898/2026.03.04.709671 bioRxiv
Show abstract

Rapid advancements in AI have enabled significant progress in protein and nucleic acid design, but they also pose biosecurity challenges. We examine the vulnerabilities of biosecurity screening software (BSS) to AI-reformulated synthetic homologs of proteins of concern (POCs) that have been fragmented into smaller segments. We evaluate four BSS tools that were recently patched to enhance their AI resiliency. Without any further modification, we found that two of the four tools were capable of robustly detecting fragments as short as 50 nucleotides, demonstrating screening capabilities that exceed those requested in the U.S. Framework for Nucleic Acid Synthesis. Upgraded versions of the other two tools improved performance. Although our findings confirm the effectiveness of the tested BSS tools, at the same time, they emphasize the urgency of developing alternate BSS approaches to counter evolving AI-enabled biosecurity risks.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
ACS Synthetic Biology
256 papers in training set
Top 0.2%
22.8%
2
Cell Systems
167 papers in training set
Top 1%
8.5%
3
Journal of Chemical Information and Modeling
207 papers in training set
Top 1.0%
4.9%
4
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.9%
5
Nucleic Acids Research
1128 papers in training set
Top 4%
4.9%
6
Nature Communications
4913 papers in training set
Top 37%
4.0%
50% of probability mass above
7
PLOS Computational Biology
1633 papers in training set
Top 11%
2.9%
8
Nature Biotechnology
147 papers in training set
Top 3%
2.6%
9
Nature Methods
336 papers in training set
Top 3%
2.5%
10
PLOS ONE
4510 papers in training set
Top 46%
2.4%
11
Bioinformatics
1061 papers in training set
Top 6%
2.1%
12
Scientific Reports
3102 papers in training set
Top 50%
2.1%
13
Protein Science
221 papers in training set
Top 0.7%
1.8%
14
eLife
5422 papers in training set
Top 47%
1.3%
15
Cell
370 papers in training set
Top 13%
1.3%
16
Advanced Science
249 papers in training set
Top 13%
1.3%
17
Nature Machine Intelligence
61 papers in training set
Top 2%
1.2%
18
Synthetic Biology
21 papers in training set
Top 0.1%
1.2%
19
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.2%
20
iScience
1063 papers in training set
Top 23%
1.1%
21
Journal of Molecular Biology
217 papers in training set
Top 3%
1.0%
22
International Journal of Molecular Sciences
453 papers in training set
Top 13%
0.9%
23
Computational and Structural Biotechnology Journal
216 papers in training set
Top 8%
0.8%
24
ACS Omega
90 papers in training set
Top 4%
0.8%
25
mSystems
361 papers in training set
Top 7%
0.8%
26
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 5%
0.8%
27
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
28
Science
429 papers in training set
Top 20%
0.7%
29
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 1%
0.7%
30
Journal of Cheminformatics
25 papers in training set
Top 0.6%
0.7%