Back

MolClaw: An Autonomous Agent with Hierarchical Skills for Drug Molecule Evaluation, Screening, and Optimization

Zhang, L.; Wang, L.; Sun, X.; Tang, W.; Su, H.; Qian, Y.; Yang, Q.; Li, Q.; Tang, Z.; Sun, H.; Han, Y.; Jiang, Y.; Lou, W.; Zhou, B.; Wang, X.; Bai, L.; Xie, Z.

2026-04-06 bioinformatics
10.64898/2026.04.03.716272 bioRxiv
Show abstract

Computational drug discovery, particularly the complex workflows of drug molecule screening and optimization, requires orchestrating dozens of specialized tools in multi-step workflows, yet current AI agents struggle to maintain robust performance and consistently underperform in these high-complexity scenarios. Here we present MolClaw, an autonomous agent that leads drug molecule evaluation, screening, and optimization. It unifies over 30 specialized domain resources through a three-tier hierarchical skill architecture (70 skills in total) that facilitates agent long-term interaction at runtime: tool-level skills standardize atomic operations, workflow-level skills compose them into validated pipelines with quality check and reflection, and a discipline-level skill supplies scientific principles governing planning and verification across all scenarios in the field. Additionally, we introduce MolBench, a benchmark comprising molecular screening, optimization, and end-to-end discovery challenges spanning 8 to 50+ sequential tool calls. MolClaw achieves state-of-the-art performance across all metrics, and ablation studies confirm that gains concentrate on tasks that demand structured workflows while vanishing on those solvable with ad hoc scripting, establishing workflow orchestration competence as the primary capability bottleneck for AI-driven drug discovery.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 3%
9.1%
2
Cell Systems
167 papers in training set
Top 1%
9.1%
3
Nature Communications
4913 papers in training set
Top 22%
8.4%
4
PLOS ONE
4510 papers in training set
Top 32%
4.9%
5
Bioinformatics Advances
184 papers in training set
Top 0.7%
4.9%
6
npj Digital Medicine
97 papers in training set
Top 1.0%
4.3%
7
Nucleic Acids Research
1128 papers in training set
Top 6%
3.6%
8
BMC Bioinformatics
383 papers in training set
Top 3%
3.6%
9
Nature Methods
336 papers in training set
Top 3%
3.6%
50% of probability mass above
10
PLOS Computational Biology
1633 papers in training set
Top 11%
3.3%
11
GigaScience
172 papers in training set
Top 0.9%
2.4%
12
Scientific Reports
3102 papers in training set
Top 48%
2.4%
13
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 27%
2.1%
14
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
15
iScience
1063 papers in training set
Top 15%
1.7%
16
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
17
Nature Machine Intelligence
61 papers in training set
Top 2%
1.7%
18
eLife
5422 papers in training set
Top 42%
1.7%
19
Journal of Cheminformatics
25 papers in training set
Top 0.3%
1.7%
20
Advanced Science
249 papers in training set
Top 11%
1.7%
21
Patterns
70 papers in training set
Top 1%
1.5%
22
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.5%
23
Nature Medicine
117 papers in training set
Top 3%
1.2%
24
Nature Biotechnology
147 papers in training set
Top 6%
1.1%
25
Nature
575 papers in training set
Top 13%
0.9%
26
Genome Biology
555 papers in training set
Top 6%
0.9%
27
Genome Medicine
154 papers in training set
Top 7%
0.9%
28
Journal of Molecular Biology
217 papers in training set
Top 3%
0.9%
29
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.5%
0.9%
30
Cell Genomics
162 papers in training set
Top 6%
0.8%