Back

Human-supervised Agentic AI for Hypothesis Generation and Experimental Assistance in Drug Repurposing

Huynh, D.-L.; Asp, E.; Ballante, F.; Puigvert, J. C.; DeGrave, A.; Karki, R.; Nader, K.; Östling, P.; Pokharel, B.; Rietdijk, J.; Schlotawa, L.; Schmidt, L.; Seal, S.; Seashore-Ludlow, B.; Aittokallio, T.; Spjuth, O.

2026-04-22 bioinformatics
10.64898/2026.04.20.719538 bioRxiv
Show abstract

Computational drug repurposing has largely been focused on rapid hypothesis generation, yet real-world applications span a far broader lifecycle, from drug candidate suggestion to designing experiments, analyzing assay data, and iteratively refining candidates. Here, we demonstrate that agentic AI can fulfill this entire scope. To this end, we developed RepurAgent, a hierarchical multi-agent AI system comprising a supervisor agent and a planning agent that coordinate four specialized sub-agents -- research, prediction, data, and report -- through a human-in-the-loop design, with episodic memory and retrieval-augmented generation. The system is grounded in data, tools, and standard operating procedures specific for drug repurposing, developed within the REMEDi4ALL consortium. We validated the agentic system across three scenarios spanning the various stages within the repurposing lifecycle: in Acute Myeloid Leukemia, RepurAgent recovered up to 97% of disease-relevant pathways identified by Google Co-Scientist, completing the workflow within 60 minutes; in a retrospective COVID-19 antiviral screen, RepurAgent acted as an adaptive experimental collaborator, prioritizing compounds with AUC-ROC up to 0.98 without predefined thresholds and flagging confounders missed in manual review; and for Multiple Sulfatase Deficiency, it prioritized 82 high-confidence candidates from 5000 compounds, which were further corroborated by domain experts. These results demonstrate that agentic AI can support across the full drug repurposing lifecycle, from hypothesis generation to experimental analysis. RepurAgent is open source and deployed at https://repuragent.serve.scilifelab.se/.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 3%
9.1%
2
Nature Communications
4913 papers in training set
Top 22%
8.4%
3
Nature Methods
336 papers in training set
Top 2%
4.8%
4
Bioinformatics Advances
184 papers in training set
Top 0.7%
4.8%
5
Cell Systems
167 papers in training set
Top 3%
4.8%
6
Nucleic Acids Research
1128 papers in training set
Top 5%
4.1%
7
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.9%
8
Genome Medicine
154 papers in training set
Top 2%
3.7%
9
PLOS Computational Biology
1633 papers in training set
Top 9%
3.7%
10
Journal of Cheminformatics
25 papers in training set
Top 0.2%
3.6%
50% of probability mass above
11
Patterns
70 papers in training set
Top 0.3%
3.1%
12
Journal of Chemical Information and Modeling
207 papers in training set
Top 1%
3.1%
13
BMC Bioinformatics
383 papers in training set
Top 3%
2.7%
14
PLOS ONE
4510 papers in training set
Top 44%
2.7%
15
Nature Machine Intelligence
61 papers in training set
Top 1%
2.3%
16
npj Digital Medicine
97 papers in training set
Top 2%
2.1%
17
Advanced Science
249 papers in training set
Top 9%
1.9%
18
npj Systems Biology and Applications
99 papers in training set
Top 1.0%
1.8%
19
Scientific Reports
3102 papers in training set
Top 56%
1.8%
20
Nature Medicine
117 papers in training set
Top 2%
1.7%
21
Nature Biotechnology
147 papers in training set
Top 5%
1.7%
22
iScience
1063 papers in training set
Top 18%
1.5%
23
GigaScience
172 papers in training set
Top 2%
1.3%
24
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
25
Nature
575 papers in training set
Top 14%
0.9%
26
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.2%
0.8%
27
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 43%
0.8%
28
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.8%
29
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
30
Genome Biology
555 papers in training set
Top 7%
0.7%