Back

Surgical Information Assistant: A technical report on an agentic information retrieval System for surgical information

Bhattacharyya, K.

2025-05-21 surgery
10.1101/2025.05.20.25328046 medRxiv
Show abstract

We present the Surgical Information Assistant, an agentic retrieval-augmented generation (RAG) system designed to improve access to surgical knowledge in resource-constrained settings. Built on the Open Manual of Surgery for Resource-Limited Settings, the assistant uses a retrieval-method we call DeRetSyn (Decom-pose-Retrieve-Synthesize). We evaluate DeRetSyn using automated metrics and partial human validation across 14,500 synthesized question-answer pairs and find that it achieves 63% top-1 accuracy using a 3B Llama model - outperforming GPT-4o (42.5%) without RAG and a 8B Llama model with conventional RAG ([~=]53%) while being significantly smaller and more computationally efficient. We also find that the DeRetSyn system with the Llama 3B model outperforms GPT-4o on the publicly available PubMedQA dataset on overall accuracy under specific prompting patterns. The Surgical Information Assistant demonstrates how agentic orchestration can extend the capabilities of small language models and offers a deployable framework for point-of-care medical decision support, education, and QA in low-bandwidth environments. We plan to release our benchmark dataset, codebase, prompt library, and RAG evaluation results for all categories for the entire dataset along with chain-of-thought reasoning from GPT-4o, Llama-3.1-8B, and Llama-3.2-3B upon publication.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
npj Digital Medicine
97 papers in training set
Top 0.1%
63.6%
50% of probability mass above
2
Nature Medicine
117 papers in training set
Top 1%
2.8%
3
PLOS Computational Biology
1633 papers in training set
Top 13%
2.2%
4
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 30%
1.9%
5
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
1.8%
6
iScience
1063 papers in training set
Top 13%
1.8%
7
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.7%
8
Scientific Reports
3102 papers in training set
Top 61%
1.5%
9
Nature Methods
336 papers in training set
Top 5%
1.4%
10
Bioinformatics
1061 papers in training set
Top 8%
1.3%
11
Nature Human Behaviour
85 papers in training set
Top 3%
1.0%
12
Cell Systems
167 papers in training set
Top 10%
1.0%
13
PLOS ONE
4510 papers in training set
Top 63%
0.9%
14
Nucleic Acids Research
1128 papers in training set
Top 16%
0.8%
15
BMC Bioinformatics
383 papers in training set
Top 7%
0.8%
16
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
17
GigaScience
172 papers in training set
Top 4%
0.7%
18
Biology Methods and Protocols
53 papers in training set
Top 3%
0.7%
19
Journal of Biomedical Informatics
45 papers in training set
Top 2%
0.7%
20
Nature Communications
4913 papers in training set
Top 66%
0.5%
21
PLOS Digital Health
91 papers in training set
Top 3%
0.5%
22
Nature
575 papers in training set
Top 17%
0.5%