Back

Improved precision oncology question-answering using agentic LLM

Das, R.; Maheswari, K.; Siddiqui, S.; Arora, N.; Paul, A.; Nanshi, J.; Udbalkar, V.; Sarvade, A.; Chaturvedi, H.; Shvartsman, T.; Masih, S.; Thippeswamy, R.; Patil, S.; Nirni, S. S.; Garsson, B.; Bandyopadhyay, S.; Maulik, U.; Farooq, M.; Sengupta, D.

2024-09-24 oncology
10.1101/2024.09.20.24314076
Show abstract

The clinical adoption of Large Language Models (LLMs) in biomedical research has been limited by concerns regarding the quality, accuracy, and reliability of their outputs, particularly in precision oncology, where clinical decision-making demands high precision. Current models, often based on fine-tuned foundational LLMs, are prone to issues such as hallucinations, incoherent reasoning, and loss of context. In this work, we present GeneSilico Copilot, an advanced agent-based architecture that transforms LLMs from simple response synthesizers to clinical reasoning systems. Our approach is centred around a bespoke ReAct agent that orchestrates a suite of specialized tools for asynchronous information retrieval and synthesis. These tools access curated document vector stores containing clinical treatment guidelines, genomic insights, drug information, clinical trials, and breast cancer-specific literature. To leverage large context windows of current LLMs, we implement a hybrid search strategy that prioritizes key information and dynamically integrates summarized content, reducing context fragmentation. Incorporating additional metadata further allows for precise, transparent and evidence-backed reasoning at each step of the thought process. The system ensures that at every stage, the agent can synthesize meaningful, context-aware observations that contribute to a coherent and comprehensive final response that aligns with clinical standards. Evaluations on real-world breast cancer cases show that GeneSilico Copilot significantly improves response accuracy and personalization. This system represents a critical advancement toward making LLMs clinically deployable in precision oncology and has potential applications in broader medical domains requiring complex, data-driven decision-making.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
JCO Clinical Cancer Informatics
based on 14 papers
Top 0.1%
15.9%
2
PLOS Computational Biology
based on 141 papers
Top 1%
11.5%
3
PLOS ONE
based on 1737 papers
Top 48%
10.5%
4
npj Precision Oncology
based on 14 papers
Top 0.2%
5.5%
5
Scientific Reports
based on 701 papers
Top 32%
5.5%
6
Nature Communications
based on 483 papers
Top 20%
3.1%
50% of probability mass above
7
Nature Medicine
based on 88 papers
Top 3%
2.9%
8
npj Digital Medicine
based on 85 papers
Top 7%
2.5%
9
Cancers
based on 57 papers
Top 5%
2.4%
10
Clinical Cancer Research
based on 22 papers
Top 2%
2.4%
11
JAMIA Open
based on 35 papers
Top 4%
2.4%
12
Scientific Data
based on 30 papers
Top 1%
1.9%
13
International Journal of Radiation Oncology*Biology*Physics
based on 13 papers
Top 1%
1.8%
14
Cancer Medicine
based on 17 papers
Top 3%
1.4%
15
JCO Precision Oncology
based on 11 papers
Top 2%
1.4%
16
iScience
based on 74 papers
Top 4%
1.4%
17
Computers in Biology and Medicine
based on 39 papers
Top 4%
1.4%
18
eLife
based on 262 papers
Top 23%
1.3%
19
Med
based on 26 papers
Top 0.5%
1.2%
20
Breast Cancer Research
based on 11 papers
Top 1%
0.9%
21
Journal of the American Medical Informatics Association
based on 53 papers
Top 6%
0.8%
22
Frontiers in Oncology
based on 34 papers
Top 5%
0.8%
23
Cureus
based on 64 papers
Top 16%
0.8%
24
Journal for ImmunoTherapy of Cancer
based on 14 papers
Top 3%
0.7%
25
Bulletin of Mathematical Biology
based on 17 papers
Top 2%
0.7%
26
JMIR Formative Research
based on 31 papers
Top 6%
0.7%
27
Radiotherapy and Oncology
based on 11 papers
Top 2%
0.7%
28
Leukemia
based on 11 papers
Top 2%
0.7%