Back

Generation of guideline-based clinical decision trees in oncology using large language models

Miao, B. Y.; Rodriguez Almaraz, E.; Ashraf Ganjouei, A.; Suresh, A.; Zack, T.; Bravo, M.; Raghavendran, S.; Oskotsky, B.; Alaa, A.; Butte, A. J.

2024-03-06 oncology
10.1101/2024.03.04.24303737 medRxiv
Show abstract

BackgroundMolecular biomarkers play a pivotal role in the diagnosis and treatment of oncologic diseases but staying updated with the latest guidelines and research can be challenging for healthcare professionals and patients. Large Language Models (LLMs), such as MedPalm-2 and GPT-4, have emerged as potential tools to streamline biomedical information extraction, but their ability to summarize molecular biomarkers for oncologic disease subtyping remains unclear. Auto-generation of clinical nomograms from text guidelines could illustrate a new type of utility for LLMs. MethodsIn this cross-sectional study, two LLMs, GPT-4 and Claude-2, were assessed for their ability to generate decision trees for molecular subtyping of oncologic diseases with and without expert-curated guidelines. Clinical evaluators assessed the accuracy of biomarker and cancer subtype generation, as well as validity of molecular subtyping decision trees across five cancer types: colorectal cancer, invasive ductal carcinoma, acute myeloid leukemia, diffuse large B-cell lymphoma, and diffuse glioma. ResultsBoth GPT-4 and Claude-2 "off the shelf" successfully produced clinical decision trees that contained valid instances of biomarkers and disease subtypes. Overall, GPT-4 and Claude-2 showed limited improvement in the accuracy of decision tree generation when guideline text was added. A Streamlit dashboard was developed for interactive exploration of subtyping trees generated for other oncologic diseases. ConclusionThis study demonstrates the potential of LLMs like GPT-4 and Claude-2 in aiding the summarization of molecular diagnostic guidelines in oncology. While effective in certain aspects, their performance highlights the need for careful interpretation, especially in zero-shot settings. Future research should focus on enhancing these models for more nuanced and probabilistic interpretations in clinical decision-making. The developed tools and methodologies present a promising avenue for expanding LLM applications in various medical specialties. Key Points- Large language models, such as GPT-4 and Claude-2, can generate clinical decision trees that summarize best-practice guidelines in oncology - Providing guidelines in the prompt query improves the accuracy of oncology biomarker and cancer subtype information extraction - However, providing guidelines in zero-shot settings does not significantly improve generation of clinical decision trees for either GPT-4 or Claude-2

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Artificial Intelligence in Medicine
15 papers in training set
Top 0.1%
14.2%
2
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.1%
14.2%
3
Frontiers in Oncology
95 papers in training set
Top 0.3%
8.1%
4
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.4%
6.8%
5
Biology Methods and Protocols
53 papers in training set
Top 0.1%
4.8%
6
PLOS ONE
4510 papers in training set
Top 34%
4.3%
50% of probability mass above
7
Cancer Medicine
24 papers in training set
Top 0.3%
3.9%
8
Scientific Reports
3102 papers in training set
Top 37%
3.6%
9
BMC Bioinformatics
383 papers in training set
Top 3%
3.6%
10
Computers in Biology and Medicine
120 papers in training set
Top 1%
2.7%
11
JMIR Medical Informatics
17 papers in training set
Top 0.5%
2.6%
12
PeerJ
261 papers in training set
Top 6%
1.8%
13
BMC Cancer
52 papers in training set
Top 1%
1.7%
14
Database
51 papers in training set
Top 0.4%
1.7%
15
BMJ Open
554 papers in training set
Top 10%
1.3%
16
npj Digital Medicine
97 papers in training set
Top 3%
1.3%
17
Journal of Translational Medicine
46 papers in training set
Top 2%
1.2%
18
BMC Research Notes
29 papers in training set
Top 0.3%
0.9%
19
Informatics in Medicine Unlocked
21 papers in training set
Top 0.8%
0.9%
20
BMC Infectious Diseases
118 papers in training set
Top 4%
0.9%
21
Frontiers in Bioinformatics
45 papers in training set
Top 0.6%
0.9%
22
JAMIA Open
37 papers in training set
Top 1%
0.9%
23
iScience
1063 papers in training set
Top 29%
0.8%
24
JCO Precision Oncology
14 papers in training set
Top 0.4%
0.8%
25
JMIR Formative Research
32 papers in training set
Top 2%
0.7%
26
PLOS Computational Biology
1633 papers in training set
Top 26%
0.7%
27
JAMA Network Open
127 papers in training set
Top 5%
0.6%
28
Cancers
200 papers in training set
Top 5%
0.6%
29
European Journal of Cancer
10 papers in training set
Top 0.7%
0.6%
30
International Journal of Medical Informatics
25 papers in training set
Top 2%
0.6%