Back

Assessing the performance of GPT-4 in the filed of osteoarthritis and orthopaedic case consultation

li, j.; Gao, X.; Dou, T.; Gao, Y.; Zhu, W.

2023-08-09 orthopedics
10.1101/2023.08.06.23293735 medRxiv
Show abstract

BackgroundLarge Language Models (LLMs) like GPT-4 demonstrate potential applications in diverse areas, including healthcare and patient education. This study evaluates GPT-4s competency against osteoarthritis (OA) treatment guidelines from the United States and China and assesses its ability in diagnosing and treating orthopedic diseases. MethodsData sources included OA management guidelines and orthopedic examination case questions. Queries were directed to GPT-4 based on these resources, and its responses were compared with the established guidelines and cases. The accuracy and completeness of GPT-4s responses were evaluated using Likert scales, while case inquiries were stratified into four tiers of correctness and completeness. ResultsGPT-4 exhibited strong performance in providing accurate and complete responses to OA management recommendations from both the American and Chinese guidelines, with high Likert scale scores for accuracy and completeness. It demonstrated proficiency in handling clinical cases, making accurate diagnoses, suggesting appropriate tests, and proposing treatment plans. Few errors were noted in specific complex cases. ConclusionsGPT-4 exhibits potential as an auxiliary tool in orthopedic clinical practice and patient education, demonstrating high accuracy and completeness in interpreting OA treatment guidelines and analyzing clinical cases. Further validation of its capabilities in real-world clinical scenarios is needed.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
JMIR Medical Informatics
17 papers in training set
Top 0.1%
22.4%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.1%
17.4%
3
JAMIA Open
37 papers in training set
Top 0.1%
17.4%
50% of probability mass above
4
Scientific Reports
3102 papers in training set
Top 18%
6.3%
5
PLOS ONE
4510 papers in training set
Top 34%
4.3%
6
Medicine
30 papers in training set
Top 0.6%
3.2%
7
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
2.6%
8
PLOS Digital Health
91 papers in training set
Top 1%
2.4%
9
npj Digital Medicine
97 papers in training set
Top 2%
1.9%
10
Journal of Medical Internet Research
85 papers in training set
Top 2%
1.8%
11
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.7%
12
Bioengineering
24 papers in training set
Top 0.5%
1.7%
13
Frontiers in Public Health
140 papers in training set
Top 6%
1.3%
14
Healthcare
16 papers in training set
Top 1%
1.2%
15
International Journal of Medical Informatics
25 papers in training set
Top 1%
1.2%
16
BMC Medical Education
20 papers in training set
Top 0.7%
0.9%
17
Artificial Intelligence in Medicine
15 papers in training set
Top 0.6%
0.9%
18
Frontiers in Human Neuroscience
67 papers in training set
Top 2%
0.8%
19
BMC Medicine
163 papers in training set
Top 7%
0.7%
20
Applied Sciences
24 papers in training set
Top 1%
0.7%
21
BJGP Open
12 papers in training set
Top 0.7%
0.7%