Back

Improved Performance of ChatGPT-4 on the OKAP Exam: A Comparative Study with ChatGPT-3.5

Teebagy, S.; Colwell, L.; Wood, E.; Yaghy, A.; Faustina, M.

2023-04-03 ophthalmology
10.1101/2023.04.03.23287957 medRxiv
Show abstract

This study aims to evaluate the performance of ChatGPT-4, an advanced Artificial Intelligence (AI) language model, on the Ophthalmology Knowledge Assessment Program (OKAP) examination compared to its predecessor, ChatGPT-3.5. Both models were tested on 180 OKAP practice questions covering various ophthalmology subject categories. Results showed that ChatGPT-4 significantly outperformed ChatGPT-3.5 (81% vs. 57%; p<0.001), indicating improvements in medical knowledge assessment. The superior performance of ChatGPT-4 suggests potential applicability in ophthalmologic education and clinical decision support systems. Future research should focus on refining AI models, ensuring a balanced representation of fundamental and specialized knowledge, and determining the optimal method of integrating AI into medical education and practice.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Computers in Biology and Medicine
120 papers in training set
Top 0.1%
26.6%
2
PLOS ONE
4510 papers in training set
Top 14%
12.8%
3
PLOS Digital Health
91 papers in training set
Top 0.2%
10.4%
4
Scientific Reports
3102 papers in training set
Top 11%
7.4%
50% of probability mass above
5
Journal of Medical Internet Research
85 papers in training set
Top 0.9%
5.0%
6
F1000Research
79 papers in training set
Top 0.3%
4.5%
7
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1.0%
2.8%
8
BMC Genomics
328 papers in training set
Top 1%
2.7%
9
JMIR Medical Informatics
17 papers in training set
Top 0.5%
2.1%
10
International Journal of Environmental Research and Public Health
124 papers in training set
Top 4%
1.7%
11
British Journal of Ophthalmology
14 papers in training set
Top 0.2%
1.5%
12
Bioengineering
24 papers in training set
Top 0.5%
1.5%
13
Eye
11 papers in training set
Top 0.3%
1.4%
14
International Journal of Medical Informatics
25 papers in training set
Top 1%
1.3%
15
Biology Methods and Protocols
53 papers in training set
Top 2%
1.1%
16
Frontiers in Public Health
140 papers in training set
Top 6%
1.1%
17
European Journal of Neuroscience
168 papers in training set
Top 0.9%
1.1%
18
BMC Medical Education
20 papers in training set
Top 0.7%
1.0%
19
Frontiers in Neuroscience
223 papers in training set
Top 6%
0.9%
20
Data in Brief
13 papers in training set
Top 0.3%
0.8%
21
npj Digital Medicine
97 papers in training set
Top 3%
0.8%
22
Journal of Clinical Medicine
91 papers in training set
Top 7%
0.7%
23
Vaccines
196 papers in training set
Top 3%
0.7%
24
Annals of Translational Medicine
17 papers in training set
Top 2%
0.5%