Improved Performance of ChatGPT-4 on the OKAP Exam: A Comparative Study with ChatGPT-3.5

Teebagy, S.; Colwell, L.; Wood, E.; Yaghy, A.; Faustina, M.

2023-04-03 ophthalmology

10.1101/2023.04.03.23287957 medRxiv

Show abstract

This study aims to evaluate the performance of ChatGPT-4, an advanced Artificial Intelligence (AI) language model, on the Ophthalmology Knowledge Assessment Program (OKAP) examination compared to its predecessor, ChatGPT-3.5. Both models were tested on 180 OKAP practice questions covering various ophthalmology subject categories. Results showed that ChatGPT-4 significantly outperformed ChatGPT-3.5 (81% vs. 57%; p<0.001), indicating improvements in medical knowledge assessment. The superior performance of ChatGPT-4 suggests potential applicability in ophthalmologic education and clinical decision support systems. Future research should focus on refining AI models, ensuring a balanced representation of fundamental and specialized knowledge, and determining the optimal method of integrating AI into medical education and practice.

Matching journals

●Non-profit ◐University press ○Commercial

The top 4 journals account for 50% of the predicted probability mass.

Only show non-profit

Computers in Biology and Medicine

○ 120 papers in training set

● 4510 papers in training set

PLOS Digital Health

● 91 papers in training set

Scientific Reports

○ 3102 papers in training set

50% of probability mass above

Journal of Medical Internet Research

◐ 85 papers in training set

○ 79 papers in training set

BMC Medical Informatics and Decision Making

○ 39 papers in training set

○ 328 papers in training set

JMIR Medical Informatics

◐ 17 papers in training set

International Journal of Environmental Research and Public Health

○ 124 papers in training set

British Journal of Ophthalmology

● 14 papers in training set

○ 24 papers in training set

○ 11 papers in training set

International Journal of Medical Informatics

○ 25 papers in training set

Biology Methods and Protocols

◐ 53 papers in training set

Frontiers in Public Health

○ 140 papers in training set

European Journal of Neuroscience

○ 168 papers in training set

BMC Medical Education

○ 20 papers in training set

Frontiers in Neuroscience

○ 223 papers in training set

○ 13 papers in training set

npj Digital Medicine

○ 97 papers in training set

Journal of Clinical Medicine

○ 91 papers in training set

○ 196 papers in training set

Annals of Translational Medicine

○ 17 papers in training set