Back

Probing the Surgical Competence of LLMs: A global health study leveraging AfriMedQA benchmarks

2025-10-07 surgery Title + abstract only
View on medRxiv
Show abstract

Global surgical care faces a severe workforce shortage, with more than 1.2 million additional specialists needed by 2030, particularly in low- and middle-income countries (LMICs). Large language models (LLMs) have demonstrated impressive medical reasoning on standardized exams, but their safety, reliability, and specialty-specific performance--especially in procedural fields such as surgery--remain uncertain. Here we evaluate over 40 state-of-the-art LLMs on 3,900 expert-authored multiple-choice...

Predicted journal destinations