Back

Identifying academic success and underperformance: The discriminative power of very short answer questions and multiple-choice questions

van Wijk, E. V.; van Blankenstein, F. M.; Ruijter, B. N.; Rohling, J. H. T.; van der Kraan, J.; Dekker, F. W.; Langers, A.

2026-05-01 medical education
10.64898/2026.04.29.26352108 medRxiv
Show abstract

BackgroundMultiple-choice questions (MCQs) are widely used in medical education, but are criticized for cueing and guessing. Very short answer questions (VSAQs), which require students to generate responses independently, may better assess knowledge. While VSAQs demonstrate higher item discrimination within individual exams, their effectiveness in distinguishing academic performance across multiple assessments remains unclear - representing a key gap in the validation of VSAQs under Messicks framework, specifically the category of "relations to other variables". This study examines whether VSAQs or MCQs more effectively distinguish students of varying performance levels across multiple summative examinations. MethodsWe analyzed retrospective data from six mixed-format examinations with VSAQs and MCQs of three cohorts of first- and second-year medical students. Academic performance was measured using grade point average (GPA) across assessments. Linear regression assessed the relationship of each question format with GPA, while ROC curves and C-statistics evaluated their ability to identify poor and excellent performing students (lowest and highest quintile of GPA). ResultsVSAQs showed higher item discrimination (Rir-values) than MCQs in all exams. VSAQs also had a stronger positive association with GPA compared to MCQs, and higher C-statistics, indicating superior discriminative ability. ConclusionVSAQs outperform MCQs in distinguishing academic performance levels across multiple assessments. Their integration into examinations enhances discriminative ability and may facilitate earlier identification of poor and excellent performing students, enabling targeted interventions and support of students.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.