Identifying academic success and underperformance: The discriminative power of very short answer questions and multiple-choice questions
van Wijk, E. V.; van Blankenstein, F. M.; Ruijter, B. N.; Rohling, J. H. T.; van der Kraan, J.; Dekker, F. W.; Langers, A.
Show abstract
BackgroundMultiple-choice questions (MCQs) are widely used in medical education, but are criticized for cueing and guessing. Very short answer questions (VSAQs), which require students to generate responses independently, may better assess knowledge. While VSAQs demonstrate higher item discrimination within individual exams, their effectiveness in distinguishing academic performance across multiple assessments remains unclear - representing a key gap in the validation of VSAQs under Messicks framework, specifically the category of "relations to other variables". This study examines whether VSAQs or MCQs more effectively distinguish students of varying performance levels across multiple summative examinations. MethodsWe analyzed retrospective data from six mixed-format examinations with VSAQs and MCQs of three cohorts of first- and second-year medical students. Academic performance was measured using grade point average (GPA) across assessments. Linear regression assessed the relationship of each question format with GPA, while ROC curves and C-statistics evaluated their ability to identify poor and excellent performing students (lowest and highest quintile of GPA). ResultsVSAQs showed higher item discrimination (Rir-values) than MCQs in all exams. VSAQs also had a stronger positive association with GPA compared to MCQs, and higher C-statistics, indicating superior discriminative ability. ConclusionVSAQs outperform MCQs in distinguishing academic performance levels across multiple assessments. Their integration into examinations enhances discriminative ability and may facilitate earlier identification of poor and excellent performing students, enabling targeted interventions and support of students.
Matching journals
The top 4 journals account for 50% of the predicted probability mass.