Back

Domain-adapted language model using reinforcement learning for various dementias

Kowshik, S. S.; Jasodanand, V. H.; Bellitti, M.; Puducheri, S.; Xu, L.; Liu, Y.; Saichandran, K. S.; Dwyer, B. C.; Gabelle, A.; Hao, H.; Kedar, S.; Murman, D. L.; O'Shea, S.; Saint-Hilaire, M.-H.; Samudra, N. P.; Sartor, E. A.; Swaminathan, A.; Taraschenko, O.; Yuan, J.; Au, R.; Kolachalama, V. B.

2026-03-23 neurology

10.64898/2026.03.17.26348154 medRxiv

Show abstract

Large language models excel at processing complex clinical data and advanced reasoning, yet domain-specific adaptation is essential to realize their full potential in fields such as Alzheimers disease and related dementias (ADRD). Here, we present a generative language model for ADRD fine-tuned via reinforcement learning with verifiable rewards using a self-certainty-aware advantage. Model development and validation leveraged data from five ADRD cohorts, totaling 54, 535 participants. Our framework integrates demographics, personal and family medical histories, medication use, neuropsychological test results, functional assessments, physical and neurological examination findings, laboratory data and multimodal neuroimaging to construct comprehensive clinical profiles. On held-out testing data involving 36, 688 participants, our model achieved robust performance on syndromic classification, primary etiological diagnosis and biomarker prediction. Model predictions were validated against postmortem-confirmed diagnoses, and clinical utility was demonstrated in a controlled within-subjects crossover study where board-certified neurologists reviewed cases with and with-out model assistance, showing that exposure to model responses improved diagnostic performance. These results demonstrate that targeted domain adaptation with reinforcement learning can enable language models to deliver accurate, reasoning-driven support in ADRD evaluation. Prospective validation will be essential to translate these advances into improved patient outcomes.

Domain-adapted language model using reinforcement learning for various dementias

Matching journals