Can Large Language Models Diagnose Primary Immunodeficiency from Patient-Described Symptoms?

Reteig, L. C.; Woloshin, S.; Maglione, P. J.; Farmer, J. R.; Ong, M.-S.

2026-05-27 allergy and immunology

10.64898/2026.05.26.26353818 medRxiv

Show abstract

Patients with primary immunodeficiency (PID) often face prolonged diagnostic delays and may increasingly turn to large language models (LLMs) to interpret their symptoms during this period. We evaluated whether an LLM could recognize PID from symptom descriptions derived from interviews with 21 PID patients. In a prior study, we showed that GPT-4o identified PID in 96% of cases when prompted with physician-written patient histories (Rider et al., JACI, 2024). Here, when prompted with symptom descriptions in patients' own words, GPT-5 identified PID in only 7 cases (33%), although it more broadly suggested immune system issues in 18 cases (81%). The gap between these findings indicates that LLMs are sensitive to the language and framing of symptom descriptions, performing substantially worse when patients describe their own symptoms in everyday language than when clinicians summarize patient histories in structured medical terms. This study underscores the need to carefully evaluate how LLMs are used in patient-facing applications.

Matching journals

●Non-profit ◐University press ○Commercial

The top 8 journals account for 50% of the predicted probability mass.

Only show non-profit

International Journal of Medical Informatics

○ 25 papers in training set

Journal of The Royal Society Interface

● 189 papers in training set

PLOS Digital Health

● 91 papers in training set

Scientific Reports

○ 3102 papers in training set

● 4510 papers in training set

Cell Reports Methods

○ 141 papers in training set

Proceedings of the National Academy of Sciences

● 2130 papers in training set

● 5422 papers in training set

50% of probability mass above

Bioinformatics Advances

◐ 184 papers in training set

● 281 papers in training set

npj Digital Medicine

○ 97 papers in training set

Clinical Infectious Diseases

◐ 231 papers in training set

○ 1063 papers in training set

Journal of the American Medical Informatics Association

◐ 61 papers in training set

Frontiers in Digital Health

○ 20 papers in training set

Life Science Alliance

● 263 papers in training set

European Respiratory Journal

● 54 papers in training set

The American Journal of Human Genetics

○ 206 papers in training set

Journal of Medical Internet Research

◐ 85 papers in training set

Cell Reports Medicine

○ 140 papers in training set

Nature Medicine

○ 117 papers in training set

◐ 225 papers in training set

Computers in Biology and Medicine

○ 120 papers in training set

○ 104 papers in training set

Science Translational Medicine

● 111 papers in training set

European Journal of Human Genetics

○ 49 papers in training set

Journal of Clinical Microbiology

● 120 papers in training set

PLOS Computational Biology

● 1633 papers in training set

○ 16 papers in training set

Communications Biology

○ 886 papers in training set