Back

Genosolver: Rare Disease Diagnosis through Holistic Integration of Unstructured Clinical Narratives Using Large Language and Reasoning Models

Islam, T.; Danner, M.; Ziad, Z.; Begemann, M.; Beijer, D.; Lischka, A.; Lausberg, E.; Mattern, L.; Suh, J.; Wittig, P.; Guezel, N.; Schlaich, E.; Karaivanova, R.; D'Augello, S.; Franken, L.; Ruedebusch, J.; Mueller, R.; Perchalla, E.; Zempel, H.; Haag, N.; Eggermann, K.; Eggermann, T.; Meyer, R.; Kraft, F.; Elbracht, M.; Kurth, I.; Krause, J.

2026-06-05 health informatics
10.64898/2026.06.04.26354845 medRxiv
Show abstract

Background: Molecular medicine has made genetic diagnostics crucial for rare diseases, but the majority of patients remains without diagnosis even after state-of-the-art assessment. Standardized systems for integrating clinical features, such as the Human Phenotype Ontology (HPO), offer assistance, but are often insufficiently detailed and fail to capture crucial clinical parameters such as age at onset, longitudinal changes in symptoms, detailed characteristics of a clinical symptom, or the absence of a feature. Results: We present Genosolver an integrated workflow that utilizes machine learning to address this bottleneck. Using Large Language Models (LLMs) and Large Reasoning Models (LRMs) on unstructured clinical notes and electronic health care data, we generate a workflow that unifies phenotype extraction, generates differential diagnosis, and prioritizes genetic variants from genome data. We evaluated the performance on 233 previously genetically solved cases, where Genosolver ranked the causative gene first in 72% of cases and in 94% of cases in the top 10 gene list, outperforming the existing benchmarking tool Exomiser by 9%. Semi-automated reanalysis of 1,875 unsolved rare disease cases yielded an additional diagnostic rate of 1.7%. Incorporating rich, unstandardized clinical narratives substantially enhanced model performance beyond HPO-only inputs and demonstrated competitive results using data security compliant local models. Conclusion: Integrating unstandardized clinical data with local LLMs and reasoning offers a scalable, data-secure workflow that increases molecular diagnoses in rare diseases.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
22.7%
2
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.1%
12.5%
3
npj Digital Medicine
97 papers in training set
Top 0.5%
10.2%
4
Scientific Reports
3102 papers in training set
Top 27%
4.3%
5
Journal of Biomedical Informatics
45 papers in training set
Top 0.3%
4.3%
50% of probability mass above
6
Bioinformatics
1061 papers in training set
Top 5%
4.0%
7
Med
38 papers in training set
Top 0.1%
3.6%
8
The Lancet Digital Health
25 papers in training set
Top 0.2%
2.9%
9
Genetics in Medicine
69 papers in training set
Top 0.5%
2.4%
10
Frontiers in Digital Health
20 papers in training set
Top 0.5%
1.9%
11
JAMIA Open
37 papers in training set
Top 0.7%
1.9%
12
Nature Communications
4913 papers in training set
Top 51%
1.7%
13
BMC Bioinformatics
383 papers in training set
Top 5%
1.5%
14
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.5%
15
International Journal of Medical Informatics
25 papers in training set
Top 0.9%
1.5%
16
iScience
1063 papers in training set
Top 19%
1.3%
17
Genome Medicine
154 papers in training set
Top 5%
1.3%
18
Nature Medicine
117 papers in training set
Top 3%
1.3%
19
eBioMedicine
130 papers in training set
Top 3%
1.1%
20
GENETICS
189 papers in training set
Top 1%
0.9%
21
PLOS ONE
4510 papers in training set
Top 63%
0.9%
22
Journal of Medical Internet Research
85 papers in training set
Top 4%
0.9%
23
JMIR Medical Informatics
17 papers in training set
Top 1%
0.8%
24
BMJ Health & Care Informatics
13 papers in training set
Top 0.8%
0.8%
25
BMC Medical Genomics
36 papers in training set
Top 1%
0.7%
26
Science Translational Medicine
111 papers in training set
Top 7%
0.6%
27
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.6%
28
Patterns
70 papers in training set
Top 4%
0.5%
29
Journal of Personalized Medicine
28 papers in training set
Top 2%
0.5%