Back

Clinical Evaluation of an AI System for Streamlined Variant Interpretation in Genetic Testing

Ruzicka, J.; Ravel, J.-M.; Audoux, J.; Boulat, A.; Thevenon, J.; Yauy, K.; Dancer, M.; Raymond, L.; Lombardi, Y.; Philippe, N.; Blum, M. G.; Duforet-Frebourg, N.; Mesnard, L.

2025-02-05 genetic and genomic medicine
10.1101/2025.02.04.25321641 medRxiv
Show abstract

The growing use of genomic sequencing to diagnose hereditary diseases has increased the interpretive workload for clinical laboratories. Efficient methods are needed to maximize diagnostic yield without overwhelming resources. We developed DiagAI, an integrative machine-learning system trained to prioritize and sort causal variants in rare diseases. DiagAI integrates Universal Pathogenicity Predictor (UP2), a machine-learning model trained to predict ACMG pathogenicity classes, PhenoGenius to match genotype-phenotype interactions and expert features such as inheritance and variant quality. We retrospectively analyzed 196 diagnosed exomes from a nephrology cohort. To benchmark UP2s performance, we evaluated the ranking of 62 causal missense variants. UP2 ranked variants most effectively beyond shortlist sizes of 10 and identified pathogenic variants missed by AlphaMissense. DiagAI identified 94.9% of causal variants in diagnostic exomes with HPO terms, compared to 90.8% without, with median shortlist sizes of 12 and 9 variants, respectively. With HPO terms, 74% of top-ranked variants were diagnostic, versus 42% without, outperforming Exomiser and AI-MARRVEL. DiagAI produces accurate shortlists that streamline variant interpretation, offering a scalable solution for growing diagnostic volumes.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
39.9%
2
The American Journal of Human Genetics
206 papers in training set
Top 0.6%
8.3%
3
Nature Genetics
240 papers in training set
Top 1%
6.5%
50% of probability mass above
4
Nature Communications
4913 papers in training set
Top 28%
6.5%
5
Genetics in Medicine
69 papers in training set
Top 0.4%
4.4%
6
Nature Medicine
117 papers in training set
Top 0.8%
3.6%
7
Bioinformatics
1061 papers in training set
Top 6%
2.4%
8
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
2.1%
9
Med
38 papers in training set
Top 0.2%
2.1%
10
Cell Genomics
162 papers in training set
Top 3%
1.7%
11
Genome Biology
555 papers in training set
Top 5%
1.2%
12
Cell Systems
167 papers in training set
Top 10%
1.0%
13
Human Mutation
29 papers in training set
Top 0.5%
1.0%
14
Nature Biomedical Engineering
42 papers in training set
Top 2%
0.9%
15
Scientific Reports
3102 papers in training set
Top 72%
0.8%
16
Journal of Clinical Investigation
164 papers in training set
Top 6%
0.8%
17
Nucleic Acids Research
1128 papers in training set
Top 17%
0.8%
18
BMC Genomics
328 papers in training set
Top 6%
0.7%
19
Genetics in Medicine Open
10 papers in training set
Top 0.1%
0.7%
20
Cell
370 papers in training set
Top 18%
0.7%
21
Science Translational Medicine
111 papers in training set
Top 7%
0.7%
22
iScience
1063 papers in training set
Top 40%
0.5%
23
European Journal of Human Genetics
49 papers in training set
Top 2%
0.5%
24
Nature
575 papers in training set
Top 18%
0.5%
25
npj Digital Medicine
97 papers in training set
Top 4%
0.5%
26
Cancer Discovery
61 papers in training set
Top 2%
0.5%