Back

An optimized variant prioritization process for rare disease diagnostics: recommendations for Exomiser and Genomiser

Cooperstein, I. B.; Marwaha, S.; Ward, A.; Kobren, S. N.; Carter, J. N.; Undiagnosed Diseases Network, ; Wheeler, M. T.; Marth, G. T.

2025-04-20 genetic and genomic medicine
10.1101/2025.04.18.25326061 medRxiv
Show abstract

PurposeWhole-exome sequencing (WES) and whole-genome sequencing (WGS) are increasingly used as standard genetic tests to identify the diagnostic variants in rare disease cases. However, prioritizing these variants to reduce the time and burden of manual interpretation by clinical teams remains a significant challenge. The Exomiser/Genomiser software suite is the most widely adopted open-source software for prioritizing coding and non-coding variants. Despite its ubiquitous use, limited data-driven guidelines currently exist to optimize its performance for diagnostic variant prioritization. Based on detailed analyses of Undiagnosed Diseases Network (UDN) probands, this study presents optimized parameters and practical recommendations for deploying the Exomiser and Genomiser tools. We also highlight scenarios where diagnostic variants may be missed and propose alternative workflows to improve diagnostic success in such complex cases. MethodsWe analyzed 386 diagnosed probands from the UDN, including cases with coding and non-coding diagnostic variants. We systematically evaluated how tool performance was affected by key parameters, including gene:phenotype association data, variant pathogenicity predictors, phenotype term quality and quantity, and the inclusion and accuracy of family variant data. ResultsParameter optimization significantly improved Exomisers performance over default parameters. For WGS data, the percentage of coding diagnostic variants ranked within the top ten candidates increased from 49.7% to 85.5%, and for WES, from 67.3% to 88.2%. For non-coding variants prioritized with Genomiser, the top ten rankings improved from 15.0% to 40.0%. We also explored refinement strategies for Exomiser outputs, including using p-value thresholds and flagging genes that are frequently ranked in the top 30 candidates but rarely associated with diagnoses. ConclusionThis study provides an evidence-based framework for variant prioritization in WES and WGS data using Exomiser and Genomiser. These recommendations have been implemented in the Mosaic platform to support the ongoing analysis of undiagnosed UDN participants and provide efficient, scalable reanalysis to improve diagnostic yield. Our work also highlights the importance of tracking solved cases and diagnostic variants that can be used to benchmark bioinformatics tools.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Human Mutation
29 papers in training set
Top 0.1%
22.9%
2
Genetics in Medicine
69 papers in training set
Top 0.1%
18.9%
3
Genome Medicine
154 papers in training set
Top 0.4%
10.3%
50% of probability mass above
4
BMC Bioinformatics
383 papers in training set
Top 2%
6.4%
5
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.7%
3.7%
6
npj Genomic Medicine
33 papers in training set
Top 0.2%
3.3%
7
Scientific Reports
3102 papers in training set
Top 42%
2.9%
8
BMC Medical Genomics
36 papers in training set
Top 0.3%
2.1%
9
Bioinformatics
1061 papers in training set
Top 6%
2.1%
10
Human Genetics
25 papers in training set
Top 0.1%
1.9%
11
BMC Genomics
328 papers in training set
Top 2%
1.9%
12
European Journal of Human Genetics
49 papers in training set
Top 0.5%
1.8%
13
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
14
Genetic Epidemiology
46 papers in training set
Top 0.5%
1.4%
15
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.5%
1.0%
16
Human Genomics
21 papers in training set
Top 0.3%
0.9%
17
GigaScience
172 papers in training set
Top 3%
0.8%
18
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
19
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
20
Nucleic Acids Research
1128 papers in training set
Top 17%
0.8%
21
PLOS ONE
4510 papers in training set
Top 67%
0.8%
22
The Journal of Molecular Diagnostics
36 papers in training set
Top 0.6%
0.7%
23
Frontiers in Genetics
197 papers in training set
Top 11%
0.7%
24
The American Journal of Human Genetics
206 papers in training set
Top 5%
0.5%
25
Genes
126 papers in training set
Top 4%
0.5%
26
Journal of Medical Genetics
28 papers in training set
Top 0.7%
0.5%