Back

VSPGx: A High-Accuracy Pharmacogenomics Interpretation Software Solution with Automated CPIC Guideline Integration

Fortier, N.; Rudy, G.; Scherer, A.

2025-11-26 bioinformatics
10.1101/2025.11.24.690276 bioRxiv
Show abstract

Accurate pharmacogenomic genotype determination and interpretation are essential for personalized medicine, yet existing bioinformatics tools face significant limitations in detecting named alleles, maintaining current allele definitions, and providing comprehensive clinical annotations. We present VSPGx, a pharmacogenomics interpretation software solution that identifies diplotypes from next-generation sequencing data and annotates them against Clinical Pharmacogenetics Implementation Consortium (CPIC) and FDA drug recommendations using automated curation of the latest allele definitions. We benchmarked VSPGx against established tools including Aldy, PharmCAT, and Stargazer using both synthetic datasets and real-world clinical samples. In a comprehensive synthetic benchmark spanning 3,655 CYP2C9 diplotype combinations, VSPGx achieved 99.97% concordance, matching PharmCATs performance and substantially outperforming Aldy (93.08%) and Stargazer (27.06%). Clinical validation using 11 TaqMan OpenArray samples demonstrated 88.2% allele concordance and 89.1% phenotype concordance across 110 gene-sample combinations, with all discrepancies attributed to the benchmark data utilizing outdated allele definitions rather than VSPGx errors. Our automated curation process ensures continuous alignment with current CPIC guidelines, addressing a critical gap in existing pharmacogenomic analysis tools. VSPGx provides a robust, clinically-validated solution for pharmacogenomic analysis that combines high-accuracy diplotype calling with up-to-date, evidence-based drug recommendations.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
44.9%
2
Nature Communications
4913 papers in training set
Top 26%
6.9%
50% of probability mass above
3
The American Journal of Human Genetics
206 papers in training set
Top 1.0%
4.6%
4
PLOS ONE
4510 papers in training set
Top 37%
3.9%
5
Clinical Pharmacology & Therapeutics
25 papers in training set
Top 0.1%
3.9%
6
Bioinformatics
1061 papers in training set
Top 6%
2.9%
7
Nucleic Acids Research
1128 papers in training set
Top 7%
2.9%
8
BMC Bioinformatics
383 papers in training set
Top 4%
2.2%
9
Briefings in Bioinformatics
326 papers in training set
Top 3%
2.2%
10
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
2.0%
11
Clinical and Translational Science
21 papers in training set
Top 0.4%
1.8%
12
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.3%
13
BioData Mining
15 papers in training set
Top 0.6%
1.0%
14
Bioinformatics Advances
184 papers in training set
Top 4%
1.0%
15
Scientific Reports
3102 papers in training set
Top 72%
0.9%
16
npj Digital Medicine
97 papers in training set
Top 4%
0.7%
17
PLOS Computational Biology
1633 papers in training set
Top 26%
0.7%
18
Nature Biotechnology
147 papers in training set
Top 8%
0.7%
19
Computational and Structural Biotechnology Journal
216 papers in training set
Top 11%
0.5%
20
Genome Biology
555 papers in training set
Top 9%
0.5%
21
JCO Clinical Cancer Informatics
18 papers in training set
Top 1%
0.5%
22
Clinical Chemistry
22 papers in training set
Top 1%
0.5%
23
Clinical Infectious Diseases
231 papers in training set
Top 5%
0.5%