Back

The performance of in silico prediction tools for variant curation in a panel of cancer genes.

Nelson, N.; Niaz, A.; Fairfax, K.; Bryan, T. M.; Lucas, S.; Dickinson, J.

2025-07-30 genetic and genomic medicine
10.1101/2025.07.29.25331316
Show abstract

Rare single base pair changes in genes are an important cause of disease, as they can reside in key regions of the gene influencing biological function by impacting the protein conformation and protein interactions. Generation of the necessary experimental evidence to define the outcome of the presence of these gene variants is time consuming and costly. These challenges have led to the development of a plethora of in silico prediction tools. These tools frequently use similar sources of information and are trained on overlapping multi-gene truth datasets. However, frequently there has been no quantitative validation of the performance of these in silico tools for individual genes. Here we have applied the ClinGen Sequence Variant Interpretation Working Groups recommended in silico score thresholds to a set of predisposition gene variants with established pathogenicity/benignity. Of the genes assessed (BRCA1, BRCA2, TP53, TERT and ATM), in silico tool predictions showed inferior sensitivity (<65%) for pathogenic TERT variants and inferior sensitivity ([&le;]81%) for benign TP53 variants. This validation study highlights in silico tool performance can be gene-specific and is dependent on the training set on which the algorithm is built. Where there are sufficient numbers of established benign and pathogenic missense variants based on clinical and functional evidence, the use of in silico tool scores should be validated for individual genes. For genes where this is not possible and gene-agnostic in silico score cut offs are used, consideration of missense variant-protein structural impact relationships is suggested.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.