Back

The prevalence of protein misfolding as a mechanism for hereditary deafness

Gogal, R. A.; Cox, G. M.; Kolbe, D. L.; Odell, A. M.; Ovel, C. E.; McCormick, K. I.; Hong, B.; Azaiez, H.; Casavant, T. L.; Smith, R. J. H.; Braun, T. A.; Schnieders, M. J.

2026-03-11 genetics
10.64898/2026.03.09.710547 bioRxiv
Show abstract

Hearing loss is the most common sensory deficit impacting [~]5% of the worlds population. The Deafness Variation Database (DVD) is a public resource of deafness variants, containing over 380,000 missense variants across 224 genes, with 303,577 classified as a variant of uncertain significance (VUS). To address the challenge of evaluating each deafness associated VUS, we evaluate a family of probabilistic frameworks to quantify the strength of computational evidence based on ACMG/AMP recommendations. First, CADD and REVEL are compared using Bayesian models parameterized using either a ClinVar 2019 dataset or labeled DVD variants. The REVEL model built using the DVD dataset demonstrates the best accuracy, sensitivity, and specificity. Incorporation of (in)tolerance to missense variation based on sorting each gene into three bins (tolerant, average, intolerant) shows that intolerant DVD genes are consistent with a higher prior probability of being pathogenic (25.7%) than average (10.7%) or tolerant (8.7%) genes. Finally, the impact of protein folding stability was incorporated using a 2D likelihood, which surpassed the simpler models while also offering a biophysical rationale for the disease mechanism. The protein folding-informed Bayesian model results in 28,866 prioritized VUSs reaching a posterior probability of pathogenicity above 98% with a false positive rate of only 0.14%. Overall, 54,752 missense variants are predicted to cause protein folding destabilization of greater than 1.0 kcal/mol, while 18,706 of the 28,886 prioritized VUS (65%) surpass this threshold. From these VUSs, we identify twelve probands where the patients genetic diagnosis is upgraded to likely pathogenic/pathogenic. We highlight two variants that cause clear structural disruption, demonstrating the impact of biophysical characterization on variant evaluation. Author SummaryWe investigate the impacts of single amino acid changes on protein structure and folding in the context of hearing loss. Hearing loss is the most common impairment of the main senses affecting nearly 5% of the worlds population. About 45% of people with hearing loss receive a diagnosis after targeted genetic testing. Here, we integrate biophysical data that quantifies the effect of a change to protein sequence on protein folding in combination with genetic data to improve our ability to identify protein amino acid changes that are likely to impact hearing. Our work leads to 12 patients receiving an upgraded diagnosis with their variant disrupting protein stability. Although the method is applied to hearing loss, it can be used for interpreting protein sequence changes in other disease contexts.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
12.2%
2
PLOS Genetics
756 papers in training set
Top 2%
8.3%
3
Scientific Reports
3102 papers in training set
Top 10%
8.3%
4
Genetic Epidemiology
46 papers in training set
Top 0.1%
6.3%
5
The American Journal of Human Genetics
206 papers in training set
Top 0.8%
6.3%
6
Bioinformatics Advances
184 papers in training set
Top 0.9%
4.3%
7
Human Genetics and Genomics Advances
70 papers in training set
Top 0.1%
4.3%
8
Human Genetics
25 papers in training set
Top 0.1%
3.9%
50% of probability mass above
9
Human Mutation
29 papers in training set
Top 0.2%
3.9%
10
PLOS Computational Biology
1633 papers in training set
Top 10%
3.5%
11
Journal of Medical Genetics
28 papers in training set
Top 0.2%
2.7%
12
GENETICS
189 papers in training set
Top 0.4%
2.6%
13
Genetics in Medicine
69 papers in training set
Top 0.6%
1.9%
14
Frontiers in Genetics
197 papers in training set
Top 4%
1.9%
15
European Journal of Human Genetics
49 papers in training set
Top 0.5%
1.9%
16
Nucleic Acids Research
1128 papers in training set
Top 10%
1.9%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 33%
1.7%
18
PLOS ONE
4510 papers in training set
Top 54%
1.7%
19
iScience
1063 papers in training set
Top 18%
1.5%
20
Genome Biology
555 papers in training set
Top 5%
1.5%
21
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.5%
22
BMC Bioinformatics
383 papers in training set
Top 5%
1.3%
23
Genetics
225 papers in training set
Top 3%
1.3%
24
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
25
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
0.8%
26
Biochemistry and Biophysics Reports
28 papers in training set
Top 2%
0.7%
27
International Journal of Molecular Sciences
453 papers in training set
Top 16%
0.7%
28
Communications Biology
886 papers in training set
Top 25%
0.7%
29
npj Genomic Medicine
33 papers in training set
Top 1%
0.7%
30
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.7%