Back

In vivo versus in silico assessment of potentially pathogenic missense variants in human reproductive genes

Ding, X.; Singh, P.; Tran, T. N.; Fragoza, R.; Yu, H.; Schimenti, J. C.

2021-10-12 genetics
10.1101/2021.10.12.464112 bioRxiv
Show abstract

Infertility is a heterogeneous condition, with genetic causes estimated to be involved in approximately half of the cases. High-throughput sequencing (HTS) is becoming an increasingly important tool for genetic diagnosis of diseases including idiopathic infertility, however, most rare or minor alleles revealed by HTS are variants of uncertain significance (VUS). Interpreting the functional impacts of VUS is challenging but profoundly important for clinical management and genetic counseling. To determine the consequences of population polymorphisms in key fertility genes, we functionally evaluated 11 missense variants in the genes ANKRD31, BRDT, DMC1, EXOI, FKBP6, MCM9, M1AP, MEI1, MSH4 and SEPT12 by generating genome-edited mouse models. Nine variants were classified as deleterious by most functional prediction algorithms, and two disrupted a protein-protein interaction in the yeast 2 hybrid assay. Even though these genes are known to be essential for normal meiosis or spermiogenesis in mice, only one of the tested human variants (rs1460351219, encoding p.R581H in MCM9), which was observed in a male infertility patient, compromised fertility or gametogenesis in the mouse models. To explore the disconnect between predictions and outcomes, we compared pathogenicity calls of missense variants made by ten widely-used algorithms to: 1) those present in ClinVar, and 2) those which have been evaluated in mice. We found that all the algorithms performed poorly in terms of predicting the effects of human missense variants that have been modeled in mice. These studies emphasize caution in the genetic diagnoses of infertile patients based primarily on pathogenicity prediction algorithms, and emphasize the need for alternative and efficient in vitro or vivo functional validation models for more effective and accurate VUS delineation to either pathogenic or benign categories. SignificanceAlthough infertility is a substantial medical problem that affects up to 15% of couples, the potential genetic causes of idiopathic infertility have been difficult to decipher. This problem is complicated by the large number of genes that can cause infertility when perturbed, coupled with the large number of VUS that are present in the genomes of affected patients. Here, we present and analyze mouse modeling data of missense variants that are classified as deleterious by commonly-used pathogenicity prediction algorithms but which caused no detectible phenotype when introduced into mice by genome editing. We find that augmenting pathogenicity predictions with preliminary screens for biochemical defects substantially enhanced the proportion of prioritized variants that caused phenotypes in mice. The results emphasize that, in the absence of substantial improvements of in silico prediction tools or other compelling pre-existing evidence, in vivo analysis is crucial for confident attribution of infertility alleles.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
The American Journal of Human Genetics
206 papers in training set
Top 0.1%
32.3%
2
Human Genetics and Genomics Advances
70 papers in training set
Top 0.1%
14.1%
3
PLOS Genetics
756 papers in training set
Top 2%
8.2%
50% of probability mass above
4
eLife
5422 papers in training set
Top 15%
6.2%
5
Molecular Human Reproduction
11 papers in training set
Top 0.1%
4.7%
6
Human Reproduction
18 papers in training set
Top 0.2%
3.5%
7
GENETICS
189 papers in training set
Top 0.3%
3.0%
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 27%
2.3%
9
Journal of Medical Genetics
28 papers in training set
Top 0.2%
2.0%
10
Genetics in Medicine
69 papers in training set
Top 0.6%
2.0%
11
Scientific Reports
3102 papers in training set
Top 54%
1.8%
12
Human Molecular Genetics
130 papers in training set
Top 2%
1.7%
13
Frontiers in Genetics
197 papers in training set
Top 6%
1.5%
14
Disease Models & Mechanisms
119 papers in training set
Top 2%
1.3%
15
European Journal of Human Genetics
49 papers in training set
Top 0.9%
1.2%
16
The Journal of Clinical Endocrinology & Metabolism
35 papers in training set
Top 1%
0.9%
17
Nature Communications
4913 papers in training set
Top 62%
0.8%
18
Genetics
225 papers in training set
Top 4%
0.7%
19
G3: Genes, Genomes, Genetics
222 papers in training set
Top 1%
0.7%
20
BMC Biology
248 papers in training set
Top 5%
0.7%
21
PLOS Computational Biology
1633 papers in training set
Top 28%
0.6%