Back

PhenoScore: AI-based phenomics to quantify rare disease and genetic variation

Dingemans, A. J. M.; Hinne, M.; Truijen, K. M. G.; Goltstein, L.; van Reeuwijk, J.; de Leeuw, N.; Schuurs-Hoeijmakers, J.; Pfundt, R.; Diets, I. J. M.; den Hoed, J.; de Boer, E.; Coenen-van der Spek, J.; Jansen, S.; van Bon, B. W.; Jonis, N.; Ockeloen, C.; Vulto-van Silfhout, A. T.; Kleefstra, T.; Koolen, D. A.; Campeau, P. M.; Palmer, E. E.; Van Esch, H.; Lyon, G. J.; Alkuraya, F. S.; Rauch, A.; Marom, R.; Baralle, D.; van der Sluijs, P. J.; Santen, G. W. E.; Kooy, R. F.; van Gerven, M. A. J.; Vissers, L. E. L. M.; de Vries, B. B. A.

2022-10-26 genetic and genomic medicine
10.1101/2022.10.24.22281480 medRxiv
Show abstract

While both molecular and phenotypic data are essential when interpreting genetic variants, prediction scores (CADD, PolyPhen, and SIFT) have focused on molecular details to evaluate pathogenicity -- omitting phenotypic features. To unlock the full potential of phenotypic data, we developed PhenoScore: an open source, artificial intelligence-based phenomics framework. PhenoScore combines facial recognition technology with Human Phenotype Ontology (HPO) data analysis to quantify phenotypic similarity at both the level of individual patients as well as of cohorts. We prove PhenoScores ability to recognize distinct phenotypic entities by establishing recognizable phenotypes for 25 out of 26 investigated genetic syndromes against clinical features observed in individuals with other neurodevelopmental disorders. Moreover, PhenoScore was able to provide objective clinical evidence for two distinct ADNP-related phenotypes, that had already been established functionally, but not yet phenotypically. Hence, PhenoScore will not only be of use to unbiasedly quantify phenotypes to assist genomic variant interpretation at the individual level, such as for reclassifying variants of unknown clinical significance, but is also of importance for detailed genotype-phenotype studies.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
40.9%
2
Nature Communications
4913 papers in training set
Top 27%
6.6%
3
Nature Medicine
117 papers in training set
Top 0.6%
4.1%
50% of probability mass above
4
Scientific Reports
3102 papers in training set
Top 33%
3.7%
5
Genetics in Medicine
69 papers in training set
Top 0.4%
3.7%
6
Cell Genomics
162 papers in training set
Top 2%
3.2%
7
European Journal of Human Genetics
49 papers in training set
Top 0.5%
1.9%
8
npj Genomic Medicine
33 papers in training set
Top 0.3%
1.8%
9
Human Genetics
25 papers in training set
Top 0.1%
1.8%
10
Nature Genetics
240 papers in training set
Top 4%
1.8%
11
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.8%
12
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.8%
13
Nucleic Acids Research
1128 papers in training set
Top 10%
1.8%
14
Brain
154 papers in training set
Top 3%
1.3%
15
Med
38 papers in training set
Top 0.5%
1.1%
16
Human Mutation
29 papers in training set
Top 0.5%
1.0%
17
Frontiers in Genetics
197 papers in training set
Top 7%
1.0%
18
iScience
1063 papers in training set
Top 25%
0.9%
19
Communications Biology
886 papers in training set
Top 18%
0.9%
20
Bioinformatics
1061 papers in training set
Top 9%
0.7%
21
PLOS ONE
4510 papers in training set
Top 70%
0.7%
22
npj Digital Medicine
97 papers in training set
Top 4%
0.7%
23
BMC Medical Genomics
36 papers in training set
Top 2%
0.5%
24
Molecular Systems Biology
142 papers in training set
Top 2%
0.5%
25
Science Advances
1098 papers in training set
Top 34%
0.5%