Back

Comprehensive analysis of de novo variants across 2,497 orofacial cleft trios reveals novel genetic drivers of disease

Kurtas, N. E.; Sanchis-Juan, A.; Shin, E.; Curtis, S. W.; Robinson, K. R.; Lee, A. S.; Alade, A. A.; Zhao, X.; Fu, J.; Diaz Perez, K. K.; Gowans, J. J. L.; Eshete, M. A.; Adeyemo, W. L.; Buxo, C. J.; Padilla, C. D.; Poletta, F. A.; Carreno Torres, A.; Wehby, G. L.; Hecht, J. T.; Moreno Uribe, L. M.; Mukhopadhyay, N.; Shaffer, J. R.; Weinberg, S. M.; Murray, J. C.; Beaty, T. H.; Butali, A.; Talkowski, M.; Marazita, M. L.; Leslie-Clarkson, E. J.; Brand, H.

2026-05-24 genetic and genomic medicine
10.64898/2026.05.21.26352934 medRxiv
Show abstract

Background Orofacial clefts (OFCs) and other palate abnormalities (PAs) are among the most common birth defects worldwide and are characterized by the abnormal formation of the lip and/or palate. Genetic studies have traditionally classified OFC cases as either syndromic, involving OFCs alongside other congenital anomalies, or nonsyndromic, which represent the majority of cases and occur in isolation. Emerging genomic evidence indicates that genes traditionally associated with syndromic forms of OFC can also harbor variants contributing to isolated cases, challenging the notion of a strict dichotomy between these categories and supporting their integration for gene discovery. Methods In this study, we applied multiple analytic approaches to characterize the genetic architecture of OFC and PAs by integrating genomic data from 2,497 trios with an OFC (n=2080) and PA (n=417) affected proband. We compared these findings across OFC subtypes and syndromic status with those from 5,515 control trios to identify enriched biological pathways and mechanisms and to prioritize candidate genes using variant burden testing. Results We observed a significant enrichment of de novo protein-truncating and damaging missense variants in cases compared to controls (OR = 2.17, p = 1.21x10-32), with particularly strong signals in biologically relevant gene sets involving OFC-associated, constrained, Mendelian disorder, and mouse candidate genes. Variant burden testing identified 39 OFC risk genes at FDR [≤] 0.05, which we then integrated with 593 established OFC genes to interrogate the functional underpinnings of OFC via network analysis. This analysis revealed 309 high-order interactor genes not previously associated with OFC. Notably, this OFC network clustered into ten distinct biological pathways, with nucleosome-associated genes showing significant enrichment among cases in our cohort (OR = 14.8, p = 8.1x10-4). In a final integrative step, we combined evidence across all analyses to nominate 231 candidate genes, 32 of which contained at least two deleterious de novo variants in our cohort. Conclusions These findings underscore the value of integrating diverse OFC and PA subtypes, syndromic status, and variant classes to refine the genetic architecture of these disorders, highlighting both phenotypic expansion of known disease genes and the emergence of novel gene-phenotype associations.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genetics in Medicine
69 papers in training set
Top 0.1%
38.4%
2
The American Journal of Human Genetics
206 papers in training set
Top 0.4%
10.3%
3
European Journal of Human Genetics
49 papers in training set
Top 0.2%
4.4%
50% of probability mass above
4
American Journal of Medical Genetics Part A
17 papers in training set
Top 0.1%
4.0%
5
Journal of Medical Genetics
28 papers in training set
Top 0.1%
3.6%
6
Human Molecular Genetics
130 papers in training set
Top 0.7%
3.6%
7
Human Genetics and Genomics Advances
70 papers in training set
Top 0.1%
3.6%
8
npj Genomic Medicine
33 papers in training set
Top 0.2%
3.3%
9
Human Genetics
25 papers in training set
Top 0.1%
2.1%
10
Genetics in Medicine Open
10 papers in training set
Top 0.1%
1.8%
11
Genome Medicine
154 papers in training set
Top 4%
1.7%
12
Nature Genetics
240 papers in training set
Top 5%
1.4%
13
Nature Communications
4913 papers in training set
Top 55%
1.4%
14
Genetic Epidemiology
46 papers in training set
Top 0.6%
1.2%
15
The Journal of Pediatrics
15 papers in training set
Top 0.4%
1.2%
16
JAMA Pediatrics
10 papers in training set
Top 0.1%
1.1%
17
PLOS Genetics
756 papers in training set
Top 12%
1.0%
18
Scientific Reports
3102 papers in training set
Top 70%
0.9%
19
Human Mutation
29 papers in training set
Top 0.6%
0.9%
20
Cell Genomics
162 papers in training set
Top 8%
0.7%
21
Archives of Disease in Childhood
15 papers in training set
Top 0.5%
0.7%
22
Annals of Clinical and Translational Neurology
29 papers in training set
Top 1%
0.7%
23
Journal of Clinical Investigation
164 papers in training set
Top 8%
0.7%
24
Brain
154 papers in training set
Top 5%
0.5%
25
Cell Reports Medicine
140 papers in training set
Top 10%
0.5%
26
Molecular Psychiatry
242 papers in training set
Top 4%
0.5%