Genome-Wide Significance Reconsidered: Low-Frequency Variants and Regulatory Networks in Autism
Mendes de Aquino, M.; Engchuan, W.; Thompson, S.; Zhou, X.; Safarian, N.; Chen, D. Z.; Trost, B.; Salazar, N. B.; Ma, C.; Thiruvahindrapuram, B.; Vorstman, J.; Scherer, S. W.; Breetvelt, E.
Show abstract
Low-frequency variants (LFVs), defined by minor allele frequencies (MAF) of 1-5%, occupy the gap between common and rare variants in both frequency and effect size. The conventional genome-wide association study (GWAS) significance threshold (5x10-) is overly conservative for LFVs, which account for more than 25% of variants in GWAS. This limitation may obscure meaningful associations in highly heritable yet genetically complex disorders such as autism spectrum disorder (ASD). We hypothesize that the scarcity of significant LFVs in ASD GWAS reflects statistical constraints rather than a true lack of association. To address this, we derived a MAF-specific genome-wide significance threshold using linkage disequilibrium-informed simulations applied to ASD GWAS summary statistics, identifying 2.03x10- as optimal. Applying this threshold revealed three novel LFVs mapping to zinc finger proteins (ZNF420, ZNF781) and known ASD-related genes (KMT2E, PRKDC, MCM4). Enrichment analyses suggested their function in nervous system development and gene regulation. Our findings highlight the contribution of LFVs to ASD risk and underscore the importance of frequency-aware association strategies.
Matching journals
The top 10 journals account for 50% of the predicted probability mass.