Back

Epistatic SNP network analysis (ESNA): A scalable framework for genome-wide detection of higher-order genetic interactions

Zhang, Y.; Han, M.; Ambalavanan, A.; Topouza, D.; Fang, Z. Y.; Stickley, S. A.; Anand, S.; Turvey, S.; Mandhane, P. J.; Simons, E.; Moraes, T. J.; Subbarao, P.; Choi, J.; Duan, Q.

2026-05-13 genetic and genomic medicine
10.64898/2026.05.08.26352667 medRxiv
Show abstract

Although genome-wide association studies (GWASs) have been widely applied to investigate the genetic basis of common traits and diseases in human populations, the associated loci do not fully account for the estimated heritability. The missing heritability may be explained, in part, by epistasis or gene-gene interactions. Existing methods for detecting epistasis, however, are limited to pair-wise interactions and/or targeted genomic regions. Here, we present a novel model, termed the Epistatic SNP Network Analysis (ESNA), which detects higher-order epistatic interactions using genome-wide SNP data. ESNA employs a scale-free network algorithm within a parallel computing framework that identifies modules of correlated SNPs, potentially interacting variants that converge on common biological pathways, while enhancing computational efficiency. We applied ESNA to investigate epistatic interactions contributing to respiratory outcomes such as recurrent wheeze and asthma among preschool-aged children in the CHILD Cohort Study. Using genome-wide data comprising 775,569 SNPs from 1,899 children, ESNA identified 914 SNP network modules, 9 of which were significantly associated with recurrent wheeze between ages 2 and 5 years (P<5.47x10-5). Furthermore, 7 of these wheeze-associated modules were also associated with asthma by age 5 years (P<5.47x10-5). Pathway enrichment analysis revealed that the associated modules consist of SNPs located in genes previously implicated in asthma and related biological processes, such as cellular response to stimuli and nervous system development. Compared to existing network-based methods for epistasis, ESNA demonstrated substantial improvements in computational efficiency, reducing memory usage by 50% and processing genome-wide SNP data 48 times faster. The code implementation and documentation are available at https://github.com/ComputationalGenomicsLaboratory/ESNA.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 0.8%
27.4%
2
Genome Medicine
154 papers in training set
Top 1%
6.3%
3
Nature Communications
4913 papers in training set
Top 30%
6.3%
4
The American Journal of Human Genetics
206 papers in training set
Top 0.8%
6.3%
5
Genome Research
409 papers in training set
Top 0.4%
6.3%
50% of probability mass above
6
Scientific Reports
3102 papers in training set
Top 31%
3.9%
7
iScience
1063 papers in training set
Top 5%
3.6%
8
Cell Genomics
162 papers in training set
Top 1%
3.6%
9
Genetic Epidemiology
46 papers in training set
Top 0.2%
3.6%
10
PLOS Genetics
756 papers in training set
Top 5%
3.6%
11
PLOS Computational Biology
1633 papers in training set
Top 13%
2.3%
12
Human Genetics and Genomics Advances
70 papers in training set
Top 0.2%
2.1%
13
BMC Bioinformatics
383 papers in training set
Top 4%
2.1%
14
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.9%
15
Communications Biology
886 papers in training set
Top 6%
1.9%
16
Bioinformatics Advances
184 papers in training set
Top 3%
1.8%
17
Nature Genetics
240 papers in training set
Top 5%
1.6%
18
eLife
5422 papers in training set
Top 46%
1.5%
19
PLOS ONE
4510 papers in training set
Top 60%
1.2%
20
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
0.9%
21
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
22
Human Genetics
25 papers in training set
Top 0.3%
0.8%
23
Human Molecular Genetics
130 papers in training set
Top 3%
0.7%
24
Genome Biology
555 papers in training set
Top 8%
0.7%