Back

Optimizing Gene Selection and Network-Level Insights in Hypertrophic Cardiomyopathy: A Novel Genetic Algorithm Combined with WGCNA and Statistical Filtering

Mandal, S.; Sahaya, A.; Thakur, A.; Biswas, S.

2025-07-02 health informatics
10.1101/2025.07.01.25330641 medRxiv
Show abstract

A cardiac condition known as hypertrophic cardiomyopathy (HCM) is characterized by an irregular thickening of the heart muscle. There is still much to learn about its molecular mechanics. In order to pinpoint important genes and regulatory abnormalities in HCM, this work offers a thorough computational analysis of gene expression data. Two strategies are employed here. Initially, hub genes were identified, co-expression networks were constructed, gene modules were detected, and they were linked to clinical characteristics using Weighted Gene Co-expression Network Analysis (WGCNA). Second, the same dataset was subjected to three different gene selection techniques: variance-based filtering, volcano plot analysis, and a Genetic Algorithm for Novel Gene Acquisition (GANGA). For the first time, GANGA successfully incorporates a previously defined objective function from simulated annealing into a genetic algorithm. Additionally, it uses two-point crossover, meticulous parameter optimization, and customizable elitism. Three genes were shown to be shared by all approaches, including WGCNA: RASID1, CEBPD, and S100A9. Through enrichment analysis, these were confirmed to be implicated in pathways linked to inflammation. Their incorporation into cytokine-driven networks was validated by investigation of protein-protein interactions. S100A9 emerged as a crucial regulator that activates RASD1 in illness, according to co-expression networks constructed for normal and HCM samples, which showed changed regulatory patterns. The methodological development of modifying and optimizing a simulated annealing-based objective function within a GA framework in GANGA for efficient gene selection, as well as the comprehensive multi-method pipeline for HCM analysis, are what make this study distinctive.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Computers in Biology and Medicine
120 papers in training set
Top 0.1%
40.1%
2
Scientific Reports
3102 papers in training set
Top 5%
10.6%
50% of probability mass above
3
PLOS ONE
4510 papers in training set
Top 31%
4.9%
4
BioMed Research International
25 papers in training set
Top 0.7%
3.7%
5
Physical Biology
43 papers in training set
Top 0.7%
2.4%
6
Informatics in Medicine Unlocked
21 papers in training set
Top 0.4%
1.9%
7
Vaccines
196 papers in training set
Top 1%
1.8%
8
BMC Cardiovascular Disorders
14 papers in training set
Top 1.0%
1.7%
9
Chaos, Solitons & Fractals
32 papers in training set
Top 1%
1.7%
10
Journal of Personalized Medicine
28 papers in training set
Top 0.4%
1.5%
11
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.4%
1.5%
12
Biomedicines
66 papers in training set
Top 1%
1.4%
13
Cognitive Neurodynamics
15 papers in training set
Top 0.2%
1.4%
14
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 2%
1.4%
15
PLOS Computational Biology
1633 papers in training set
Top 18%
1.4%
16
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.4%
17
International Journal of Molecular Sciences
453 papers in training set
Top 10%
1.2%
18
BMC Bioinformatics
383 papers in training set
Top 6%
1.1%
19
Advanced Biology
29 papers in training set
Top 0.7%
1.0%
20
Bioinformatics
1061 papers in training set
Top 9%
0.9%
21
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
22
Journal of Translational Medicine
46 papers in training set
Top 2%
0.8%
23
Frontiers in Physiology
93 papers in training set
Top 5%
0.8%
24
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.8%
25
Biology Methods and Protocols
53 papers in training set
Top 2%
0.8%
26
Mathematics
11 papers in training set
Top 0.4%
0.8%
27
Biomedical Signal Processing and Control
18 papers in training set
Top 0.5%
0.8%
28
Computational and Structural Biotechnology Journal
216 papers in training set
Top 11%
0.7%
29
Physica A: Statistical Mechanics and its Applications
10 papers in training set
Top 0.3%
0.7%