Back

An AI-Integrated Framework for Precision Genomics in Coronary Artery Disease Using Whole Exome and Phenotypic Data

UPPALURI, K. R.; CHALLA, H. J.; VEMPATI, K. K.; KADALI, L. N.; PALASAMUDRAM, K.; RAYALA, M.

2026-01-30 genetic and genomic medicine
10.64898/2026.01.28.26345099 medRxiv
Show abstract

Coronary artery disease (CAD) is a multifactorial condition influenced by genetic, phenotypic, and environmental factors. Traditional risk prediction models fall short in capturing the polygenic complexity of CAD, particularly in underrepresented populations. This study presents SIGMA (Scoring Importance of Genes specific to disease using Machine learning Algorithms), a novel AI-powered framework that enhances CAD risk prediction by integrating genomic and phenotypic data. Our approach leverages GEMS (GeneConnectRx Evidence Metrics), an LLM-driven system to score 1772 CAD-associated genes, and CASCADE (Comprehensive Assessment of Sequence and Clinical Annotation Data Evaluation), a tiered variant scoring pipeline. Using whole exome sequencing (WES) data from 1,243 individuals (628 controls, 615 CAD cases), the model integrates age and gender as key non-modifiable phenotypes. Results show significant improvements in sensitivity (from 0.41 to 0.79), specificity (0.70 to 0.72), and AUC (0.59 to 0.81) when phenotype data are incorporated. Our findings highlight the potential of AI-integrated genomics for population-specific CAD risk stratification.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Genome Medicine
154 papers in training set
Top 0.1%
18.4%
2
The American Journal of Human Genetics
206 papers in training set
Top 0.7%
6.7%
3
Scientific Reports
3102 papers in training set
Top 25%
4.8%
4
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.4%
4.8%
5
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.8%
6
Nature Communications
4913 papers in training set
Top 41%
3.5%
7
Bioinformatics
1061 papers in training set
Top 5%
3.5%
8
Human Genetics
25 papers in training set
Top 0.1%
3.5%
50% of probability mass above
9
Genetic Epidemiology
46 papers in training set
Top 0.2%
3.5%
10
Journal of Biomedical Informatics
45 papers in training set
Top 0.5%
3.0%
11
Frontiers in Genetics
197 papers in training set
Top 3%
3.0%
12
BMC Genomics
328 papers in training set
Top 1%
2.8%
13
BMC Medical Genomics
36 papers in training set
Top 0.3%
2.3%
14
Nucleic Acids Research
1128 papers in training set
Top 8%
2.3%
15
European Journal of Human Genetics
49 papers in training set
Top 0.5%
2.1%
16
npj Digital Medicine
97 papers in training set
Top 2%
1.8%
17
Bioinformatics Advances
184 papers in training set
Top 3%
1.7%
18
PLOS ONE
4510 papers in training set
Top 54%
1.7%
19
npj Genomic Medicine
33 papers in training set
Top 0.4%
1.6%
20
International Journal of Epidemiology
74 papers in training set
Top 1%
1.6%
21
Human Genomics
21 papers in training set
Top 0.1%
1.5%
22
Genetics in Medicine
69 papers in training set
Top 0.7%
1.3%
23
Cell Genomics
162 papers in training set
Top 4%
1.3%
24
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.5%
0.9%
25
Human Molecular Genetics
130 papers in training set
Top 3%
0.9%
26
Journal of Translational Medicine
46 papers in training set
Top 2%
0.8%
27
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.8%
28
PLOS Computational Biology
1633 papers in training set
Top 24%
0.8%
29
Human Mutation
29 papers in training set
Top 0.7%
0.7%
30
Communications Biology
886 papers in training set
Top 25%
0.7%