Back

Genetic underpinnings of predicted changes in cardiovascular function using self supervised learning

Levine, Z.; Lutsker, G.; Godneva, A.; Weinberger, A.; Lotan-Pompan, M.; Talmor-Barkan, Y.; Reisner, Y.; Rossman, H.; Segal, E.

2024-08-21 systems biology
10.1101/2024.08.15.608061 bioRxiv
Show abstract

BackgroundThe genetic underpinnings of cardiovascular disease remain elusive. Contrastive learning algorithms have recently shown cutting-edge performance in extracting representations from electrocardiogram (ECG) signals that characterize cross-temporal cardiovascular state. However, there is currently no connection between these representations and genetics. MethodsWe designed a new metric, denoted as Delta ECG, which measures temporal shifts in patients cardiovascular state, and inherently adjusts for inter-patient differences at baseline. We extracted this measure for 4,782 patients in the Human Phenotype Project using a novel self-supervised learning model, and quantified the associated genetic signals with Genome-Wide-Association Studies (GWAS). We predicted the expression of thousands of genes extracted from Peripheral Blood Mononuclear Cells (PBMCs). Downstream, we ran enrichment and overrepresentation analysis of genes we identified as significantly predicted from ECG. FindingsIn a Genome-Wide Association Study (GWAS) of Delta ECG, we identified five associations that achieved genome-wide significance. From baseline embeddings, our models significantly predict the expression of 57 genes in men and 9 in women. Enrichment analysis showed that these genes were predominantly associated with the electron transport chain and the same immune pathways as identified in our GWAS. ConclusionsWe validate a novel method integrating self-supervised learning in the medical domain and simple linear models in genetics. Our results indicate that the processes underlying temporal changes in cardiovascular health share a genetic basis with CVD, its major risk factors, and its known correlates. Moreover, our functional analysis confirms the importance of leukocytes, specifically eosinophils and mast cells with respect to cardiac structure and function.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.1%
44.1%
2
Bioinformatics
1061 papers in training set
Top 3%
10.7%
50% of probability mass above
3
European Heart Journal - Digital Health
15 papers in training set
Top 0.1%
6.8%
4
Scientific Reports
3102 papers in training set
Top 15%
6.7%
5
PLOS Computational Biology
1633 papers in training set
Top 11%
3.3%
6
Human Genomics
21 papers in training set
Top 0.1%
2.0%
7
PLOS ONE
4510 papers in training set
Top 51%
1.8%
8
BMC Medicine
163 papers in training set
Top 3%
1.8%
9
BMC Bioinformatics
383 papers in training set
Top 4%
1.8%
10
BMC Medical Genomics
36 papers in training set
Top 0.4%
1.8%
11
Bioinformatics Advances
184 papers in training set
Top 3%
1.4%
12
npj Digital Medicine
97 papers in training set
Top 2%
1.3%
13
Circulation
66 papers in training set
Top 2%
1.3%
14
Frontiers in Physiology
93 papers in training set
Top 4%
1.3%
15
Nature Communications
4913 papers in training set
Top 57%
1.2%
16
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 2%
1.0%
17
Journal of Biomedical Informatics
45 papers in training set
Top 1%
0.9%
18
iScience
1063 papers in training set
Top 28%
0.8%
19
Nature Cardiovascular Research
28 papers in training set
Top 0.5%
0.8%
20
Journal of the American Heart Association
119 papers in training set
Top 4%
0.8%
21
Journal of Molecular and Cellular Cardiology
39 papers in training set
Top 0.9%
0.7%
22
Circulation Research
39 papers in training set
Top 1%
0.5%