Back

Identification of candidate biomarkers and molecular networks associated with Pulmonary Arterial Hypertension using machine learning and plasma multi-Omics analysis

Kheyfets, V. O.; Sweatt, A. J.; Zhang, H.; Nemkov, T.; Heerdt, P.; Dzieciatkowska, M.; Stephenson, D.; LaCroix, I. S.; D'Alessandro, A.; Oldham, W. M.; Hansen, K.; Zamanian, R. T.; Stenmark, K. R.

2024-12-20 cardiovascular medicine
10.1101/2024.12.17.24319117 medRxiv
Show abstract

BackgroundPulmonary arterial hypertension (PAH) is a rare but severe and life- threatening condition that primarily affects the pulmonary blood vessels and the right ventricle of the heart. The limited availability of human tissue for research--most of which represents only end-stage disease--has led to a reliance on preclinical animal models. However, these models often fail to capture the heterogeneity and complexity of the human condition. Analyzing the molecular signatures in patient plasma provides a unique opportunity to gain insights into PAH pathobiology, explore disease heterogeneity absent in animal models, and identify potential therapeutic targets. ObjectiveThis study aims to characterize the circulating peptides, metabolites, and lipids most relevant to PAH by leveraging unbiased mass spectrometry and advanced computational tools. Building on prior research that identified individual circulating factors, this work seeks to integrate these molecular layers to better understand their interactions and collective contribution to PAH pathobiology. MethodsPeripheral blood samples were collected from 402 patients with PAH and 76 healthy individuals. Various types of molecules in the blood - peptides, metabolites, and lipids- were measured. Statistical and machine learning methods were used to identify differences between PAH patients and healthy individuals, and further to understand how these molecules might interact with each other. A survival model was also trained to examine the association between the blood molecular signature and patient outcomes. ResultsDifferential abundance analysis revealed 832 peptides (from 291 proteins), 45 metabolites, and 222 lipids significantly altered in PAH compared to controls. Machine learning- based feature selection identified 11 key molecules, including 2-Hydroxyglutarate, that together achieved a classification accuracy of 98.6% for PAH in a multivariate model tested on a withheld cohort. Latent network discovery uncovered 7 distinct networks, highlighting interacting molecules from pathways--such as hypoxia, glycolysis, fatty acid metabolism, and complement activation--that we and others have previously linked to vascular lesions in PAH patients. A survival model incorporating 155 molecular features predicted outcomes in PAH patients with a c-index of 0.762, independent of traditional clinical parameters. This model stratified patients into risk categories consistent with established markers of cardiac function, exercise tolerance, and the REVEAL 2.0 risk score. ConclusionThis study underscores the utility of integrated omics in unraveling PAH pathobiology in human subjects. Our findings highlight the central role of hypoxia signaling pathways interacting with disrupted fatty acid metabolism, complement activation, inflammation, and mitochondrial dysfunction. These interactions, revealed through latent network analysis, emphasize the metabolic and immune dysregulation underlying PAH. Furthermore, many of the molecules identified in the circulation were consistent with pathways enriched in pulmonary vascular lesions, reinforcing their biological relevance. Circulating plasma molecules from these networks demonstrated strong prognostic capabilities, comparable to current clinical risk scores, offering insights into disease progression and potential for future clinical application.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
American Journal of Respiratory Cell and Molecular Biology
38 papers in training set
Top 0.1%
22.2%
2
Journal of the American Heart Association
119 papers in training set
Top 0.6%
10.3%
3
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.2%
8.3%
4
American Journal of Respiratory and Critical Care Medicine
39 papers in training set
Top 0.1%
7.1%
5
Arteriosclerosis, Thrombosis, and Vascular Biology
65 papers in training set
Top 0.5%
4.8%
50% of probability mass above
6
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 0.9%
3.9%
7
European Respiratory Journal
54 papers in training set
Top 0.5%
3.2%
8
The Journal of Heart and Lung Transplantation
10 papers in training set
Top 0.1%
3.0%
9
EBioMedicine
39 papers in training set
Top 0.2%
2.3%
10
JACC: Basic to Translational Science
15 papers in training set
Top 0.1%
2.0%
11
Hypertension
32 papers in training set
Top 0.3%
2.0%
12
European Heart Journal
16 papers in training set
Top 0.3%
2.0%
13
Respiratory Research
19 papers in training set
Top 0.2%
2.0%
14
Circulation: Heart Failure
14 papers in training set
Top 0.3%
1.7%
15
BMC Cardiovascular Disorders
14 papers in training set
Top 1%
1.7%
16
Scientific Reports
3102 papers in training set
Top 59%
1.7%
17
BMC Medical Genomics
36 papers in training set
Top 0.7%
1.3%
18
PLOS ONE
4510 papers in training set
Top 59%
1.3%
19
Cardiovascular Research
33 papers in training set
Top 0.7%
1.2%
20
The American Journal of Cardiology
15 papers in training set
Top 1%
1.1%
21
JCI Insight
241 papers in training set
Top 6%
0.9%
22
Frontiers in Physiology
93 papers in training set
Top 6%
0.7%
23
British Journal of Haematology
15 papers in training set
Top 0.5%
0.7%
24
Circulation
66 papers in training set
Top 3%
0.7%
25
Journal of the American College of Cardiology
12 papers in training set
Top 0.8%
0.6%
26
Medicine
30 papers in training set
Top 3%
0.6%
27
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.6%
28
The Journal of Infectious Diseases
182 papers in training set
Top 6%
0.6%