Back

Contrastive Multi-modal Training with Electrocardiography and Natural Language Echocardiography Reports for Zero-shot Prediction of Structural Heart Disease

WONG, W.-C.; LIU, C.; ELIAS, P.; HUGHES, J. W.; LEUNG, C.-Y.; QIAN, X.-Y.; LI, H.-L.; LAU, Y.-M.; TAO, C.-F.; CHOO, A.; YUNG, C.-H.; FONG, C.-H.; CHOI, W.-K.; CHENG, C.-K.; CHENG, L.-L.; LAU, L.-M.; RELWANI, R.; QIN, J.; YU, L.; LUI, H.-W.; CHIU, H.-O. A. C.; TSE, H.-F.; SIU, C.-W.; ARCUCCI, R.; HO, J. W.-K.; WONG, C.-K.

2025-09-18 cardiovascular medicine
10.1101/2025.09.16.25335870 medRxiv
Show abstract

BackgroundMachine learning models for predicting structural heart disease (SHD) from electrocardiography (ECG) traditionally required structured echocardiographic data. The potential of echocardiography (ECHO) natural language reports remains underused. We describe MERL-ECHO, a multimodal model using contrastive language-image pre-training (CLIP) that aligns ECG with ECHO natural language reports for zero-shot SHD prediction. MethodsWe conducted a multi-center retrospective study using paired ECG and ECHO natural language reports from Queen Mary Hospital and Tung Wah Hospital in Hong Kong. MERL-ECHO was trained on 45,016 pairs ECG-ECHO pairs. Performance was evaluated on an internal test set covering 10 SHDs and on an external test set of 5,442 ECGs with ECHO-derived labels for 6 SHDs from Columbia University Irving Medical Center, USA. ResultsThe cohort included 8,192 patients (mean age 73.7{+/-}16.5 years; 55.3% male). In the internal test set, MERL-ECHO achieved an average AUROC of 0.69, with strongest performance for left ventricular dilation (0.78), right ventricular systolic dysfunction (0.71), and tricuspid regurgitation (0.71). In the external test set, the average AUROC was 0.72, with highest performance for left ventricular systolic dysfunction (0.76) and aortic stenosis (0.76). Pre-training improved AUROC by up to 5%, performance scaled with larger datasets, and ResNet18 outperformed ViT-Tiny as ECG encoder by 7%. Saliency analysis revealed interpretable ECG features, including unexpected P-wave changes in aortic stenosis, suggesting novel disease markers. ConclusionsMERL-ECHO leverages ECHO natural language reports for multimodal training with ECG. This CLIP-based model enables accurate zero-shot prediction of SHDs and highlights interpretable ECG features with potential clinical relevance.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
European Heart Journal - Digital Health
15 papers in training set
Top 0.1%
25.7%
2
Circulation
66 papers in training set
Top 0.2%
14.3%
3
npj Digital Medicine
97 papers in training set
Top 0.8%
6.3%
4
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 0.7%
4.8%
50% of probability mass above
5
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.3%
4.8%
6
Medical Image Analysis
33 papers in training set
Top 0.3%
4.3%
7
Journal of the American Heart Association
119 papers in training set
Top 2%
3.9%
8
European Heart Journal
16 papers in training set
Top 0.2%
3.6%
9
Scientific Reports
3102 papers in training set
Top 37%
3.6%
10
Journal of the American College of Cardiology
12 papers in training set
Top 0.3%
1.7%
11
Nature Communications
4913 papers in training set
Top 52%
1.7%
12
Frontiers in Physiology
93 papers in training set
Top 3%
1.7%
13
The American Journal of Cardiology
15 papers in training set
Top 0.9%
1.7%
14
The Lancet Digital Health
25 papers in training set
Top 0.5%
1.5%
15
European Journal of Human Genetics
49 papers in training set
Top 0.9%
1.2%
16
Genome Medicine
154 papers in training set
Top 6%
1.1%
17
PLOS ONE
4510 papers in training set
Top 64%
0.9%
18
Nature Medicine
117 papers in training set
Top 4%
0.9%
19
EBioMedicine
39 papers in training set
Top 1.0%
0.8%
20
Nature Cardiovascular Research
28 papers in training set
Top 0.5%
0.8%
21
Heart
10 papers in training set
Top 0.9%
0.7%
22
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.7%
23
BMC Medicine
163 papers in training set
Top 8%
0.7%
24
Computers in Biology and Medicine
120 papers in training set
Top 6%
0.6%
25
JACC: Clinical Electrophysiology
11 papers in training set
Top 0.4%
0.6%