Back

Intraplaque haemorrhage quantification and molecular characterisation using attention based multiple instance learning

Cisternino, F.; Song, Y.; Peters, T. S.; Westerman, R.; de Borst, G. J.; Diez Benavente, E.; van den Dungen, N. A. M.; van der Kraak, P. H.; De Kleijn, D.; Mekke, J. M.; Mokry, M.; Pasterkamp, G.; den Ruijter, H. M.; Velema, E.; Miller, C.; Glastonbury, C. A.; van der Laan, S. W.

2025-03-06 cardiovascular medicine
10.1101/2025.03.04.25323316 medRxiv
Show abstract

Intraplaque haemorrhage (IPH) represents a critical feature of plaque vulnerability as it is robustly associated with adverse cardiovascular events, including stroke and myocardial infarction. How IPH drives plaque instability is unknown. However, its identification and quantification in atherosclerotic plaques is currently performed manually, with high inter-observer variability, limiting its accurate assessment in large cohorts. Leveraging the Athero-Express biobank, an ongoing study comprising a comprehensive dataset of histological, transcriptional, and clinical information from 2,595 carotid endarterectomy patients, we developed an attention-based additive multiple instance learning (MIL) framework to automate the detection and quantification of IPH across whole-slide images of nine distinct histological stains. We demonstrate that routinely available Haematoxylin and Eosin (H&E) staining outperformed all other plaque relevant Immunohistochemistry (IHC) stains tested (AUROC = 0.86), underscoring its utility in quantifying IPH. When combining stains through ensemble models, we see that H&E + CD68 (a macrophage marker) as well as H&E + Verhoeff-Van Gieson elastic fibers staining (EVG) leads to a substantial improvement (AUROC = 0.92). Using our model, we could derive IPH area from the MIL-derived patch-level attention scores, enabling not only classification but precise localisation and quantification of IPH area in each plaque, facilitating downstream analyses of its association and cellular composition with clinical outcomes. By doing so, we demonstrate that IPH presence and area are the most significant predictors of both preoperative symptom presentation and major adverse cardiovascular events (MACE), outperforming manual scoring methods. Automating IPH detection also allowed us to characterise IPH on a molecular level at scale. Pairing IPH measurements with single-cell transcriptomic analyses revealed key molecular pathways involved in IPH, including TNF- signalling, extracellular matrix remodelling and the presence of foam cells. This study represents the largest effort in the cardiovascular field to integrate digital pathology, machine learning, and molecular data to predict and characterize IPH which leads to better understanding how it drives symptoms and MACE. Our model provides a scalable, interpretable, and reproducible method for plaque phenotyping, enabling the derivation of plaque phenotypes for predictive modelling of MACE outcomes.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Medical Image Analysis
33 papers in training set
Top 0.1%
17.4%
2
Nature Cardiovascular Research
28 papers in training set
Top 0.1%
12.4%
3
Nature Communications
4913 papers in training set
Top 16%
10.4%
4
Circulation
66 papers in training set
Top 0.6%
6.3%
5
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.4%
3.9%
50% of probability mass above
6
Scientific Reports
3102 papers in training set
Top 31%
3.9%
7
npj Digital Medicine
97 papers in training set
Top 1%
3.9%
8
European Heart Journal - Digital Health
15 papers in training set
Top 0.2%
3.6%
9
Atherosclerosis
29 papers in training set
Top 0.6%
2.1%
10
EMBO Molecular Medicine
85 papers in training set
Top 1%
2.1%
11
Nature Medicine
117 papers in training set
Top 2%
1.9%
12
Arteriosclerosis, Thrombosis, and Vascular Biology
65 papers in training set
Top 1%
1.9%
13
eLife
5422 papers in training set
Top 38%
1.9%
14
iScience
1063 papers in training set
Top 12%
1.9%
15
Nature Genetics
240 papers in training set
Top 4%
1.8%
16
Cell Genomics
162 papers in training set
Top 3%
1.7%
17
Journal of the American Heart Association
119 papers in training set
Top 3%
1.5%
18
Biomaterials
78 papers in training set
Top 0.8%
1.1%
19
Cardiovascular Research
33 papers in training set
Top 0.8%
0.9%
20
Advanced Science
249 papers in training set
Top 16%
0.9%
21
Genome Medicine
154 papers in training set
Top 7%
0.9%
22
Cell Reports
1338 papers in training set
Top 32%
0.8%
23
PLOS Computational Biology
1633 papers in training set
Top 23%
0.8%
24
Communications Biology
886 papers in training set
Top 24%
0.7%
25
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
26
Science Translational Medicine
111 papers in training set
Top 7%
0.6%
27
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.6%
28
Nature Biomedical Engineering
42 papers in training set
Top 2%
0.6%
29
EBioMedicine
39 papers in training set
Top 2%
0.6%