Back

Artificial Intelligence for Cardiac Biomarkers After Myocardial Infarction: A Systematic Review and a Leakage-Aware Modeling Framework

Piorkowska, N. J.; Olejnik, A.; Madeyski, L.; Musz, A.; Kuliczkowski, W.; Mysiak, A.; Zyłka, A.; Bil-Lula, I.

2026-04-29 cardiovascular medicine
10.64898/2026.04.28.26351958 medRxiv
Show abstract

AimsTo systematically evaluate how artificial intelligence and machine-learning (AI/ML) methods are applied to cardiac biomarkers after myocardial infarction (MI), identify recurring methodological limitations, and operationalize a leakage-aware modelling workflow in a proof-of-concept post-MI dataset using a controlled proxy classification task. Methods and resultsA PRISMA 2020-compliant systematic review of studies published between 2015 and 2025 identified 120 eligible studies from 1,389 records. Most studies used multimodal inputs combining biomarkers with clinical or functional variables (109/120, 90.8%) and focused on prediction or prognostic modelling (89/120, 74.2%). Logistic or regularized regression (76/120, 63.3%) and Random Forest (69/120, 57.5%) were the most frequently used approaches. Internal validation predominated, whereas independent external validation was reported in only 44/120 studies (36.7%). Area under the receiver operating characteristic curve (ROC-AUC) was reported in 114/120 studies (95.0%), while calibration analyses and decision-curve analysis remained limited. Formal explainability methods were used inconsistently, and public code availability was uncommon. To translate these observations into a practical framework, we implemented a leakage-aware machine-learning workflow in a proof-of-concept dataset of 152 patients with MI and 117 variables. The analytical task was defined as a binary classification problem (STEMI vs NSTEMI), used intentionally as a methodological proxy rather than a clinically relevant prognostic endpoint. Three predefined feature-set variants were benchmarked using nested cross-validation. The FULL variant achieved near-perfect discrimination [ROC-AUC 0.9988 (95% CI 0.9925-1.000)], the CLINICAL variant showed modest performance [0.6025 (0.4463-0.7450)], and the BIOMARKERS variant yielded strong discrimination with low dimensionality [0.9300 (0.8537-0.9863)]. Permutation-based falsification testing reduced performance towards chance level, supporting the procedural integrity of the workflow. ConclusionsAI/ML research on cardiac biomarkers after MI is expanding rapidly but remains limited by heterogeneous methodology, insufficient external validation, incomplete interpretability, and weak reproducibility practices. A leakage-aware framework integrating explicit feature governance, nested validation, calibration assessment, robustness analyses, and falsification testing may improve the credibility and translational relevance of biomarker-based cardiovascular AI studies. However, the proof-of-concept case study is intended as a methodological demonstration and does not represent prognostic modelling of post-MI outcomes. Translational PerspectiveAI models using cardiac biomarkers after MI often report strong discrimination, but their clinical value is undermined by limited external validation, incomplete calibration assessment, and poor transparency. Our systematic review identifies these recurrent weaknesses, while the proof-of-concept case study demonstrates how a leakage-aware workflow can distinguish clinically plausible signals from structurally inflated performance under controlled analytical conditions. The use of a proxy classification task highlights methodological behavior rather than clinical prognosis, underscoring the need for future studies to validate such frameworks on clinically meaningful post-MI outcomes. Integrating explicit feature governance, nested validation, calibration, decision-analytic assessment, and falsification testing may help move biomarker-based cardiovascular AI from promising retrospective performance towards more reproducible and clinically trustworthy prediction models.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
European Heart Journal - Digital Health
15 papers in training set
Top 0.1%
28.0%
2
npj Digital Medicine
97 papers in training set
Top 0.6%
8.5%
3
Circulation
66 papers in training set
Top 0.5%
7.3%
4
BMC Medicine
163 papers in training set
Top 0.5%
6.5%
50% of probability mass above
5
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.4%
3.6%
6
Scientific Reports
3102 papers in training set
Top 35%
3.6%
7
The Lancet Digital Health
25 papers in training set
Top 0.2%
2.9%
8
Journal of the American Heart Association
119 papers in training set
Top 2%
2.6%
9
The American Journal of Cardiology
15 papers in training set
Top 0.7%
2.6%
10
PLOS ONE
4510 papers in training set
Top 47%
2.1%
11
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 1%
2.1%
12
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.3%
1.9%
13
Heart
10 papers in training set
Top 0.5%
1.7%
14
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.7%
15
Nature Communications
4913 papers in training set
Top 55%
1.3%
16
JACC: Clinical Electrophysiology
11 papers in training set
Top 0.2%
1.2%
17
eLife
5422 papers in training set
Top 51%
1.0%
18
Journal of the American College of Cardiology
12 papers in training set
Top 0.6%
0.9%
19
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.9%
20
BMJ Open
554 papers in training set
Top 12%
0.8%
21
Medical Image Analysis
33 papers in training set
Top 1%
0.8%
22
European Journal of Preventive Cardiology
13 papers in training set
Top 1.0%
0.7%
23
European Heart Journal
16 papers in training set
Top 0.9%
0.7%
24
Genome Medicine
154 papers in training set
Top 9%
0.7%
25
BMC Cardiovascular Disorders
14 papers in training set
Top 2%
0.7%
26
BMJ Health & Care Informatics
13 papers in training set
Top 1%
0.7%
27
Biology Methods and Protocols
53 papers in training set
Top 3%
0.7%
28
Journal of Clinical Medicine
91 papers in training set
Top 8%
0.5%
29
BMC Medical Informatics and Decision Making
39 papers in training set
Top 3%
0.5%
30
EBioMedicine
39 papers in training set
Top 2%
0.5%