Back

Shortkit-ML: A Unified Multi-Perspective Framework for Detecting Shortcut Learning in Medical Imaging Embeddings

Cajas, S.; Marzullo, A.; Kapadia, S.; Santos, F.; Ocampo Osorio, F.; Kong, Q.; Quarta, A.; Kuo, P.-C.; Patel, M.; Rojas Sillery, R. I.; Celi, L. A.

2026-04-30 health informatics
10.64898/2026.04.29.26352053 medRxiv
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWShortcut learning poses a significant challenge in clinical artificial intelligence, as models may rely on spurious signals rather than clinically relevant features, leading to biased predictions and poor generalization. Existing detection methods are fragmented and lack systematic evaluation across datasets and model architectures. To address this issue, we propose ShortKit-ML, an open-source Python framework for unified shortcut analysis in embedding spaces. The framework integrates over 20 detection methods and six mitigation strategies within a modular pipeline, encompassing embedding analysis, fairness metrics, training dynamics, causal methods, explainability, and representation analysis. We evaluate the framework on chest X-ray datasets (CheXpert and MIMIC-CXR), synthetic benchmarks, and an out-of-domain dataset (CelebA). Experimental results demonstrate that multi-method auditing provides more stable and interpretable evidence than individual methods, while detector disagreement reveals meaningful representational differences. The proposed framework offers automated reporting, interactive visualization, and is available as a pip-installable package. The source code and documentation are publicly available at https://github.com/criticaldata/ShortKit-ML and https://criticaldata.github.io/ShortKit-ML/.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Medical Image Analysis
33 papers in training set
Top 0.1%
18.9%
2
npj Digital Medicine
97 papers in training set
Top 0.5%
10.2%
3
Nature Machine Intelligence
61 papers in training set
Top 0.4%
6.4%
4
Patterns
70 papers in training set
Top 0.1%
6.4%
5
Nature Communications
4913 papers in training set
Top 33%
4.9%
6
Scientific Reports
3102 papers in training set
Top 27%
4.4%
50% of probability mass above
7
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.2%
3.6%
8
Nature Biomedical Engineering
42 papers in training set
Top 0.3%
3.6%
9
PLOS ONE
4510 papers in training set
Top 42%
3.1%
10
Artificial Intelligence in Medicine
15 papers in training set
Top 0.2%
1.9%
11
Journal of Biomedical Informatics
45 papers in training set
Top 0.7%
1.9%
12
NeuroImage: Clinical
132 papers in training set
Top 2%
1.7%
13
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.7%
14
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.5%
15
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.5%
16
Communications Medicine
85 papers in training set
Top 0.4%
1.3%
17
eBioMedicine
130 papers in training set
Top 2%
1.2%
18
Biology Methods and Protocols
53 papers in training set
Top 2%
1.1%
19
Human Brain Mapping
295 papers in training set
Top 4%
1.1%
20
Bioinformatics
1061 papers in training set
Top 8%
1.0%
21
PLOS Computational Biology
1633 papers in training set
Top 21%
1.0%
22
Communications Biology
886 papers in training set
Top 17%
1.0%
23
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.8%
0.8%
24
IEEE Transactions on Medical Imaging
18 papers in training set
Top 0.5%
0.8%
25
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.7%
0.8%
26
Science Advances
1098 papers in training set
Top 30%
0.8%
27
NeuroImage
813 papers in training set
Top 6%
0.8%
28
Science Translational Medicine
111 papers in training set
Top 7%
0.7%
29
Nature Medicine
117 papers in training set
Top 6%
0.7%
30
Advanced Science
249 papers in training set
Top 22%
0.7%