Back

Adversarial Robustness of Capsule Networks for Medical Image Classification

Srinivasan, A.; Sritharan, D. V.; Chadha, S.; Fu, D.; Hossain, J. O.; Breuer, G. A.; Aneja, S.

2026-03-10 health informatics
10.64898/2026.03.09.26347900 medRxiv
Show abstract

PurposeDeep learning models are increasingly being used in medical diagnostics, but their vulnerability to adversarial perturbations raises concerns about their reliability in clinical applications. Capsule networks (CapsNets) are a promising architecture for medical imaging tasks, given their ability to model spatial relationships and train with smaller amounts of data. Although previous studies have focused on adversarial training approaches to improve robustness, exploring alternative architectures is an underexplored direction for combating poor adversarial stability. Prior work has suggested that CapsNets may exhibit improved robustness to adversarial perturbations compared to convolutional neural networks (CNNs), but performance on adversarial images has not been studied systematically in clinical environments. We evaluated the robustness of CapsNets compared to CNNs and vision transformers (ViTs) across multiple medical image classification tasks. MethodsWe trained two CNNs (ResNet-18 and ResNet-50), one ViT (MedViT), and two CapsNets (DR-CapsNet and BP-CapsNet) on four distinct medical imaging datasets (PneumoniaMNIST, BreastMNIST, NoduleMNIST3D, and BloodMNIST) and one natural image dataset (MNIST). Models were evaluated on adversarial examples generated by projected gradient descent and fast gradient sign method across a range of perturbation bounds. Interpretability experiments, including latent space and Gradient-weighted Class Activation Mapping (Grad-CAM) analyses, were conducted to better understand model stability on adversarial inputs. ResultsCapsNets demonstrated superior robustness under adversarial perturbations compared to CNNs and ViTs across all medical imaging datasets and the natural image dataset. Latent space and Grad-CAM visualizations revealed that CapsNets maintained more consistent embedding representations and attention maps after adversarial perturbations compared to CNNs and ViTs, suggesting that advantages in CapsNet robustness are supported, at least in part, by more stable feature encodings. Bayes-Pearson routing further improved robustness over standard dynamic routing in CapsNets without compromising baseline performance, suggesting a potential architectural improvement. ConclusionCapsNets exhibit intrinsic advantages in adversarial robustness over CNN- and ViT-based models on medical imaging tasks, suggesting they are a reliable alternative for medical image classification. These findings support the use of CapsNets in clinical applications where model reliability is critical.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Biology Methods and Protocols
53 papers in training set
Top 0.1%
14.2%
2
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.1%
6.7%
3
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.4%
6.7%
4
Scientific Reports
3102 papers in training set
Top 15%
6.7%
5
PLOS Digital Health
91 papers in training set
Top 0.5%
4.8%
6
Medical Image Analysis
33 papers in training set
Top 0.3%
3.9%
7
PLOS ONE
4510 papers in training set
Top 36%
3.9%
8
Computers in Biology and Medicine
120 papers in training set
Top 0.9%
3.6%
50% of probability mass above
9
npj Digital Medicine
97 papers in training set
Top 1%
3.6%
10
Artificial Intelligence in Medicine
15 papers in training set
Top 0.2%
2.6%
11
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.2%
2.1%
12
Informatics in Medicine Unlocked
21 papers in training set
Top 0.4%
1.9%
13
Patterns
70 papers in training set
Top 0.7%
1.9%
14
Bioinformatics
1061 papers in training set
Top 7%
1.9%
15
Diagnostics
48 papers in training set
Top 0.9%
1.8%
16
PLOS Computational Biology
1633 papers in training set
Top 17%
1.6%
17
Expert Systems with Applications
11 papers in training set
Top 0.1%
1.6%
18
Human Brain Mapping
295 papers in training set
Top 3%
1.5%
19
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.5%
20
JMIR Medical Informatics
17 papers in training set
Top 1%
1.2%
21
IEEE Access
31 papers in training set
Top 0.6%
1.2%
22
Journal of Medical Imaging
11 papers in training set
Top 0.2%
1.1%
23
BMJ Open
554 papers in training set
Top 11%
0.9%
24
BMJ Health & Care Informatics
13 papers in training set
Top 0.8%
0.9%
25
Journal of Pathology Informatics
13 papers in training set
Top 0.3%
0.9%
26
Journal of Medical Internet Research
85 papers in training set
Top 4%
0.8%
27
JMIRx Med
31 papers in training set
Top 2%
0.8%
28
International Journal of Medical Informatics
25 papers in training set
Top 2%
0.8%
29
Journal of Biomedical Informatics
45 papers in training set
Top 2%
0.7%
30
NeuroImage
813 papers in training set
Top 6%
0.7%