Back

OCR-Mediated Modality Dominance in Vision-Language Models: Implications for Radiology AI Trustworthiness

2026-02-24 health informatics Title + abstract only
View on medRxiv
Show abstract

1.BackgroundVision-language models (VLMs) are increasingly proposed for radiologic decision support, yet the security implications of deploying general-domain, OCR-capable models in diagnostic workflows remain poorly characterized. When image-embedded text is not treated as untrusted input, the visual channel becomes vulnerable to adversarial manipulation through OCR-readable overlays. MethodsNine commercial VLMs, none intended or validated for clinical diagnosis, were evaluated on 600 brain MR...

Predicted journal destinations