Back

Pre-trained Vision Transformers for Seizure Prediction: A Reproducible Baseline with Event-Based Evaluation and Statistical Validation

Yin, Z.; Wang, S.; Moraros, J.

2026-03-14 bioinformatics
10.64898/2026.03.11.711230 bioRxiv
Show abstract

BackgroundScalp electroencephalography (EEG) based seizure prediction plays a critical role in improving the quality of life for patients with drug-resistant epilepsy, offering the potential for real-time warnings and timely interventions. Despite its clinical significance and decades of research, the field still lacks an open benchmark with reproducible baselines and deployment-oriented event-level evaluation. Most prior work relies on the small and outdated Childrens Hospital Boston (CHB-MIT) dataset and reports window-level metrics only, leaving the false-alarm burden of a real warning system underspecified. In seizure prediction, the cost of false alarm is significantly high since patients may receive painful electrical stimulation to suppress seizure. Hence, false alarms per hour (FA/h) and partial AUC (pAUC) are the most deployment-relevant metrics, reflecting alarm burden and discriminability in the low-false-alarm operating region that a usable warning system can realistically tolerate. However, few studies have systematically reported such metrics. In addition, vision transformers event-level performance under deployable FA/h constraints remains underexplored, and newer backbones such as MambaVision have yet to be evaluated under this setting. MethodsIn this work, we introduce a reproducible 5-fold benchmark derived from the Temple University Hospital EEG Seizure Corpus (TUSZ) dataset, and evaluate models using a pseudo-real-time event pipeline, reporting event-level sensitivity, false alarms per hour (FA/h) and partial AUC (pAUC). All models are compared to random predictors for statistical validation. We benchmark pre-trained vision transformers (SegFormer and MambaVision) under three EEG-to-image encoding methods, including a self-proposed Temporal-Patchify encoding for SegFormer. ResultsOur proposed Temporal-Patchify encoding method achieves state-of-the-art performance. We achieved 0.61 pAUC, which is 16.2% higher than the baseline Temporal-Tile SegFormer of Parani et al. The false-alarm burden (0.40{+/-}0.28 FA/h) is 44.4% lower than the Temporal-Tile SegFormer baseline while maintaining clinically usable sensitivity (60.7%{+/-}5.0%). We further perform statistical validation against a matched Poisson random predictor, confirming performance exceeds chance. Finally, we report end-to-end inference through-put up to 920 windows/s, confirming MambaVisions fastest inference speed, exceeding SegFormer by over 20%. ConclusionsThis work bridges the gap between seizure prediction algorithms and clinically usable seizure prediction systems in real-world settings. Our findings indicate that pre-trained vision transformers, when coupled with appropriate EEG encoding methods, can achieve robust performance in low-false-alarm operating regimes, which is critical for real-world deployment. This benchmark and evaluation framework may facilitate more clinically meaningful and reproducible seizure prediction research.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.1%
10.2%
2
Bioinformatics
1061 papers in training set
Top 3%
8.3%
3
Scientific Reports
3102 papers in training set
Top 12%
7.3%
4
PLOS ONE
4510 papers in training set
Top 25%
6.9%
5
Epilepsia
49 papers in training set
Top 0.3%
4.9%
6
PLOS Computational Biology
1633 papers in training set
Top 8%
4.4%
7
Frontiers in Neuroscience
223 papers in training set
Top 1%
3.7%
8
Journal of Neural Engineering
197 papers in training set
Top 0.7%
3.6%
9
NeuroImage
813 papers in training set
Top 3%
3.1%
50% of probability mass above
10
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.4%
11
Biology Methods and Protocols
53 papers in training set
Top 0.5%
2.4%
12
Brain Communications
147 papers in training set
Top 1%
2.1%
13
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.9%
14
BMC Bioinformatics
383 papers in training set
Top 4%
1.9%
15
BioData Mining
15 papers in training set
Top 0.3%
1.7%
16
Nature Communications
4913 papers in training set
Top 51%
1.7%
17
Epilepsy & Behavior
12 papers in training set
Top 0.2%
1.7%
18
npj Systems Biology and Applications
99 papers in training set
Top 1%
1.7%
19
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.5%
20
iScience
1063 papers in training set
Top 21%
1.2%
21
Briefings in Bioinformatics
326 papers in training set
Top 5%
1.1%
22
Nature Machine Intelligence
61 papers in training set
Top 3%
1.0%
23
Journal of Medical Internet Research
85 papers in training set
Top 4%
1.0%
24
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
25
Epilepsy Research
12 papers in training set
Top 0.3%
0.8%
26
Cell Reports Methods
141 papers in training set
Top 5%
0.8%
27
Expert Systems with Applications
11 papers in training set
Top 0.4%
0.8%
28
IEEE Access
31 papers in training set
Top 1.0%
0.8%
29
Frontiers in Physiology
93 papers in training set
Top 6%
0.7%
30
Bioengineering
24 papers in training set
Top 2%
0.7%