Fail closed trust gated synthetic augmentation governs tail risk under subject shift in EEG

Choi, D.; Yip, C.; Choi, A.; Park, J.

2026-01-28 bioinformatics

10.64898/2026.01.26.701638 bioRxiv

Show abstract

Synthetic augmentation can silently harm subject-disjoint EEG generalization. We propose trustgated augmentation (TGA), a control layer that scores synthetic windows with a teacher trained on real data for label consistency and confidence; only samples above a confidence quantile q are eligible. A fail-closed selector injects synthetic data only if validation AUROC exceeds real-only by a margin, otherwise reverting to real-only. In PainMunich chronic-pain EEG (n = 189) at 5% subject scarcity, ungated augmentation harmed 56% of paired runs ({Delta}AUROC< -0.01), whereas TGA at q = 0.99 reduced harm to 24% with comparable mean AUROC. In BCI IV-2a motor imagery (n = 9) at 25% scarcity, strict gating improved AUROC (0.679 vs 0.627) and reduced harm (0.16 vs 0.44). A covariance-manifold audit showed synthetic windows were strongly off-manifold (mean distance ratio 2.39 x 104), motivating explicit governance.

Matching journals

●Non-profit ◐University press ○Commercial

The top 6 journals account for 50% of the predicted probability mass.

Only show non-profit

Nature Communications

○ 4913 papers in training set

Nature Machine Intelligence

○ 61 papers in training set

Proceedings of the National Academy of Sciences

● 2130 papers in training set

○ 336 papers in training set

Advanced Science

○ 249 papers in training set

Nature Medicine

○ 117 papers in training set

50% of probability mass above

● 4510 papers in training set

○ 282 papers in training set

PLOS Computational Biology

● 1633 papers in training set

○ 575 papers in training set

Science Advances

● 1098 papers in training set

Communications Biology

○ 886 papers in training set

○ 167 papers in training set

Scientific Reports

○ 3102 papers in training set

◐ 1061 papers in training set

● 5422 papers in training set

Nature Computational Science

○ 50 papers in training set

npj Digital Medicine

○ 97 papers in training set

Nature Biomedical Engineering

○ 42 papers in training set

○ 70 papers in training set

Nature Biotechnology

○ 147 papers in training set

● 429 papers in training set

Nature Neuroscience

○ 216 papers in training set

Nature Genetics

○ 240 papers in training set

○ 813 papers in training set

Physical Review Research

● 46 papers in training set

Genome Research

● 409 papers in training set

Cancer Research

● 116 papers in training set

The American Journal of Human Genetics

○ 206 papers in training set

npj Systems Biology and Applications

○ 99 papers in training set