Back

Wearable sleep staging using photoplethysmography and accelerometry across sleep apnea severity: a focus on very severe sleep apnea

Ogaki, S.; Kaneda, M.; Nohara, T.; Fujita, S.; Osako, N.; Yagi, T.; Tomita, Y.; Ogata, T.

2026-04-13 health informatics
10.64898/2026.04.09.26350266 medRxiv
Show abstract

Study ObjectivesTo evaluate wearable sleep staging across sleep apnea severity, including very severe sleep apnea defined as an apnea-hypopnea index (AHI)[≥] 50 events/h, and to assess how training-set composition affects performance in this subgroup. MethodsWe analyzed 552 overnight recordings, 318 from the Sleep Lab Dataset and 234 from the Hospital Dataset. In the Hospital Dataset, 26.5% had very severe sleep apnea. We developed a deep learning model for sleep staging using RR intervals from wrist-worn photoplethysmography and three-axis accelerometry. Baseline performance was assessed by cross-validation under 5-stage and 4-stage staging. We examined night-level associations with AHI severity. We also compared the baseline model with an ablation model trained on the same number of recordings but with more Sleep Lab Dataset and lower-AHI Hospital Dataset recordings, evaluating both models in the very severe subgroup. ResultsIn 5-stage classification, Cohens kappa was 0.586 in the Sleep Lab Dataset and 0.446 in the Hospital Dataset. Under 4-stage staging, the gap narrowed, with kappa values of 0.632 and 0.525, respectively. In the Hospital Dataset, performance declined with increasing AHI severity. Among 62 recordings with very severe sleep apnea, reducing high-AHI representation in training lowered kappa from 0.365 to 0.303. ConclusionsWearable sleep staging performance declined across greater sleep apnea severity in this clinical cohort. Clinical utility may benefit from training data that better represent the target severity spectrum and from selecting staging granularity to match the intended use case. Statement of SignificanceRepeated laboratory polysomnography is impractical for long-term sleep apnea management. Wearable sleep staging could support scalable monitoring, yet its reliability in clinically severe sleep apnea has remained unclear. This study developed and evaluated a wearable sleep staging approach in both sleep-laboratory and hospital cohorts. The hospital cohort included many severe and very severe cases. Performance was lower in the hospital cohort and declined with greater sleep apnea severity. A coarser staging scheme reduced the gap between cohorts, and models trained without representative very severe cases performed worse in this target population. These findings highlight the value of severity-aware model development and motivate future multi-night home validation with reliability cues.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Journal of Sleep Research
31 papers in training set
Top 0.1%
12.4%
2
Journal of Medical Internet Research
85 papers in training set
Top 0.4%
10.3%
3
Scientific Reports
3102 papers in training set
Top 7%
10.0%
4
JMIR mHealth and uHealth
10 papers in training set
Top 0.1%
6.3%
5
Frontiers in Digital Health
20 papers in training set
Top 0.2%
3.9%
6
PLOS ONE
4510 papers in training set
Top 36%
3.9%
7
npj Digital Medicine
97 papers in training set
Top 1%
3.9%
50% of probability mass above
8
Annals of Neurology
57 papers in training set
Top 0.6%
3.6%
9
Physiological Measurement
12 papers in training set
Top 0.1%
3.6%
10
BMJ Open
554 papers in training set
Top 6%
3.6%
11
Frontiers in Neurology
91 papers in training set
Top 2%
3.0%
12
Frontiers in Physiology
93 papers in training set
Top 2%
1.9%
13
Journal of Biological Rhythms
21 papers in training set
Top 0.1%
1.9%
14
eClinicalMedicine
55 papers in training set
Top 0.6%
1.7%
15
Critical Care
14 papers in training set
Top 0.4%
1.3%
16
Sensors
39 papers in training set
Top 1%
1.3%
17
Sleep
26 papers in training set
Top 0.4%
1.3%
18
Neurophotonics
37 papers in training set
Top 0.4%
1.3%
19
Sleep Medicine
18 papers in training set
Top 0.3%
1.3%
20
JAMIA Open
37 papers in training set
Top 1%
1.2%
21
JMIR Formative Research
32 papers in training set
Top 1%
1.2%
22
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.2%
23
Journal of NeuroEngineering and Rehabilitation
28 papers in training set
Top 0.8%
0.9%
24
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.9%
25
Wellcome Open Research
57 papers in training set
Top 2%
0.8%
26
eBioMedicine
130 papers in training set
Top 4%
0.7%
27
NeuroImage
813 papers in training set
Top 6%
0.7%
28
PLOS Digital Health
91 papers in training set
Top 3%
0.6%