Back

Temporally Continuous Automated Sleep-Wake Classification Using Deep Learning

Somaskandhan, P.; Korkalainen, H.; Leppänen, T.; Töyräs, J.; Melehan, K.; Ruehland, W.; Sands, S. A.; Mann, D. L.; Wilson, D. L.; Terrill, P. I.

2025-12-04 health informatics
10.64898/2025.12.03.25341129 medRxiv
Show abstract

IntroductionSegmenting sleep into fixed 30-second epochs remains central to current sleep scoring practice, yet it imposes rigid boundaries that may not accurately reflect the true temporal sleep dynamics. We aimed to develop a deep learning-based, high-temporal-resolution sleep-wake classifier leveraging temporally continuous manual reference scoring without fixed epoch boundaries and transfer learning techniques to facilitate progress toward a more physiologically consistent sleep assessment. MethodsThree independent datasets were utilized, of which two included sleep-wake scoring manually conducted in a temporally continuous manner. A U-Net based model was initially trained on a large dataset scored using 30-second epochs, with post hoc scoring modifications (n=2034). It was then fine-tuned via transfer learning using a subset of one of the datasets with temporally continuous scoring (n=39) and validated on both its holdout portion (n=40) and the other independent temporally continuous scoring dataset (n=20). Wakefulness and arousals were consolidated, acknowledging their shared physiological characteristics. Prediction confidence estimates were also generated. ResultsThe model achieved overall concordance of 88.96% ({kappa}=0.78) and 88.23% ({kappa}=0.76) in the holdout and second independent evaluation dataset, respectively, with temporally continuous scoring. Correlation between 1-second automatic predictions and temporally continuous manual scoring was r=0.93 (p<0.001) for total sleep time and r=0.67 (p<0.001) for sleep-to-wake transition index. ConclusionsThese findings support the utility of our model in addressing key limitations of 30-second epoch-based scoring and progressing toward more physiologically consistent sleep-wake assessment by providing a practical basis for subsequent analyses. Misclassifications generally showed lower confidences, indicating additional value for targeted review. Statement of SignificanceConventional sleep scoring remains constrained by fixed 30-second epochs, which may fail to capture the true temporal dynamics of the underlying changes between sleep and wakefulness. In this study, we used polysomnography data manually scored on a temporally continuous basis as the gold standard to develop and validate a deep learning model capable of classifying sleep and wakefulness-like states (consolidating wakefulness and arousal) at high temporal resolution without fixed 30-second epochs. The model demonstrated strong agreement with the gold standard, and as such, lays a practical foundation for deriving improved physiologically meaningful biomarkers of sleep fragmentation and continuity, with potential diagnostic and prognostic value and broad applicability toward a more precise and physiologically consistent sleep assessment.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Journal of Sleep Research
31 papers in training set
Top 0.1%
26.4%
2
Journal of Medical Internet Research
85 papers in training set
Top 0.4%
10.3%
3
Scientific Reports
3102 papers in training set
Top 8%
9.3%
4
Annals of Neurology
57 papers in training set
Top 0.3%
4.9%
50% of probability mass above
5
PLOS ONE
4510 papers in training set
Top 30%
4.9%
6
Sleep
26 papers in training set
Top 0.2%
4.0%
7
Sleep Medicine
18 papers in training set
Top 0.2%
3.7%
8
Computers in Biology and Medicine
120 papers in training set
Top 1%
2.4%
9
JMIR mHealth and uHealth
10 papers in training set
Top 0.1%
2.1%
10
npj Digital Medicine
97 papers in training set
Top 2%
2.1%
11
SLEEP
28 papers in training set
Top 0.2%
1.9%
12
Frontiers in Neurology
91 papers in training set
Top 3%
1.7%
13
eClinicalMedicine
55 papers in training set
Top 0.6%
1.7%
14
Frontiers in Digital Health
20 papers in training set
Top 0.6%
1.7%
15
BMJ Open
554 papers in training set
Top 10%
1.4%
16
BMC Medicine
163 papers in training set
Top 4%
1.4%
17
Journal of Biological Rhythms
21 papers in training set
Top 0.2%
1.2%
18
NeuroImage
813 papers in training set
Top 5%
1.1%
19
eBioMedicine
130 papers in training set
Top 3%
0.8%
20
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.8%
21
Frontiers in Physiology
93 papers in training set
Top 6%
0.8%
22
Experimental Neurology
57 papers in training set
Top 2%
0.7%
23
Critical Care
14 papers in training set
Top 0.6%
0.7%
24
Psychiatry Research
35 papers in training set
Top 2%
0.7%
25
Sensors
39 papers in training set
Top 2%
0.7%
26
Physiological Measurement
12 papers in training set
Top 0.5%
0.5%
27
Movement Disorders
62 papers in training set
Top 1%
0.5%
28
Life Sciences
25 papers in training set
Top 2%
0.5%
29
Communications Biology
886 papers in training set
Top 32%
0.5%