Back

Development and internal validation of a prediction model for sleep apnea syndrome treated with continuous positive airway pressure based on claims and health checkup data linked to personal health records

Muraki, T.; Ueda, T.; Hasegawa, C.; Usui, H.; Koshimizu, H.; Ariyada, K.; Kusajima, K.; Tomita, Y.; Yanagisawa, M.; Iwagami, M.

2026-02-11 epidemiology
10.64898/2026.02.08.26345272 medRxiv
Show abstract

PurposeTo develop and validate a prediction model for sleep apnea syndrome (SAS) treated with continuous positive airway pressure (CPAP) in the general population. MethodsUsing claims and health checkup data held by JMDC Inc., linked to personal health records (Pep Up), we developed and internally validated a prediction model for SAS treated with CPAP, defined as a diagnosis of SAS and reimbursement records of CPAP. Every three months from January 1, 2022 to July 1, 2024 (i.e., 11 timepoints), we identified eligible individuals with available data both 1 year before and 1 year after that timepoint to define the presence/absence of SAS treated with CPAP, as well as 279 predictor variables. We developed a LightGBM model for the training and tuning datasets and evaluated its performance on the validation dataset. ResultsAmong 18,692,873 observations (mean age 44.8{+/-}11.3 years, women 37.5%) obtained from 1,858,566 people, 300,868 (1.6%) had SAS treated with CPAP. The area under the receiver operating characteristic curve was 0.898 (95% confidence interval 0.895-0.901). The positive predictive values among people with the top 1% and 10% prediction scores were 28.3% and 10.3%, respectively. According to the SHapley Additive exPlanations plot, male sex was the most important predictor, followed by age, body mass index, and waist circumference. We also demonstrated that personal health records significantly improved the predictive performance. ConclusionWe developed a prediction model to identify people at high risk of SAS and encourage them to undergo polysomnography or related tests.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 8%
19.6%
2
Scientific Reports
3102 papers in training set
Top 5%
10.6%
3
Journal of Sleep Research
31 papers in training set
Top 0.1%
4.1%
4
BMJ Open
554 papers in training set
Top 5%
3.8%
5
JMIR mHealth and uHealth
10 papers in training set
Top 0.1%
3.8%
6
Journal of Medical Internet Research
85 papers in training set
Top 1%
3.8%
7
Journal of Clinical Medicine
91 papers in training set
Top 3%
1.8%
8
BMC Medical Research Methodology
43 papers in training set
Top 0.7%
1.6%
9
Frontiers in Public Health
140 papers in training set
Top 5%
1.4%
50% of probability mass above
10
JMIR Formative Research
32 papers in training set
Top 1%
1.3%
11
Annals of Epidemiology
19 papers in training set
Top 0.3%
1.3%
12
Swiss Medical Weekly
12 papers in training set
Top 0.2%
1.3%
13
Sleep
26 papers in training set
Top 0.4%
1.3%
14
Clinical Infectious Diseases
231 papers in training set
Top 4%
1.2%
15
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.9%
16
Genetic Epidemiology
46 papers in training set
Top 0.7%
0.9%
17
BMJ Open Respiratory Research
32 papers in training set
Top 0.5%
0.9%
18
Biology Methods and Protocols
53 papers in training set
Top 2%
0.8%
19
Frontiers in Neurology
91 papers in training set
Top 5%
0.8%
20
International Journal of Epidemiology
74 papers in training set
Top 2%
0.8%
21
Sleep Medicine
18 papers in training set
Top 0.4%
0.8%
22
Frontiers in Medicine
113 papers in training set
Top 6%
0.8%
23
International Journal of Medical Informatics
25 papers in training set
Top 2%
0.8%
24
Wellcome Open Research
57 papers in training set
Top 2%
0.8%
25
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
0.8%
26
European Respiratory Journal
54 papers in training set
Top 2%
0.8%
27
npj Digital Medicine
97 papers in training set
Top 3%
0.8%
28
Pharmacoepidemiology and Drug Safety
13 papers in training set
Top 0.4%
0.8%
29
Bioinformatics
1061 papers in training set
Top 9%
0.8%
30
Frontiers in Digital Health
20 papers in training set
Top 1%
0.8%