Back

Comparing Machine and Deep Learning Models for Pediatric Anxiety Classification using Structured EHRs and Area-based Measures of Health Data

Lee, E. W.; Choo, S.; Maguire, D.; Shivanna, A.; Santel, D.; Bhatnagar, S.; Goethert, I.; Patterson, K.; Gholap, J.; Hanson, H. A.; Chandrashekar, M.; Ammerman, R. T.; Pestian, J. P.; Glauser, T.; Brokamp, C.; Strawn, J. R.; Kapadia, A.; Agasthya, G.

2025-05-02 health informatics
10.1101/2025.05.01.25326789 medRxiv
Show abstract

ObjectiveThis study investigates the performance of various machine learning (ML) and deep learning (DL) models to classify pediatric patients at risk of anxiety disorders using electronic health records (EHRs). By leveraging EHR data and including Area-based measures of health (ABMH) data, this approach aims to enable proactive care by monitoring potential anxiety onset comprehensively across various age groups. MethodsIn this study, we trained a series of ML and DL models to classify youth at risk of developing anxiety disorders. ML models (Logistic Regression, Decision Tree, Random Forest, K-Nearest Neighbors, XGBoost) and DL models (LSTM, GRU, RETAIN, Dipole) were trained using structured EHR data from 30-day periods before anxiety diagnoses. Two datasets per age group were used: one with structured EHR data only and another with incorporating both structured EHR and ABMH data. Model performance was assessed using accuracy, the AUROC, AUPRC, PPV, NPV, and F1 scores. ResultsThe ML models provided a solid performance baseline, with XGBoost showing strong baseline performance across age groups, with AUROC scores of 0.817 (structured EHR) and 0.816 (structured EHR + ABMH). Between DL models, RETAIN and Dipole performed the best. For example, RETAIN achieved AUROC scores of 0.851 (structured EHR) and 0.853 (structured + ABMH), while Dipole scored 0.853 and 0.857, respectively, for 8-year-olds. These results underscore the viability of both ML and DL models for the early detection of pediatric anxiety disorders. ConclusionThis study comprehensively investigated ML and DL models for diagnosing pediatric anxiety. We demonstrated that ML and DL models can effectively monitor probable anxiety onset within an EHR system and also with the ABMH data. We discovered that model performance varied with age, indicating the need for personalized model development per age group for effective clinical predictive analytics.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
International Journal of Medical Informatics
25 papers in training set
Top 0.1%
22.9%
2
Journal of Medical Internet Research
85 papers in training set
Top 0.2%
15.0%
3
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.3%
8.6%
4
JMIR Formative Research
32 papers in training set
Top 0.1%
8.3%
50% of probability mass above
5
JMIR Medical Informatics
17 papers in training set
Top 0.2%
4.9%
6
PLOS ONE
4510 papers in training set
Top 38%
3.6%
7
PLOS Digital Health
91 papers in training set
Top 0.7%
3.6%
8
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.6%
2.9%
9
Frontiers in Digital Health
20 papers in training set
Top 0.4%
2.1%
10
Scientific Reports
3102 papers in training set
Top 49%
2.1%
11
Journal of Biomedical Informatics
45 papers in training set
Top 0.7%
1.9%
12
JAMIA Open
37 papers in training set
Top 0.7%
1.9%
13
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.7%
14
Frontiers in Public Health
140 papers in training set
Top 5%
1.7%
15
PeerJ
261 papers in training set
Top 7%
1.7%
16
BMC Medical Research Methodology
43 papers in training set
Top 0.8%
1.2%
17
International Journal of Environmental Research and Public Health
124 papers in training set
Top 6%
1.0%
18
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
0.9%
19
BMJ Health & Care Informatics
13 papers in training set
Top 0.8%
0.8%
20
Frontiers in Psychiatry
83 papers in training set
Top 3%
0.8%
21
Physiological Measurement
12 papers in training set
Top 0.4%
0.7%
22
Acta Neuropsychiatrica
12 papers in training set
Top 1%
0.7%
23
BMC Psychiatry
22 papers in training set
Top 0.9%
0.7%
24
BJPsych Open
25 papers in training set
Top 0.8%
0.7%
25
Database
51 papers in training set
Top 1%
0.7%
26
Informatics in Medicine Unlocked
21 papers in training set
Top 2%
0.5%
27
Journal of Personalized Medicine
28 papers in training set
Top 2%
0.5%
28
Healthcare
16 papers in training set
Top 3%
0.5%