Back

Early prediction and fairness evaluation of perinatal depression using EHR: A study of 18,000+ Pregnancies

Sarwal, V.; Pimplaskar, A.; Richards, M.; Sobowale, K.; Chiang, J. N.; Loohuis, L. O.

2025-07-03 psychiatry and clinical psychology
10.1101/2025.06.19.25329946 medRxiv
Show abstract

Perinatal depression (PND), defined as a depressive illness occurring during pregnancy or following childbirth, affects between 10-20% of mothers. It is one of the greatest causes of mortality and morbidity in mothers and is associated with poor outcomes in children. Early identification of at-risk mothers has the potential to greatly reduce its impact. While specific risk factors for PND have been identified, most notably a history of prior depression, it is unclear whether mothers Electronic Health Records (EHR) can be used early in pregnancy to predict who will go on to develop PND, especially in mothers without a history of prior depression. In this paper, we used clinical EHR data from the UCLA health system to develop predictive models of perinatal depression at a patients first prenatal visit (n = 18,081 pregnant mothers, n=4,307 with PND). We used a variety of predictive models, including Ridge Regression, Gradient Boosting Trees, Random Forests, and ExtraTrees. We performed separate analyses including only mothers without a history of prior depression. We further evaluated the robustness and fairness of our algorithms comparing models stratified by self-reported ethnoracial group and social determinants of health (e.g., social vulnerability index (SVI)). All model architectures used perform similarly. The Random Forest model provided robust performance with the highest accuracy and well-balanced sensitivity and specificity (AUROC 0.75, CI [0.66,0.84] in the full cohort). However, performance was reduced among mothers without prior depression (AUROC 0.71, CI [0.6,0.8]). Important risk factors identified by our model include known risk factors, such as prior mental health histories (prior depression, anxiety disorders), socioeconomic factors (social vulnerability), patient vitals (blood pressure), and measures of inflammation in blood (white blood cell counts, platelet counts), as well as novel ones (patient pulse, mean platelet volume (MPV), red blood cell distribution width (RDWSD) and rapid plasma reagin (RPR)). We observed similar model performance when stratifying our cohort by social determinants of health, with overlapping ROC bounds, equalized odds ratios between groups close to 0.8, and largely overlapping predictors of importance across models. This was not the case for ethnoracial groups, where despite observing top predictive features varied by ethnoracial category.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Nature Medicine
117 papers in training set
Top 0.1%
32.0%
2
npj Digital Medicine
97 papers in training set
Top 0.4%
12.1%
3
Nature Communications
4913 papers in training set
Top 24%
8.0%
50% of probability mass above
4
Translational Psychiatry
219 papers in training set
Top 1%
4.7%
5
Molecular Psychiatry
242 papers in training set
Top 0.8%
3.9%
6
Scientific Reports
3102 papers in training set
Top 39%
3.5%
7
Communications Medicine
85 papers in training set
Top 0.1%
2.8%
8
Acta Psychiatrica Scandinavica
10 papers in training set
Top 0.1%
2.0%
9
Nature Neuroscience
216 papers in training set
Top 4%
2.0%
10
Biological Psychiatry
119 papers in training set
Top 1%
2.0%
11
BMC Medicine
163 papers in training set
Top 3%
2.0%
12
Frontiers in Psychiatry
83 papers in training set
Top 2%
1.6%
13
PLOS Digital Health
91 papers in training set
Top 2%
1.4%
14
eLife
5422 papers in training set
Top 48%
1.3%
15
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
16
PLOS Medicine
98 papers in training set
Top 3%
1.3%
17
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.3%
18
Biological Psychiatry Global Open Science
54 papers in training set
Top 0.9%
1.3%
19
PLOS ONE
4510 papers in training set
Top 61%
1.2%
20
European Psychiatry
10 papers in training set
Top 0.5%
0.9%
21
Journal of Affective Disorders
81 papers in training set
Top 1%
0.9%
22
Science
429 papers in training set
Top 20%
0.7%
23
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 48%
0.6%
24
Social Science & Medicine
15 papers in training set
Top 1%
0.6%
25
JAMA Network Open
127 papers in training set
Top 5%
0.6%
26
BJPsych Open
25 papers in training set
Top 0.8%
0.6%