Back

Unintended Pregnancy and Preterm Birth in the United States: Causal Inference and Risk Prediction Using National Survey of Family Growth Data

ADETUNJI, S. A.; Nerandra, P. M.

2025-11-27 sexual and reproductive health
10.1101/2025.11.25.25341033 medRxiv
Show abstract

BackgroundUnintended pregnancy remains common in high income countries and has been linked to poorer maternal and neonatal outcomes. Whether pregnancy intention has an independent, causal effect on preterm birth, beyond social and clinical risk factors, is uncertain. MethodsWe conducted a cross-sectional analysis of a nationally representative sample of singleton live births from a US reproductive health survey. Pregnancy intention (intended vs unintended) was reported at conception. Preterm birth was defined as delivery before 37 completed weeks. We used survey-weighted logistic regression and a suite of causal estimators, including inverse probability weighted marginal structural models, augmented inverse probability weighting, targeted maximum likelihood estimation with Super Learner, Bayesian g-computation, and causal forests. Models adjusted for maternal age, race and ethnicity, parity, marital or cohabiting status, education, poverty ratio, insurance, and body mass index. We also trained Super Learner prediction models with 10-fold cross validation and evaluated discrimination, calibration, high risk stratification, and net clinical benefit. FindingsIn the weighted population, 39.1% of pregnancies were unintended. Preterm birth occurred in 12.6% of unintended vs 9.0% of intended pregnancies. In survey-weighted logistic models, unintended pregnancy was associated with higher odds of preterm birth (adjusted odds ratio 1.43, 95% CI 1.06 to 1.94). Across advanced causal estimators, the risk difference for unintended vs intended pregnancy was small but consistent, around 3 excess preterm births per 100 live births, with limited positivity and modest E-values suggesting that unmeasured confounding could attenuate or explain part of the association. A Super Learner ensemble achieved excellent discrimination (area under the curve about 0.98 vs 0.56 for baseline logistic regression), good calibration, and identified a top 10% risk stratum with markedly higher observed preterm birth risk than the lower 90%. InterpretationIn this national sample, unintended pregnancy functioned primarily as a marker of concentrated social and clinical vulnerability rather than a large, isolated causal driver of preterm birth. Nonetheless, pregnancy intention materially improved risk stratification when combined with standard covariates. Joint use of causal inference and machine learning provides a defensible framework to target intensified antenatal support to women at highest risk while avoiding overinterpretation of intention as a deterministic cause. FundingNo external funding.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
International Journal of Epidemiology
74 papers in training set
Top 0.1%
42.6%
2
PLOS Medicine
98 papers in training set
Top 0.1%
11.2%
50% of probability mass above
3
Social Science & Medicine
15 papers in training set
Top 0.1%
5.2%
4
Nature Human Behaviour
85 papers in training set
Top 0.5%
5.2%
5
Scientific Reports
3102 papers in training set
Top 32%
3.8%
6
Clinical Infectious Diseases
231 papers in training set
Top 2%
2.8%
7
BMC Pregnancy and Childbirth
20 papers in training set
Top 0.3%
2.5%
8
Nature Communications
4913 papers in training set
Top 46%
2.2%
9
SSM - Population Health
17 papers in training set
Top 0.1%
2.2%
10
PLOS Computational Biology
1633 papers in training set
Top 15%
1.8%
11
PLOS ONE
4510 papers in training set
Top 56%
1.6%
12
European Journal of Human Genetics
49 papers in training set
Top 0.8%
1.3%
13
The Lancet Digital Health
25 papers in training set
Top 0.7%
1.2%
14
JAMA Network Open
127 papers in training set
Top 3%
1.0%
15
BMJ
49 papers in training set
Top 0.9%
1.0%
16
BMC Medicine
163 papers in training set
Top 5%
1.0%
17
Canadian Medical Association Journal
15 papers in training set
Top 0.3%
0.8%
18
Journal of the American Heart Association
119 papers in training set
Top 4%
0.8%
19
European Journal of Public Health
20 papers in training set
Top 0.9%
0.8%
20
Frontiers in Public Health
140 papers in training set
Top 7%
0.8%
21
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.8%
22
Journal of Medical Economics
10 papers in training set
Top 0.1%
0.8%
23
British Journal of General Practice
22 papers in training set
Top 0.5%
0.8%
24
PLOS Digital Health
91 papers in training set
Top 3%
0.5%
25
BMC Biology
248 papers in training set
Top 6%
0.5%