Back

Unmasking Disparities in Caesarean Section Utilization in Nigeria: A K-Means Cluster Analysis of Obstetric Risk Profiles Using Machine Learning Approach

AJEBORIOGBON, S. A.; Ogunetimoju, A. M.; Bisiriyu, O. L.

2026-05-01 epidemiology
10.64898/2026.04.30.26352131 medRxiv
Show abstract

Caesarean section rates in Nigeria remain suboptimal, with significant disparities across socioeconomic and geographic strata. The objective of this research is to identify and characterize distinct obstetric risk profiles associated with caesarean section utilization in Nigeria using K-means cluster analysis, and to examine the sociodemographic and geographic factors driving these disparities. We analyzed data from 13,915 women with recent births in the 2024 Nigeria Demographic and Health Survey. Fourteen variables spanning demographics, socioeconomic status, healthcare access, medical history, and geography were used as clustering features. K-Means clustering was performed with optimal cluster selection based on silhouette score, Davies Bouldin index, and Calinski Harabasz index. Bootstrap validation with 100 iterations assessed cluster stability, while chi-square tests and logistic regression examined associations between cluster membership and surgical delivery. Ten distinct clusters were identified with rates ranging from 1.7% to 14.4%, representing an 8.4-fold variation. The highest utilization cluster at 14.4% comprised urban, highly educated, wealthy women with extensive antenatal care averaging 16.5 visits, while the lowest utilization cluster at 1.7% consisted of rural, poorly educated, impoverished women with minimal healthcare access averaging 2.3 visits. Cluster membership was significantly associated with utilization, and bootstrap analysis confirmed cluster stability with a mean silhouette of 0.220. Machine learning based clustering reveals profound disparities in utilization across distinct population subgroups in Nigeria, highlighting the dual challenge of underutilization among disadvantaged rural populations and potential overutilization among urban elites. Targeted interventions addressing geographic, economic, and healthcare access barriers are essential to optimize utilization across all population segments.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 5%
23.6%
2
Scientific Reports
3102 papers in training set
Top 0.5%
19.5%
3
PLOS Global Public Health
293 papers in training set
Top 0.8%
10.6%
50% of probability mass above
4
Journal of Global Health
18 papers in training set
Top 0.1%
4.8%
5
Frontiers in Public Health
140 papers in training set
Top 1%
4.5%
6
International Journal of Medical Informatics
25 papers in training set
Top 0.5%
2.9%
7
Epidemics
104 papers in training set
Top 0.8%
2.0%
8
BMC Pregnancy and Childbirth
20 papers in training set
Top 0.4%
1.9%
9
PLOS Digital Health
91 papers in training set
Top 1%
1.7%
10
International Journal of Environmental Research and Public Health
124 papers in training set
Top 4%
1.6%
11
The American Journal of Tropical Medicine and Hygiene
60 papers in training set
Top 3%
1.4%
12
JMIRx Med
31 papers in training set
Top 1%
1.2%
13
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.9%
14
International Journal of Epidemiology
74 papers in training set
Top 2%
0.9%
15
PeerJ
261 papers in training set
Top 13%
0.8%
16
BMJ Global Health
98 papers in training set
Top 3%
0.8%
17
BMC Medicine
163 papers in training set
Top 6%
0.8%
18
Journal of Clinical Medicine
91 papers in training set
Top 6%
0.8%
19
Disaster Medicine and Public Health Preparedness
16 papers in training set
Top 1%
0.8%
20
Epidemiology and Infection
84 papers in training set
Top 3%
0.8%
21
BMJ Open
554 papers in training set
Top 14%
0.5%
22
BMC Health Services Research
42 papers in training set
Top 3%
0.5%