Back

Human-supervised, large language model-based clinical decision support aligned to national newborn protocols in Kenya: a pragmatic, early-stage evaluation

Kuria, T.; Kamau, G.; Makokha, F.; Omondi, P.; Mbugua, G.; David, K.; Mbugua, S.; Gitaka, J.

2026-03-25 health informatics
10.64898/2026.03.22.26348994 medRxiv
Show abstract

Introduction: Timely, protocol-adherent clinical decisions are crucial for reducing neonatal mortality in low-resource settings. Translating extensive national guidelines into bedside practice remains challenging. Objective: We developed and evaluated AIFYA, a human-supervised, large language model LLM based clinical decision support system CDSS aligned with Kenya's national newborn care protocols. Methods: This prospective mixed methods early stage evaluation guided by the DECIDE-AI framework embedded AIFYA into routine workflows at two public health facilities Level 5 and Level 4 in Bungoma County Kenya from September 2024 to June 2025. Primary outcomes were adoption measured by cumulative neonatal cases managed training reach assessed by credentialed healthcare workers HCWs and guideline and citation concordance evaluated through blinded review of 118 AI generated recommendations by two neonatologists with adjudication by a third. Secondary outcomes included protocol adherence and triage to decision time. Results: A total of 50 HCWs were trained and 550 neonatal cases were managed over 10 months. Among surveyed HCWs n equals 33, 76 percent were female with mean age 32.1 years. Expert review found 75 percent of recommendations were correct and 15 percent partially correct with strong inter rater reliability weighted Cohen's kappa 0.85 and 95 percent CI 0.79 to 0.91. Citation accuracy was 96 percent. In 40 complex dosing scenarios 75 percent of outputs were rated correct. The median triage to decision time was 23 minutes with interquartile range 18 to 31. Implementation was supported by an offline first architecture and a facility based coaching model sustaining engagement despite staff turnover. Conclusion: A human supervised AI CDSS directly and transparently anchored to national clinical guidelines can be successfully implemented in routine low resource neonatal care settings. The system demonstrated high user adoption and strong expert rated concordance. High citation accuracy builds clinical trust ensuring safety and enabling auditable AI. These findings support progression to controlled multi site trials to evaluate clinical effectiveness. Keywords: Neonatal care Clinical decision support system Large language model Artificial intelligence Human supervised Low resource settings Guideline adherence Digital health Kenya

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
PLOS Digital Health
91 papers in training set
Top 0.1%
31.9%
2
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.2%
13.9%
3
BMJ Health & Care Informatics
13 papers in training set
Top 0.1%
6.2%
50% of probability mass above
4
PLOS ONE
4510 papers in training set
Top 32%
4.7%
5
Frontiers in Digital Health
20 papers in training set
Top 0.1%
4.7%
6
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 1.0%
4.2%
7
International Journal of Medical Informatics
25 papers in training set
Top 0.4%
3.5%
8
Journal of Medical Internet Research
85 papers in training set
Top 2%
3.5%
9
Wellcome Open Research
57 papers in training set
Top 0.4%
3.0%
10
Scientific Reports
3102 papers in training set
Top 52%
2.0%
11
DIGITAL HEALTH
12 papers in training set
Top 0.3%
2.0%
12
BMC Medicine
163 papers in training set
Top 4%
1.4%
13
BMJ Open
554 papers in training set
Top 10%
1.3%
14
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.3%
15
BMJ Open Quality
15 papers in training set
Top 0.6%
1.2%
16
JMIR Medical Informatics
17 papers in training set
Top 1%
0.9%
17
Frontiers in Public Health
140 papers in training set
Top 8%
0.8%
18
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.7%
19
JAMIA Open
37 papers in training set
Top 2%
0.7%
20
Healthcare
16 papers in training set
Top 2%
0.6%
21
CMAJ Open
12 papers in training set
Top 0.3%
0.6%
22
JMIR Formative Research
32 papers in training set
Top 2%
0.6%