Back

Design and preliminary safety validation of a hybrid deterministic-AI triage system for multilingual primary healthcare: a WhatsApp-based vignette study in South Africa

Nkosi-Mjadu, B. E.

2026-04-22 health informatics
10.64898/2026.04.21.26349781 medRxiv
Show abstract

BackgroundSouth Africas public healthcare system serves most of the population through approximately 3,900 primary healthcare clinics characterised by long waiting times and high volumes of repeat-prescription visits. No published pre-arrival digital triage system operates across all 11 official South African languages while aligning with the South African Triage Scale (SATS). This paper reports the design and preliminary safety validation of BIZUSIZO, a hybrid deterministic-AI WhatsApp triage system. MethodsBIZUSIZO delivers SATS-aligned triage via WhatsApp, combining AI-assisted free-text classification (Claude Haiku 4.5) with a Deterministic Clinical Safety Layer (DCSL) that overrides AI output for 53 clinical discriminator categories (14 RED, 19 ORANGE, 20 YELLOW) coded in all 11 official languages and independent of AI availability. A five-domain risk factor assessment can only upgrade triage level. One hundred and twenty clinical vignettes in patient language (English, isiZulu, isiXhosa, Afrikaans; 30 per language) were scored against a developer-assigned gold standard with independent blinded nurse review. A 121-vignette multilingual DCSL safety consistency check across all 11 languages and a 220-call post-hoc framing sensitivity evaluation (110 paired vignettes) were also conducted. ResultsUnder-triage was 3.3% (4/120; 95% CI: 0.9%-8.3%) with no RED under-triage; exact concordance was 80.0% (96/120) and quadratic weighted kappa 0.891 (95% CI: 0.827-0.932). One two-level under-triage was observed on a non-RED presentation (V072, isiXhosa burns vignette, ORANGEGREEN); one two-level over-triage was observed (V054, isiZulu deep laceration, YELLOWRED). In the framing sensitivity evaluation, AI-only classification achieved 50.9% RED invariance under adversarial framing; full-pipeline classification achieved 95.0% in four validated languages, with the DCSL rescuing 18 of 23 AI drift cases. ConclusionsA hybrid deterministic-AI triage system with DCSL-based emergency detection achieved zero RED under-triage and consistent RED detection across all 11 official languages. The 16.7% over-triage rate falls within published South African SATS ranges (13.1-49%). A single two-level under-triage event was observed on an isiXhosa burns vignette (ORANGEGREEN) and is discussed in Limitations. Findings are preliminary; prospective validation against independent nurse triage is the necessary next step.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
BMJ Open
554 papers in training set
Top 1%
14.5%
2
PLOS ONE
4510 papers in training set
Top 20%
9.9%
3
BMJ Health & Care Informatics
13 papers in training set
Top 0.1%
8.3%
4
Emergency Medicine Journal
20 papers in training set
Top 0.1%
7.0%
5
PLOS Digital Health
91 papers in training set
Top 0.4%
6.2%
6
BMC Health Services Research
42 papers in training set
Top 0.7%
3.5%
7
BMJ
49 papers in training set
Top 0.3%
3.5%
50% of probability mass above
8
Scientific Reports
3102 papers in training set
Top 39%
3.5%
9
The Lancet Digital Health
25 papers in training set
Top 0.2%
3.2%
10
JMIR Medical Informatics
17 papers in training set
Top 0.5%
2.4%
11
Journal of Medical Internet Research
85 papers in training set
Top 2%
2.3%
12
npj Digital Medicine
97 papers in training set
Top 2%
2.0%
13
BMC Medicine
163 papers in training set
Top 3%
1.9%
14
International Journal of Medical Informatics
25 papers in training set
Top 0.8%
1.8%
15
Frontiers in Digital Health
20 papers in training set
Top 0.7%
1.7%
16
eClinicalMedicine
55 papers in training set
Top 0.6%
1.7%
17
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.7%
18
JMIR Public Health and Surveillance
45 papers in training set
Top 2%
1.6%
19
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 3%
1.6%
20
Nature Communications
4913 papers in training set
Top 53%
1.6%
21
BMJ Open Quality
15 papers in training set
Top 0.5%
1.5%
22
BMC Medical Research Methodology
43 papers in training set
Top 0.7%
1.5%
23
CMAJ Open
12 papers in training set
Top 0.2%
0.9%
24
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.9%
25
Trials
25 papers in training set
Top 2%
0.8%
26
JAMIA Open
37 papers in training set
Top 2%
0.7%
27
British Journal of General Practice
22 papers in training set
Top 0.6%
0.7%
28
Healthcare
16 papers in training set
Top 2%
0.7%
29
BMJ Paediatrics Open
21 papers in training set
Top 0.8%
0.7%
30
DIGITAL HEALTH
12 papers in training set
Top 0.8%
0.6%