Back

Leveraging Expert Knowledge and Causal Structure Learning to Build Parsimonious Models of Acute Brain Dysfunction in the Pediatric Intensive Care Unit

Perez Claudio, E.; Horvat, C.; Au, A. K.; Clark, R. S. B.; Taylor, M. W.; Cooper, G. F.; Li, R.; Nourelahi, M.; Hochheiser, H.

2026-02-18 health informatics
10.64898/2026.02.17.26345661 medRxiv
Show abstract

Machine learning adoption in clinical decision support systems remains limited by concerns about transparency and robustness. Causal structure learning (CSL) combined with expert knowledge may address these concerns by identifying potentially causal predictors, enabling more interpretable and clinically aligned models. In this study, we show that by integrating clinician expertise with CSL algorithms we can identify plausible causal drivers of acquired acute brain dysfunction (ABD) in the pediatric intensive care unit (PICU), which enables the development of parsimonious predictive models without substantial loss in performance. To do so, we analyzed 18,568 PICU encounters from the University of Pittsburgh Medical Center Childrens Hospital (2010-2022) and elicited knowledge from experienced clinicians. Encounters with acquired ABD were defined using the validated ABD computable phenotype. Expert knowledge was elicited from four clinicians through iterative interviews to construct a consensus directed acyclic graph (DAG). Clinician consensus achieved acceptable inter-rater reliability (Fleiss Kappa = 0.62) after two rounds of interviews and identified 16 biomarkers as potential causes of acquired ABD. Two CSL algorithms, GOLEM and PC-MB, were applied to enrich the clinicians consensus DAG. The PC-MB algorithm showed 78% concordance with expert consensus, while GOLEM showed 46%. Together, the CSL algorithms identified seven biomarkers as potential causes that were not included in the clinicians DAG: blood urea nitrogen, creatinine, dobutamine, glucose, potassium, PTT, SpO2. Using multiple variations of the enriched DAGs, XGBoost models were trained using biomarkers identified as potential causes of acquired ABD; these were evaluated primarily by area under the precision-recall curve (AUPRC). Models trained on the intersection of clinician consensus and PC-MB DAGs achieved an AUPRC of 0.79 (95% CI: 0.75-0.82) using only 14 biomarkers, compared to 0.81 (95% CI: 0.78-0.84) for the control model using all 45 biomarkers. When restricted to vitals and laboratory results alone, the best-performing model achieved an AUPRC of 0.77. Combining clinical expertise with causal structure learning enables the identification of causal hypotheses consistent with the clinical understanding of the participating clinicians and the development of parsimonious predictive models for acquired ABD in the PICU.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
23.1%
2
npj Digital Medicine
97 papers in training set
Top 0.4%
12.7%
3
PLOS Digital Health
91 papers in training set
Top 0.2%
8.6%
4
Scientific Reports
3102 papers in training set
Top 9%
8.6%
50% of probability mass above
5
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.3%
2.7%
6
The Lancet Digital Health
25 papers in training set
Top 0.2%
2.7%
7
International Journal of Medical Informatics
25 papers in training set
Top 0.6%
2.5%
8
Nature Machine Intelligence
61 papers in training set
Top 1%
2.4%
9
Journal of Medical Internet Research
85 papers in training set
Top 2%
2.1%
10
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
1.9%
11
JMIR Medical Informatics
17 papers in training set
Top 0.7%
1.7%
12
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.7%
13
Journal of Biomedical Informatics
45 papers in training set
Top 0.8%
1.7%
14
Nature Communications
4913 papers in training set
Top 53%
1.5%
15
Nature Medicine
117 papers in training set
Top 3%
1.4%
16
Frontiers in Medicine
113 papers in training set
Top 4%
1.4%
17
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
18
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.4%
1.3%
19
eLife
5422 papers in training set
Top 50%
1.1%
20
PLOS ONE
4510 papers in training set
Top 63%
0.9%
21
Frontiers in Digital Health
20 papers in training set
Top 1%
0.9%
22
Science Advances
1098 papers in training set
Top 26%
0.9%
23
Annals of Neurology
57 papers in training set
Top 2%
0.8%
24
Cell Reports Medicine
140 papers in training set
Top 7%
0.8%
25
BMJ Health & Care Informatics
13 papers in training set
Top 0.8%
0.8%
26
Patterns
70 papers in training set
Top 3%
0.7%
27
Artificial Intelligence in Medicine
15 papers in training set
Top 0.7%
0.7%
28
Communications Medicine
85 papers in training set
Top 1%
0.7%
29
Critical Care Explorations
15 papers in training set
Top 0.5%
0.7%
30
iScience
1063 papers in training set
Top 36%
0.7%