Back

From Charting Burden to Workflow Signal: Retrospective Validation of Documentation-Density Measures for ICU Complexity and Long-Stay Risk

Collier, A.

2026-06-06 health informatics
10.64898/2026.06.04.26354922 medRxiv
Show abstract

Background Electronic health record documentation patterns may reflect workflow complexity, monitoring intensity, and operational strain in intensive care settings. However, documentation-derived features can be sensitive to local documentation culture, data capture systems, and outcome definitions. Retrospective validation across multiple datasets is therefore needed before these signals are used in workflow intelligence or clinical AI governance tools. Objective To evaluate whether documentation-density and documentation-timing features show reproducible retrospective signal for ICU workflow complexity and long-stay proxy outcomes across de-identified critical care datasets, while distinguishing workflow and long-stay associations from unsupported claims about mortality prediction, burden reduction, or deployment readiness. Methods We synthesized retrospective validation results from de-identified ICU and workflow datasets generated through a prespecified documentation-density validation program. Feature families included Documentation Burden Score style features, Shift-End Documentation Rate style features, documentation reliability style metadata, and all-documentation feature sets where available. Outcomes included long ICU length of stay proxies, mortality where available, and workflow proxy endpoints. Models compared baseline feature sets with enhanced models containing documentation-density or workflow features. Performance was summarized using area under the receiver operating characteristic curve, Brier score where reported, delta AUROC, bootstrap confidence intervals where reported, and label-shuffle controls where available. Results The strongest external long-stay proxy evidence came from the NWICU chartevents analysis, which included 28,612 ICU stays, 20,267 stays with chart events, and 9,619,759 chart events. For ICU length of stay greater than the median, baseline AUROC was 0.5252. Enhanced AUROC was 0.9512 for Documentation Burden Score features, 0.9214 for Shift-End Documentation Rate features, 0.8470 for documentation reliability style features, and 0.9517 for all documentation features. Corresponding label-shuffle enhanced AUROCs were near random, ranging from 0.4897 to 0.5064. For ICU length of stay greater than the 75th percentile, baseline AUROC was 0.5155. Enhanced AUROC was 0.9433 for Documentation Burden Score features, 0.9194 for Shift-End Documentation Rate features, 0.8118 for documentation reliability style features, and 0.9427 for all documentation features, with label-shuffle enhanced AUROCs from 0.4836 to 0.4999. Additional retrospective support was observed in eICU workflow analyses, HiRID first-24-hour documentation-density analyses, MIMIC-IV HF ICU internal analyses, MIMIC-IV-Note metadata extensions, and nursing-chart or lab density proxy analyses. However, cross-institution discrimination transfer was weak without recalibration, and several analyses remained proxy validations rather than final clinical validations. Conclusions Documentation-density and documentation-timing features show promising retrospective signal for ICU workflow complexity and long-stay proxy outcomes, especially in NWICU chartevents and selected internal dataset-specific analyses. These findings support further preregistered, prospective, silent-mode validation of documentation-derived workflow intelligence. They do not establish prospective clinical performance, mortality reduction, clinician burden reduction, autonomous deterioration prediction, or deployment readiness.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.1%
22.3%
2
npj Digital Medicine
97 papers in training set
Top 0.5%
10.3%
3
Critical Care Explorations
15 papers in training set
Top 0.1%
8.3%
4
Journal of Medical Internet Research
85 papers in training set
Top 0.6%
6.7%
5
JMIR Medical Informatics
17 papers in training set
Top 0.1%
6.3%
50% of probability mass above
6
JAMIA Open
37 papers in training set
Top 0.3%
4.8%
7
Scientific Reports
3102 papers in training set
Top 37%
3.5%
8
BMJ Health & Care Informatics
13 papers in training set
Top 0.2%
2.7%
9
JAMA Network Open
127 papers in training set
Top 1%
2.6%
10
PLOS ONE
4510 papers in training set
Top 45%
2.6%
11
PLOS Digital Health
91 papers in training set
Top 1%
2.1%
12
Frontiers in Digital Health
20 papers in training set
Top 0.5%
1.9%
13
Journal of General Internal Medicine
20 papers in training set
Top 0.5%
1.7%
14
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.5%
15
BMC Medical Research Methodology
43 papers in training set
Top 0.7%
1.5%
16
BMJ
49 papers in training set
Top 0.7%
1.3%
17
Journal of Biomedical Informatics
45 papers in training set
Top 1%
1.2%
18
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
1.1%
19
The Lancet Digital Health
25 papers in training set
Top 0.9%
0.9%
20
BMJ Open
554 papers in training set
Top 13%
0.7%
21
Journal of Clinical and Translational Science
11 papers in training set
Top 0.4%
0.7%
22
BMC Medicine
163 papers in training set
Top 7%
0.7%
23
European Heart Journal - Digital Health
15 papers in training set
Top 0.6%
0.7%
24
Frontiers in Medicine
113 papers in training set
Top 7%
0.7%
25
Critical Care
14 papers in training set
Top 0.7%
0.7%
26
Annals of Internal Medicine
27 papers in training set
Top 1%
0.6%
27
Nature Communications
4913 papers in training set
Top 66%
0.6%