Back

Prioritising Hospital Complaints: An Innovative Tool Using Large Language Model-Assisted Content Analysis and Machine Learning Algorithms

Sulaiman, M. H.; Muda, N.; Abdul Razak, F.

2025-06-08 health systems and quality improvement
10.1101/2025.06.07.25329193 medRxiv
Show abstract

BackgroundIn clinical settings, patients often express dissatisfaction through narrative speech or written text. However, most complaints management systems still rely on manual review or rulebased methods that fail to capture the severity or urgency of complaints. This leads to inconsistent triage, delayed resolution and missed opportunities for systemic improvement. A novel model leveraging large language model-assisted content analysis (LACA) and machine learning (ML) can transform subjective narratives into standardized, machine-readable severity scores, facilitating the prioritisation of complaints. ObjectiveThis study aims to (1) determine the precision, recall fscore and accuracy of the proposed predictive models used to classify comments into low-alert and high-alert comment, (2) determine the construct validity and internal consistency (Cronbachs ) of the themes found in LACA conducted on hospital web-based review data, (3) determine the predictors of low-alert and high-alert comments and their ability to change the log-odds of the outcome in logistic regression, and (4) to measure the robustness of the explanatory model measured by pseudo-R2. MethodologyLACA was performed using a set of thematic codes to generate an independent variable dataset (x), with a scale of 0: not an issue, 1: a small issue, 2: a moderate issue, 3: a serious issue, and 4: an extremely serious issue. The independent variables (x) and the dependent variable (y, representing the review rating) were then split into training and testing sets to build predictive ML models. Grid search was used to determine the optimal combination of hyperparameters. The performance of the predictive and explanatory models was evaluated. ResultsML classification was able to produce f1-score of 0.88 - 0.94 and accuracy of 0.92 for LR model; and f1-score of 0.87 - 0.94 and accuracy of 0.92 for ANN model The behaviour of predictive models was successfully explained by the explanatory model: Six (6) themes were determined with cumulative explained variance (CEV) of 0.74 and average Cronbachs of 0.86. LR shows significance on 5 themes with pseudo-R2 of 0.55. ConclusionThis study demonstrates that a data pipeline utilizing LACA and ML algorithms shows excellent performance in classifying patient comments in a hospital setting. All effectiveness parameters including CEV, Cronbachs , precision, recall, f1-score, and accuracy indicate strong performance in differentiating high-alert from low-alert comments.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.1%
38.2%
2
Journal of Medical Internet Research
85 papers in training set
Top 0.3%
12.5%
50% of probability mass above
3
PLOS ONE
4510 papers in training set
Top 27%
6.4%
4
JMIRx Med
31 papers in training set
Top 0.1%
4.4%
5
PLOS Digital Health
91 papers in training set
Top 0.9%
2.9%
6
JMIR Medical Informatics
17 papers in training set
Top 0.5%
2.4%
7
BMJ Health & Care Informatics
13 papers in training set
Top 0.3%
2.4%
8
Healthcare
16 papers in training set
Top 0.4%
2.1%
9
International Journal of Medical Informatics
25 papers in training set
Top 0.6%
2.1%
10
JMIR Formative Research
32 papers in training set
Top 0.6%
1.9%
11
BMJ Open Quality
15 papers in training set
Top 0.5%
1.7%
12
Frontiers in Public Health
140 papers in training set
Top 5%
1.7%
13
Journal of Biomedical Informatics
45 papers in training set
Top 0.8%
1.7%
14
Frontiers in Digital Health
20 papers in training set
Top 0.8%
1.3%
15
Bioengineering
24 papers in training set
Top 1%
0.9%
16
Journal of Clinical and Translational Science
11 papers in training set
Top 0.3%
0.9%
17
JAMIA Open
37 papers in training set
Top 1%
0.9%
18
Artificial Intelligence in Medicine
15 papers in training set
Top 0.5%
0.9%
19
Scientific Reports
3102 papers in training set
Top 70%
0.9%
20
BMC Medical Education
20 papers in training set
Top 0.8%
0.8%
21
Public Health in Practice
11 papers in training set
Top 0.3%
0.8%
22
Journal of Clinical Medicine
91 papers in training set
Top 6%
0.8%
23
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.8%
0.8%
24
Research Synthesis Methods
20 papers in training set
Top 0.2%
0.8%
25
Sensors
39 papers in training set
Top 2%
0.8%
26
Journal of Public Health
23 papers in training set
Top 1%
0.7%
27
Disaster Medicine and Public Health Preparedness
16 papers in training set
Top 2%
0.5%
28
Cureus
67 papers in training set
Top 6%
0.5%
29
Medicine
30 papers in training set
Top 3%
0.5%