Back

Improving and Interpreting Surgical Case Duration Prediction with Machine Learning Methodology

Lai, J.; Huang, C.-C.; Liu, S.-C.; Huang, J.-Y.; Cho, D.-Y.; Yu, J.

2020-12-08 health systems and quality improvement
10.1101/2020.06.10.20127910 medRxiv
Show abstract

Predictive accuracy of surgical case duration plays a critical role in reducing cost of operation room (OR) utilization. The most common approaches used by hospitals rely on historic averages based on a specific surgeon or a specific procedure type obtained from the electronic medical record (EMR) scheduling systems. However, low predictive accuracy of EMR leads to negative impacts on patients and hospitals, such as rescheduling of surgeries and cancellation. In this study, we aim to improve prediction of operation case duration with advanced machine learning (ML) algorithms. We obtained a large data set containing 170,748 operation cases (from Jan 2017 to Dec 2019) from a hospital. The data covered a broad variety of details on patients, operations, specialties and surgical teams. Meanwhile, a more recent data with 8,672 cases (from Mar to Apr 2020) was also available to be used for external evaluation. We computed historic averages from EMR for surgeon- or procedure-specific and they were used as baseline models for comparison. Subsequently, we developed our models using linear regression, random forest and extreme gradient boosting (XGB) algorithms. All models were evaluated with R-squre (R2), mean absolute error (MAE), and percentage overage (case duration > prediction + 10 % & 15 mins), underage (case duration < prediction - 10 % & 15 mins) and within (otherwise). The XGB model was superior to the other models by having higher R2 (85 %) and percentage within (48 %) as well as lower MAE (30.2 mins). The total prediction errors computed for all the models showed that the XGB model had the lowest inaccurate percent (23.7 %). As a whole, this study applied ML techniques in the field of OR scheduling to reduce medical and financial burden for healthcare management. It revealed the importance of operation and surgeon factors in operation case duration prediction. This study also demonstrated the importance of performing an external evaluation to better validate performance of ML models.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.1%
59.3%
50% of probability mass above
2
PLOS ONE
4510 papers in training set
Top 25%
6.8%
3
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.1%
3.7%
4
JMIR Medical Informatics
17 papers in training set
Top 0.4%
3.1%
5
Scientific Reports
3102 papers in training set
Top 44%
2.7%
6
Journal of Medical Internet Research
85 papers in training set
Top 2%
1.9%
7
Frontiers in Public Health
140 papers in training set
Top 5%
1.7%
8
International Journal of Environmental Research and Public Health
124 papers in training set
Top 5%
1.3%
9
International Journal of Medical Informatics
25 papers in training set
Top 1%
0.9%
10
BMC Health Services Research
42 papers in training set
Top 2%
0.9%
11
JMIRx Med
31 papers in training set
Top 1%
0.9%
12
Bioengineering
24 papers in training set
Top 1%
0.9%
13
JAMIA Open
37 papers in training set
Top 1%
0.8%
14
Artificial Intelligence in Medicine
15 papers in training set
Top 0.6%
0.8%
15
Physiological Measurement
12 papers in training set
Top 0.4%
0.8%
16
Journal of Biomedical Informatics
45 papers in training set
Top 1%
0.8%
17
Frontiers in Artificial Intelligence
18 papers in training set
Top 0.7%
0.8%
18
Informatics in Medicine Unlocked
21 papers in training set
Top 1%
0.7%
19
Epidemiology and Infection
84 papers in training set
Top 3%
0.7%
20
Sensors
39 papers in training set
Top 2%
0.6%
21
Heliyon
146 papers in training set
Top 8%
0.6%