Back

AI-Generated Responses to Patient's Messages: Effectiveness, Feasibility and Implementation

Bladder, K. J. M.; Verburg, A. C.; Arts-Tenhagen, M.; Willemsen, R.; van den Broek, G. B.; Driessen, C. M. L.; Driessen, R. J. B.; Robberts, B.; Scheffer, A. R. T.; de Vries, A. P.; Frenzel, T.; Swillens, J. E. M.

2026-03-02 health informatics
10.64898/2026.03.02.26347175 medRxiv
Show abstract

BackgroundGenerative artificial intelligence (GenAI) in healthcare may reduce administrative burden and enhance quality of care. Large language models (LLMs) can generate draft responses to patient messages using electronic health record (EHR) data. This could mitigate increased workload related to high message volumes. While effectiveness and feasibility of these GenAI tools have been studied in the United States, evidence from non-English contexts is scarce, particularly regarding user experience. ObjectiveThis study evaluated the effectiveness, feasibility and barriers and facilitators of implementing Epics Augmented Response Technology (Art) GenAI tool (Epic Systems Corporation, Verona, WI, USA) in a Dutch academic healthcare setting among a broad range of end users. It explored healthcare professionals (HCP) usage metrics, expectations, and early user experiences. MethodsWe conducted a hybrid type 1 effectiveness-implementation design. HCPs of four clinical departments (dermatology, medical oncology, otorhinolaryngology, and pulmonology) participated in a six-month study. Effectiveness of Art was assessed using efficiency indicators from Epic (including all InBasket users in the hospital) and survey scales measuring well-being and clinical efficiency at three time points: PRE, POST-1 (1 month), and POST-2 (4 months). Feasibility of Art was evaluated through adoption indicators from Epic and survey scales on use and usability. Barriers and facilitators of Art implementation were collected through the survey and thematized using the NASSS framework (Nonadoption, Abandonment, Scale-up, Spread and Sustainability). Results237 unique HCPs generated a total of 8,410 drafts. Review and drafting times were similar for users with and without Art, indicating minimal differences. Perceived clinical efficiency declined significantly from PRE to POST-2, while well-being remained unchanged. Adoption was initially high but decreased over time, averaging 16.7% across departments. Usability and intention-to-use scores also declined significantly. Oualitative findings highlighted time savings, well-structured drafts, and patient-centered language as facilitators. Reported barriers included limited impact on time, low practical utility, content inaccuracies, and style misalignment. ConclusionsThis evaluation of a GenAI tool for patient-provider communication in a non-English academic hospital revealed mixed perceptions of effectiveness and feasibility. High initial expectations contrasted with limited perceived impact on time-savings, well-being and clinical efficiency, alongside declining adoption and usability. Barriers and facilitators revealed contrasting views. These findings underscore the need for a workflow for the handling of user feedback, guidance on clinical responsibilities, along with clear communication about the tools purpose and limitations to manage expectations. Additionally, establishing consensus on a set of quality indicators and their thresholds that indicate when a GenAI tool is sufficiently robust will be critical for responsible scaling of GenAI in clinical practice.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Journal of Medical Internet Research
85 papers in training set
Top 0.2%
13.9%
2
BMJ Health & Care Informatics
13 papers in training set
Top 0.1%
8.1%
3
DIGITAL HEALTH
12 papers in training set
Top 0.1%
7.9%
4
JMIR Formative Research
32 papers in training set
Top 0.2%
6.2%
5
Frontiers in Digital Health
20 papers in training set
Top 0.1%
6.2%
6
JAMIA Open
37 papers in training set
Top 0.3%
4.7%
7
PLOS Digital Health
91 papers in training set
Top 0.6%
4.2%
50% of probability mass above
8
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.6%
4.0%
9
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.8%
3.5%
10
BMJ Open
554 papers in training set
Top 6%
3.5%
11
npj Digital Medicine
97 papers in training set
Top 1%
3.5%
12
PLOS ONE
4510 papers in training set
Top 43%
3.0%
13
BMJ Open Quality
15 papers in training set
Top 0.3%
3.0%
14
JMIR Medical Informatics
17 papers in training set
Top 0.4%
2.8%
15
BMC Health Services Research
42 papers in training set
Top 0.9%
2.3%
16
JMIR Public Health and Surveillance
45 papers in training set
Top 1%
2.0%
17
Healthcare
16 papers in training set
Top 0.5%
1.8%
18
Frontiers in Public Health
140 papers in training set
Top 5%
1.7%
19
International Journal of Medical Informatics
25 papers in training set
Top 1.0%
1.4%
20
Pilot and Feasibility Studies
12 papers in training set
Top 0.4%
1.2%
21
Journal of General Internal Medicine
20 papers in training set
Top 0.7%
1.2%
22
JMIR Research Protocols
18 papers in training set
Top 1%
1.1%
23
CMAJ Open
12 papers in training set
Top 0.2%
0.9%
24
Scientific Reports
3102 papers in training set
Top 72%
0.9%
25
Cancer Medicine
24 papers in training set
Top 1%
0.9%
26
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 7%
0.6%