Back

Trilingual (EN/ZH-CN/JP) synthetic dataset of cerebral infarction patient-nurse bedside dialogs with metadata

2026-01-06 nursing Title + abstract only
View on medRxiv
Show abstract

We propose a large-scale synthetic dataset that correlates structured background information aligned with the actual distribution of patients with cerebral infarction, nurse characteristics, and nurse-patient dialogues across diverse scenarios. Medical dialogue corpora are scarce due to privacy and access restrictions. Even when available, they primarily focus on physician-patient interactions and offer limited metadata (clinical covariates, staff characteristics, etc.). To address this gap, thi...

Predicted journal destinations