Back

Leveraging State-of-the-Art LLMs for the De-identification of Sensitive Health Information in Clinical Speech

Dai, H.-J.; Mir, T. H.; Fang, L.-C.; Chen, C.-T.; Feng, H.-H.; Lai, J.-R.; Hsu, H.-C.; Nandy, P.; Panchal, O.; Liao, W.-H.; Tien, Y.-Z.; Chen, P.-Z.; Lin, Y.-R.; Jonnagaddala, J.

2026-04-17 health informatics
10.64898/2026.04.13.26349911 medRxiv
Show abstract

Accurate recognition and deidentification of sensitive health information (SHI) in spoken dialogues requires multimodal algorithms that can understand medical language and contextual nuance. However, the recognition and deidentification risks expose sensitive health information (SHI). Additionally, the variability and complexity of medical terminology, along with the inherent biases in medical datasets, further complicate this task. This study introduces the SREDH/AI-Cup 2025 Medical Speech Sensitive Information Recognition Challenge, which focuses on two tasks: Task-1: Speech transcription systems must accurately transcribe speech into text; and Task-2: Medical speech de-identification to detect and appropriately classify mentions of SHI. The competition attracted 246 teams; top-performing systems achieved a mixed error rate (MER) of 0.1147 and a macro F1-score of 0.7103, with average MER and macro F1-score of 0.3539 and 0.2696, respectively. Results were presented at the IW-DMRN workshop in 2025. Notably, the results reveal that LLMs were prevalent across both tasks: 97.5% of teams adopted LLMs for Task 1 and 100% for Task 2. Highlighting their growing role in healthcare. Furthermore, we finetuned six models, demonstrating strong precision ([~]0.885-0.889) with slightly lower recall ([~]0.830-0.847), resulting in F1-scores of 0.857-0.867.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of Biomedical Informatics
45 papers in training set
Top 0.1%
28.5%
2
Frontiers in Digital Health
20 papers in training set
Top 0.1%
7.0%
3
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.2%
6.6%
4
Scientific Reports
3102 papers in training set
Top 16%
6.6%
5
Artificial Intelligence in Medicine
15 papers in training set
Top 0.1%
4.4%
50% of probability mass above
6
npj Digital Medicine
97 papers in training set
Top 0.9%
4.4%
7
Computers in Biology and Medicine
120 papers in training set
Top 0.8%
3.7%
8
Journal of Medical Internet Research
85 papers in training set
Top 1%
3.7%
9
Scientific Data
174 papers in training set
Top 0.9%
1.8%
10
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.8%
11
PLOS Digital Health
91 papers in training set
Top 1%
1.7%
12
JAMIA Open
37 papers in training set
Top 0.9%
1.5%
13
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 3%
1.5%
14
Nature Communications
4913 papers in training set
Top 58%
1.0%
15
Informatics in Medicine Unlocked
21 papers in training set
Top 0.9%
0.9%
16
Journal of Personalized Medicine
28 papers in training set
Top 0.8%
0.9%
17
International Journal of Medical Informatics
25 papers in training set
Top 1%
0.9%
18
eBioMedicine
130 papers in training set
Top 3%
0.8%
19
Patterns
70 papers in training set
Top 2%
0.8%
20
Bioinformatics
1061 papers in training set
Top 9%
0.8%
21
iScience
1063 papers in training set
Top 28%
0.8%
22
Med
38 papers in training set
Top 0.6%
0.8%
23
PLOS ONE
4510 papers in training set
Top 65%
0.8%
24
Sensors
39 papers in training set
Top 2%
0.8%
25
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 0.9%
0.8%
26
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
0.8%
27
Biomedical Signal Processing and Control
18 papers in training set
Top 0.5%
0.7%
28
Biology Methods and Protocols
53 papers in training set
Top 3%
0.7%
29
JMIR Medical Informatics
17 papers in training set
Top 2%
0.7%
30
Cureus
67 papers in training set
Top 6%
0.7%