Back

An Effective Automated Algorithm to Isolate Patient Speech from Conversations with Clinicians

Jaquenoud, T.; Keene, S.; Shlayan, N.; Federman, A.; Pandey, G.

2022-11-30 health informatics
10.1101/2022.11.29.22282914
Show abstract

A growing number of algorithms are being developed to automatically identify disorders or disease biomarkers from digitally recorded audio of patient speech. An important step in these analyses is to identify and isolate the patients speech from that of other speakers or noise that are captured in a recording. However, current algorithms, such as diarization, only label the identified speech segments in terms of non-specific speakers, and do not identify the specific speaker of each segment, e.g., clinician and patient. In this paper, we present a novel algorithm that not only performs diarization on clinical audio, but also identifies the patient among the speakers in the recording and returns an audio file containing only the patients speech. Our algorithm first uses pretrained diarization algorithms to separate the input audio into different tracks according to nonspecific speaker labels. Next, in a novel step not conducted in other diarization tools, the algorithm uses the average loudness (quantified as power) of each audio track to identify the patient, and return the audio track containing only their speech. Using a practical expert-based evaluation methodology and a large dataset of clinical audio recordings, we found that the best implementation of our algorithm achieved near-perfect accuracy on two validation sets. Thus, our algorithm can be used for effectively identifying and isolating patient speech, which can be used in downstream expert and/or data-driven analyses.

Matching journals

1
Computers in Biology and Medicine
Elsevier BV · based on 39 published papers
#1
60× avg
2
Scientific Reports
Springer Science and Business Media LLC · based on 701 published papers
Top 12%
2.3× avg
3
Journal of Biomedical Informatics
Elsevier BV · based on 37 published papers
Top 0.4%
44× avg
4
PLOS ONE
Public Library of Science (PLoS) · based on 1737 published papers
Top 57%
7.6%
5
PLOS Digital Health
Public Library of Science (PLoS) · based on 88 published papers
Top 2%
14× avg
6
IEEE Journal of Biomedical and Health Informatics
Institute of Electrical and Electronics Engineers (IEEE) · based on 14 published papers
#1
77× avg
7
JAMIA Open
Oxford University Press (OUP) · based on 35 published papers
Top 3%
9.9× avg
8
Journal of Medical Internet Research
JMIR Publications Inc. · based on 81 published papers
Top 7%
3.6× avg
9
Patterns
Elsevier BV · based on 15 published papers
Top 0.6%
26× avg
10
Journal of the American Medical Informatics Association
Oxford University Press (OUP) · based on 53 published papers
Top 4%
5.8× avg
11
BMC Medical Informatics and Decision Making
Springer Science and Business Media LLC · based on 36 published papers
Top 5%
5.1× avg
12
npj Digital Medicine
Springer Science and Business Media LLC · based on 85 published papers
Top 8%
2.3× avg
13
Frontiers in Digital Health
Frontiers Media SA · based on 18 published papers
Top 2%
12× avg
14
Frontiers in Artificial Intelligence
Frontiers Media SA · based on 11 published papers
Top 1%
16× avg
15
Frontiers in Neuroscience
Frontiers Media SA · based on 29 published papers
Top 3%
7.7× avg
16
Frontiers in Psychiatry
Frontiers Media SA · based on 56 published papers
Top 6%
3.6× avg
17
IEEE Access
Institute of Electrical and Electronics Engineers (IEEE) · based on 11 published papers
Top 1%
16× avg
18
JMIR Medical Informatics
JMIR Publications Inc. · based on 16 published papers
Top 5%
4.6× avg
19
Informatics in Medicine Unlocked
Elsevier BV · based on 11 published papers
Top 2%
8.9× avg
20
Journal of Neural Engineering
IOP Publishing · based on 19 published papers
Top 2%
6.4× avg
21
Nature Medicine
Springer Science and Business Media LLC · based on 88 published papers
Top 19%
0.7%