Back

Psychiatric Voice Biomarkers: Methodological flaws in pediatric populations

Hamoudi, H. J. A. S.; Wu, M.-J.; Sanches, M.; Soutullo, C. A.; Olmos, C.; Taylor, L. K.; Zunta-Soares, G.; Soares, J. C.; Mwangi, B.

2025-10-15 psychiatry and clinical psychology
10.1101/2025.10.13.25337901 medRxiv
Show abstract

IntroductionPsychiatric assessments rely on patient self-reports, clinician observations, and standardized scales, while objective technological tools are currently not reliable enough to be utilized in a clinical setting. Voice may be utilized as a biomarker in different scenarios, including differential diagnosis, assessing symptom severity and predicting suicidality. However, its use depends on accurate automatic speech recognition (ASR). Current gold standard open source ASR systems are trained mainly on adult speech and perform poorly in children, limiting application in pediatric psychiatry. MethodsWe benchmarked two open-source ASR models--NVIDIA Parakeet and Whisper-small--on the Ohio Child Speech Corpus (303 children, ages 4-9), using the reference human transcripts provided with the dataset. Audio was standardized to each models expected sampling rate. No model fine-tuning or adaptation was performed. For each utterance, we computed word error rate (WER) and character error rate (CER), and assessed semantic fidelity using Sentence Movers Distance (SMD) and BERTScore F1. Metrics were summarized overall, stratified by single-year age bins (4, 5, 6, 7, 8, 9), and also grouped into two broader categories: younger children (ages 4-6) and older children (ages 7-9). We compared WER, CER, SMD, and BERTScore F1 across both age groups and evaluated age effects as trends using nonparametric statistical tests. ResultsBoth models showed significant age effects where younger children had markedly higher word error rates (WER >40%) and character error rates (CER >30%) compared to older children (WER [~]30%, CER [~]20%). Sentence mover distance improved with age, while BERTScore F1 remained stable. Despite age-related improvements, overall transcription accuracy was low. DiscussionCurrent commonly used open-source ASR systems are inadequate for pediatric audio transcription, specifically in younger children. In order to build clinically translatable tools, collecting child-specific data and model fine-tuning through structured speech paradigms is essential.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Translational Psychiatry
219 papers in training set
Top 0.2%
14.6%
2
Psychiatry Research
35 papers in training set
Top 0.1%
14.3%
3
Frontiers in Psychiatry
83 papers in training set
Top 0.6%
6.3%
4
Acta Psychiatrica Scandinavica
10 papers in training set
Top 0.1%
6.3%
5
PLOS ONE
4510 papers in training set
Top 33%
4.6%
6
Frontiers in Digital Health
20 papers in training set
Top 0.2%
4.3%
50% of probability mass above
7
Journal of Medical Internet Research
85 papers in training set
Top 2%
3.2%
8
Journal of Speech, Language, and Hearing Research
10 papers in training set
Top 0.1%
3.1%
9
Schizophrenia Bulletin
29 papers in training set
Top 0.3%
2.4%
10
Schizophrenia Research
29 papers in training set
Top 0.3%
2.3%
11
Acta Neuropsychiatrica
12 papers in training set
Top 0.3%
2.3%
12
Schizophrenia
19 papers in training set
Top 0.2%
2.1%
13
European Psychiatry
10 papers in training set
Top 0.3%
1.7%
14
The British Journal of Psychiatry
21 papers in training set
Top 0.6%
1.7%
15
BioData Mining
15 papers in training set
Top 0.4%
1.5%
16
Scientific Reports
3102 papers in training set
Top 64%
1.3%
17
American Journal of Medical Genetics Part B: Neuropsychiatric Genetics
22 papers in training set
Top 0.3%
1.2%
18
NeuroImage: Clinical
132 papers in training set
Top 3%
1.2%
19
Biological Psychiatry: Cognitive Neuroscience and Neuroimaging
62 papers in training set
Top 1%
1.2%
20
JMIR Formative Research
32 papers in training set
Top 1%
0.9%
21
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.8%
22
Molecular Psychiatry
242 papers in training set
Top 3%
0.7%
23
Psychological Medicine
74 papers in training set
Top 2%
0.7%
24
Psychiatry and Clinical Neurosciences
11 papers in training set
Top 0.4%
0.7%
25
Journal of Alzheimer’s Disease
39 papers in training set
Top 1%
0.7%
26
Contemporary Clinical Trials Communications
11 papers in training set
Top 0.8%
0.6%
27
BJPsych Open
25 papers in training set
Top 0.8%
0.6%
28
JAMA Network Open
127 papers in training set
Top 5%
0.6%