Back

Conversational Speech for Respiratory Triage in Primary Care: A Pilot Study

Ravi, V.; Noufi, C.

2026-06-11 respiratory medicine
10.64898/2026.06.09.26355284 medRxiv
Show abstract

Background. Respiratory complaints account for a substantial share of adult ambulatory care visits, and triaging them accurately has direct consequences for antibiotic stewardship and pathogen-specific therapy. Prior work has investigated voice as a triage signal, but that literature is dominated by single-condition detection from scripted speech in crowdsourced or controlled clinical settings and has not been evaluated at primary care scale on conversational ambient audio. Methods. A dataset of 514,377 ambient-recorded primary care visits from 379,225 adult patients at a US clinic network was used, with per-visit clinically assigned ICD-10 diagnosis codes and de-identified demographic and geographic metadata. Patient audio was extracted from each doctor-patient conversation, and spectral, voice quality, and prosodic features were computed. Eleven binary classification tasks were defined, aligned with a respiratory triage cascade (e.g., acute respiratory versus acute non-respiratory illness, and lower versus upper respiratory tract infection). An acoustic model (feed-forward network) was trained independently for each task using patient-stratified five-fold cross-validation and evaluated on a held-out test set. Each task's model was also compared against six non-acoustic baselines using a single demographic, geographic, or temporal variable. The 11 trained classifiers were composed into a hierarchical cascade and illustrated as case studies on selected patients. Results. Test-set AUC across the 11 tasks ranged from 0.602 (95% CI: 0.588-0.614) to 0.745 (95% CI: 0.742-0.748), with a mean expected calibration error of 0.018. Six of eleven binaries outperformed all confounder baselines. Four binaries showed median within-stratum AUC of 0.62-0.70 when the confounder was held fixed, indicating acoustic discrimination beyond what the confounder alone explains. The exception was the pneumonia versus non-pneumonia lower respiratory tract infection binary, which failed against the patient-city confounder baseline, plausibly reflecting a clinic-level difference in ICD-10 coding. Conclusion. Conversational primary care audio carries acoustic signal that discriminates clinically meaningful respiratory contrasts. Absolute performance is moderate, but the conditions are stricter than prior work: conversational speech and differential-diagnosis contrasts among sick patients. This pilot study is a baseline for voice-based clinical AI moving beyond sick-versus-healthy detection toward differential-diagnosis panels and a proof-of-concept for hierarchical reasoning.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 0.2%
23.1%
2
Journal of Medical Internet Research
85 papers in training set
Top 0.2%
13.0%
3
npj Digital Medicine
97 papers in training set
Top 0.8%
6.5%
4
ERJ Open Research
44 papers in training set
Top 0.2%
5.0%
5
PLOS ONE
4510 papers in training set
Top 32%
4.4%
50% of probability mass above
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 19%
3.7%
7
PLOS Digital Health
91 papers in training set
Top 0.8%
3.3%
8
European Respiratory Journal
54 papers in training set
Top 0.6%
2.7%
9
Royal Society Open Science
193 papers in training set
Top 1%
2.5%
10
Computers in Biology and Medicine
120 papers in training set
Top 1%
2.4%
11
JMIR Medical Informatics
17 papers in training set
Top 0.6%
1.9%
12
iScience
1063 papers in training set
Top 11%
1.9%
13
IEEE Access
31 papers in training set
Top 0.3%
1.9%
14
Critical Care Explorations
15 papers in training set
Top 0.2%
1.7%
15
BMJ Open
554 papers in training set
Top 10%
1.3%
16
Frontiers in Digital Health
20 papers in training set
Top 1%
1.0%
17
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.9%
1.0%
18
JMIRx Med
31 papers in training set
Top 1%
0.9%
19
BJGP Open
12 papers in training set
Top 0.5%
0.9%
20
JMIR Formative Research
32 papers in training set
Top 1%
0.9%
21
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 5%
0.8%
22
eBioMedicine
130 papers in training set
Top 4%
0.8%
23
Communications Medicine
85 papers in training set
Top 1%
0.8%
24
Life
27 papers in training set
Top 0.3%
0.8%
25
International Journal of Epidemiology
74 papers in training set
Top 3%
0.7%
26
BMJ Open Respiratory Research
32 papers in training set
Top 0.7%
0.7%
27
Frontiers in Medicine
113 papers in training set
Top 7%
0.7%
28
Infection
15 papers in training set
Top 0.5%
0.5%
29
Respiratory Research
19 papers in training set
Top 0.7%
0.5%
30
Journal of Translational Medicine
46 papers in training set
Top 4%
0.5%