Back

Evaluating Few-Shot Prompting for Spectrogram-Based Lung Sound Classification Using a Multimodal Language Model

2025-07-28 respiratory medicine Title + abstract only
View on medRxiv
Show abstract

IntroductionTraditional deep learning models for lung sound analysis require large, labeled datasets; multimodal LLMs may offer a flexible, prompt-based alternative. This study aimed to evaluate the utility of a general-purpose multimodal LLM, GPT-4o, for lung sound classification from mel-spectrograms and assess whether a few-shot prompt approach improves performance over zero-shot prompting. MethodsUsing the ICBHI 2017 Respiratory Sound Database, 6898 annotated respiratory cycles were convert...

Predicted journal destinations