Back

A standardized naturalistic audio stimuli database with unsupervised labeling

Al-Naji, A.; Schubotz, R. I.; Zahedi, A.

2026-04-21 neuroscience
10.64898/2026.04.16.718910 bioRxiv
Show abstract

Research in cognitive neuroscience has relied on simple, highly controlled stimuli due to the difficulty in developing standardized, ecologically valid stimulus sets. However, there is a consensus that using ecologically valid stimuli is imperative to generalize results beyond controlled laboratory settings. The current study introduces a naturalistic audio stimulus database, consisting of short, recognizable, and emotionally rated stimuli. To create such a database, the current study collected 291 audio files from a wide range of sources. 361 participants rated the audio clips on emotionality, arousal, and recognizability, and subsequently freely described the audios by typing what they believed the sound to be. The text responses of the participants were embedded and clustered using an unsupervised machine-learning algorithm to derive a participant-grounded organization of auditory object categories. The results indicate audio clips were easily recognizable, while emotionality and arousal ratings showed broad variability, making the database suitable for diverse experimental needs. Furthermore, the final database comprises 10 distinct semantic categories, providing a diverse set of auditory stimuli.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 0.7%
18.9%
2
PLOS ONE
4510 papers in training set
Top 12%
14.9%
3
Scientific Data
174 papers in training set
Top 0.1%
12.7%
4
Frontiers in Neuroscience
223 papers in training set
Top 0.3%
7.3%
50% of probability mass above
5
Frontiers in Human Neuroscience
67 papers in training set
Top 0.3%
4.4%
6
Trends in Hearing
12 papers in training set
Top 0.1%
3.6%
7
The Journal of the Acoustical Society of America
33 papers in training set
Top 0.1%
2.1%
8
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
9
European Journal of Neuroscience
168 papers in training set
Top 0.4%
1.7%
10
NeuroImage
813 papers in training set
Top 4%
1.5%
11
eneuro
389 papers in training set
Top 6%
1.5%
12
Hearing Research
49 papers in training set
Top 0.2%
1.4%
13
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 4%
1.2%
14
Behavior Research Methods
25 papers in training set
Top 0.1%
1.2%
15
Frontiers in Psychology
49 papers in training set
Top 1.0%
0.9%
16
Journal of Neural Engineering
197 papers in training set
Top 2%
0.9%
17
Heliyon
146 papers in training set
Top 6%
0.8%
18
Advanced Science
249 papers in training set
Top 19%
0.8%
19
The Journal of Neuroscience
928 papers in training set
Top 8%
0.8%
20
Journal of Neuroscience Methods
106 papers in training set
Top 2%
0.8%
21
Neuroscience
88 papers in training set
Top 3%
0.8%
22
Nature Human Behaviour
85 papers in training set
Top 4%
0.8%
23
Journal of Cognitive Neuroscience
119 papers in training set
Top 2%
0.7%
24
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
25
Sensors
39 papers in training set
Top 2%
0.7%
26
Ear & Hearing
15 papers in training set
Top 0.3%
0.5%
27
IEEE Transactions on Neural Systems and Rehabilitation Engineering
40 papers in training set
Top 0.7%
0.5%
28
iScience
1063 papers in training set
Top 40%
0.5%
29
Neuroscience Letters
28 papers in training set
Top 2%
0.5%
30
BioData Mining
15 papers in training set
Top 1%
0.5%