Back

From sound to source: Human and model recognition of environmental sounds

Alavilli, S.; McDermott, J. H.

2026-03-14 neuroscience
10.64898/2026.03.12.711349 bioRxiv
Show abstract

Our ability to recognize sound sources in the world is critical to daily life, but is not well documented or understood in computational terms. We developed a large-scale behavioral benchmark of human environmental sound recognition, built stimulus-computable models of sound recognition, and used the benchmark to compare models to humans. The behavioral benchmark measured how sound recognition varied across source categories, audio distortions, and concurrent sound sources, all of which influenced recognition performance in humans. Artificial neural network models trained to recognize sounds in multi-source scenes reached near-human accuracy and qualitatively matched human patterns of performance in many conditions. By contrast, traditional models of the cochlea and auditory cortex that were trained to recognize sounds produced worse matches to human performance. Models trained on larger datasets exhibited stronger alignment with both human behavior and brain responses. The results suggest that many aspects of human sound recognition emerge in systems optimized for the problem of real-world recognition. The benchmark results set the stage for future explorations of auditory scene perception involving salience and attention.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 1%
18.3%
2
Scientific Reports
3102 papers in training set
Top 7%
9.9%
3
Frontiers in Neuroscience
223 papers in training set
Top 0.2%
9.0%
4
PLOS ONE
4510 papers in training set
Top 26%
6.7%
5
The Journal of the Acoustical Society of America
33 papers in training set
Top 0.1%
4.8%
6
The Journal of Neuroscience
928 papers in training set
Top 3%
4.1%
50% of probability mass above
7
eneuro
389 papers in training set
Top 2%
3.9%
8
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 1%
3.6%
9
Nature Communications
4913 papers in training set
Top 41%
3.5%
10
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 21%
3.5%
11
Trends in Hearing
12 papers in training set
Top 0.1%
3.2%
12
Nature Human Behaviour
85 papers in training set
Top 1%
3.0%
13
eLife
5422 papers in training set
Top 31%
2.7%
14
NeuroImage
813 papers in training set
Top 4%
1.9%
15
Journal of the Association for Research in Otolaryngology
11 papers in training set
Top 0.1%
1.3%
16
Frontiers in Computational Neuroscience
53 papers in training set
Top 2%
1.2%
17
Hearing Research
49 papers in training set
Top 0.3%
0.9%
18
iScience
1063 papers in training set
Top 25%
0.9%
19
Journal of Cognitive Neuroscience
119 papers in training set
Top 1%
0.7%
20
Neural Computation
36 papers in training set
Top 0.7%
0.7%
21
Science Advances
1098 papers in training set
Top 30%
0.7%
22
Nature Machine Intelligence
61 papers in training set
Top 4%
0.7%
23
Journal of Neural Engineering
197 papers in training set
Top 2%
0.6%
24
Communications Biology
886 papers in training set
Top 30%
0.6%