Back

Dual pathway architecture in songbirds enables robust sensorimotor learning

Sankar, R.; Suryawanshi, A.; Rougier, N. P.; Leblois, A.

2026-05-08 neuroscience
10.64898/2026.05.07.723469 bioRxiv
Show abstract

The acquisition of sensorimotor skills critically depends on basal ganglia (BG)-thalamo-cortical circuits. Prevailing theories propose that the BG optimize motor output through reinforcement learning (RL), using internal performance evaluations to approximate stochastic gradient ascent. However, this framework struggles in non-convex performance landscapes, where local optima hinder efficient learning. Songbirds provide a compelling biological example of robust sensorimotor learning, mastering complex vocalizations through trial-and-error within a specialized BG-thalamo-cortical architecture. Here, we present a computational model constrained by the anatomy, physiology, and developmental trajectory of the zebra finch song system. The model combines a BG-driven RL pathway with a parallel cortical motor pathway that progressively consolidates successful motor patterns via Hebbian plasticity. In addition, we incorporate synaptic volatility within the BG pathway, introducing structured variability across learning. Through simulations of vocal learning using both a biophysical syrinx model and synthetic performance landscapes, we demonstrate that this dual-pathway architecture reliably converges to global optima and outperforms standard and noise-annealed RL approaches. The model reproduces key experimental features of song learning, including non-monotonic learning trajectories, a gradual reduction in motor variability, and the developmental transfer of motor control from subcortical to cortical circuits. Mechanistically, delayed maturation of the cortical pathway provides an implicit regulation of the exploration-exploitation trade-off, while synaptic volatility enables escape from local optima. These results highlight the importance of neural circuit architecture and dynamics in efficient learning, and suggest biologically inspired design principles for improving the robustness and sample efficiency of artificial RL systems in complex sensorimotor domains.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 2%
18.2%
2
PLOS Computational Biology
1633 papers in training set
Top 1%
17.1%
3
Nature Communications
4913 papers in training set
Top 15%
12.2%
4
eLife
5422 papers in training set
Top 4%
12.1%
50% of probability mass above
5
Frontiers in Computational Neuroscience
53 papers in training set
Top 0.7%
3.6%
6
Cell Reports
1338 papers in training set
Top 16%
3.5%
7
Science Advances
1098 papers in training set
Top 13%
2.0%
8
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 2%
2.0%
9
Nature Neuroscience
216 papers in training set
Top 4%
2.0%
10
Neuron
282 papers in training set
Top 5%
1.8%
11
Proceedings of the Royal Society B: Biological Sciences
341 papers in training set
Top 4%
1.7%
12
PRX Life
34 papers in training set
Top 0.4%
1.7%
13
The Journal of Neuroscience
928 papers in training set
Top 6%
1.7%
14
Scientific Reports
3102 papers in training set
Top 59%
1.7%
15
Science
429 papers in training set
Top 15%
1.6%
16
iScience
1063 papers in training set
Top 22%
1.2%
17
PNAS Nexus
147 papers in training set
Top 0.9%
0.9%
18
Current Biology
596 papers in training set
Top 13%
0.9%
19
Physical Review X
23 papers in training set
Top 0.5%
0.9%
20
Communications Biology
886 papers in training set
Top 25%
0.7%
21
Journal of The Royal Society Interface
189 papers in training set
Top 5%
0.7%
22
Nature Human Behaviour
85 papers in training set
Top 5%
0.6%