Back

Sample size dependence of Occam's razor in human decision-making

Rinaldi, F. G.; Piasini, E.

2026-01-27 neuroscience
10.64898/2026.01.25.701581 bioRxiv
Show abstract

To make sense of a noisy world, living beings constantly face decisions between competing interpretations for ambiguous sensory data. This process parallels statistical model selection, where most frameworks, like the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC), are based on a trade-off between a models goodness-of-fit and its complexity. The same tradeoff has been observed in humans. However, a core tenet of normative frameworks is that the trade-off should depend on the sample size (N); as more data becomes available, the goodness-of-fit grows faster than the complexity penalty, weakening the overall bias towards simplicity. It is unknown whether humans also conform to this scaling principle, and if so, whether it arises from an optimal computation or a simpler heuristic. Here, we investigate this question using a preregistered visual task where subjects inferred the number of latent Gaussian sources generating clusters of data-points, and where the number of points (N) presented on each trial is varied systematically. We use three kinds of model to describe their behavior: a model with linear scaling of evidence in N (as in BIC and AIC), a model with no scaling, and a model with sub-linear scaling inspired by known biases in numerosity perception. Our results demonstrate that the normative, linear scaling model provides the worst account of human behavior. Instead, we find strong evidence for a sub-linear scaling of sample size. By inferring the shape of this scaling with Gaussian Processes, we reveal a distinct logarithmic trend for smaller N, and a flattening for higher values, both consistent with numerosity perception biases. This finding suggests that, when selecting for competing explanations for sensory data, humans do not perform a principled computation but rather employ a more efficient heuristic that repurposes lower perceptual mechanisms to dynamically weight evidence against model complexity.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
PLOS Computational Biology
1633 papers in training set
Top 0.3%
28.7%
2
Journal of Vision
92 papers in training set
Top 0.1%
10.5%
3
Nature Communications
4913 papers in training set
Top 21%
8.7%
4
Scientific Reports
3102 papers in training set
Top 11%
7.4%
50% of probability mass above
5
The Journal of Neuroscience
928 papers in training set
Top 2%
5.0%
6
eLife
5422 papers in training set
Top 19%
4.5%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 17%
4.1%
8
PLOS ONE
4510 papers in training set
Top 43%
2.8%
9
Nature Human Behaviour
85 papers in training set
Top 1%
2.5%
10
eneuro
389 papers in training set
Top 4%
2.2%
11
Cognition
44 papers in training set
Top 0.2%
2.2%
12
Frontiers in Neuroscience
223 papers in training set
Top 3%
1.8%
13
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 3%
1.8%
14
Communications Biology
886 papers in training set
Top 13%
1.3%
15
Science Advances
1098 papers in training set
Top 22%
1.3%
16
Neuroscience of Consciousness
12 papers in training set
Top 0.2%
1.1%
17
Journal of Neurophysiology
263 papers in training set
Top 0.6%
1.0%
18
Current Biology
596 papers in training set
Top 12%
1.0%
19
NeuroImage
813 papers in training set
Top 5%
0.9%
20
PLOS Biology
408 papers in training set
Top 16%
0.9%
21
Communications Psychology
20 papers in training set
Top 0.3%
0.8%
22
Neural Networks
32 papers in training set
Top 0.8%
0.8%
23
Journal of The Royal Society Interface
189 papers in training set
Top 5%
0.7%
24
Vision Research
26 papers in training set
Top 0.3%
0.5%