Cognition
○ Elsevier BV
Preprints posted in the last 30 days, ranked by how well they match Cognition's content profile, based on 44 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.
Zyryanov, A.; Pierz, V.; Oganian, Y.
Show abstract
Humans comprehend language incrementally, updating the representation of sentence meaning with each incoming word. These updates are guided by the distance between each perceived word and prior expectations--the prediction error. The alignment between large language models (LLMs) and cortical activity inspires the hypothesis that the cortical computation of prediction error is Surface-based, driven by statistical patterns of word form co-occurrence. In contrast, psycholinguistic models propose that prediction error computation is Meaning-based, driven by word semantics. We used polysemic words with ambiguous semantics to distinguish these models: ambiguity would introduce uncertainty into meaning representations and hence the prediction error, if Meaning-based, but would not affect the prediction error, if Surface-based. We examined how ambiguity influenced prediction error signatures in self-paced reading times and magnetoencephalographic (MEG) neural responses during sentence processing. While an LLM-based proxy of prediction error robustly predicted reading times and neural responses to unambiguous words, it failed to predict either under ambiguity. That is, prediction error computation was altered by uncertainty in word meaning, which supports the Meaning-based model and corroborates the essential role of word meaning in predictive language processing. Our findings highlight an important limitation of LLMs as in silico models of the human language faculty.
Zylberberg, A.; Alvarez Heduan, F.
Show abstract
We study how confidence in perceptual decisions depends on whether it is communicated verbally (e.g., "very likely") or numerically (e.g., "80% certainty"). We find that verbal expressions more reliably distinguish correct from incorrect choices than numerical reports, challenging the common assumption that numerical probabilities provide more precise representations of uncertainty. Additionally, in a dyadic decision-making task in which participants can revise their initial reports based on a partners choice and expressed confidence, verbal and numerical reports are equally effective in supporting accurate revisions of initial judgments. Together, these results underscore the effectiveness of verbal expressions as a means of conveying decision confidence.
Shalu, S.; Muralikrishnan, R.; Schlesewsky, M.; Bornkessel-Schlesewsky, I.; Choudhary, K. K.
Show abstract
The present study examined whether thematic reversal anomalies are processed similarly across subject and object experiencer constructions in Malayalam. Event-related brain potentials (ERPs) were recorded as 30 first-language speakers of Malayalam read transitive sentences with the two types of experiencer verbs, in which the thematic role assignment for the preceding arguments was either correct or reverse. The reversal anomaly became apparent only at the position of the experiencer verb. A linear mixed-models analysis confirmed a biphasic N400-P600 effect at the verb for both verb types when the argument roles were reverse. Thus, our results suggest a uniform processing strategy for TRAs irrespective of the type of experiencer verb involved. However, the N400 amplitude was larger for the object experiencer verb compared to subject experiencer verbs. We suggest that the quantitative difference observed for object experiencer verbs is due to the inverse linking of grammatical function and thematic roles associated with these verbs. In other words, verb-specific linking properties modulate the processing of TRAs involving object experiencer verbs. We argue that this modulation occurs because the parser recalibrates cue weighting when the expected form-to-meaning mappings are overridden by the inverse linking properties of object experiencer verbs.
Colak, H.; Benzaquen, E.; Guo, X.; Lad, M.; Sedley, W.; Griffiths, T. D.
Show abstract
Understanding speech in noisy environments (SPIN) is an important everyday ability, and engaging in musical activities has been proposed as a factor that may support this ability. However, the cognitive mechanisms underlying a potential musical advantage in SPIN perception remain unclear. Here we investigated whether musical sophistication is associated with better SPIN perception in a large population-based sample, and whether this relationship is mediated by auditory working memory (AWM), verbal working memory (VWM), or non-verbal intelligence. We recruited 203 participants and measured SPIN perception at both word and sentence levels. Musical sophistication was assessed using the Goldsmiths Musical Sophistication Index (Gold-MSI). AWM was measured using delayed matching of tone frequency or the modulation rate of amplitude modulated white noise, VWM was based on backward digit span task, and non-verbal intelligence used matrix reasoning. Mediation analyses revealed that AWM fully mediated the relationship between musical sophistication and SPIN perception, whereas VWM showed no mediation effect. Non-verbal intelligence showed a partial mediating effect. Additional control analyses using structural equation modelling revealed that the indirect effect through AWM remained significant after accounting for age, hearing thresholds, and non-verbal intelligence. Together, these findings suggest that individuals with greater musical sophistication demonstrate better daily life listening abilities, and that superior auditory working memory may be the key cognitive mechanism underlying this advantage.
Nakao, A.; Yamada, N.; Wakatsuki, T.
Show abstract
Internal forward models predict the sensory consequences of motor commands; however, whether the anticipated availability of post-action feedback contributes to the precision of the action itself remains unknown. We manipulated the predictability of post-release visual occlusion in skilled basketball players. Participants performed three-point shots while wearing liquid-crystal shutter goggles. The study tested three conditions: a no-occlusion baseline, certain-occlusion condition in which players knew that their vision would be occluded at ball release in every trial, and random-occlusion condition in which they could not predict whether an occlusion would occur. Shooting accuracy declined in the certain-occlusion condition relative to the no-occlusion condition (49.2% vs 41.7%). The random-occlusion condition did not differ from the baseline (46.1%). Within the random condition, the accuracy in occluded trials were virtually identical to that in non-occluded trials (46.6% vs 46.2%), even though the immediate visual occlusion was the same as in the certain-occlusion condition. These results demonstrate that it is not the absence of post-action information per se that disrupts motor execution, but the prior certainty that action consequences will be unavailable. We interpret this finding as a prospective influence of anticipated consequence loss, whereby motor execution depends on whether the prediction-outcome loop remains closable.
Mugleston, J. D.; Huang, S.-M.; Dahl, C. D.
Show abstract
Human pointing is often used to test whether dogs extract object-specific information from human communicative cues. However, above-chance responses in standard object-choice tasks do not by themselves distinguish between a referential interpretation, in which the gesture identifies a specific target, and an attentional interpretation, in which it primarily biases behaviour toward a broader spatial region. We addressed this issue using an asymmetric six-cup arrangement designed to separate coarse side guidance from exact cup localisation more clearly than a symmetric multi-cup design. Performance in domestic dogs was analysed using three measures: the probability of reaching the correct side, the probability of choosing the correct cup overall, and the probability of choosing the correct cup conditional on having first reached the correct side. The principal comparison involved three matched trial classes: the symmetric 3-vs-3 condition, 2-vs-4 trials with the baited cup on the 2-cup side, and 2-vs-4 trials with the baited cup on the 4-cup side. Descriptively, pointing trials exceeded matched no-point control trials more clearly for side selection than for overall cup choice. The clearest condition effect was observed at the level of side guidance. Dogs were most likely to reach the correct side when the baited cup was located on the 4-cup side of the unequal arrangement. Mixed-effects models confirmed a reliable group effect for side accuracy, whereas overall cup accuracy showed only a weaker and less robust condition effect, and within-side localisation revealed no reliable group difference once condition-specific chance baselines were taken into account. A complementary generative model comparison converged on the same conclusion: a referential-only model fit poorly, an attention-only model captured most of the grouped outcome structure, and a combined model yielded only a modest improvement. Dog point-following is therefore best understood as a layered process dominated by attentional guidance, with only limited additional target-specific localisation.
Lipinska, A.; Ciupinska, K.; Rutiku, R.
Show abstract
Visual working memory (vWM) is often linked to conscious experience and visual imagery, but it is typically described as a system that stores separate, independent items. These assumptions are difficult to reconcile, given the unified nature of conscious experience. Here, we test the hypothesis that vWM relies on at least two distinct representations: an underlying, unconscious memory trace and a consciously accessible, integrated representation. A total of 216 participants performed a change-detection task, in which they rated their perceptual awareness of the memory display during the maintenance interval. Critically, we manipulated the statistical properties of the displays (average item size and size variability) to probe sensitivity to unified ensemble-level structure. Results revealed a dissociation between subjective and objective measures. Perceptual awareness increased for displays with larger, more variable items, whereas objective performance improved for displays with smaller, less variable items. Despite this difference, subjective awareness still predicted performance, and even incorrect responses showed consistent biases rather than random guesses. Importantly, individual differences in imagery vividness (VVIQ) were selectively associated with subjective awareness and estimation bias, but not with objective correctness. These precision biases were further shaped by display statistics, suggesting that multiple representations can guide behavior. Together, our findings support a reinterpretation of vWM performance in which task responses can draw on both unconscious and consciously accessible representations. One possible explanation for these behavioral patterns is that subjective experience reflects integrated, ensemble-like representations, while objective performance depends more strongly on item-specific information. Public significance statementsWorking memory allows us to temporarily hold and use information, and differences in this ability are closely linked to broader cognitive skills such as intelligence. This study shows that these differences may not depend only on how much information people can store, but also on how they experience it: some individuals appear to rely more on consciously accessible, image-like representations, especially when memory is uncertain or prone to error. By demonstrating that subjective experience and the vividness of imagery can shape behavior independently of objective accuracy, these findings suggest that how we use memory may be as important as how much we can store, with implications for understanding individual differences in cognition.
Mori, K.; Yamada, M.
Show abstract
The willingness to exert cognitive effort is essential but is constrained by the subjective cost of effort. Although effortful tasks are often avoided, positive bias about ones own performance may help sustain engagement with cognitive demands. Here, participants completed an effort-based decision-making task and reported trial-by-trial predictions of their own performance, allowing us to quantify performance prediction error (PPE) as the discrepancy between subjective and objective accuracy. The results showed that PPE was predominantly positive and increased with effort level, indicating greater overestimation under higher cognitive demands. Using a computational model, we show that choices were best explained by a learning model in which rewarded trials accompanied by positive PPE decreased subsequent sensitivity to effort. A confidence-based control model did not provide a better account of choices, suggesting that this effect was better captured by positive performance bias than by confidence alone. Our findings provide a computational account of how biased self-evaluation may attenuate the subjective cost of cognitive effort and extend the positive bias literature to the task need for cognitive effort.
Mahesan, D.; Sharma, K.; Weinerth, M. K.; Dhaka, S.; Meinzer, M.; Fischer, R.
Show abstract
Response inhibition, the ability to suppress contextually inappropriate actions, is a cornerstone of cognitive control and is commonly assessed using paradigms such as the go/no-go task. However, traditional go/no-go paradigms rely on binary outcomes such as commission errors, which offer limited insight into the dynamic, graded behavioral adjustments underlying successful stopping. The present study developed a novel mouse-tracking go/no-go paradigm with a dynamic start to capture inhibitory processes during ongoing execution. Twenty-three healthy young adults completed the task in two sessions separated by approximately one week to evaluate the test-retest reliability of standard behavioral measures (error rates and reaction times), and three kinematic features: path length, mean velocity, and mean acceleration. Results revealed robust differences between go and no-go trials across all measures. Successful inhibition was characterized by significantly shorter path lengths and reduced mean velocity and acceleration compared to go trials. Critically, all measures demonstrated moderate-to-good test-retest reliability across sessions, with intraclass correlation coefficients ranging from .75 to .85 for go trials and from .59 to .83 for no-go trials. These findings establish construct validity and psychometric reliability of the current mouse-tracking go/no-go paradigm. The demonstrated stability of these measures provides the methodological foundation for their use in cross-sectional, longitudinal, and intervention research targeting inhibitory control.
Razi, H.; Sambrook, T.; Garrett, N.
Show abstract
Confirmation bias impacts judgments and decisions across a range of domains including finance, policy and science. Here we examine whether explicitly labelling information as true or false disrupts a core underlying computational mechanism that can generate this pervasive bias - asymmetric learning. Human participants (Study 1: N=47; Study 2: N=57) completed a 2 alternative forced choice (2AFC) task previously used to test for the presence of confirmation bias. Participants made choices between pairs of options that could win or lose money and received either factual or counterfactual feedback after each choice. We introduced a key novel feature into the task - providing explicit cues that signalled to participants whether feedback they had seen was true (verified) or false (debunked). Learning in response to feedback was attenuated under false compared to true labels but was present under both. Fitting participants choices to computational models enabled us to examine how sensitivity to the feedback varied as a function of both the label (true/false) and confirmation (confirmatory/disconfirmatory). This revealed a distinct pattern of learning rates typical of confirmation bias (enhanced learning from positive prediction errors for chosen options and from negative prediction errors for unchosen options) in response to both true and false labels. The findings highlight how confirmation bias plays an important role in the effectiveness of interventions designed to verify true and/or debunk false claims. Verification is less likely to succeed when information disconfirms prior beliefs. Conversely, debunking false claims is unlikely to succeed when the information confirms ones prior beliefs.
Shurygina, O.; Wirth, L. A.; Rolfs, M.; Ohl, S.
Show abstract
Saccades made during memory maintenance prioritize memory for the saccade target, but it is unclear if this benefit is specific to a location or extends across memorized objects. In three experiments, we examined whether saccadic selection spreads to other locations within the same object. In Experiment 1, we asked observers to remember three oriented Gabors presented either within contour-defined objects or without object structure. A subsequent movement cue prompted observers to move their eyes to the indicated location. We then probed memory for stimuli at locations equidistant from the saccade target, in either the same or a different object. Memory was best for stimuli at locations congruent with the saccade target, and consistently weaker for other stimuli presented in the same or a different object than the saccade target. In Experiment 2, we created more complex objects by adding more object features to the stimulus. Again, memory performance was best for stimuli congruent with the saccade target location, whereas memory in incongruent trials was worse and similar for stimuli in the same and different object as the saccade target. In Experiment 3, we tested if saccadic selection is present and propagates within the object in a change detection task. Again, memory performance (i.e., change detection) was best at the saccade target location. However, this memory benefit also spread to other locations within the same object. Our results imply that saccadic selection in visual working memory is primarily space-based but can also spread towards locations within the object where a saccade was directed.
Bartling, B. A.
Show abstract
Flow state, characterized by optimal engagement and performance, represents a key concept in understanding human performance and cognitive resource allocation. Grounded in Csikszentmihalyis and Sherrys flow theory and the Limited Capacity Model of Motivated Mediated Message Processing (LC4MP), this study investigated physiological and neural correlates of flow state during a simulated driving task under different music conditions and difficulty levels. Using a 2 x 3 factorial design with 20 participants, this study examined self-selected versus non-self-selected music across three difficulty levels, testing the relationship between task switching, cognitive resource allocation, and flow state. Physiological measures included heart rate and EEG (alpha/theta power) using a 4-channel Muse 2 headband, alongside a self-report measure of flow experience. Hierarchical linear modeling revealed significant physiological changes during self-selected music: heart rate decreased ({beta} = -5.15, p < .001), while alpha ({beta} = 5829.77, p < .001) and theta power ({beta} = 7637.24, p < .001) increased. Task difficulty also showed significant effects, with heart rate decreasing during hard ({beta} = -6.70, p < .001) and moderate ({beta} = -3.40, p = .001) conditions. In particular, while physiological measures showed robust changes, the self-reported flow state did not reach significance. Task switching rates showed significant decreases during self-selected music ({beta} = -0.86, p < .001) and hard difficulty ({beta} = -0.61, p < .001), supporting the LC4MP frameworks predictions regarding cognitive resource allocation. These findings demonstrate how task switching and cognitive resource allocation relate to flow state induction. The results highlight the importance of multimodal measurement approaches and demonstrate that personal relevance through music selection and task difficulty significantly influence physiological and neural responses during performance. Future research should employ more comprehensive measurement approaches to better capture the complexity of flow-related neural activity and its relationship to task switching and cognitive resource allocation.
Kumar, G. V.; Lacey, S.; Nygaard, L.; Sathian, K.
Show abstract
Iconicity refers to systematic links between word form and meaning. Although evidence for iconicity in natural language continues to grow, its neural basis remains unclear. Using functional magnetic resonance imaging (fMRI) and multivariate pattern analysis (MVPA), we examined iconic shape associations of auditory real words and pseudowords. The pseudowords were matched to the real words in phonemic and phonotactic properties, while differing primarily in the absence of learned semantic representations. Participants listened to each item and judged whether it sounded rounded or pointed. Searchlight MVPA revealed significant decoding for both stimulus types. For real words, iconic shape associations were decoded above chance in regions associated with visual and haptic shape processing (left lateral occipital complex and left anterior intraparietal sulcus), visual imagery (bilateral precuneus), phonological processing (bilateral supramarginal gyri), and semantic processing (left middle frontal and right superior frontal gyri). For pseudowords, significant decoding was found in regions associated with multisensory feature organization (right posterior intraparietal sulcus) and language processing (left angular and inferior frontal gyri). Together, these findings provide evidence for neural mechanisms mediating iconic associations, with language-related areas involved for both real words and pseudowords, and visual processing for real words.
Chaigneau, A.; Moretti, R.; Iodice, P.; Pessiglione, M.; Pezzulo, G.
Show abstract
Goal-directed behavior often requires sustained effort across a sequence of interdependent decisions, yet the determinants of persistence in such contexts remain poorly understood. Here, we investigated how individuals regulate persistence in a novel sequential effort-based task in which they controlled an avatar through successive checkpoints to reach a final goal and could make repeated attempts following failure. At each attempt, participants could choose either to persist in the same task or to disengage toward an easier but less rewarding alternative. We found that decisions to persist or disengage were jointly shaped by multiple interacting factors. Disengagement increased with task difficulty and lower skill level. It also increased with repeated attempts and time-on-task, indexing fatigue, and with accumulated errors, indexing lack of progress. Conversely, proximity to the goal promoted persistence and shaped decision dynamics by reducing choice conflict during persistence decisions and increasing hesitation during disengagement near the goal. Notably, clearing the first checkpoint produced a sharp increase in persistence, suggesting that early success plays a pivotal role. Furthermore, persistence reflected both retrospective and prospective evaluations of effort, with prior investment promoting commitment and anticipated effort reducing it. Finally, disengagement was preceded by short-term performance decline but not by gradual increases in decision conflict, suggesting relatively abrupt strategy shifts following repeated failures. Together, these findings provide a comprehensive account of persistence in sequential effortful tasks, showing that decisions to persist or disengage are jointly shaped by multiple factors related to fatigue, (lack of) progress, goal proximity, and early success.
Segura, E.; Lorenzo-Seva, U.; Zatorre, R.; Kleber, B. A.; Rodriguez-Fornells, A.
Show abstract
Singing is an innate human behaviour present across cultures and the lifespan. Despite lacking direct biological advantages, its ubiquity suggests that it is intrinsically rewarding. This research aimed to investigate the underlying factors that explain variability in sensitivity to deriving reward and enjoyment from natural singing in the general population. In Study 1 (n = 606), an initial pool of items describing daily, non-professional singing behaviours were administered to an international adult sample. Exploratory factor analysis revealed a unidimensional structure of 20 items with acceptable model fit, organized into five facets representing distinct domains of singing-related rewards: 1) pleasure and emotional evocation, 2) social singing reward, 3) singing frequency, 4) mood regulation through singing, and 5) inattentional singing during routine tasks. In Study 2 (n = 430), confirmatory factor analysis in a new sample supported this structure. When both samples were combined (n = 1036), the unidimensional model defined by these five facets showed acceptable to excellent goodness-of-fit indices, supporting the conceptualization of singing reward as a multidimensional construct with differentiated facets. This led to the Barcelona-Aarhus Natural Singing Engagement Questionnaire (BANSEQ), which demonstrated excellent reliability ( = .94) and population-level stability. Study 3 (n = 1036) tested the convergent validity of BANSEQ with measures of music reward and engagement and identified sociodemographic and psychological correlates across the five facets of singing reward. Overall, these findings characterize the sources of individual differences in the hedonic experience of natural singing and propose BANSEQ as a robust psychometric tool for its assessment in the general population.
Flo, E. E.; Flo, G. M.
Show abstract
Summary paragraphA hallmark of learning is the need for sensory stimuli (Ginns, 2015; McGraw et al., 2009; Reinwein, 2012; Spence, 1950) so that learning is fundamentally based on sensory input signals affecting behaviour, physiology, and neurology. If behavioural measures of learning can be causally linked to physiological and neurological variables, a broader understanding of the mechanisms related to learning in schools, learning disabilities, and learning and health issues may emerge (McGraw et al., 2009). Despite decades of research on the physiological/neurological variable of sympathetic activation, learning, and achievement (Horvers et al., 2021), any causal relation remains unclear (Cowley et al., 2014; Mason et al., 2020; Pijeira-Diaz et al., 2016; Sung et al., 2023; Yu et al., 2024) and issues with instrument validation remain (Costantini et al., 2023; Hu et al., 2024; Milstein & Gordon, 2020; Van Der Mee et al., 2021). Here we investigate the effect of sensory input on sympathetic activation by using validated instruments for skin conductance measurement (Batista et al., 2019) and whether sympathetic activation is connected to learning in a cognitive laboratory context and an ecologically valid classroom context. In both contexts, we found a physiological variable which correlated with learning and that sensory input affected this variable while student movement did not. These sensory inputs varied depending on the different instructional activities the students participated in. Together, these findings bring us one step closer to a model linking sensory input to behavioural, physiological, and neurological variables.
Staples, R.; DeMarco, A. T.; Laks, A. B.; Turkeltaub, P. E.
Show abstract
Computational models are a linchpin in our understanding of the neurocognitive basis of reading. These models can simulate idealized profiles of alexia syndromes, but in reality, individuals with alexia present with a wide range of mixed deficits rather than idealized syndromes. To provide a complete cognitive theory of reading, computational models must be able to account for this individual variation. However, this has never been demonstrated. We test oral reading and non-reading phonological and semantic processing in 83 left-hemisphere stroke survivors. We show that individual alexia profiles can be simulated by applying graded phonology and semantic lesions to an artificial neural network model of reading, creating "matched models" that represent individual stroke survivors. The severity of damage to the semantic and phonological layers of the matched models was highly correlated with directly-measured semantic and phonological processing deficits. However, we also identify systematic ways in which the models fail to simulate the reading performance of their matched stroke survivors. Our results support theories of alexia that rely on process-based deficits, demonstrate the feasibility of large-scale individualized modelling of alexia, and suggest ways to further improve the correspondence of models and human reading behavior.
Zhao, J.; Brennan, J. R.
Show abstract
The internal representations of large language models (LLMs) correlate, or "align", with human neural activity during language comprehension. One view holds that this alignment reflects shared sensitivity to statistical patterns in LLMs and humans, while others hold that it reflects, at least in part, the emergence of shared linguistic representations in these systems. Here, we investigate whether hierarchical linguistic composition, a property believed to be fundamental to human language, modulates LLM-brain alignment. To this end, we manipulated syntax, compositional semantics, and associative semantics in English sentences that were presented to both an LLM and human participants during an electroencephalography (EEG) experiment. We matched linguistically manipulated stimuli in predictability, which allows us to tease apart alignment induced by linguistic structure from statistical factors. By comparing LLM-EEG alignment scores that were derived using a linear encoding model across predictability-matched conditions, we evaluate how linguistic manipulations modulate the alignment between human EEG reading data and contextual embeddings extracted word-by-word from the hidden layers of GPT2-XL. Three key patterns emerge: (1) increased alignment for word sequences with syntactic structure, (2) decreased alignment for sentences with compositional semantics, and (3) associative semantics does not modulate alignment. These observed linguistic modulations of LLM-EEG alignment take place above and beyond predictability. Our results indicate that associative semantics is encoded similarly by LLMs and the brain, as are at least some aspects of syntactic structure, while compositional semantics is more uniquely encoded in the human brain.
Tabbane, E.; Figueira, S.; Benjamin, L.; Dehaene, S.; Al Roumi, F.
Show abstract
How do humans store sequences that far exceed working memory capacity? Using visuo-spatial and binary auditory sequences, we previously showed that a Language of Thought (LoT) architecture -- in which simple primitives are recursively combined into hierarchical programs -- enables efficient storage of structured sequences. Here we ask whether this principle extends to purely ordinal structure: sequences defined by how items repeat and in what order, as in AABBCCAABBCC, independently of their spatial content. Across three experiments, participants reproduced 12-item sequences of spatial locations with various ordinal structures. The minimal description length derived from the LoT model predicted recall accuracy with remarkable precision (r = .96), substantially outperforming Shannon entropy, Lempel-Ziv complexity, chunking models and subjective complexity ratings. Critically, fine-grained analyses of participants inter-click intervals during reproduction revealed systematic slowdowns at the hierarchical boundaries predicted by the LoT programs, providing a behavioral signature of the underlying mental syntax. These results identify a compact vocabulary of mental primitives -- repetition, mirroring, and interleaving -- whose composition accounts for the symbolic compression of ordinal structures. For ordinal regularities, human sequence memory operates as a form of program induction, leveraging a domain-general capacity for hierarchical compression to encode complex structured information. Author SummaryHuman short-term memory is heavily limited, holding no more than a few items at once. Yet humans routinely memorize complex sequences that far exceed this capacity. How is this possible? We propose that the brain acts like a programmer: rather than storing each element individually, it compresses sequences into short mental "programs." Just as a programmer writes "repeat ABC four times" instead of typing ABCABCABCABC, the brain leverages regularities such as repetitions (ABC-ABC) or mirror patterns (ABC-CBA) to encode sequences efficiently. We tested this idea across three experiments: two in which participants memorized and reproduced sequences of spatial positions on a screen, one where they only rated their perceived complexity. Sequences described by shorter programs were remembered far better and judged as simpler -- even when they were the same length as less structured sequences. When reproducing sequences, participants paused longer at structural boundaries, revealing the internal organization of their mental programs. Strikingly, program length predicted memory performance better than participants own complexity ratings, suggesting that these mental representations are not fully accessible to conscious awareness. Finally, we identified key new patterns -- including temporal inversion and interleaving -- that extend the Language of Thought framework. Together, these findings suggest that a compositional Language of Thought is a fundamental aspect of how the human brain efficiently store and represent structured information.
Sekine, K.; Okuma, R.; Ban, H.
Show abstract
People frequently gesture while speaking, even when listeners cannot see them--for instance, during phone calls or behind barriers. Congenitally blind individuals also gesture, indicating that gestures serve functions beyond visual communication. Previous models of gesture production (e.g., Kita & Ozyurek, 2003; Rauscher et al., 1996) suggest that gestures facilitate speech, but they rely heavily on behavioural data and provide limited insight into temporal dynamics. This study used magnetoencephalography (MEG), a neuroimaging technique with high temporal resolution, to investigate when gestures influence speech. Twenty-three native Japanese speakers took part in a storytelling task under two conditions: Gesture-Required (gesture use instructed) and Gesture-Prohibited (hands kept still). Participants described cartoon clips across multiple sessions (30 trials x 3 sessions per condition). Using speech onset as the reference point, we compared root mean square (RMS) values within a -0.25 to 0 second window. RMS values were higher in the Gesture-Prohibited condition, with increased activity in the bilateral anterior temporal lobes (Left ATL: p = .049; Right ATL: p = .027), but not in motor regions (p = .29). These findings suggest that gestures reduce neural load in language-related regions before articulation. Co-speech gestures may support speech planning by facilitating lexical retrieval or semantic structuring. The lack of motor region effects indicates that this influence is linguistic rather than motoric. This study provides direct direct neurophysiological evidence of the timing of gesture-speech interaction, supporting models that view gestures as an integral part of speech production.