Back

Language models outperform cloze predictability in a cognitive model of reading

Lopes Rego, A. T.; Snell, J.; Meeter, M.

2024-04-30 neuroscience
10.1101/2024.04.29.591593 bioRxiv
Show abstract

Although word predictability is commonly considered an important factor in reading, sophisticated accounts of predictability in theories of reading are yet lacking. Computational models of reading traditionally use cloze norming as a proxy of word predictability, but what cloze norms precisely capture remains unclear. This study investigates whether large language models (LLMs) can fill this gap. Contextual predictions are implemented via a novel parallel-graded mechanism, where all predicted words at a given position are pre-activated as a function of contextual certainty, which varies dynamically as text processing unfolds. Through reading simulations with OB1-reader, a cognitive model of word recognition and eye-movement control in reading, we compare the models fit to eye-movement data when using predictability values derived from a cloze task against those derived from LLMs (GPT2 and LLaMA). Root Mean Square Error between simulated and human eye movements indicates that LLM predictability provides a better fit than Cloze. This is the first study to use LLMs to augment a cognitive model of reading with higher-order language processing while proposing a mechanism on the interplay between word predictability and eye movements. Author SummaryReading comprehension is a crucial skill that is highly predictive of later success in education. One aspect of efficient reading is our ability to predict what is coming next in the text based on the current context. Although we know predictions take place during reading, the mechanism through which contextual facilitation affects ocolarmotor behaviour in reading is not yet well-understood. Here, we model this mechanism and test different measures of predictability (computational vs. empirical) by simulating eye movements with a cognitive model of reading. Our results suggest that, when implemented with our novel mechanism, a computational measure of predictability provide better fits to eye movements in reading than a traditional empirical measure. With this model, we scrutinize how predictions about upcoming input affects eye movements in reading, and how computational approches to measuring predictability may support theory testing. In the short term, modelling aspects of reading comprehension helps reconnect theory building and experimentation in reading research. In the longer term, more understanding of reading comprehension may help improve reading pedagogies, diagnoses and treatments.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Cognition
44 papers in training set
Top 0.1%
41.0%
2
Developmental Science
15 papers in training set
Top 0.1%
5.0%
3
PLOS Computational Biology
1633 papers in training set
Top 8%
4.1%
50% of probability mass above
4
Psychological Review
19 papers in training set
Top 0.1%
4.1%
5
Scientific Reports
3102 papers in training set
Top 33%
3.7%
6
Journal of The Royal Society Interface
189 papers in training set
Top 2%
2.7%
7
PLOS ONE
4510 papers in training set
Top 45%
2.5%
8
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 2%
2.2%
9
Neuropsychologia
77 papers in training set
Top 0.5%
2.0%
10
eneuro
389 papers in training set
Top 5%
1.9%
11
Journal of Cognitive Neuroscience
119 papers in training set
Top 0.8%
1.8%
12
Journal of Vision
92 papers in training set
Top 0.3%
1.5%
13
Frontiers in Psychology
49 papers in training set
Top 0.6%
1.5%
14
Psychonomic Bulletin & Review
14 papers in training set
Top 0.1%
1.4%
15
Frontiers in Neuroscience
223 papers in training set
Top 5%
1.3%
16
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
17
Consciousness and Cognition
17 papers in training set
Top 0.2%
1.2%
18
Communications Psychology
20 papers in training set
Top 0.2%
0.9%
19
Journal of Neurophysiology
263 papers in training set
Top 0.7%
0.9%
20
Royal Society Open Science
193 papers in training set
Top 4%
0.8%
21
Neurobiology of Language
28 papers in training set
Top 0.1%
0.8%
22
Cortex
102 papers in training set
Top 0.5%
0.8%
23
npj Science of Learning
17 papers in training set
Top 0.1%
0.8%
24
Behavior Research Methods
25 papers in training set
Top 0.2%
0.8%
25
Nature Human Behaviour
85 papers in training set
Top 5%
0.7%
26
Cerebral Cortex Communications
36 papers in training set
Top 0.4%
0.7%
27
Current Biology
596 papers in training set
Top 15%
0.7%
28
Psychological Science
14 papers in training set
Top 0.2%
0.7%
29
Journal of Experimental Psychology: General
20 papers in training set
Top 0.2%
0.5%
30
Attention, Perception, & Psychophysics
17 papers in training set
Top 0.2%
0.5%