Back

An fMRI dataset of verbalized spontaneous thought with annotated transcripts and self-report trait measures

Zhang, M.; Liu, P. R.; Su, H.; Zhao, M.; Li, X.; Born, S.; Lee, Y.; Honey, C.; Chen, J.; Lee, H.

2026-05-12 neuroscience
10.64898/2026.05.12.724488 bioRxiv
Show abstract

Spontaneous thought is pervasive in everyday human cognition, yet datasets capturing its neural dynamics under minimally interrupted conditions remain limited. The current dataset was acquired from a think-aloud functional MRI experiment in which 118 participants continuously verbalized their spontaneous thoughts during 10-minute scanning sessions. The raw MRI data and verbal transcripts with sentence-level timestamps were previously released and analyzed in our prior study examining neural activity associated with thought transitions. Building on that release, we additionally provide preprocessed MRI data, speech transcriptions with word-level timestamps aligned to image acquisition, large language model-generated ratings of transcribed thoughts across emotional and sensory dimensions, and self-report survey measures assessing personality, mental health, and cognitive abilities. Validation analyses demonstrated activation in expected cortical regions associated with speech production and sensory content identified from transcript annotations, agreement between language model and human ratings, and adequate internal consistency of survey measures, supporting the datasets overall quality. This dataset enables reuse for investigations of spontaneous thought, speech generation, and individual differences using naturalistic functional MRI data.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Scientific Data
174 papers in training set
Top 0.1%
22.5%
2
NeuroImage
813 papers in training set
Top 0.6%
18.6%
3
Imaging Neuroscience
242 papers in training set
Top 0.1%
14.4%
50% of probability mass above
4
Nature Communications
4913 papers in training set
Top 25%
7.2%
5
Human Brain Mapping
295 papers in training set
Top 2%
4.0%
6
Scientific Reports
3102 papers in training set
Top 36%
3.6%
7
Communications Biology
886 papers in training set
Top 3%
3.1%
8
Medical Image Analysis
33 papers in training set
Top 0.5%
2.1%
9
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 27%
2.1%
10
eLife
5422 papers in training set
Top 36%
2.1%
11
Science Advances
1098 papers in training set
Top 17%
1.7%
12
The Journal of Neuroscience
928 papers in training set
Top 7%
1.2%
13
Aperture Neuro
18 papers in training set
Top 0.3%
1.2%
14
Advanced Science
249 papers in training set
Top 16%
1.0%
15
eneuro
389 papers in training set
Top 8%
1.0%
16
PLOS Computational Biology
1633 papers in training set
Top 22%
0.9%
17
NeuroImage: Clinical
132 papers in training set
Top 4%
0.8%
18
Nature Human Behaviour
85 papers in training set
Top 4%
0.8%
19
Nature Neuroscience
216 papers in training set
Top 6%
0.7%
20
Frontiers in Neuroimaging
11 papers in training set
Top 0.4%
0.7%
21
Nature Methods
336 papers in training set
Top 6%
0.7%
22
Cerebral Cortex
357 papers in training set
Top 2%
0.6%
23
Neuron
282 papers in training set
Top 9%
0.6%