Back

A computational approach for measuring sentence information via surprisal: theoretical implications in nonfluent primary progressive aphasia

Rezaii, N.; Michaelov, J.; Josephy-Hernandez, S.; Ren, B.; Hochberg, D.; Quimby, M.; Dickerson, B. C.

2022-11-29 neurology
10.1101/2022.11.25.22282630 medRxiv
Show abstract

Nonfluent aphasia is a language disorder characterized by simplified sentence structures as well as word-level abnormalities such as a reduced use of verbs and function words. According to the predominant account of the disorder, both structural and word-level features are caused by a core deficit in the processing of syntax. Under this account, however, it remains unclear why nonfluent patients choose semantically richer verbs and may have an intact comprehension of verbs and function words. Here, we propose and test the hypothesis that the word-level features of nonfluency reflect a process that selects lexically richer words to increase the information content of sentences. We use a computational linguistic method to measure the information content of sentences in the language of patients with nonfluent primary progressive aphasia (nfvPPA) (n = 36) and healthy controls (n = 133). We measure sentence information using surprisal, a metric calculated by the average probability of occurrence of words in a sentence given their preceding context. We found that by packaging their structurally simple sentences with lower frequency words, nfvPPA patients produce sentences with similar surprisal as that of healthy speakers. Furthermore, we found that higher sentence surprisal in nfvPPA correlates with a lower function-to-all-word ratio, a lower verb-to-noun ratio, and a higher heavy-to-all-verb ratio. Surprisal is an effective quantitative index of sentence information. Using surprisal allows for testing an account of nonfluent aphasia that regards word-level features of nonfluency as adaptive rather than defective symptoms, a finding that may entail revisions in therapeutic approaches to nonfluent speech.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Scientific Reports
3102 papers in training set
Top 0.5%
22.0%
2
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 10%
6.6%
3
Journal of Cognitive Neuroscience
119 papers in training set
Top 0.2%
6.2%
4
Brain Communications
147 papers in training set
Top 0.5%
4.2%
5
The Journal of Neuroscience
928 papers in training set
Top 3%
3.9%
6
NeuroImage: Clinical
132 papers in training set
Top 1%
3.6%
7
Neurobiology of Language
28 papers in training set
Top 0.1%
3.5%
8
Brain Sciences
52 papers in training set
Top 0.2%
3.5%
50% of probability mass above
9
Brain
154 papers in training set
Top 2%
3.5%
10
Journal of Speech, Language, and Hearing Research
10 papers in training set
Top 0.1%
3.5%
11
Frontiers in Neuroscience
223 papers in training set
Top 2%
3.0%
12
Cerebral Cortex
357 papers in training set
Top 0.4%
2.8%
13
Cortex
102 papers in training set
Top 0.1%
2.0%
14
PLOS ONE
4510 papers in training set
Top 49%
2.0%
15
Neuropsychologia
77 papers in training set
Top 0.6%
1.7%
16
Frontiers in Psychology
49 papers in training set
Top 0.6%
1.6%
17
Cognitive, Affective, & Behavioral Neuroscience
25 papers in training set
Top 0.1%
1.4%
18
Communications Biology
886 papers in training set
Top 11%
1.4%
19
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
20
Nature Communications
4913 papers in training set
Top 57%
1.2%
21
Philosophical Transactions of the Royal Society B
51 papers in training set
Top 4%
1.1%
22
Human Brain Mapping
295 papers in training set
Top 4%
0.9%
23
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.9%
24
eLife
5422 papers in training set
Top 54%
0.9%
25
Journal of The Royal Society Interface
189 papers in training set
Top 5%
0.7%
26
Cognition
44 papers in training set
Top 0.5%
0.7%
27
Frontiers in Human Neuroscience
67 papers in training set
Top 3%
0.7%
28
Neuroscience
88 papers in training set
Top 3%
0.7%
29
Neurology
44 papers in training set
Top 2%
0.7%
30
Developmental Science
15 papers in training set
Top 0.2%
0.6%