Back

Exploring the limits of LLMs in low-resource information extraction: Case study in brain MRI reports for Epilepsy

Truong, T. H.; Foster, E.; Fazio, T.; Holper, S.; Verspoor, K. M.

2025-08-05 neurology
10.1101/2025.08.02.25332570 medRxiv
Show abstract

Information extraction (IE) from specialized clinical texts such as brain MRI reports is important for various clinical and population health contexts. However, this topic is under-explored due to privacy concerns limiting data availability and the inherent complexity and domain-specificity of clinical language. Common methods relying on substantial amounts of training data fail. The recent advances in large language model (LLM) research provide a promising solution to bridge the data scarcity gap, with improved ability to adapt to novel tasks with little supervision. We introduce a new, challenging dataset of 100 expert-annotated brain MRI reports, featuring 152 fine-grained entity types and 4 relation types, characterised by low inter-annotator agreement. This task reflects the inherent complexity and real-world ambiguity of medical text. We evaluate a small, open-weight LLM across span detection, named entity recognition, and relation extraction tasks. We compare few-shot prompting and parameter-efficient fine-tuning against specialized off-the-shelf biomedical IE systems. Our results demonstrate that both few-shot and fine-tuned LLM approaches substantially outperform off-the-shelf baselines. While LLMs show superiority, absolute performance, particularly for complex relations and fine-grained entities, remains modest, correlating with the datasets inherent difficulty and the extreme low-resource setting.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Medicine
117 papers in training set
Top 0.1%
22.1%
2
Nature Computational Science
50 papers in training set
Top 0.1%
9.9%
3
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 14%
4.8%
4
npj Digital Medicine
97 papers in training set
Top 0.9%
4.8%
5
Scientific Reports
3102 papers in training set
Top 32%
3.9%
6
Med
38 papers in training set
Top 0.1%
3.5%
7
Nature Communications
4913 papers in training set
Top 41%
3.5%
50% of probability mass above
8
Journal of Biomedical Informatics
45 papers in training set
Top 0.5%
3.2%
9
Nature
575 papers in training set
Top 8%
3.0%
10
Medical Image Analysis
33 papers in training set
Top 0.4%
2.6%
11
Scientific Data
174 papers in training set
Top 0.9%
1.9%
12
Nucleic Acids Research
1128 papers in training set
Top 10%
1.9%
13
Advanced Science
249 papers in training set
Top 10%
1.9%
14
Artificial Intelligence in Medicine
15 papers in training set
Top 0.3%
1.8%
15
Nature Biomedical Engineering
42 papers in training set
Top 0.9%
1.6%
16
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.5%
17
Imaging Neuroscience
242 papers in training set
Top 2%
1.5%
18
Human Brain Mapping
295 papers in training set
Top 3%
1.3%
19
Genome Medicine
154 papers in training set
Top 6%
1.3%
20
The Lancet Digital Health
25 papers in training set
Top 0.6%
1.3%
21
eBioMedicine
130 papers in training set
Top 2%
1.2%
22
Communications Medicine
85 papers in training set
Top 0.8%
0.9%
23
eLife
5422 papers in training set
Top 54%
0.9%
24
PLOS ONE
4510 papers in training set
Top 65%
0.9%
25
iScience
1063 papers in training set
Top 30%
0.8%
26
IEEE Access
31 papers in training set
Top 1%
0.7%
27
NeuroImage
813 papers in training set
Top 6%
0.7%
28
Network Neuroscience
116 papers in training set
Top 1%
0.7%
29
Nature Machine Intelligence
61 papers in training set
Top 4%
0.7%
30
Nature Neuroscience
216 papers in training set
Top 7%
0.6%