Back

Quantifying Scientific Consensus in Biomedical Hypotheses via LLM-Assisted Literature Screening

Kim, U.; Kwon, O.; Lee, D.

2026-04-09 bioinformatics
10.64898/2026.04.06.716861 bioRxiv
Show abstract

Systematic literature reviews are labor-intensive tasks in biomedical research. While Large Language Models (LLMs) using Retrieval-Augmented Generation (RAG) techniques have enhanced information accessibility, the inherent complexity of biological systems--characterized by high context dependency and conflicting data--remains a primary driver of LLM hallucinations. This imposes a structural constraint that limits the precision of evidence synthesis. To address these limitations, we propose an automated framework designed for the exhaustive identification of supporting and contradictory evidence within a target literature set. Rather than relying on a models pre-trained knowledge, our system requires the LLM to review each paper individually to determine its alignment with a specific research hypothesis. By evaluating semantic context, the framework captures subtle contradictions that are often overgeneralized by conventional methods. The frameworks performance was validated using the BioNLI task, where it demonstrated high classification accuracy in distinguishing whether evidence supports or contradicts a given hypothesis. Notably, the implementation of an ensemble approach provided superior stability and slightly higher precision compared to individual models. Furthermore, the framework exhibited robust performance across several well-established biological hypotheses, confirming its practical utility and reliability in real-world research. This approach provides a rigorous basis for biomedical discovery by enabling the precise, systematic analysis of biological literature and the robust collection of evidence.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Nucleic Acids Research
1128 papers in training set
Top 2%
10.0%
2
Bioinformatics
1061 papers in training set
Top 3%
10.0%
3
Advanced Science
249 papers in training set
Top 2%
7.1%
4
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.8%
5
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
4.3%
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 20%
3.6%
7
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.6%
8
BMC Bioinformatics
383 papers in training set
Top 3%
3.0%
9
Nature Communications
4913 papers in training set
Top 42%
3.0%
10
PLOS ONE
4510 papers in training set
Top 43%
2.9%
50% of probability mass above
11
NAR Genomics and Bioinformatics
214 papers in training set
Top 1%
2.7%
12
Scientific Reports
3102 papers in training set
Top 47%
2.4%
13
Database
51 papers in training set
Top 0.3%
2.1%
14
Bioinformatics Advances
184 papers in training set
Top 2%
1.9%
15
PLOS Computational Biology
1633 papers in training set
Top 15%
1.8%
16
iScience
1063 papers in training set
Top 13%
1.8%
17
GigaScience
172 papers in training set
Top 1%
1.8%
18
Genome Medicine
154 papers in training set
Top 5%
1.6%
19
Journal of Molecular Biology
217 papers in training set
Top 2%
1.5%
20
Patterns
70 papers in training set
Top 1%
1.3%
21
Cell Systems
167 papers in training set
Top 8%
1.3%
22
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.2%
23
npj Systems Biology and Applications
99 papers in training set
Top 2%
1.2%
24
Genome Biology
555 papers in training set
Top 6%
1.2%
25
Nature Machine Intelligence
61 papers in training set
Top 3%
1.2%
26
Journal of Biomedical Informatics
45 papers in training set
Top 1%
1.1%
27
Communications Biology
886 papers in training set
Top 16%
1.1%
28
eLife
5422 papers in training set
Top 54%
0.9%
29
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.9%
30
BioData Mining
15 papers in training set
Top 0.8%
0.8%