Back

Large Language Models for Accessible Reporting of Bioinformatics Analyses in Interdisciplinary Contexts

Yu, L.; Kim, D.; Cao, Y.; Shu, M. W. S.; Shen, M.; Liang, X.; Gu, J.; Jayakumar, R.; Ding, W.; Yang, F.; Zhang, X.; Kim, J.; Yang, P.; Yang, J. Y. H.

2025-11-11 bioinformatics
10.1101/2025.11.09.687479 bioRxiv
Show abstract

Health and life scientists routinely collaborate with quantitative scientists for data analysis and interpretation, yet miscommunication often obscures the interpretation of complex results. Large Language Models (LLMs) offer a promising way to bridge this gap, but their cross-discipline interpretative skill remains limited on real-word bioinformatics analyses. We therefore benchmarked four state-of-the-art LLMs: GPT-4o, o1, Claude 3.7 Sonnet, and Gemini 2.0 Flash, using automated and human evaluation frameworks to ensure holistic evaluation. Automated assessment employed multiple choice questions designed using Blooms taxonomy to assess multiple levels of understanding, while human evaluation tasked scientists to score summaries for factual consistency, lack of harmfulness, comprehensiveness, and coherence. All generally produced readable and largely safe summaries, confirming their value for first-pass translation of technical analyses, however frequently misinterpreted visualisations, produced verbose summaries and rarely offered novel insights beyond what was already contained in the analytics. Our findings suggest that LLMs are best suited for easing interdisciplinary communication rather than replacing domain expertise and human oversight remains essential to guarantee accuracy, interpretative depth, and the generation of genuinely novel scientific insights.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
GigaScience
172 papers in training set
Top 0.1%
18.7%
2
Bioinformatics
1061 papers in training set
Top 3%
10.1%
3
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.1%
7.2%
4
PLOS Computational Biology
1633 papers in training set
Top 5%
6.8%
5
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.7%
3.9%
6
Patterns
70 papers in training set
Top 0.2%
3.7%
50% of probability mass above
7
BMC Bioinformatics
383 papers in training set
Top 3%
3.6%
8
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.6%
9
Bioinformatics Advances
184 papers in training set
Top 1%
3.6%
10
Nature Communications
4913 papers in training set
Top 45%
2.4%
11
Genome Biology
555 papers in training set
Top 4%
1.9%
12
Database
51 papers in training set
Top 0.4%
1.8%
13
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.7%
14
PLOS ONE
4510 papers in training set
Top 54%
1.7%
15
Scientific Reports
3102 papers in training set
Top 58%
1.7%
16
Nucleic Acids Research
1128 papers in training set
Top 11%
1.7%
17
Cell Systems
167 papers in training set
Top 7%
1.7%
18
PLOS Biology
408 papers in training set
Top 12%
1.3%
19
Advanced Science
249 papers in training set
Top 14%
1.2%
20
npj Systems Biology and Applications
99 papers in training set
Top 2%
1.1%
21
npj Digital Medicine
97 papers in training set
Top 3%
1.1%
22
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 39%
1.1%
23
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
24
Genome Medicine
154 papers in training set
Top 7%
0.9%
25
eLife
5422 papers in training set
Top 53%
0.9%
26
Life Science Alliance
263 papers in training set
Top 1%
0.9%
27
Frontiers in Bioinformatics
45 papers in training set
Top 0.7%
0.8%
28
iScience
1063 papers in training set
Top 34%
0.7%
29
BioData Mining
15 papers in training set
Top 1.0%
0.7%
30
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 7%
0.6%