PromptBio-Bench: Benchmarking LLM-based Bioinformatics Agents for End-to-End Data Analysis

Guo, W.; Zhang, M.; Han, B.; Ma, Y.; Leng, Y.; Hebbar, S.; Zhou, X.; Gu, W.; Yang, X.; Dhar, S.

2026-05-08 bioinformatics

10.64898/2026.05.05.723092 bioRxiv

Show abstract

Large language model (LLM)-based agents hold transformative potential for automating bioinformatics workflows; however, systematic evaluations of their capabilities remain limited, hindering a clear assessment of their readiness for real-world application. We introduce PromptBio-Bench, a comprehensive evaluation suite of 194 expert-curated tasks spanning bioinformatics and data science at varied difficulty levels, and an evaluation framework for structured file comparison and scoring against expert reference answers. Benchmarking three state-of-the-art agents revealed that Biomni and ToolsGenie achieved comparable performance, and accuracy declined markedly at higher difficulty levels across all agents. As foundation models and agent frameworks continue to evolve, PromptBio-Bench provides a valuable benchmark infrastructure for the community to systematically track the progress of agentic bioinformatics.

Matching journals

●Non-profit ◐University press ○Commercial

The top 8 journals account for 50% of the predicted probability mass.

Only show non-profit

○ 167 papers in training set

○ 336 papers in training set

Nature Communications

○ 4913 papers in training set

Nature Biotechnology

○ 147 papers in training set

◐ 1061 papers in training set

Nucleic Acids Research

◐ 1128 papers in training set

◐ 172 papers in training set

Bioinformatics Advances

◐ 184 papers in training set

50% of probability mass above

○ 555 papers in training set

NAR Genomics and Bioinformatics

◐ 214 papers in training set

Genome Medicine

○ 154 papers in training set

Briefings in Bioinformatics

◐ 326 papers in training set

PLOS Computational Biology

● 1633 papers in training set

Nature Machine Intelligence

○ 61 papers in training set

○ 575 papers in training set

BMC Bioinformatics

○ 383 papers in training set

Genomics, Proteomics & Bioinformatics

◐ 171 papers in training set

● 4510 papers in training set

Nature Genetics

○ 240 papers in training set

Scientific Reports

○ 3102 papers in training set

Proceedings of the National Academy of Sciences

● 2130 papers in training set

Advanced Science

○ 249 papers in training set

Nature Computational Science

○ 50 papers in training set

Computational and Structural Biotechnology Journal

● 216 papers in training set

○ 70 papers in training set

Genome Research

● 409 papers in training set

JCO Clinical Cancer Informatics

● 18 papers in training set

Communications Biology

○ 886 papers in training set

Plant Communications

○ 35 papers in training set