Back

BioPipelines: Accessible Computational Protein and Ligand Design for Chemical Biologists

Quargnali, G.; Rivera-Fuentes, P.

2026-03-13 bioinformatics
10.64898/2026.03.11.711024 bioRxiv
Show abstract

Deep learning methods for protein structure generation, sequence design, and structure and property prediction have created unprecedented opportunities for protein engineering and drug discovery. However, using these tools often requires navigating incompatible software environments, diverse input/output formats, and high-performance computing infrastructure, any of which may hinder adoption by primarily experimental chemical biology laboratories. Here we present BioPipelines, an open-source Python framework that allows researchers to define multi-step computational design workflows in a few lines of code. Additionally, its robust yet modular architecture provides a straightforward way to expand the toolkit with different functionalities, particularly by leveraging coding agents, with little effort. The framework currently integrates over 30 tools encompassing structure generation, sequence design, structure prediction, compound screening, and analysis. The same workflow code can be prototyped interactively in a Jupyter notebook and then submitted for production-scale runs without modification. We demonstrate applications in inverse folding, gene synthesis, de novo protein design, compound library screening, iterative binding site optimization, and fusion-protein linker optimization. We hope this framework will empower researchers, allowing them to focus on the scientific question rather than computational logistics. BioPipelines is available under the MIT license at https://github.com/locbp-uzh/biopipelines.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.1%
26.2%
2
Bioinformatics
1061 papers in training set
Top 2%
17.7%
3
Journal of Cheminformatics
25 papers in training set
Top 0.1%
10.2%
50% of probability mass above
4
Protein Science
221 papers in training set
Top 0.2%
4.9%
5
Bioinformatics Advances
184 papers in training set
Top 0.8%
4.4%
6
PLOS Computational Biology
1633 papers in training set
Top 13%
2.4%
7
BMC Bioinformatics
383 papers in training set
Top 4%
2.1%
8
Journal of Molecular Biology
217 papers in training set
Top 1%
1.8%
9
PLOS ONE
4510 papers in training set
Top 53%
1.7%
10
Journal of Chemical Theory and Computation
126 papers in training set
Top 0.5%
1.7%
11
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
12
Nature Methods
336 papers in training set
Top 4%
1.7%
13
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.5%
14
Chemical Science
71 papers in training set
Top 1%
1.3%
15
Nucleic Acids Research
1128 papers in training set
Top 13%
1.2%
16
ACS Synthetic Biology
256 papers in training set
Top 2%
1.2%
17
Nature Biotechnology
147 papers in training set
Top 6%
0.9%
18
Cell Systems
167 papers in training set
Top 10%
0.9%
19
Nature Communications
4913 papers in training set
Top 61%
0.8%
20
Communications Chemistry
39 papers in training set
Top 0.9%
0.8%
21
Journal of Computational Chemistry
11 papers in training set
Top 0.2%
0.7%
22
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
23
Patterns
70 papers in training set
Top 3%
0.7%
24
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 1%
0.5%
25
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 49%
0.5%
26
mAbs
28 papers in training set
Top 0.5%
0.5%
27
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.4%
0.5%
28
ACS Omega
90 papers in training set
Top 5%
0.5%