Back

Design to Data for mutants of β-glucosidase B from Paenibacillus polymyxa: N160L, N160S, N160C, N160M, N160G

Li, N.; Vater, A.; Siegel, J. B.

2025-11-27 bioengineering
10.1101/2025.11.24.690255 bioRxiv
Show abstract

Protein design is advancing toward quantitative modeling of enzyme function and stability. However, progress remains limited by the scarcity of standardized experimental datasets for training and benchmarking computational models. The Design to Data (D2D) program addresses this need by generating harmonized measurements of catalytic and stability parameters across an extensive {beta}-glucosidase B (BglB) variant library. Here, we expand the D2D dataset with kinetic and thermal characterization of five single-point BglB variants and the wild-type (WT), including soluble expression, Michaelis-Menten constants (kcat, KM, and kcat/KM), and melting temperature (TM,). Foldit Standalone was used to model the structural effects of the mutations. In this study, a weak but consistent association between Foldit total system energy (TSE) and TM was observed, suggesting local energetic effects that may influence stability. Together with the broader D2D corpus, these data enhance the functional mapping of BglB and provide model-ready benchmarks for developing and evaluating data-driven predictors of enzyme activity and stability.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Protein Engineering, Design and Selection
14 papers in training set
Top 0.1%
12.4%
2
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.2%
10.1%
3
Protein Science
221 papers in training set
Top 0.1%
10.1%
4
Proteins: Structure, Function, and Bioinformatics
82 papers in training set
Top 0.1%
7.2%
5
Journal of Chemical Information and Modeling
207 papers in training set
Top 1.0%
4.9%
6
ACS Catalysis
16 papers in training set
Top 0.1%
3.7%
7
PLOS ONE
4510 papers in training set
Top 39%
3.6%
50% of probability mass above
8
Nature Communications
4913 papers in training set
Top 42%
3.1%
9
ACS Omega
90 papers in training set
Top 0.7%
3.1%
10
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 0.7%
2.7%
11
PLOS Computational Biology
1633 papers in training set
Top 12%
2.7%
12
ACS Synthetic Biology
256 papers in training set
Top 1%
2.6%
13
International Journal of Molecular Sciences
453 papers in training set
Top 4%
2.6%
14
Angewandte Chemie International Edition
81 papers in training set
Top 2%
2.1%
15
Bioinformatics
1061 papers in training set
Top 7%
1.9%
16
Scientific Reports
3102 papers in training set
Top 55%
1.8%
17
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.7%
18
Journal of Molecular Biology
217 papers in training set
Top 2%
1.7%
19
Biophysical Journal
545 papers in training set
Top 4%
1.2%
20
Metabolic Engineering Communications
20 papers in training set
Top 0.2%
1.2%
21
Biochemistry
130 papers in training set
Top 1%
1.0%
22
Cell Systems
167 papers in training set
Top 10%
1.0%
23
Chemical Science
71 papers in training set
Top 2%
0.9%
24
Journal of Cheminformatics
25 papers in training set
Top 0.5%
0.7%
25
Frontiers in Chemistry
14 papers in training set
Top 0.4%
0.7%
26
Communications Biology
886 papers in training set
Top 24%
0.7%
27
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
28
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 47%
0.6%
29
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.6%
30
Nucleic Acids Research
1128 papers in training set
Top 20%
0.6%