Back

Benchmarking GROMACS on Optimized Colab Processors and the Flexibility of Cloud Computing for Molecular Dynamics

Karagöl, T.; Karagöl, A.

2024-11-15 bioinformatics
10.1101/2024.11.14.623563 bioRxiv
Show abstract

Molecular dynamics (MD) simulations are widely used computational tools in chemical and biological sciences. For these simulations, GROMACS is a popular open-source alternative among molecular dynamics simulation software designed for biochemical molecules. In addition to software, these simulations traditionally relied on costly infrastructure like supercomputers or clusters for High-Performance Computing (HPC). In recent years, there has been a significant shift towards using commercial cloud providers computing resources, in general. This shift is driven by the flexibility and accessibility these platforms offer, irrespective of an organizations financial capacity. Many commercial compute platforms such as Google Compute Engine (GCE) and Amazon Web Services (AWS) provide scalable computing infrastructure. An alternative to these platforms is Google Colab, a cloud-based platform, provides a convenient computing solution by offering GPU and TPU resources that can be utilized for scientific computing. The accessibility of Colab makes it easier for a wider audience to conduct computational tasks without needing specialized hardware or otherwise costly infrastructure. However, running GROMACS on Colab also comes with limitations. Google Colab imposes usage restrictions, such as time limits for continuous sessions, capped at several hours, and limits on the availability of high-performance GPUs. Users may also face disruptions due to session timeouts or hardware availability constraints, which can be challenging for large or long-running molecular simulations. We have significantly enhanced the performance of GROMACS on Google Colab by re-compiling the software, compared to its default pre-compiled version. We also present a method for integrating Google Drive to save and resume interrupted simulations, ensuring that users can secure files after session-timeouts. Additionally, we detail the setup and utilization of the CUDA and MPI environment in Colab to enhance GROMACS performance. Finally, we compare the efficiency of CUDA-enabled GPUs with Googles TPUv2 units, highlighting the trade-offs of each platform for molecular dynamics simulations. This work equips researchers, students, and educators with practical MD tools while providing insights to optimize their simulations within the Colab environment.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 3%
10.2%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.5%
10.2%
3
BMC Bioinformatics
383 papers in training set
Top 1%
8.5%
4
PLOS ONE
4510 papers in training set
Top 22%
8.5%
5
PLOS Computational Biology
1633 papers in training set
Top 4%
7.2%
6
SoftwareX
15 papers in training set
Top 0.1%
4.9%
7
Scientific Reports
3102 papers in training set
Top 36%
3.6%
50% of probability mass above
8
Journal of Computational Chemistry
11 papers in training set
Top 0.1%
3.6%
9
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.3%
10
Gigabyte
60 papers in training set
Top 0.4%
3.1%
11
PeerJ
261 papers in training set
Top 8%
1.5%
12
F1000Research
79 papers in training set
Top 2%
1.5%
13
Journal of Biomedical Optics
25 papers in training set
Top 0.4%
1.3%
14
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.0%
15
ACS Omega
90 papers in training set
Top 3%
1.0%
16
Frontiers in Neuroinformatics
38 papers in training set
Top 0.7%
0.8%
17
The Journal of Physical Chemistry B
158 papers in training set
Top 2%
0.8%
18
Physical Review E
95 papers in training set
Top 1%
0.8%
19
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
20
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.6%
0.7%
21
GigaScience
172 papers in training set
Top 4%
0.6%
22
Neuroinformatics
40 papers in training set
Top 1%
0.6%
23
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.6%
24
Journal of Proteome Research
215 papers in training set
Top 2%
0.6%
25
Archives of Clinical and Biomedical Research
28 papers in training set
Top 3%
0.6%
26
Physical Biology
43 papers in training set
Top 3%
0.5%
27
Frontiers in Physics
20 papers in training set
Top 1%
0.5%
28
Computer Methods and Programs in Biomedicine
27 papers in training set
Top 1%
0.5%
29
iScience
1063 papers in training set
Top 40%
0.5%
30
Quantitative Biology
11 papers in training set
Top 1%
0.5%