Back

MicrobeMS - A MATLAB Toolbox for Microbial Identification Based on Mass Spectrometry

Lasch, P.

2026-05-12 bioinformatics
10.64898/2026.05.08.723807 bioRxiv
Show abstract

1.Over the last two decades, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-ToF MS) has become the standard method for identifying bacteria and has found a wide range of applications, especially in clinical microbiology. The methods high taxonomic resolution, minimal sample preparation, and complete, ready-to-use commercial systems, which include instrumentation, experimental protocols, spectral databases, and identification analysis software, were key factors in the success of MALDI-ToF MS as the standard for identifying microorganisms in routine diagnostic laboratories. However, despite the availability of these commercial solutions, there is also a growing need for efficient, cost-effective, vendor-neutral databases and analysis tools. These tools would enable the compilation of user-defined mass spectral databases and the testing of new analysis methods and algorithms, particularly in an academic context. To this end, MicrobeMS software has been developed to cover all stages of MALDI-ToF MS-based identification analysis. MicrobeMS is an easy-to-use desktop application for analyzing mass spectra from microorganisms and performing tasks related to spectrum database compilation. It includes routines for direct data import and export, biomarker peak searches, management of spectrum metadata, testing of spectrum quality, supervised and unsupervised identification analysis and intuitive result display. MicrobeMS is implemented in MATLAB and is freely available as MATLAB pcode for Windows and Linux, as well as a standalone application. Over the last fifteen years, the software has undergone continuous development and is now used routinely in various settings at the Centre for Biological Threats and Special Pathogens (ZBS) at the Robert Koch Institute (RKI) in Berlin, Germany, for example in supporting spectrum database compilation, to identify special or rare pathogenic bacteria by advanced identification analysis concepts, or to test in silico MALDI-ToF MS databases derived from microbial genomes. In this software publication the versatility and capabilities of MicrobeMS are demonstrated using a test data set from highly pathogenic bacteria (HPB) which has been obtained as part of a published European Union (EU)-funded External Quality Assurance Exercise (EQAE). MicrobeMS and HPB test data can both be downloaded from https://wiki.microbe-ms.com/. The goal of this software publication is twofold: to raise awareness of MicrobeMS within the scientific community and to encourage the testing of the software and custom-developed MALDI-ToF MS databases of the RKI, which are published at the ZENODO data repository (https://doi.org/10.5281/zenodo.7702374).

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 16%
12.2%
2
Bioinformatics
1061 papers in training set
Top 3%
9.9%
3
Journal of Proteome Research
215 papers in training set
Top 0.4%
8.1%
4
Peer Community Journal
254 papers in training set
Top 0.4%
6.2%
5
SoftwareX
15 papers in training set
Top 0.1%
4.2%
6
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.5%
3.5%
7
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.5%
8
BMC Bioinformatics
383 papers in training set
Top 3%
3.5%
50% of probability mass above
9
Analytical Chemistry
205 papers in training set
Top 0.9%
3.2%
10
Scientific Reports
3102 papers in training set
Top 48%
2.3%
11
GigaScience
172 papers in training set
Top 1%
1.8%
12
Nature Communications
4913 papers in training set
Top 52%
1.7%
13
Scientific Data
174 papers in training set
Top 1%
1.7%
14
Journal of Open Source Software
22 papers in training set
Top 0.1%
1.6%
15
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.3%
1.6%
16
Frontiers in Plant Science
240 papers in training set
Top 4%
1.6%
17
Metabolites
50 papers in training set
Top 0.6%
1.5%
18
PeerJ
261 papers in training set
Top 9%
1.3%
19
Molecular & Cellular Proteomics
158 papers in training set
Top 1%
1.3%
20
International Journal of Molecular Sciences
453 papers in training set
Top 11%
1.2%
21
Journal of Proteomics
27 papers in training set
Top 0.3%
1.1%
22
Analytical and Bioanalytical Chemistry
17 papers in training set
Top 0.3%
0.9%
23
Frontiers in Bioinformatics
45 papers in training set
Top 0.6%
0.9%
24
Frontiers in Microbiology
375 papers in training set
Top 7%
0.9%
25
Archives of Clinical and Biomedical Research
28 papers in training set
Top 2%
0.9%
26
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
0.8%
27
Communications Biology
886 papers in training set
Top 25%
0.7%
28
Viruses
318 papers in training set
Top 5%
0.7%
29
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
30
mSystems
361 papers in training set
Top 8%
0.7%