Back

GlyComboCLI enables command line-based FAIR workflows for glycan composition assignment in mass spectrometry data

Kelly, M. I.; Thang, W. C. M.; Pang, C. N. I.; Gustafsson, O. J. R.; Ashwood, C.

2026-05-14 bioinformatics
10.64898/2026.05.13.725058 bioRxiv
Show abstract

Glycans are integral biomolecules whose presence cannot be predicted from genomic data alone, necessitating experimental characterisation through approaches including mass spectrometry. Assignment of glycan compositions to observed mass to charge ratios is computationally challenging due to the potential monosaccharide diversity and existing tools lack the required flexibility for integration into automated bioinformatic workflows. Here, we present GlyComboCLI, an open-source command-line application for the assignment of glycan compositions to mass spectrometry data which expands upon our previous GUI application, GlyCombo. GlyComboCLI accepts mass lists and vendor-neutral mzML files, supports an extensive range of monosaccharides, derivatisation states, reducing-end modifications, and adducts to ensure compatibility with a breadth of glycomics approaches. Outputs are compatible with downstream tools including Skyline and GlycoWorkBench. This software is deployable as a standalone executable, a Docker container, and a Galaxy tool, adhering to FAIR principles. When applied to 52 raw files from a published mouse glycomics dataset, a local instance completed composition assignment and downstream quality control in under three hours, recovering biologically consistent findings. Furthermore, an integrated Galaxy workflow demonstrated reproducible detection of sialidase treatment effects. GlyComboCLI substantially reduces the pool of spectra requiring manual structural interpretation, offering a flexible and scalable solution for glycomics bioinformatic workflows.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Journal of Proteome Research
215 papers in training set
Top 0.2%
18.2%
2
Nature Communications
4913 papers in training set
Top 7%
18.2%
3
Bioinformatics
1061 papers in training set
Top 2%
17.8%
50% of probability mass above
4
Molecular & Cellular Proteomics
158 papers in training set
Top 0.3%
8.9%
5
Nature Methods
336 papers in training set
Top 2%
4.2%
6
Analytical Chemistry
205 papers in training set
Top 0.7%
3.9%
7
Cell Reports Methods
141 papers in training set
Top 2%
1.8%
8
PLOS ONE
4510 papers in training set
Top 51%
1.8%
9
Metabolites
50 papers in training set
Top 0.4%
1.8%
10
Bioinformatics Advances
184 papers in training set
Top 3%
1.8%
11
Nature Biotechnology
147 papers in training set
Top 5%
1.7%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.7%
13
PLOS Computational Biology
1633 papers in training set
Top 17%
1.6%
14
GigaScience
172 papers in training set
Top 2%
1.4%
15
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
16
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.4%
0.9%
17
Nucleic Acids Research
1128 papers in training set
Top 17%
0.8%
18
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.8%
19
Genome Biology
555 papers in training set
Top 8%
0.7%
20
PROTEOMICS
35 papers in training set
Top 0.8%
0.7%
21
Scientific Reports
3102 papers in training set
Top 75%
0.7%
22
Communications Biology
886 papers in training set
Top 30%
0.6%