Back

GlycoDiveR: a modular R framework to analyze and visualize highly dimensional glycoproteomics data

Veth, T. S.; Riley, N. M.

2026-03-24 systems biology
10.64898/2026.03.21.713336 bioRxiv
Show abstract

Mass spectrometry-based glycoproteomics is a critical platform for understanding the complex roles of protein glycosylation in biological systems, yet visualizing multidimensional glycoproteomics datasets remains a significant bottleneck in data interpretation and communication. Glycan microheterogeneity, i.e., the potential for a glycosite to be modified by multiple glycans, defies the binary presence-absence logic used in analyses of other post-translational modifications. Instead, glycoproteomics necessitates intentionally designed data structures and visualizations that are glycoform-centric, not just site-centric. Additionally, there is a need for complementary degrees of data analysis that alternate between glycoproteome-scale patterns and glycosite-specific regulation. Several bespoke frameworks for visualizing glycoproteomics data have emerged, but they often require advanced programming expertise and are designed for a single study rather than broad application. Here, we present our efforts to harmonize post-search data analysis of glycoproteomics through a modular R framework called GlycoDiveR. This platform streamlines import, transformation, and curation of qualitative and quantitative glycopeptide identifications, including support for raw output from multiple search engines. GlycoDiveR is designed to integrate seamlessly into existing analysis workflows by enabling fast, flexible exploration of highly dimensional glycoproteomics datasets via a consistently formatted data architecture. Our goal is to offer a customizable set of glycosylation-specific visualizations with minimal coding, while keeping data accessible to users who wish to further customize their characterization strategies. It also maintains a modular design that supports the continual addition of visualizations, analyses, and export functions. Ultimately, GlycoDiveR is meant to improve accessibility of glycoproteomic-specific analyses and lower the barrier to exploring biological narratives embedded in rich glycoproteomic datasets. GlycoDiveR is open-source and freely available at https://github.com/riley-research/GlycoDiveR.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.3%
2
Journal of Proteome Research
215 papers in training set
Top 0.3%
12.4%
3
Bioinformatics Advances
184 papers in training set
Top 0.2%
9.1%
4
Nature Methods
336 papers in training set
Top 1%
8.4%
5
Nature Communications
4913 papers in training set
Top 29%
6.3%
50% of probability mass above
6
Cell Reports Methods
141 papers in training set
Top 0.5%
4.8%
7
Cell Systems
167 papers in training set
Top 3%
4.8%
8
Molecular & Cellular Proteomics
158 papers in training set
Top 0.6%
3.9%
9
PLOS ONE
4510 papers in training set
Top 40%
3.6%
10
PLOS Computational Biology
1633 papers in training set
Top 10%
3.6%
11
Nature Biotechnology
147 papers in training set
Top 4%
1.7%
12
Molecular Systems Biology
142 papers in training set
Top 0.6%
1.7%
13
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.7%
14
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
15
Metabolites
50 papers in training set
Top 0.6%
1.3%
16
BMC Bioinformatics
383 papers in training set
Top 5%
1.3%
17
Patterns
70 papers in training set
Top 1%
1.2%
18
iScience
1063 papers in training set
Top 25%
0.9%
19
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
20
Genome Biology
555 papers in training set
Top 6%
0.9%
21
Scientific Reports
3102 papers in training set
Top 73%
0.8%
22
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.8%
23
Journal of Molecular Biology
217 papers in training set
Top 3%
0.8%
24
PROTEOMICS
35 papers in training set
Top 0.8%
0.7%
25
eLife
5422 papers in training set
Top 60%
0.7%
26
mSystems
361 papers in training set
Top 8%
0.7%
27
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 46%
0.7%