Back

Characterization of proteoform post-translational modifications by top-down and bottom-up mass spectrometry in conjunction with UniProt annotations

Chen, W.; Ding, Z.; Zang, Y.; Liu, X.

2023-04-06 bioinformatics
10.1101/2023.04.04.535618 bioRxiv
Show abstract

Many proteoforms can be produced from a gene due to genetic mutations, alternative splicing, post-translational modifications (PTMs), and other variations. PTMs in proteoforms play critical roles in cell signaling, protein degradation, and other biological processes. Mass spectrometry (MS) is the primary technique for investigating PTMs in proteoforms, and two alternative MS approaches, top-down and bottom-up, have complementary strengths. The combination of the two approaches has the potential to increase the sensitivity and accuracy in PTM identification and characterization. In addition, protein and PTM knowledgebases, such as UniProt, provide valuable information for PTM characterization and validation. Here, we present a software pipeline called PTM-TBA (PTM characterization by Top-down, Bottom-up MS and Annotations) for identifying and localizing PTMs in proteoforms by integrating top-down and bottom-up MS as well as UniProt annotations. We identified 1,662 mass shifts from a top-down MS data set of SW480 cells, 545 (33%) of which were matched to 12 common PTMs, and 351 of which were localized. PTM-TBA validated 346 of the 1,662 mass shifts using UniProt annotations or a bottom-up MS data set of SW480 cells.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of Proteome Research
215 papers in training set
Top 0.2%
18.7%
2
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.1%
12.7%
3
Bioinformatics
1061 papers in training set
Top 3%
10.1%
4
Analytical Chemistry
205 papers in training set
Top 0.3%
8.4%
5
Molecular & Cellular Proteomics
158 papers in training set
Top 0.4%
6.4%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 28%
6.3%
7
PROTEOMICS
35 papers in training set
Top 0.2%
3.6%
8
Frontiers in Molecular Biosciences
100 papers in training set
Top 0.6%
2.9%
9
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
2.1%
10
Scientific Reports
3102 papers in training set
Top 50%
2.1%
11
BMC Bioinformatics
383 papers in training set
Top 4%
1.7%
12
Nature Communications
4913 papers in training set
Top 53%
1.5%
13
Journal of Proteomics
27 papers in training set
Top 0.2%
1.3%
14
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.2%
15
iScience
1063 papers in training set
Top 23%
1.1%
16
PLOS Computational Biology
1633 papers in training set
Top 21%
1.0%
17
Genome Biology
555 papers in training set
Top 6%
0.9%
18
Briefings in Bioinformatics
326 papers in training set
Top 6%
0.9%
19
Metabolites
50 papers in training set
Top 1%
0.8%
20
Communications Biology
886 papers in training set
Top 26%
0.7%
21
ACS Omega
90 papers in training set
Top 4%
0.7%
22
mSystems
361 papers in training set
Top 8%
0.7%
23
mSphere
281 papers in training set
Top 6%
0.7%