Back

Qualitative and quantitative top-down proteomics of human colorectal cancer cell lines identified 23000 proteoforms and revealed drastic proteoform-level differences between metastatic and non-metastatic cancer cells

McCool, E. N.; Xu, T.; Chen, W.; Beller, N. C.; Nolan, S. M.; Hummon, A. B.; Liu, X.; Sun, L.

2021-10-28 cancer biology
10.1101/2021.10.27.466093 bioRxiv
Show abstract

Understanding cancer metastasis at the proteoform level is crucial for discovering new protein biomarkers for cancer diagnosis and drug development. Proteins are the primary effectors of function in biology and proteoforms from the same gene can have drastically different biological functions. Here, we present the first qualitative and quantitative top-down proteomics (TDP) study of a pair of isogenic human metastatic and non-metastatic colorectal cancer (CRC) cell lines (SW480 and SW620). This study pursues a global view of human CRC proteome before and after metastasis in a proteoform specific manner. We identified 23,319 proteoforms of 2,297 genes from the CRC cell lines using capillary zone electrophoresis-tandem mass spectrometry (CZE-MS/MS), representing nearly one order of magnitude improvement in the number of proteoform identifications from human cell lines compared to literature data. We identified 111 proteoforms containing single amino acid variants (SAAVs) using a proteogenomic approach and revealed drastic differences between the metastatic and non-metastatic cell lines regarding SAAVs profiles. Quantitative TDP analysis unveiled statistically significant differences in proteoform abundance between the SW480 and SW620 cell lines on a proteome scale for the first time. Ingenuity Pathway Analysis (IPA) disclosed that many differentially expressed genes at the proteoform level had diversified functions and were closely related to cancer. Our study represents a milestone in TDP towards the definition of human proteome in a proteoform specific manner, which will transform basic and translational biomedical research. For TOC only O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=186 SRC="FIGDIR/small/466093v1_ufig1.gif" ALT="Figure 1"> View larger version (38K): org.highwire.dtl.DTLVardef@3ee5faorg.highwire.dtl.DTLVardef@16cae5forg.highwire.dtl.DTLVardef@2c0bd0org.highwire.dtl.DTLVardef@1bb9530_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Journal of Proteome Research
215 papers in training set
Top 0.1%
23.0%
2
PROTEOMICS
35 papers in training set
Top 0.1%
10.3%
3
Analytical Chemistry
205 papers in training set
Top 0.3%
8.6%
4
Molecular & Cellular Proteomics
158 papers in training set
Top 0.4%
6.5%
5
Clinical Proteomics
10 papers in training set
Top 0.1%
6.5%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 30%
5.0%
7
Journal of Proteomics
27 papers in training set
Top 0.1%
3.7%
8
Biosensors and Bioelectronics
52 papers in training set
Top 0.6%
2.1%
9
Angewandte Chemie International Edition
81 papers in training set
Top 2%
1.7%
10
Heliyon
146 papers in training set
Top 2%
1.7%
11
iScience
1063 papers in training set
Top 17%
1.5%
12
Frontiers in Chemistry
14 papers in training set
Top 0.1%
1.5%
13
Analytica Chimica Acta
17 papers in training set
Top 0.3%
1.5%
14
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.4%
15
Nature Communications
4913 papers in training set
Top 56%
1.3%
16
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
1.0%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
1.0%
18
Scientific Reports
3102 papers in training set
Top 70%
0.9%
19
Molecules
37 papers in training set
Top 1%
0.9%
20
Metabolomics
11 papers in training set
Top 0.4%
0.8%
21
ACS Pharmacology & Translational Science
40 papers in training set
Top 1%
0.7%
22
ACS Omega
90 papers in training set
Top 5%
0.7%
23
Diagnostics
48 papers in training set
Top 2%
0.7%
24
Biomedicines
66 papers in training set
Top 4%
0.5%
25
Neoplasia
22 papers in training set
Top 0.9%
0.5%
26
Molecular Omics
21 papers in training set
Top 0.6%
0.5%
27
Cell Systems
167 papers in training set
Top 14%
0.5%
28
eBioMedicine
130 papers in training set
Top 6%
0.5%
29
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 12%
0.5%
30
Communications Biology
886 papers in training set
Top 31%
0.5%