Back

CAMP: a Convolutional Attention-based Neural Network for Multifaceted Peptide-protein Interaction Prediction

Lei, Y.; Li, S.; Liu, Z.; Wan, F.; Tian, T.; Li, S.; Zhao, D.; Zeng, J.

2020-11-16 bioinformatics
10.1101/2020.11.16.384784 bioRxiv
Show abstract

Peptide-protein interactions (PepPIs) are involved in various fundamental cellular functions and their identification is crucial for designing efficacious peptide therapeutics. To facilitate the peptide drug discovery process, a number of computational methods have been developed to predict peptide-protein interactions. However, most of the existing prediction approaches heavily depend on high-resolution structure data. Although several deep-learning-based frameworks have been proposed to predict compound-protein interactions or protein-protein interactions, few of them are particularly designed to specifically predict peptide-protein interactions. In this paper, We present a sequence-based Convolutional Attention-based neural network for Multifaceted prediction of Peptide-protein interactions, called CAMP, including predicting binary peptide-protein interactions and corresponding binding residues in the peptides. We also construct a benchmark dataset containing high-quality peptide-protein interaction pairs with the corresponding peptide binding residues for model training and evaluation. CAMP incorporates convolution neural network architectures and attention mechanism to fully exploit informative sequence-based features, including secondary structures, physicochemical properties, intrinsic disorder features and position-specific scoring matrix of the protein. Systematical evaluation of our benchmark dataset demonstrates that CAMP outperforms the state-of-the-art baseline methods on binary peptide-protein interaction prediction. In addition, CAMP can successfully identify the binding residues involved non-covalent interactions for peptides. These results indicate that CAMP can serve as a useful tool in peptide-protein interaction prediction and peptide binding site identification, which can thus greatly facilitate the peptide drug discovery process. The source code of CAMP can be found in https://github.com/twopin/CAMP.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Briefings in Bioinformatics
326 papers in training set
Top 0.1%
27.6%
2
Journal of Chemical Information and Modeling
207 papers in training set
Top 0.5%
12.3%
3
Bioinformatics
1061 papers in training set
Top 3%
8.4%
4
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
6.3%
50% of probability mass above
5
Nature Machine Intelligence
61 papers in training set
Top 0.8%
4.0%
6
Advanced Science
249 papers in training set
Top 6%
3.6%
7
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.4%
8
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.8%
2.1%
9
Quantitative Biology
11 papers in training set
Top 0.2%
1.9%
10
Scientific Reports
3102 papers in training set
Top 55%
1.8%
11
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.3%
1.3%
12
PLOS ONE
4510 papers in training set
Top 60%
1.2%
13
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
14
Science Bulletin
22 papers in training set
Top 0.6%
0.9%
15
Molecules
37 papers in training set
Top 1%
0.9%
16
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.5%
0.9%
17
Journal of Cheminformatics
25 papers in training set
Top 0.5%
0.9%
18
Communications Biology
886 papers in training set
Top 21%
0.8%
19
Communications Chemistry
39 papers in training set
Top 1.0%
0.8%
20
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
21
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
22
Nature Communications
4913 papers in training set
Top 66%
0.6%
23
ACS Omega
90 papers in training set
Top 5%
0.6%
24
Science China Life Sciences
26 papers in training set
Top 3%
0.6%
25
Nucleic Acids Research
1128 papers in training set
Top 20%
0.6%