Back

A Structure-based B-cell Epitope Prediction Model Through Combing Local and Global Features

Lu, S.; Li, Y.; Nan, X.; Zhang, S.

2021-07-14 bioinformatics
10.1101/2021.07.13.452188 bioRxiv
Show abstract

B-cell epitopes (BCEs) are a set of specific sites on the surface of an antigen that binds to an antibody produced by B-cell. The recognition of BCEs is a major challenge for drug design and vaccines development. Compared with experimental methods, computational approaches have strong potential for BCEs prediction at much lower cost. Moreover, most of the currently methods focus on using local information around target residue without taking the global information of the whole antigen sequence into consideration. We propose a novel deep leaning method through combing local features and global features for BCEs prediction. In our model, two parallel modules are built to extract local and global features from the antigen separately. For local features, we use Graph Convolutional Networks(GCNs) to capture information of spatial neighbors of a target residue. For global features, Attention-Based Bidirectional Long Short-Term Memory(Att-BLSTM) networks are applied to extract information from the whole antigen sequence. Then the local and global features are combined to predict BCEs. The experiments show that the proposed method achieves superior performance over the state-of-the-art BCEs prediction methods on benchmark datasets. Also, we compare the performance differences between data with or without global features. The experimental results show that global features play an important role in BCEs prediction. Our detailed case study on the BCEs prediction for SARS-Cov-2 receptor binding domain confirms that our method is effective for predicting and clustering true BCEs.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Briefings in Bioinformatics
326 papers in training set
Top 0.1%
28.9%
2
Bioinformatics
1061 papers in training set
Top 3%
10.5%
3
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
4.5%
4
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.4%
3.8%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 1%
3.7%
50% of probability mass above
6
PLOS Computational Biology
1633 papers in training set
Top 9%
3.7%
7
Frontiers in Immunology
586 papers in training set
Top 3%
2.9%
8
Nature Machine Intelligence
61 papers in training set
Top 2%
2.0%
9
Quantitative Biology
11 papers in training set
Top 0.2%
2.0%
10
iScience
1063 papers in training set
Top 13%
1.8%
11
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.2%
1.8%
12
Scientific Reports
3102 papers in training set
Top 56%
1.8%
13
PLOS ONE
4510 papers in training set
Top 52%
1.8%
14
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.7%
15
BMC Bioinformatics
383 papers in training set
Top 5%
1.6%
16
Computers in Biology and Medicine
120 papers in training set
Top 3%
1.4%
17
Communications Biology
886 papers in training set
Top 13%
1.3%
18
Advanced Science
249 papers in training set
Top 15%
1.2%
19
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.4%
1.0%
20
ImmunoInformatics
11 papers in training set
Top 0.1%
1.0%
21
Nature Communications
4913 papers in training set
Top 59%
0.9%
22
Frontiers in Genetics
197 papers in training set
Top 8%
0.8%
23
Science Bulletin
22 papers in training set
Top 0.8%
0.8%
24
Journal of Genetics and Genomics
36 papers in training set
Top 2%
0.8%
25
eLife
5422 papers in training set
Top 60%
0.7%
26
Expert Systems with Applications
11 papers in training set
Top 0.5%
0.7%
27
Neurocomputing
13 papers in training set
Top 0.7%
0.5%
28
Nucleic Acids Research
1128 papers in training set
Top 20%
0.5%
29
Bioengineering
24 papers in training set
Top 2%
0.5%
30
National Science Review
22 papers in training set
Top 3%
0.5%