Back

Molecular Mimicry Map (3M) of SARS-CoV-2: Prediction of potentially immunopathogenic SARS-CoV-2 epitopes via a novel immunoinformatic approach

An, H.; Park, J.

2020-11-12 bioinformatics
10.1101/2020.11.12.344424 bioRxiv
Show abstract

Currently, more than 33 million peoples have been infected by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and more than a million people died from coronavirus disease 2019 (COVID-19), a disease caused by the virus. There have been multiple reports of autoimmune and inflammatory diseases following SARS-CoV-2 infections. There are several suggested mechanisms involved in the development of autoimmune diseases, including cross-reactivity (molecular mimicry). A typical workflow for discovering cross-reactive epitopes (mimotopes) starts with a sequence similarity search between protein sequences of human and a pathogen. However, sequence similarity information alone is not enough to predict cross-reactivity between proteins since proteins can share highly similar conformational epitopes whose amino acid residues are situated far apart in the linear protein sequences. Therefore, we used a hidden Markov model-based tool to identify distant viral homologs of human proteins. Also, we utilized experimentally determined and modeled protein structures of SARS-CoV-2 and human proteins to find homologous protein structures between them. Next, we predicted binding affinity (IC50) of potentially cross-reactive T-cell epitopes to 34 MHC allelic variants that have been associated with autoimmune diseases using multiple prediction algorithms. Overall, from 8,138 SARS-CoV-2 genomes, we identified 3,238 potentially cross-reactive B-cell epitopes covering six human proteins and 1,224 potentially cross-reactive T-cell epitopes covering 285 human proteins. To visualize the predicted cross-reactive T-cell and B-cell epitopes, we developed a web-based application "Molecular Mimicry Map (3M) of SARS-CoV-2" (available at https://ahs2202.github.io/3M/). The web application enables researchers to explore potential cross-reactive SARS-CoV-2 epitopes alongside custom peptide vaccines, allowing researchers to identify potentially suboptimal peptide vaccine candidates or less ideal part of a whole virus vaccine to design a safer vaccine for people with genetic and environmental predispositions to autoimmune diseases. Together, the computational resources and the interactive web application provide a foundation for the investigation of molecular mimicry in the pathogenesis of autoimmune disease following COVID-19.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
ImmunoInformatics
11 papers in training set
Top 0.1%
17.2%
2
Bioinformatics
1061 papers in training set
Top 3%
8.3%
3
Briefings in Bioinformatics
326 papers in training set
Top 0.7%
7.0%
4
iScience
1063 papers in training set
Top 2%
6.3%
5
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.6%
6.3%
6
Scientific Reports
3102 papers in training set
Top 20%
6.2%
50% of probability mass above
7
Frontiers in Immunology
586 papers in training set
Top 2%
4.8%
8
PLOS Computational Biology
1633 papers in training set
Top 10%
3.5%
9
Patterns
70 papers in training set
Top 0.3%
3.5%
10
Cell Systems
167 papers in training set
Top 5%
2.8%
11
Bioinformatics Advances
184 papers in training set
Top 2%
2.3%
12
Nature Machine Intelligence
61 papers in training set
Top 2%
1.7%
13
Frontiers in Genetics
197 papers in training set
Top 5%
1.6%
14
eLife
5422 papers in training set
Top 44%
1.6%
15
Communications Biology
886 papers in training set
Top 13%
1.3%
16
GigaScience
172 papers in training set
Top 2%
1.3%
17
Cell Reports Medicine
140 papers in training set
Top 5%
1.2%
18
Nucleic Acids Research
1128 papers in training set
Top 14%
1.2%
19
BMC Bioinformatics
383 papers in training set
Top 6%
1.2%
20
Cell Reports Methods
141 papers in training set
Top 4%
0.9%
21
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
0.9%
22
BMC Medical Genomics
36 papers in training set
Top 1%
0.9%
23
Nature Communications
4913 papers in training set
Top 60%
0.9%
24
Vaccines
196 papers in training set
Top 2%
0.8%
25
PLOS ONE
4510 papers in training set
Top 67%
0.8%
26
Viruses
318 papers in training set
Top 5%
0.8%
27
Journal of Chemical Information and Modeling
207 papers in training set
Top 3%
0.7%
28
Antibody Therapeutics
16 papers in training set
Top 0.7%
0.6%