Back

Emerging SARS-CoV-2 mutation hotspots associated with clinical outcomes

Pang, X.; Li, P.; Zhang, L.; Que, L.; Dong, M.; Wang, Q.; Wei, Y.; Xie, B.; Xie, X.; Li, L.; Yin, C.; Wei, L.; Zhou, Q.; Li, Y.; Yu, L.; Li, W.; Mo, Z.; Leng, J.; Hu, Y.

2021-03-31 bioinformatics
10.1101/2021.03.31.437666 bioRxiv
Show abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the cause of the ongoing coronavirus disease 2019 (COVID-19) pandemic. Understanding the influence of mutations in the SARS-CoV-2 gene on clinical outcomes and related factors is critical for treatment and prevention. Here, we analyzed 209,551 high-coverage complete virus sequences and 321 RNA-seq samples to mine the mutations associated with clinical outcome in the SARS-CoV-2 genome. Several important hotspot variants were found to be associated with severe clinical outcomes. Q57H variant in ORF3a protein were found to be associated with higher mortality rate, and was high proportion in severe cases (39.36%) and 501Y.V2 strains (100%) but poorly proportional to asymptomatic cases (10.04%). T265I could change nsp2 structure and mitochondrial permeability, and evidently higher in severe cases (20.12%) and 501Y.V2 strains (100%) but lower in asymptomatic cases (1.43%). Additionally, R203K and G204R could decrease the flexibility and immunogenic property of N protein with high frequency among severe cases, VUI 202012/01 and 484K.V2 strains. Interestingly, the SARS-CoV-2 genome was more susceptible to mutation because of the high frequency of nt14408 mutation (which located in RNA polymerase) and the high expression levels of ADAR and APOBEC in severe clinical outcomes. In conclusion, several important mutation hotspots in the SARS-CoV-2 genome associated with clinical outcomes was found in our study, and that might correlate with different SARS-CoV-2 mortality rates.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Journal of Medical Virology
137 papers in training set
Top 0.1%
17.4%
2
Virologica Sinica
10 papers in training set
Top 0.1%
6.3%
3
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
6.3%
4
Frontiers in Genetics
197 papers in training set
Top 2%
4.1%
5
Briefings in Bioinformatics
326 papers in training set
Top 2%
3.9%
6
Journal of Genetics and Genomics
36 papers in training set
Top 0.3%
3.6%
7
PLOS ONE
4510 papers in training set
Top 40%
3.6%
8
Cell Discovery
54 papers in training set
Top 1%
3.6%
9
Science Bulletin
22 papers in training set
Top 0.1%
3.1%
50% of probability mass above
10
Protein & Cell
25 papers in training set
Top 0.7%
3.1%
11
The Innovation
12 papers in training set
Top 0.3%
1.9%
12
National Science Review
22 papers in training set
Top 0.8%
1.8%
13
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.8%
14
Frontiers in Microbiology
375 papers in training set
Top 5%
1.7%
15
eLife
5422 papers in training set
Top 42%
1.7%
16
Scientific Reports
3102 papers in training set
Top 59%
1.7%
17
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.7%
18
Cell Research
49 papers in training set
Top 2%
1.1%
19
International Journal of Biological Macromolecules
65 papers in training set
Top 2%
1.1%
20
Emerging Microbes & Infections
74 papers in training set
Top 1%
1.1%
21
Virus Research
36 papers in training set
Top 0.9%
0.9%
22
Communications Biology
886 papers in training set
Top 17%
0.9%
23
Genomics
60 papers in training set
Top 2%
0.9%
24
PLOS Computational Biology
1633 papers in training set
Top 22%
0.9%
25
Science China Life Sciences
26 papers in training set
Top 2%
0.9%
26
Acta Pharmaceutica Sinica B
11 papers in training set
Top 0.7%
0.9%
27
PLOS Pathogens
721 papers in training set
Top 8%
0.8%
28
Journal of Virology
456 papers in training set
Top 3%
0.8%
29
Clinical and Translational Medicine
30 papers in training set
Top 0.9%
0.8%
30
PeerJ
261 papers in training set
Top 15%
0.7%