Back

Taxonomical and ontological analysis of verified natural and laboratory human coronavirus hosts

Wang, Y.; Ye, M.; Zhang, F.; Freeman, Z. T.; Yu, H.; Ye, X.; He, Y.

2023-02-06 bioinformatics
10.1101/2023.02.05.527173 bioRxiv
Show abstract

To fully understand COVID-19, it is critical to identify and analyze all the possible hosts of SARS-CoV-2 (the pathogen of COVID-19) and compare them with the hosts of other human coronaviruses. In this study, we collected, annotated, and performed taxonomical and ontological analysis of all the reported and verified hosts for all human coronaviruses including SARS-CoV, MERS-CoV, SARS-CoV-2, and four others that cause the common cold. A total of 37 natural hosts and 19 laboratory animal hosts of host human coronaviruses were identified based on experimental or clinical evidence. Our taxonomical ontology-based analysis found that all the verified susceptible natural and laboratory animals belong to therian mammals. Specifically, these 37 natural therian hosts include one wildlife marsupial mammal (i.e., Didelphis virginiana) and 36 Eutheria mammals (a.k.a. placental mammals). The 19 laboratory animal hosts are also classified as placental mammals. While several non-therian animals (including snake, housefly, zebrafish) were reported to be likely SARS-CoV-2 hosts, our analysis excluded them due to the lack of convincing evidence. Genetically modified mouse models with human Angiotensin-converting enzyme 2 (ACE2) or dipeptidyl peptidase-4 (DPP4) protein were more susceptible to virulent human coronaviruses with clear symptoms. Coronaviruses often became more virulent and adaptive in the mouse hosts after a series of viral passages in the mice. To support knowledge standardization and analysis, we have also represented the annotated host knowledge in the Coronavirus Infectious Disease Ontology (CIDO) and provided ways to automatically query the knowledge.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Database
51 papers in training set
Top 0.1%
22.8%
2
PLOS ONE
4510 papers in training set
Top 15%
12.6%
3
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 1%
4.9%
4
Scientific Reports
3102 papers in training set
Top 27%
4.4%
5
Journal of Medical Virology
137 papers in training set
Top 2%
2.1%
6
Briefings in Bioinformatics
326 papers in training set
Top 3%
1.9%
7
Animals
20 papers in training set
Top 0.4%
1.7%
50% of probability mass above
8
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
9
Journal of Clinical Medicine
91 papers in training set
Top 3%
1.7%
10
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
11
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.5%
12
BioMed Research International
25 papers in training set
Top 2%
1.2%
13
Genome Medicine
154 papers in training set
Top 6%
1.2%
14
Virologica Sinica
10 papers in training set
Top 0.2%
1.2%
15
Viruses
318 papers in training set
Top 4%
1.1%
16
Frontiers in Microbiology
375 papers in training set
Top 7%
1.1%
17
Transboundary and Emerging Diseases
34 papers in training set
Top 0.6%
0.9%
18
Microbiology Spectrum
435 papers in training set
Top 4%
0.9%
19
Life
27 papers in training set
Top 0.2%
0.9%
20
The Innovation
12 papers in training set
Top 0.7%
0.9%
21
Scientific Data
174 papers in training set
Top 2%
0.9%
22
Nucleic Acids Research
1128 papers in training set
Top 15%
0.9%
23
Informatics in Medicine Unlocked
21 papers in training set
Top 1%
0.8%
24
Communications Biology
886 papers in training set
Top 21%
0.8%
25
Clinical Infectious Diseases
231 papers in training set
Top 4%
0.8%
26
Journal of Genetics and Genomics
36 papers in training set
Top 2%
0.8%
27
Heliyon
146 papers in training set
Top 5%
0.8%
28
Biology
43 papers in training set
Top 2%
0.8%
29
Emerging Microbes & Infections
74 papers in training set
Top 2%
0.8%
30
F1000Research
79 papers in training set
Top 4%
0.8%