Back

Single source of pangolin CoVs with a near identical Spike RBD to SARS-CoV-2

Chan, Y. A.; Zhan, S. H.

2020-10-23 genomics
10.1101/2020.07.07.184374 bioRxiv
Show abstract

Multiple publications have independently described pangolin CoV genomes from the same batch of smuggled pangolins confiscated in Guangdong province in March, 2019. We analyzed the three metagenomic datasets that sampled this batch of pangolins and found that the two complete pangolin CoV genomes, GD_1 by Xiao et al. Nature and MP789 by Liu et al. PLoS Pathogens, were both built primarily using the 2019 dataset first described by Liu et al. Viruses. Other publications, such as Zhang et al. Current Biology and Lam et al. Nature, have also relied on this same dataset by Liu et al. Viruses for their assembly of the Guangdong pangolin CoV sequences and comparisons to SARS-CoV-2. To our knowledge, all of the published pangolin CoV genome sequences that share a highly similar Spike receptor binding domain with SARS-CoV-2 originate from this singular batch of smuggled pangolins. This raises the question of whether pangolins are truly reservoirs or hosts of SARS-CoV-2-related coronaviruses in the wild, or whether the pangolins may have contracted the CoV from another host species during trafficking. Our observations highlight the importance of requiring authors to publish their complete genome assembly pipeline and all contributing raw sequence data, particularly those supporting epidemiological investigations, in order to empower peer review and independent analysis of the sequence data. This is necessary to ensure both the accuracy of the data and the conclusions presented by each publication.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Cell
370 papers in training set
Top 0.1%
39.5%
2
Nature
575 papers in training set
Top 2%
14.7%
50% of probability mass above
3
Nature Communications
4913 papers in training set
Top 22%
8.4%
4
Science
429 papers in training set
Top 6%
4.9%
5
Nature Genetics
240 papers in training set
Top 2%
4.9%
6
Nature Microbiology
133 papers in training set
Top 1%
3.6%
7
eLife
5422 papers in training set
Top 40%
1.8%
8
Cell Reports
1338 papers in training set
Top 24%
1.7%
9
Cell Host & Microbe
113 papers in training set
Top 3%
1.7%
10
Science Translational Medicine
111 papers in training set
Top 3%
1.7%
11
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 36%
1.3%
12
mBio
750 papers in training set
Top 9%
1.2%
13
Molecular Cell
308 papers in training set
Top 8%
1.2%
14
Molecular Biology and Evolution
488 papers in training set
Top 3%
1.2%
15
Nature Structural & Molecular Biology
218 papers in training set
Top 4%
1.0%
16
Communications Biology
886 papers in training set
Top 19%
0.9%
17
Nature Biotechnology
147 papers in training set
Top 7%
0.8%
18
Current Biology
596 papers in training set
Top 13%
0.8%
19
Virus Evolution
140 papers in training set
Top 1%
0.7%
20
Cell Discovery
54 papers in training set
Top 5%
0.7%
21
Science Advances
1098 papers in training set
Top 31%
0.7%
22
Cell Systems
167 papers in training set
Top 13%
0.7%
23
Cell Reports Medicine
140 papers in training set
Top 9%
0.6%
24
mSphere
281 papers in training set
Top 7%
0.6%