Back

The COVID-19 Infodemic: The complex task of elevating signal and eliminating noise.

DESAI, T.; Conjeevaram, A.

2021-01-20 medical education
10.1101/2021.01.19.21249936 medRxiv
Show abstract

In Situation Report #3 and 39 days before declaring COVID-19 a pandemic, the WHO declared a -19 infodemic. The volume of coronavirus tweets was far too great for one to find accurate or reliable information. Healthcare workers were flooded with which drowned the of valuable COVID-19 information. To combat the infodemic, physicians created healthcare-specific micro-communities to share scientific information with other providers. We analyzed the content of eight physician-created communities and categorized each message in one of five domains. We coded 1) an application programming interface to download tweets and their metadata in JavaScript Object Notation and 2) a reading algorithm using visual basic application in Excel to categorize the content. We superimposed the publication date of each tweet into a timeline of key pandemic events. Finally, we created NephTwitterArchive.com to help healthcare workers find COVID-19-related signal tweets when treating patients. We collected 21071 tweets from the eight hashtags studied. Only 9051 tweets were considered signal: tweets categorized into both a domain and subdomain. There was a trend towards fewer signal tweets as the pandemic progressed, with a daily median of 22% (IQR 0-42%. The most popular subdomain in Prevention was PPE (2448 signal tweets). In Therapeutics, Hydroxychloroquine/chloroquine wwo Azithromycin and Mechanical Ventilation were the most popular subdomains. During the active Infodemic phase (Days 0 to 49), a total of 2021 searches were completed in NephTwitterArchive.com, which was a 26% increase from the same time period before the pandemic was declared (Days -50 to -1). The COVID-19 Infodemic indicates that future endeavors must be undertaken to eliminate noise and elevate signal in all aspects of scientific discourse on Twitter. In the absence of any algorithm-based strategy, healthcare providers will be left with the nearly impossible task of manually finding high-quality tweets from amongst a tidal wave of noise.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Journal of Medical Internet Research
85 papers in training set
Top 0.1%
47.3%
2
npj Digital Medicine
97 papers in training set
Top 0.1%
27.6%
50% of probability mass above
3
Scientific Reports
3102 papers in training set
Top 43%
2.8%
4
International Journal of Medical Informatics
25 papers in training set
Top 0.6%
2.2%
5
Nature Communications
4913 papers in training set
Top 54%
1.4%
6
PLOS Biology
408 papers in training set
Top 13%
1.3%
7
PLOS ONE
4510 papers in training set
Top 62%
0.9%
8
Frontiers in Medicine
113 papers in training set
Top 5%
0.9%
9
PLOS Digital Health
91 papers in training set
Top 2%
0.8%
10
Nature Medicine
117 papers in training set
Top 4%
0.8%
11
Patterns
70 papers in training set
Top 2%
0.8%
12
Frontiers in Public Health
140 papers in training set
Top 7%
0.8%
13
Journal of Biomedical Informatics
45 papers in training set
Top 1%
0.8%
14
iScience
1063 papers in training set
Top 29%
0.8%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 44%
0.8%
16
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.8%
17
BMJ Open
554 papers in training set
Top 12%
0.8%
18
PLOS Computational Biology
1633 papers in training set
Top 26%
0.7%
19
Nature Human Behaviour
85 papers in training set
Top 5%
0.5%
20
Science Advances
1098 papers in training set
Top 34%
0.5%
21
BioData Mining
15 papers in training set
Top 1%
0.5%
22
The Lancet Digital Health
25 papers in training set
Top 1%
0.5%
23
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 3%
0.5%