Back

Correcting under-reported COVID-19 case numbers

Lachmann, A.

2020-03-18 health informatics
10.1101/2020.03.14.20036178 medRxiv
Show abstract

The COVID-19 virus has spread worldwide in a matter of a few months, while healthcare systems struggle to monitor and report current cases. Testing results have struggled with the relative capabilities, testing policies and preparedness of each affected country, making their comparison a non-trivial task. Since severe cases, which more likely lead to fatal outcomes, are detected at a higher rate than mild cases, the reported virus mortality is likely inflated in most countries. Lockdowns and changes in human behavior modulate the underlying growth rate of the virus. Under-sampling of infection cases may lead to the under-estimation of total cases, resulting in systematic mortality estimation biases. For healthcare systems worldwide it is important to know the expected number of cases that will need treatment. In this manuscript, we identify a generalizable growth rate decay reflecting behavioral change. We propose a method to correct the reported COVID-19 cases and death numbers by using a benchmark country (South Korea) with near-optimal testing coverage, with considerations on population demographics. We extrapolate expected deaths and hospitalizations with respect to observations in countries that passed the exponential growth curve. By applying our correction, we predict that the number of cases is highly under-reported in most countries and a significant burden on worldwide hospital capacity. The full analysis workflow and data is available at: https://github.com/lachmann12/covid19

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Patterns
70 papers in training set
Top 0.1%
12.5%
2
Scientific Reports
3102 papers in training set
Top 4%
12.3%
3
npj Digital Medicine
97 papers in training set
Top 0.5%
10.4%
4
Journal of Medical Internet Research
85 papers in training set
Top 0.4%
10.1%
5
Nature Communications
4913 papers in training set
Top 26%
6.8%
50% of probability mass above
6
JAMIA Open
37 papers in training set
Top 0.3%
4.3%
7
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.6%
4.2%
8
PLOS Computational Biology
1633 papers in training set
Top 11%
3.1%
9
PLOS ONE
4510 papers in training set
Top 44%
2.7%
10
Journal of Biomedical Informatics
45 papers in training set
Top 0.6%
2.4%
11
iScience
1063 papers in training set
Top 9%
2.4%
12
Communications Medicine
85 papers in training set
Top 0.1%
2.4%
13
eLife
5422 papers in training set
Top 38%
1.9%
14
JMIR Public Health and Surveillance
45 papers in training set
Top 1%
1.9%
15
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 35%
1.5%
16
Communications Biology
886 papers in training set
Top 12%
1.3%
17
PNAS Nexus
147 papers in training set
Top 0.5%
1.3%
18
JMIR Medical Informatics
17 papers in training set
Top 1%
0.9%
19
PLOS Digital Health
91 papers in training set
Top 3%
0.8%
20
Med
38 papers in training set
Top 0.9%
0.7%
21
The Lancet Digital Health
25 papers in training set
Top 1%
0.6%
22
Nature Computational Science
50 papers in training set
Top 2%
0.6%
23
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.6%
24
Nature Machine Intelligence
61 papers in training set
Top 4%
0.6%
25
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.6%