Back

A nationwide study of 331 rare diseases among 58 million individuals: prevalence, demographics, and COVID-19 outcomes

Thygesen, J. H.; ZHANG, H.; Issa, H.; Wu, J.; Hama, T.; Pinho-Gomes, A.-C.; Groza, T.; Khalid, S.; Lumbers, R.; Hocaoglu, M.; Khunti, K.; Priedon, R.; Banerjee, A.; Pontikos, N.; Tomlinson, C.; Torralbo, A.; Taylor, P.; Sudlow, C.; Denaxas, S.; Hemingway, H.; Wu, H.

2023-10-13 health informatics
10.1101/2023.10.12.23296948 medRxiv
Show abstract

BackgroundThe Global Burden of Disease study has provided key evidence to inform clinicians, researchers, and policy makers across common diseases, but no similar effort with single study design exists for hundreds of rare diseases. Consequently, many rare conditions lack population-level evidence including prevalence and clinical vulnerability. This has led to the absence of evidence-based care for rare diseases, prominently in the COVID-19 pandemic. MethodThis study used electronic health records (EHRs) of more than 58 million people in England, linking nine National Health Service datasets spanning healthcare settings for people alive on Jan 23, 2020. Starting with all rare diseases listed in Orphanet, we quality assured and filtered down to analyse 331 conditions with ICD-10 or SNOMED-CT mappings clinically validated in our dataset. We report 1) population prevalence, clinical and demographic details of rare diseases, and 2) investigate differences in mortality with SARs-CoV-2. FindingsAmong 58,162,316 individuals, we identified 894,396 with at least one rare disease. Prevalence data in Orphanet originates from various sources with varying degrees of precision. Here we present reproducible age and gender-adjusted estimates for all 331 rare diseases, including first estimates for 186 (56.2%) without any reported prevalence estimate in Orphanet. We identified 49 rare diseases significantly more frequent in females and 62 in males. Similarly we identified 47 rare diseases more frequent in Asian as compared to White ethnicity and 22 with higher Black to white ratios as compared to similar ratios in population controls. 37 rare diseases were overrepresented in the white population as compared to both Black and Asian ethnicities. In total, 7,965 of 894,396 (0.9%) of rare-disease patients died from COVID-19, as compared to 141,287 of 58,162,316 (0.2%) in the full study population. Eight rare diseases had significantly increased risks for COVID-19-related mortality in fully vaccinated individuals, with bullous pemphigoid (8.07[3.01-21.62]) being worst affected. InterpretationOur study highlights that National-scale EHRs provide a unique resource to estimate detailed prevalence, clinical and demographic data for rare diseases. Using COVID-19-related mortality analysis, we showed the power of large-scale EHRs in providing insights to inform public health decision-making for these often neglected patient populations. FundingBritish Heart Foundation Data Science Centre, led by Health Data Research UK. Research in contextO_ST_ABSEvidence before the studyC_ST_ABSWe have previously published the largest study looking at COVID-19 across rare diseases, but with a sample size of 158 COVID-19 infected rare disease patients and 125 unaffected relatives, from Genomics England, the power of that study was limited. We searched PubMed from database inception to Apr 21, 2023, for publications using the search terms "COVID-19" or "SARS-CoV-2" and "rare disease" or "ORPHANET", without language restrictions. There are many studies examining the severity of COVID-19 in rare disease patients. However, to date, most studies have focused on a single or a few rare diseases associated with severity of COVID-19, and not taken a comprehensive rare disease wide approach. So far no studies have examined the impact of vaccination on mortality in rare disease patients. Moreover, the sample size used to examine rare diseases is limited in most studies. The largest study we identified included 168,680 individuals but only focused on autoimmune rheumatic disease. Added value of this studyIn this study we use national scale EHR data from England to report age and gender adjusted point prevalence for 331 rare diseases, with clinically-validated ICD-10 and/or SNOMED-CT code lists. Among these, 186 (56.2%) diseases did not have existing point prevalence data available in Orphanet. To our knowledge, this is the first time that rare diseases have been examined on a national scale, encompassing a population of over 58 million people. The large sample size provides sufficient statistical power to detect and describe enough carriers of even very rare conditions <1 case per million. Our analysis of COVID-related mortality has demonstrated the clinical relevance of national data for rare diseases. Specifically, we identified eight rare conditions that are associated with a significantly increased risk of mortality from COVID-19, even among fully vaccinated individuals. Implication of all the available evidenceThese findings provide robust reproducible prevalence, gender, and ethnicity estimates for disease that may often have been under prioritised, and where such information in most cases was not previously available. Our COVID-19 mortality findings highlight the need for targeted policy and support addressing the high level of vulnerability of these patients to COVID-19.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Journal of the American College of Cardiology
12 papers in training set
Top 0.1%
10.2%
2
Nature Communications
4913 papers in training set
Top 17%
10.2%
3
BMJ
49 papers in training set
Top 0.1%
6.9%
4
European Respiratory Journal
54 papers in training set
Top 0.2%
6.4%
5
eClinicalMedicine
55 papers in training set
Top 0.1%
4.9%
6
BMJ Open
554 papers in training set
Top 4%
4.9%
7
Annals of the Rheumatic Diseases
32 papers in training set
Top 0.2%
4.3%
8
Rheumatology
21 papers in training set
Top 0.2%
3.7%
50% of probability mass above
9
Journal of Personalized Medicine
28 papers in training set
Top 0.1%
3.1%
10
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.1%
2.9%
11
The Lancet Rheumatology
11 papers in training set
Top 0.1%
2.9%
12
Scientific Reports
3102 papers in training set
Top 47%
2.4%
13
Science Translational Medicine
111 papers in training set
Top 2%
2.1%
14
eBioMedicine
130 papers in training set
Top 1%
1.8%
15
Journal of Infection
71 papers in training set
Top 1%
1.7%
16
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.5%
17
BMC Medical Genomics
36 papers in training set
Top 0.7%
1.2%
18
JAMA Pediatrics
10 papers in training set
Top 0.1%
0.9%
19
Gut
36 papers in training set
Top 0.8%
0.8%
20
Communications Medicine
85 papers in training set
Top 1%
0.8%
21
The Lancet Digital Health
25 papers in training set
Top 1%
0.8%
22
Wellcome Open Research
57 papers in training set
Top 2%
0.8%
23
PLOS Medicine
98 papers in training set
Top 5%
0.7%
24
European Journal of Epidemiology
40 papers in training set
Top 0.8%
0.7%
25
PLOS ONE
4510 papers in training set
Top 71%
0.6%
26
Genome Medicine
154 papers in training set
Top 9%
0.6%
27
Circulation
66 papers in training set
Top 3%
0.6%
28
PLOS Genetics
756 papers in training set
Top 17%
0.6%
29
Human Genetics and Genomics Advances
70 papers in training set
Top 1.0%
0.6%
30
Human Mutation
29 papers in training set
Top 0.8%
0.6%