Back

Phenotype Execution and Modelling Architecture (PhEMA) to support disease surveillance and real-world evidence studies: English sentinel network evaluation.

Jamie, G.; Elson, W.; de Lusignan, S.; Kar, D.; Wimalaratna, R.; Hoang, U.; Meza-Torres, B.; Forbes, A.; Hinton, W.; Anand, S.; Ferreira, F.; Ordonez-Mena, J.; Agrawal, U.; Byford, R.

2023-11-22 health informatics
10.1101/2023.11.21.23298758 medRxiv
Show abstract

ObjectiveTo evaluate Phenotype Execution and Modelling Architecture (PhEMA), to express sharable phenotypes using Clinical Query Language (CQL) and intensional SNOMED CT Fast Healthcare Interoperability Resources (FHIR) valuesets, for exemplar chronic disease, sociodemographic risk factor and surveillance phenotypes. MethodWe curated three phenotypes: Type 2 diabetes (T2DM), excessive alcohol use and incident influenza-like illness (ILI) using CQL to define clinical and administrative logic. We defined our phenotypes with valuesets, using SNOMEDs hierarchy and expression constraint language (ECL), and CQL, combining valuesets and adding temporal elements where needed. We compared the count of cases found using PhEMA with our existing approach using convenience datasets. ResultsThe T2DM phenotype could be defined as two intensionally defined SNOMED valuesets and a CQL script. It increased the prevalence from 7.2% to 7.3%. Excess alcohol phenotype was defined by valuesets that added qualitative clinical terms to the quantitative conceptual definitions we currently use; this change increased prevalence by 58%, from 1.2% to 1.9%. We created an ILI valueset with SNOMED concepts, adding a temporal element using CQL to differentiate new episodes. This increased the weekly incidence in our convenience sample (weeks 26 to 38) from 0.95 cases to 1.11 cases per 100,000 people. ConclusionsPhenotypes for surveillance and research can be described fully and comprehensibly using CQL and intensional FHIR valuesets. Our use case phenotypes identified a greater number of cases, whilst anticipated from excessive alcohol this was not for our other variable. This may have been due to our use of SNOMED CT hierarchy.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
JAMIA Open
37 papers in training set
Top 0.1%
22.3%
2
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.1%
14.2%
3
JMIR Medical Informatics
17 papers in training set
Top 0.1%
6.3%
4
JMIR Public Health and Surveillance
45 papers in training set
Top 0.4%
4.3%
5
PLOS ONE
4510 papers in training set
Top 34%
4.3%
50% of probability mass above
6
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.8%
3.6%
7
Journal of Biomedical Informatics
45 papers in training set
Top 0.5%
3.6%
8
BMJ Open
554 papers in training set
Top 6%
3.2%
9
International Journal of Medical Informatics
25 papers in training set
Top 0.5%
3.0%
10
npj Digital Medicine
97 papers in training set
Top 2%
2.3%
11
BMJ Health & Care Informatics
13 papers in training set
Top 0.3%
2.1%
12
BMC Medical Research Methodology
43 papers in training set
Top 0.5%
2.1%
13
Journal of Medical Internet Research
85 papers in training set
Top 2%
1.9%
14
International Journal of Environmental Research and Public Health
124 papers in training set
Top 3%
1.9%
15
Frontiers in Public Health
140 papers in training set
Top 5%
1.7%
16
Scientific Reports
3102 papers in training set
Top 59%
1.7%
17
BMC Health Services Research
42 papers in training set
Top 1%
1.6%
18
Frontiers in Digital Health
20 papers in training set
Top 0.8%
1.5%
19
The Lancet Digital Health
25 papers in training set
Top 0.7%
1.1%
20
JMIR Formative Research
32 papers in training set
Top 1%
0.9%
21
Journal of Personalized Medicine
28 papers in training set
Top 1%
0.8%
22
Acta Neuropsychiatrica
12 papers in training set
Top 0.9%
0.8%
23
BMC Infectious Diseases
118 papers in training set
Top 6%
0.6%
24
Wellcome Open Research
57 papers in training set
Top 3%
0.6%
25
DIGITAL HEALTH
12 papers in training set
Top 0.8%
0.6%