Back

Standardised Human Phenotype Ontology Annotation Enables High Quality Phenotypic Data Capture in a Real-World Common Variable Immunodeficiency Cohort

Campos, L. C.; Favreau, E.; Greene, D.; Blach, J.; Thomas, M.; Alsehaim, K.; Mutlu, L.; Elhadari, S.; Herwadkar, A.; Payne, J.; Lever, C.; Mahmoud, D.; Moreira, F.; O'Sullivan, M.; Berry, M.; Twigg, G.; Hart, A. C. J.; Joshi, N.; Fuller, S.; INTREPID Consortium, ; Smith, K. G. C.; Turro, E.; Cook, M. C.; Wallace, C.; Burns, S. O.

2026-04-29 allergy and immunology
10.64898/2026.04.29.26350963 medRxiv
Show abstract

BackgroundPatients with Common Variable Immunodeficiency (CVID) exhibit diverse clinical manifestations, indicating heterogeneity in pathogenic mechanisms. Systematic application of standardised phenotyping in large cohorts is essential to dissect this heterogeneity. The Human Phenotype Ontology (HPO) provides a structured framework for capturing and comparing disease phenotypes. ObjectiveTo evaluate the implementation and outcomes of HPO-based phenotyping in CVID patients enrolled for whole-genome sequencing in a large national adult primary immunodeficiency cohort. MethodsWe developed a web-based Phenotype Capture Tool and delivered structured clinician training to standardise HPO annotation. Numerical laboratory parameters were mapped to corresponding HPO terms to enrich patient records. ResultsWe coded the phenotypes of 526 CVID patients across 11 UK centres. Clinician training increased phenotype granularity and improved phenotyping consistency between clinicians. We assigned 883 unique HPO terms across the cohort and applied logical rules to the terms to classify patients into an infection-only group and a complex phenotype group (42% vs 58%, respectively). Patients in the complex phenotype group were significantly more likely to have reduced switched memory and expanded CD21low B cells, as well as pathogenic variants in IUIS-listed genes overall and pathogenic NFKB1 variants specifically. Having a pathogenic variant in an IUIS-listed gene was associated with Autoimmune hemolytic anemia and having a pathogenic NFKB1 variant specifically was associated with Autoimmune neutropenia. ConclusionThis is the first study to systematically collect granular HPO-coded phenotypes in a large real-world CVID cohort, refining the CVID landscape and providing a comprehensive CVID HPO term set relevant for international research. Clinical ImplicationHPO allows systematic capture of CVID phenotypes with low inter-clinician variability and improves comparison of cohorts, enhancing identification of disease heterogeneity essential to support genotype-phenotype studies and targeted therapeutic strategies. Capsule summaryHPO-based phenotyping of 526 CVID patients improved annotation quality, identifying immunological and genetic associations with clinical manifestations, distinguishing infection-only from complex disease and refining clinical characterisation to support international collaboration.

Matching journals

The top 14 journals account for 50% of the predicted probability mass.

1
Clinical Infectious Diseases
231 papers in training set
Top 0.6%
7.3%
2
PLOS ONE
4510 papers in training set
Top 24%
7.0%
3
Nature Communications
4913 papers in training set
Top 28%
6.5%
4
Frontiers in Immunology
586 papers in training set
Top 1%
5.0%
5
Genes & Immunity
11 papers in training set
Top 0.1%
4.4%
6
Wellcome Open Research
57 papers in training set
Top 0.3%
3.7%
7
The Journal of Infectious Diseases
182 papers in training set
Top 1%
2.8%
8
eBioMedicine
130 papers in training set
Top 0.6%
2.4%
9
Clinical Immunology
21 papers in training set
Top 0.2%
2.1%
10
Journal of Clinical Pathology
12 papers in training set
Top 0.1%
2.1%
11
British Journal of Haematology
15 papers in training set
Top 0.2%
1.9%
12
International Journal of Medical Informatics
25 papers in training set
Top 0.7%
1.8%
13
Annals of the Rheumatic Diseases
32 papers in training set
Top 0.4%
1.8%
14
Journal of Clinical Investigation
164 papers in training set
Top 3%
1.7%
50% of probability mass above
15
Archives of Disease in Childhood
15 papers in training set
Top 0.2%
1.7%
16
European Respiratory Journal
54 papers in training set
Top 0.9%
1.7%
17
Journal of Allergy and Clinical Immunology
25 papers in training set
Top 0.4%
1.7%
18
BMJ Open Respiratory Research
32 papers in training set
Top 0.4%
1.7%
19
Genetics
225 papers in training set
Top 3%
1.4%
20
Infection
15 papers in training set
Top 0.2%
1.4%
21
The Lancet Rheumatology
11 papers in training set
Top 0.1%
1.4%
22
Journal of Clinical Microbiology
120 papers in training set
Top 1%
1.4%
23
The American Journal of Human Genetics
206 papers in training set
Top 3%
1.3%
24
Immunology
29 papers in training set
Top 0.6%
1.3%
25
Cell Reports Methods
141 papers in training set
Top 3%
1.3%
26
BMJ Open
554 papers in training set
Top 10%
1.3%
27
JCI Insight
241 papers in training set
Top 5%
1.0%
28
BMC Medicine
163 papers in training set
Top 6%
0.9%
29
Rheumatology
21 papers in training set
Top 0.3%
0.9%
30
Journal of Translational Medicine
46 papers in training set
Top 2%
0.9%