Back

Tiny Babies, Big Data: ICD Billing Code Patterns in Neonates Diagnosed with Genetic Disease in the Neonatal Intensive Care Unit

Brokamp, E.; Arun, R.; Wojcik, M. H.; Chaudhari, B. P.; Antoniou, A. A.

2026-02-11 genetic and genomic medicine
10.64898/2026.02.08.26345857 medRxiv
Show abstract

PurposeGenetic diseases often present and are first diagnosed in the neonatal intensive care unit (NICU). Accurate identification of neonates with genetic diagnoses (GDs) in electronic health records (EHR) would enable a more complete understanding of their phenotypic spectrum, advancing care and personalized medicine. Prior research has used International Classification of Diseases (ICD) billing codes as proxies for GDs, though their accuracy for detecting confirmed GDs is uncertain. We evaluate the ICD codes for neonates with confirmed GDs and compare ICD billing code patterns between neonates with and without GD in two independent NICU cohorts. MethodsRetrospective analysis of patients admitted to the Boston Childrens Hospital (BCH) level IV NICU (1,344 neonates) and Nationwide Childrens Hospital (NCH)s neonatal network (33,315 neonates, mixed Level III/IV). For both cohorts, GDs captured by phecodes, aggregates of ICD codes, were compared with confirmed GDs. Two separate phenome-wide association studies (PheWAS) compared phecode patterns between neonates with GDs and those without, adjusting for sex, age at admission, gestational age, and NICU length of stay. ResultsGenetic phecodes were able to correctly identify 43.5% of neonates that received a GD in the BCH or NCH NICUs. Among 719 individuals with two or more genetic phecodes at BCH or NCH, 566 (78.72%) had a true GD. The BCH PheWAS analysis revealed a statistically significant positive association with atrioventricular septal defects and a negative association with bronchopulmonary dysplasia. The NCH pheWAS revealed 179 significantly associated phecodes, including many congenital anomalies. ConclusionThe use of ICD codes to identify NICU infants with GDs is neither sensitive nor accurate, though phecode analysis demonstrated stronger accuracy than sensitivity. Our data highlight clinical features of NICU infants more commonly seen in those that receive a GD (congenital heart defects) and those that are not (BPD). Our results can help to better predict and identify NICU neonates that receive a GD.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Genetics in Medicine
69 papers in training set
Top 0.1%
41.1%
2
The Journal of Pediatrics
15 papers in training set
Top 0.1%
7.1%
3
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.4%
6.6%
50% of probability mass above
4
PLOS ONE
4510 papers in training set
Top 47%
2.2%
5
JAMA Network Open
127 papers in training set
Top 1%
2.2%
6
Pediatric Research
18 papers in training set
Top 0.2%
1.8%
7
Scientific Reports
3102 papers in training set
Top 56%
1.8%
8
EClinicalMedicine
21 papers in training set
Top 0.2%
1.8%
9
Archives of Disease in Childhood
15 papers in training set
Top 0.2%
1.8%
10
Critical Care
14 papers in training set
Top 0.2%
1.8%
11
Circulation
66 papers in training set
Top 2%
1.5%
12
Journal of Clinical Medicine
91 papers in training set
Top 4%
1.5%
13
Open Heart
19 papers in training set
Top 0.8%
1.3%
14
BMJ Open
554 papers in training set
Top 10%
1.3%
15
British Journal of Anaesthesia
14 papers in training set
Top 0.6%
1.0%
16
Trials
25 papers in training set
Top 1%
0.9%
17
Journal of the American Heart Association
119 papers in training set
Top 3%
0.9%
18
Genome Medicine
154 papers in training set
Top 7%
0.9%
19
Frontiers in Pediatrics
29 papers in training set
Top 0.7%
0.9%
20
JAMA
17 papers in training set
Top 0.2%
0.8%
21
JMIR Medical Informatics
17 papers in training set
Top 1%
0.8%
22
European Journal of Human Genetics
49 papers in training set
Top 1%
0.8%
23
The American Journal of Human Genetics
206 papers in training set
Top 4%
0.8%
24
Nature Communications
4913 papers in training set
Top 61%
0.8%
25
BMJ Open Respiratory Research
32 papers in training set
Top 0.7%
0.7%
26
Communications Medicine
85 papers in training set
Top 1%
0.7%
27
Pediatric Infectious Disease Journal
16 papers in training set
Top 0.3%
0.7%
28
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 1%
0.5%
29
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.8%
0.5%
30
Pediatric Pulmonology
14 papers in training set
Top 0.4%
0.5%