Back

Single center Automated, Multi-Source deeply Phenotyped Heart Transplant Registry as a template to build tailored data infrastructure

Patel, K.; Eager, T. N.; Ghobrial, M.; Moore, L. W.; Guha, A.; Martin, C.; Akay, M. H.; Loza, L.; Jones, S. L.; Gaber, A. O.; Bhimaraj, A.

2026-02-09 cardiovascular medicine
10.64898/2026.02.07.26345785 medRxiv
Show abstract

BackgroundTraditional heart transplant registries often lack the granularity required for deep phenotyping and rely on labor-intensive manual abstraction. We describe the methodology and validation of a next-generation, automated, multi-source registry designed to address these limitations. MethodsUtilizing a High-Performance Computing environment, we integrated structured data from Epic data warehouses (Clarity and Caboodle), external molecular diagnostics, and verified UNOS survival records. A custom deterministic rule-based Natural Language Processing (NLP) engine was developed to extract echocardiographic measures, rejection grades, and vasculopathy scores from over 21,000 unstructured clinical reports. ResultsThe Houston Methodist J.C. Walter Jr. Transplant Center Precision Registry and Platform-Heart (TCPR-Heart) captures 1,687 heart transplants (1,636 patients) spanning the years 1984-2025. The TCPR-Heart comprises 1,054 transplants with active clinical follow-up: 555 transplants were extracted and abstracted from our modern electronic health record (EHR) in the decade since deployment, providing access to data throughout the patients course of heart transplant; 427 were legacy active transplants (transplanted pre-2016 with continued follow-up), and 72 were external transplants (transplanted elsewhere but followed at Methodist). Additionally, the registry houses a historic cohort of 633 transplants (last follow-up < June 2016) with limited variables. Automated deep phenotyping successfully generated longitudinal data trends across clinical domains, including immunosuppression strategies, rejection, immunologic HLA data, renal function, metabolic profiles, vasculopathy, graft function, hospitalization burden and survival information. ConclusionThis automated framework unifies clinical, administrative, and molecular data streams. By leveraging an automated, regularly updated registry, we established a scalable, high-fidelity data source as a foundation for further innovations and novel applications based on an expertly curated and validated data source.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Circulation
66 papers in training set
Top 0.3%
10.7%
2
European Heart Journal - Digital Health
15 papers in training set
Top 0.1%
10.4%
3
The Journal of Heart and Lung Transplantation
10 papers in training set
Top 0.1%
8.7%
4
npj Digital Medicine
97 papers in training set
Top 1.0%
4.4%
5
Journal of the American Heart Association
119 papers in training set
Top 2%
4.4%
6
Nature Communications
4913 papers in training set
Top 36%
4.1%
7
PLOS ONE
4510 papers in training set
Top 37%
3.7%
8
The American Journal of Cardiology
15 papers in training set
Top 0.6%
3.7%
50% of probability mass above
9
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.4%
3.7%
10
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.9%
3.0%
11
American Journal of Transplantation
15 papers in training set
Top 0.1%
2.1%
12
Kidney360
22 papers in training set
Top 0.3%
1.9%
13
Scientific Reports
3102 papers in training set
Top 52%
1.9%
14
The Lancet Digital Health
25 papers in training set
Top 0.3%
1.9%
15
Circulation: Heart Failure
14 papers in training set
Top 0.2%
1.8%
16
Genome Medicine
154 papers in training set
Top 4%
1.7%
17
Journal of the American College of Cardiology
12 papers in training set
Top 0.4%
1.4%
18
Frontiers in Cardiovascular Medicine
49 papers in training set
Top 2%
1.4%
19
Frontiers in Physiology
93 papers in training set
Top 3%
1.4%
20
Journal of Translational Medicine
46 papers in training set
Top 1%
1.3%
21
JACC: Clinical Electrophysiology
11 papers in training set
Top 0.2%
1.3%
22
American Journal of Respiratory Cell and Molecular Biology
38 papers in training set
Top 0.6%
1.0%
23
European Heart Journal
16 papers in training set
Top 0.6%
1.0%
24
iScience
1063 papers in training set
Top 28%
0.8%
25
Heart
10 papers in training set
Top 0.8%
0.8%
26
Diabetologia
36 papers in training set
Top 0.9%
0.8%
27
Nature Medicine
117 papers in training set
Top 5%
0.8%
28
BMC Medical Informatics and Decision Making
39 papers in training set
Top 3%
0.7%
29
EBioMedicine
39 papers in training set
Top 1%
0.7%
30
PLOS Digital Health
91 papers in training set
Top 3%
0.7%