Back

Fine-Tuning PubMedBERT for Hierarchical Condition Category Classification

Wang, X.; Hammarlund, N.; Prosperi, M.; Zhu, Y.; Revere, L.

2026-04-15 health systems and quality improvement
10.64898/2026.04.13.26350814 medRxiv
Show abstract

Automating Hierarchical Condition Category (HCC) assignment directly from unstructured electronic health record (EHR) notes remains an important but understudied problem in clinical informatics. We present HCC-Coder, an end to end NLP system that maps narrative documentation to 115 Centers for Medicare & Medicaid Services(CMS) HCC codes in a multi-label setting. On the test dataset, HCC-Coder achieves a macro-F1 of 0.779 and a micro-F1 of 0.756, with a macro-sensitivity of 0.819 and macro-specificity of 0.998. By contrast, Generative Pre-trained Transformer (GPT)-4o achieves highest score of a macro-F1 of 0.735 and a micro-F1 of 0.708 under five-shot prompting. The fine-tuned model demonstrates consistent absolute improvements of 4%-5% in F1-scores over GPT-4o. To address severe label imbalance, we incorporate inverse-frequency weighting and per-label threshold calibration. These findings suggest that domain-adapted transformers provide more balanced and reliable performance than prompt-based large language models for hierarchical clinical coding and risk adjustment.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Journal of Biomedical Informatics
45 papers in training set
Top 0.1%
17.7%
2
Nature
575 papers in training set
Top 3%
9.2%
3
Scientific Reports
3102 papers in training set
Top 23%
4.9%
4
PLOS ONE
4510 papers in training set
Top 31%
4.9%
5
npj Digital Medicine
97 papers in training set
Top 1.0%
4.3%
6
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.7%
4.0%
7
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 18%
3.9%
8
Advanced Science
249 papers in training set
Top 6%
3.1%
50% of probability mass above
9
Journal of Personalized Medicine
28 papers in training set
Top 0.2%
2.4%
10
Nature Computational Science
50 papers in training set
Top 0.5%
1.8%
11
European Heart Journal - Digital Health
15 papers in training set
Top 0.3%
1.8%
12
Bioinformatics
1061 papers in training set
Top 7%
1.7%
13
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.7%
14
Nature Communications
4913 papers in training set
Top 53%
1.5%
15
Genome Medicine
154 papers in training set
Top 5%
1.5%
16
Communications Medicine
85 papers in training set
Top 0.4%
1.3%
17
Nucleic Acids Research
1128 papers in training set
Top 13%
1.3%
18
Biometrics
22 papers in training set
Top 0.1%
1.3%
19
BioData Mining
15 papers in training set
Top 0.5%
1.2%
20
Nature Machine Intelligence
61 papers in training set
Top 3%
1.1%
21
iScience
1063 papers in training set
Top 23%
1.1%
22
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.1%
23
BMC Bioinformatics
383 papers in training set
Top 6%
1.0%
24
Medical Decision Making
10 papers in training set
Top 0.2%
0.9%
25
Patterns
70 papers in training set
Top 2%
0.9%
26
Science
429 papers in training set
Top 18%
0.9%
27
Artificial Intelligence in Medicine
15 papers in training set
Top 0.6%
0.8%
28
Nature Human Behaviour
85 papers in training set
Top 4%
0.8%
29
Genome Biology
555 papers in training set
Top 7%
0.8%
30
Heliyon
146 papers in training set
Top 6%
0.8%