Back

Fine-Tuning PubMedBERT for Hierarchical Condition Category Classification

Wang, X.; Hammarlund, N.; Prosperi, M.; Zhu, Y.; Revere, L.

2026-04-15 health systems and quality improvement
10.64898/2026.04.13.26350814 medRxiv
Show abstract

Automating Hierarchical Condition Category (HCC) assignment directly from unstructured electronic health record (EHR) notes remains an important but understudied problem in clinical informatics. We present HCC-Coder, an end-to-end NLP system that maps narrative documentation to 115 Centers for Medicare & Medicaid Services(CMS) HCC codes in a multi-label setting. On the test dataset, HCC-Coder achieves a macro-F1 of 0.779 and a micro-F1 of 0.756, with a macro-sensitivity of 0.819 and macro-specificity of 0.998. By contrast, Generative Pre-trained Transformer (GPT)-4o achieves the highest score of a macro-F1 of 0.735 and a micro-F1 of 0.708 under five-shot prompting. The fine-tuned model demonstrates consistent absolute improvements of 4%-5% in F1-scores over GPT-4o. To address severe label imbalance, we incorporate inverse-frequency weighting and per-label threshold calibration. These findings suggest that domain-adapted transformers provide more balanced and reliable performance than prompt-based large language models for hierarchical clinical coding and risk adjustment.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Journal of Biomedical Informatics
45 papers in training set
Top 0.1%
17.8%
2
Nature
575 papers in training set
Top 3%
9.3%
3
Scientific Reports
3102 papers in training set
Top 22%
4.9%
4
PLOS ONE
4510 papers in training set
Top 30%
4.9%
5
npj Digital Medicine
97 papers in training set
Top 1%
4.4%
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 19%
3.7%
7
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.7%
3.7%
8
Advanced Science
249 papers in training set
Top 6%
3.1%
50% of probability mass above
9
Journal of Personalized Medicine
28 papers in training set
Top 0.2%
2.1%
10
European Heart Journal - Digital Health
15 papers in training set
Top 0.3%
1.9%
11
Nature Computational Science
50 papers in training set
Top 0.5%
1.8%
12
Bioinformatics
1061 papers in training set
Top 7%
1.7%
13
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.7%
14
Nature Communications
4913 papers in training set
Top 54%
1.5%
15
Genome Medicine
154 papers in training set
Top 5%
1.5%
16
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
17
Communications Medicine
85 papers in training set
Top 0.3%
1.5%
18
Biometrics
22 papers in training set
Top 0.1%
1.2%
19
BioData Mining
15 papers in training set
Top 0.5%
1.2%
20
iScience
1063 papers in training set
Top 23%
1.1%
21
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
1.1%
22
Nature Machine Intelligence
61 papers in training set
Top 3%
1.1%
23
BMC Bioinformatics
383 papers in training set
Top 6%
1.0%
24
Medical Decision Making
10 papers in training set
Top 0.2%
1.0%
25
Science
429 papers in training set
Top 18%
0.9%
26
Patterns
70 papers in training set
Top 2%
0.9%
27
Nature Human Behaviour
85 papers in training set
Top 4%
0.8%
28
Artificial Intelligence in Medicine
15 papers in training set
Top 0.6%
0.8%
29
The Lancet Digital Health
25 papers in training set
Top 1.0%
0.8%
30
Genome Biology
555 papers in training set
Top 7%
0.8%