Back

Biologically Informed Prediction of Response to Neoadjuvant Chemotherapy using Routine Clinical Data in Breast Cancer

Teng, X.; Jiang, Y.; Cho, W. C.; Wang, H.; Ma, J.; Zhao, M.; Meng, X.; Xiao, H.; Lai, Q.; Zhang, X.; Xie, H.; Li, T.; Li, Z.; Ren, G.; CHEUNG, A. L.-Y.; Cai, J.

2026-01-22 oncology
10.64898/2026.01.20.26344418
Show abstract

BackgroundEarly and accurate prediction of pathological complete response (pCR) is essential for personalizing neoadjuvant chemotherapy (NACT) in invasive breast cancer. However, most high-performing predictive models rely on costly, multi-modal data that are not routinely available in standard clinical practice. PurposeTo develop and validate Breast Cancer Biological Multi-modal Information Transfer for Response Prediction Model (BC-BioMIXER), a biologically informed predictive model that transfers multi-omics-derived knowledge to routine clinical data, enabling accurate prediction of pathological complete response prior to neoadjuvant chemotherapy initiation. Material and MethodsBC-BioMIXER was developed in a multi-modality cohort of 648 patients with invasive breast cancer (T2-4, any N, M0) incorporating transcriptomic, proteomic, MRI, and clinical data. The model was externally validated in three independent cohorts (total N = 830), including one multi-modality cohort, one clinical trial cohort, and one contemporary real-world cohort. All patients received NACT followed by surgery. The framework employs a teacher-student knowledge-transfer paradigm in which a multi-omics teacher model learns biologically integrated representations that are subsequently transferred to a student model using only routine clinical data. Predictive performance for pCR was benchmarked against a multi-modality reference model and evaluated across cohorts, receptor-defined subgroups (HER2 and hormone receptor [HR]), and treatment groups (NACT with or without immune checkpoint inhibitors [ICI]). Prognostic value was assessed using distant recurrence-free survival (DRFS). The potential to inform immunotherapy decision-making was explored by comparing DRFS between NACT + ICI and NACT-alone groups within model-predicted pCR and non-pCR subgroups. ResultsBC-BioMIXER achieved pCR prediction performance comparable to the multi-modality benchmark (AUC 0.82 vs. 0.85; p = 0.271) and demonstrated consistent discrimination across all validation cohorts (AUCs 0.82, 0.81, and 0.80; all p < 0.001). Patients predicted to achieve pCR experienced significantly improved 3-year DRFS (HR = 0.36; 95% CI, 0.20-0.67; p < 0.001). In patients treated with NACT + ICI, BC-BioMIXER showed numerically superior pCR prediction compared with PD-L1 expression alone (AUC 0.84 vs. 0.72; p = 0.08). Notably, within the model-predicted non-pCR subgroup, patients receiving NACT + ICI had significantly inferior DRFS compared with those receiving NACT alone (HR = 2.70; p = 0.032), whereas no significant difference was observed in the predicted pCR subgroup. ConclusionBC-BioMIXER translates multi-omics-derived biological knowledge into a robust, routine-data-based predictive tool for breast cancer NACT. Its consistent validation across evolving clinical settings and its potential to inform personalized immunotherapy strategies highlight a step toward scalable and accessible precision oncology. HighlightsO_LIBrings multi-omics power to routine clinical practice: Through cross-modality knowledge transfer, BC-BioMIXER leverages transcriptomic and proteomic data during training to enable highly accurate pCR prediction using only standard MRI and clinical variables (AUC 0.82 vs. 0.85 for full multi-modality benchmark, p=0.271). C_LIO_LIConsistently strong and generalizable performance: Validated in three independent cohorts (total N=830), the model maintained robust pCR discrimination (AUC 0.80-0.82, all p<0.001) across receptor subtypes (HR/HER2) and treatment regimens, including with or without immune checkpoint inhibitors. C_LIO_LIGuides personalized immunotherapy de-escalation: In HER2-negative patients predicted as non-pCR, adding ICI to neoadjuvant chemotherapy was associated with significantly worse distant recurrence-free survival (HR 2.70, p=0.032) compared to chemotherapy alone. This effect was not seen in the predicted pCR group, suggesting the model may help identify patients unlikely to benefit from additional immunotherapy. C_LI

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
based on 483 papers
Top 2%
17.7%
2
npj Precision Oncology
based on 14 papers
Top 0.1%
7.9%
3
Clinical Cancer Research
based on 22 papers
Top 0.4%
7.9%
4
eLife
based on 262 papers
Top 2%
6.7%
5
JCO Clinical Cancer Informatics
based on 14 papers
Top 0.3%
6.7%
6
Breast Cancer Research
based on 11 papers
Top 0.2%
4.9%
50% of probability mass above
7
Cancers
based on 57 papers
Top 4%
3.1%
8
Scientific Reports
based on 701 papers
Top 52%
2.9%
9
PLOS Computational Biology
based on 141 papers
Top 4%
2.9%
10
PLOS ONE
based on 1737 papers
Top 82%
2.5%
11
Journal for ImmunoTherapy of Cancer
based on 14 papers
Top 1.0%
2.5%
12
British Journal of Cancer
based on 22 papers
Top 2%
2.4%
13
Proceedings of the National Academy of Sciences
based on 100 papers
Top 7%
1.6%
14
JCO Precision Oncology
based on 11 papers
Top 1%
1.6%
15
JAMA Network Open
based on 125 papers
Top 10%
1.6%
16
Communications Medicine
based on 63 papers
Top 1%
1.4%
17
Cancer Medicine
based on 17 papers
Top 3%
1.4%
18
Frontiers in Oncology
based on 34 papers
Top 5%
1.4%
19
International Journal of Radiation Oncology*Biology*Physics
based on 13 papers
Top 2%
1.2%
20
Leukemia
based on 11 papers
Top 1%
0.9%
21
Communications Biology
based on 36 papers
Top 4%
0.8%
22
npj Digital Medicine
based on 85 papers
Top 12%
0.8%
23
Journal of Clinical Investigation
based on 50 papers
Top 3%
0.8%
24
BMC Cancer
based on 21 papers
Top 4%
0.8%
25
iScience
based on 74 papers
Top 6%
0.8%
26
Nature Genetics
based on 72 papers
Top 8%
0.8%
27
Cancer Epidemiology, Biomarkers & Prevention
based on 14 papers
Top 3%
0.8%
28
Cell Reports Medicine
based on 49 papers
Top 6%
0.7%
29
Nature
based on 58 papers
Top 10%
0.7%
30
Nature Medicine
based on 88 papers
Top 18%
0.7%