Back

Cancers

MDPI AG

Preprints posted in the last 7 days, ranked by how well they match Cancers's content profile, based on 200 papers previously published here. The average preprint has a 0.22% match score for this journal, so anything above that is already an above-average fit.

1
Quantitative and qualitative patient-reported analysis of misdiagnosis and/or late diagnosis of metastatic lobular cancer

Cody, M. E.; Chang, H.-C.; Foldi, J.; Jankowitz, R. C.; Balic, M.; Cushing, T.; Donnelly, C.; Freeney, S.; Levine, J.; Petitti, L.; Ryan, N.; Spencer, K.; Turner, C.; Tseng, G. C.; Desmedt, C.; Oesterreich, S.; Lee, A. V.

2026-04-20 oncology 10.64898/2026.04.16.26348799 medRxiv
Top 0.3%
12.0%
Show abstract

BackgroundInvasive lobular breast cancer (ILC) is the most commonly diagnosed special histological subtype of breast cancer (BC). Metastatic ILC (mILC) is less sensitive to FDG-PET imaging and often metastasizes to unusual sites --peritoneum, gastrointestinal (GI) tract, ovaries, urinary tract, and orbit--which may go unrecognized after a long disease-free interval. Some metastatic sites cause nonspecific symptoms, like abdominal/epigastric pain, with numerous published case reports of mILC misdiagnosed as gastric cancer. These atypical BC metastatic sites may lead to late and/or misdiagnosis, thereby delaying effective treatments. ObjectiveWe developed a patient survey to investigate the patient-reported prevalence of delayed diagnosis or misdiagnosis of mILC and their potential impact upon treatment outcomes. MethodsA 45-question survey was developed and piloted with breast cancer researchers, clinical oncologists, and patient advocates. This IRB-approved survey was then distributed to patients with ILC. Analyses including data QC and visualization were conducted in R using descriptive statistics. Incomplete or inconsistent responses were excluded, and summary statistics were stratified by four common mILC sites to highlight subgroup differences. Results525 patient surveys were completed, with 450 patients diagnosed with ILC, and of those 321 diagnosed with mILC. For those with mILC, 33.3% (n=107) were diagnosed with de novo mILC at initial presentation. Of the patients diagnosed with mILC, 32.1% (n=103) presented with other medical conditions at diagnosis. Misdiagnosis was reported by 26.2% (n=84) of patients with mILC, and of these cases, 31% (n=26) had [≥]2 misdiagnoses. The top 5 misdiagnoses were bone-related condition (24.7%), benign breast condition (23.4%), another type of BC (7.8%), diagnostic delay (7.8%), and menopause related (5.2%). 44.5% of patients waited [≥]1 year for an accurate diagnosis. 49 patients were treated for their misdiagnosis, and 6 received incorrect cancer treatments. The most frequently reported contributors to delayed or misdiagnosis were inconclusive imaging, providers lack of ILC knowledge, and initial misdiagnosis. Of the 321 patients with mILC, 138 (42.9%) reported symptoms before diagnosis; the most common were back pain (16.5%), fatigue/malaise (14.9%), GI symptoms (11.8%), bloating (8.4%), and weight loss (8.1%). Although 40% of patients reported having a mammogram at the time of their initial misdiagnosis, ILC was detected in only 20.5% (24/116) of these cases, and mammography detected only 5 (25%) of the 20 de novo mILC cases. Patients reported additional diagnostic testing within 1-3 months of their initial mammogram, includingbiopsy, ultrasound (US), and MRI. 47.9% of patients were in active BC surveillance after curative intent therapy at the time of their mILC diagnosis; however, no statistical difference was seen in time to diagnosis versus those patients not under surveillance. ConclusionOur survey results underscore the urgent need to improve diagnostic strategies for mILC. Addressing delays and diagnostic errors in mILC is critical to optimizing treatment strategies and improving patient outcomes.

2
Clinical outcomes and prognostic factors of low-grade serous ovarian cancer: A single-centre observational retrospective study

Prakash, R.; Khan, A.; Shahbazian, L.; Arthur, A.; Levin, G.; Gilbert, L.; Telleria, C. M.

2026-04-20 oncology 10.64898/2026.04.17.26351112 medRxiv
Top 0.4%
10.1%
Show abstract

ObjectiveThe purpose of the present study is to describe the survival outcomes of patients with low-grade serous ovarian cancer (LGSOC) in the post-operative setting from a tertiary gynecologic oncology referral centre in Quebec, including evaluation of patient characteristics, clinical outcomes and prognostic factors. MethodsThe study included 25 patients: 1) with a post-surgical histopathologic diagnosis of a low-grade serous tumour of the ovary, 2) underwent primary cytoreductive surgery prior to adjuvant therapy, and 3) for whom clinical data was available. Clinical and demographic features were characterized by descriptive statistics. Clinical endpoints of progression-free survival (PFS) and overall survival (OS) were assessed, utilizing the Kaplan-Meier method for estimating survival probabilities. ResultsThe median age of this cohort was 61 years (range, 26-81). Median OS was 140.6 months in patients with no residual disease (R0), 71 months in patients with microscopic residual disease (R1), and 27.7 months in patients with macroscopic residual disease (R2) (p=.001). Residual disease was also found to significantly impact PFS (p=.008). Administration of adjuvant chemotherapy failed to improve survival outcomes altogether (PFS, p = .270; OS, p = .300). ConclusionsThis study supports the shifting consensus that optimal cytoreductive surgery, where feasible, is paramount for successful treatment of LGSOC. Furthermore, treatment with adjuvant chemotherapy may lead to worse survival outcomes.

3
Histology-Derived Signatures Predict Recurrence Risk and Chemotherapy Benefit in Randomized Trials of Early Breast Cancer

Howard, F. M.; Li, A.; Kochanny, S.; Sullivan, M.; Flores, E. M.; Dolezal, J.; Khramtsova, G.; Hassan, S.; Medenwald, R.; Saha, P.; Fan, C.; McCart, L.; Watson, M.; Teras, L. R.; Bodelon, C.; Patel, A. V.; Symmans, W. F.; Partridge, A.; Carey, L.; Olopade, O. I.; Stover, D.; Perou, C.; Yao, K.; Pearson, A. T.; Huo, D.

2026-04-24 oncology 10.64898/2026.04.23.26351499 medRxiv
Top 0.4%
10.0%
Show abstract

Purpose: To test whether histology-derived gene-expression signatures from routine hematoxylin and eosin slides are prognostic for recurrence and predictive of chemotherapy benefit in early breast cancer. Methods: We conducted a multi-cohort study including CALGB 9344 (anthracycline +/- paclitaxel), CALGB 9741 (standard vs dose-dense chemotherapy), a pooled Chicago real-world cohort, and the American Cancer Society (ACS) Cancer Prevention Studies-II and -3. Whole-slide images were processed with a previously described pipeline to generate 61 histology-derived signatures per patient. The primary endpoint was distant recurrence-free interval (DRFI), except in ACS, where breast cancer-specific survival was used. Secondary endpoints include distant recurrence-free survival (DRFS) and overall survival. The most prognostic signature in CALGB 9344, selected by Harrell's C-index, was evaluated in additional cohorts. Signature-treatment interaction was assessed by likelihood-ratio tests. Multivariable Cox models incorporating age, tumor size, nodal status, estrogen/progesterone receptor status, and signature were fit in CALGB 9344 to improve risk stratification. Results: A total of 7,170 patients were included across four cohorts. The top histology-derived signature in CALGB 9344 showed strong prognostic performance for 5-year DRFI (C-index 0.63) and performed well across validation cohorts (C-index 0.60, 0.70, and 0.62 in CALGB 9741, Chicago, and ACS, respectively). The strongest predictive signal for treatment benefit was observed for DRFS. High-risk cases identified by the signature demonstrated greater benefit from taxane in CALGB 9344 (adjusted hazard ratio [aHR] 0.76 for DRFS, 95% CI 0.66-0.88; interaction p=0.028), from dose-dense chemotherapy in CALGB 9741 (aHR 0.69, 95% CI 0.56-0.85; interaction p=0.039), and differential chemotherapy benefit in the Chicago cohort (aHR 0.84, 95% CI 0.59-1.21; interaction p=0.009). Combined clinical-histology models improved risk stratification and identified low-risk groups with a 2%-10% risk of distant recurrence or breast cancer death. Conclusion: Histology-derived signatures from H&E images are broadly prognostic and, unlike clinical factors, may predict chemotherapy benefit.

4
PTHrP drives aggressive traits in colorectal cancer cells: Implications of tumor-stromal cells

Novoa Diaz, M. B.; Carriere, P. M.; Birkenstok, C.; Gonzalez Osorio, S.; Zwenger, A.; Contreras, H.; Gentili, C.

2026-04-21 cancer biology 10.64898/2026.04.16.718950 medRxiv
Top 0.7%
6.9%
Show abstract

In the tumor microenvironment (TME), dynamic interactions between cells and soluble factors promote tumor progression. We previously demonstrated that parathyroid hormone-related peptide (PTHrP), a TME-associated cytokine, enhances the aggressive phenotype of HCT116 colorectal cancer (CRC) cells, and that conditioned medium from PTHrP-treated HMEC-1 endothelial stromal cells (CM) induces epithelial-to-mesenchymal transition (EMT) in CRC cells. Here, Western blot analysis showed that CM modulates Met receptor expression and activation and promotes cancer stem cell (CSC) traits in HCT116 cells. Since PTHrP induces CPT-11 chemoresistance through Met signaling, we investigated the involvement of the CM-Met axis in this process. Viability assays revealed that CM increases cell number and confers CPT11 resistance through Met activation. Transforming growth factor beta 1 (TGF{beta}1), upregulated in PTHrP-treated HMEC-1 cells, was evaluated as a potential mediator. Its neutralization reversed the CM-induced increase in cell number but did not affect chemoresistance. In silico analyses revealed differences between CRC and normal tissues related to TGF{beta}1 signaling and Met activation, along with positive correlations among the analyzed markers. Immunohistochemical observation of human samples is consistent with our previous findings. Overall, these findings support a role for PTHrP in promoting CRC aggressiveness through coordinated effects on tumor and stromal compartments

5
CDK4/6 inhibitors enhance oxaliplatin efficacy in colorectal cancer with RB-dependent and tumor-selective activity in intestinal model

Souza, A. S. O.; Conceicao, J. S. M.; Ferraz, L. S.; Delou, J. M. A.; Miranda, B. R.; Verissimo, C.; Carneiro, M. S. C.; Rehen, S.; Bonamino, M. H.; Borges, H. L.

2026-04-19 cancer biology 10.64898/2026.04.15.718743 medRxiv
Top 0.9%
6.3%
Show abstract

Although the retinoblastoma protein (pRB) is functionally inactivated by hyperphosphorylation in the majority of colorectal cancers (CRC) - with RB1 rarely mutated and even amplified at the genomic level - three critical gaps remain unaddressed: no study has systematically compared which first-line chemotherapeutic agent best synergizes with CDK4/6 inhibition using head-to-head quantitative analysis; functional differences between palbociclib and abemaciclib in chemotherapy combinations have not been characterized in CRC; and direct genetic evidence of RB dependency in this combinatorial context is lacking. Here, we addressed these gaps by evaluating palbociclib and abemaciclib combined with oxaliplatin, 5-fluorouracil, and SN-38 in HCT116 CRC cells, with validation in SW480 cells, RB1-silenced HCT116 cells (shRNA-RB), and non-tumoral intestinal epithelial cells (IEC-6), using quantitative drug interaction analysis (Chou-Talalay), cell cycle profiling, apoptosis assessment, and pRB phosphorylation measurement. Oxaliplatin was the most consistently synergistic partner for both CDK4/6 inhibitors (CI < 1 across all tested concentrations), while combinations with SN-38 yielded variable results and 5-FU combinations approached additivity. The oxaliplatin combination reinforced G1 arrest and enhanced cell death, with abemaciclib producing more pronounced apoptotic induction than palbociclib - an effect not explained by differential pRB target engagement (both inhibitors reduced pRB Ser807/811 phosphorylation by [~]50%), likely reflecting abemaciclibs broader kinase inhibitory profile. shRNA-mediated RB1 silencing partially attenuated the combinatorial effect, providing direct genetic evidence that the synergy is RB-dependent. Importantly, the combination did not significantly potentiate oxaliplatin cytotoxicity in non-tumoral IEC-6 intestinal epithelial cells, in contrast to the pronounced enhancement observed in tumor cells, and synergistic benefit was preserved at sub-cytotoxic inhibitor concentrations. These findings identify oxaliplatin as the optimal chemotherapeutic partner for CDK4/6 inhibition in CRC, with a mechanism involving RB-dependent potentiation of apoptosis that is preferentially active against tumor cells and maintained at clinically relevant inhibitor doses.

6
Mechanistic learning to predict and understand minimal residual disease

Marzban, S.; Robertson-Tessi, M.; West, J.

2026-04-21 cancer biology 10.64898/2026.04.16.718968 medRxiv
Top 1%
4.8%
Show abstract

Mechanistic modeling has long been used as a tool to describe the dynamics of biological systems, especially cancer in response to treatment. Their key advantage lies in interpretability of relationships between input parameters and outcomes of interest. In contrast, machine learning techniques offer strong prediction performance, especially for high dimensional datasets that are common in oncology. Here, we employ a Mechanstic Learning framework that combines the advantages of both approaches by training machine learning models on mechanistic parameters inferred from clinical patient data. The mechanistic model (a Markov chain model) contains sixteen parameters that describe the rate of cell fate transitions that occur in patients with B-cell precursor acute lymphoblastic leukemia. The machine learning (a ridge logistic regression model) is trained on these parameters to predict two clinically-relevant features: BCR::ABL1 fusion gene status (positive or negative) and minimal residual disease status (positive or negative) post-induction chemotherapy. Model training is done in an iterative fashion to assess which (and how many) parameters are critical to maintain high predictive performance. Using machine learning models trained on the clinical flow-cytometry data, we find that the stem-like cell state alone is the most predictive feature for both BCR::ABL1-positive and MRD-positive disease, with combination scores (defined as the average of accuracy, balanced accuracy, and area under the curve) of 0.80 and 0.67, respectively. By comparison, mechanistic learning achieves comparable or improved combination scores for BCR::ABL1-positive and MRD-positive disease, with scores of 0.81 and 0.71, respectively, using only de-differentiation for BCR::ABL1 and primitive-state persistence together with differentiation-directed exit for MRD. Thus, the mechanistic-learning approach not only preserves predictive performance, but also provides a biological hypothesis for why stemness is predictive of these clinically relevant outcomes.

7
Integrated Single-Cell and Spatial Profiling of MMP Gene Expression in Colorectal Cancer

Danese, N. A.; Kurkcu, S. R.; Bleiler, M.; Nito, K.; Kuo, A.; Rosenberg, D. W.; Nakanishi, M.; Giardina, C.

2026-04-21 cancer biology 10.64898/2026.04.17.719089 medRxiv
Top 1%
4.8%
Show abstract

Increased matrix metalloproteinase (MMP) expression has long been recognized as a common feature of colorectal cancers (CRCs), yet less is known about how these enzymes interact to impact cancer progression. Taking advantage of single-cell and spatial transcriptomic data, we analyzed the cell-type-specific and spatial expression of MMPs in CRCs. Distinct colon cancer-associated fibroblast (CAF) subtypes were found to express different MMP combinations, including MMP1/3-expressing and MMP11-expressing CAFs. Conversely, myeloid cells (monocytes, macrophages, and dendritic cells) expressed varying levels of the "myeloid MMPs" 9, 12, and 14, which correlated closely with secretory gene expression. Finally, a small population of cancer cells expressed high levels of MMP7. The MMP7-expressing cancer cells frequently co-expressed MMP1, MMP14, and several Wnt-related genes, consistent with a cancer cell type at high risk of malignancy and metastasis. Spatial transcriptomic data showed MMP expression in discernible clusters driven in part by cell-type localization, including fibroblast-heavy stromal regions and inflammatory cell hubs. Epithelial-rich areas showed subregions of MMP7-expressing cancer cells, including areas where cancer cell and myeloid MMP expression overlap. Tumors showed a wide variation in MMP1-expressing CAFs, a variation reflected in primary CAF cell lines. In vitro, MMP1 expression was a stable phenotype that persisted through multiple rounds of division. MMP1-expressing CAFs were frequently positioned at the stromal interface, suggesting a role in facilitating cell movement across the tumor boundary. Our analysis indicates that cell-type and positional MMP expression varies between tumors and may play a role in determining lesion progression and cancer spread.

8
Racioethnic Disparities in Risk of Cardiometabolic Risk Factors and Cardiovascular Disease among Women Treated for Breast Cancer: The Pathways Heart Study

Yao, S.; Zimbalist, A.; Sheng, H.; Fiorica, P.; Cheng, R.; Medicino, L.; Omilian, A.; Zhu, Q.; Roh, J.; Laurent, C.; Lee, V.; Ergas, I.; Iribarren, C.; Rana, J.; Nguyen-Huynh, M.; Rillamas-Sun, E.; Hershman, D.; Ambrosone, C.; Kushi, L.; Greenlee, H.; Kwan, M.

2026-04-24 epidemiology 10.64898/2026.04.23.26351612 medRxiv
Top 1%
4.4%
Show abstract

Background: Few studies have examined racioethnic disparities in cardiovascular disease (CVD) in women after breast cancer treatment, who are at higher risk due to cardiotoxic cancer treatment. Methods: Based on the Pathways Heart Study of women with a history of breast cancer, this analysis examines the association between cardiometabolic risk factors (hypertension, diabetes, and dyslipidemia) and CVD events with self-reported race and ethnicity, as well as genetic similarity. Multivariable logistic and Cox proportional hazards regression models were used to test race and ethnicity and genetic similarity with prevalent and incident cardiometabolic risk factors and CVD events. Results: Of the 4,071 patients in this analysis, non-Hispanic Black (NHB), Asian, and Hispanic women were more likely to have prevalent and incident diabetes than non-Hispanic White (NHW) women. Analysis of genetic similarity revealed results consistent with self-reported race and ethnicity. For CVD risk, NHB women were more likely to develop heart failure and cardiomyopathy than NHW women. In contrast, Hispanic women were at lower risk of any incident CVD, serious CVD, arrhythmia, heart failure or cardiomyopathy, and ischemic heart disease, which was consistent with the associations found with Native American ancestry. Conclusions: This is the largest multi-ethnic study of disparities in CVD health in breast cancer survivors, demonstrating corroborating findings between self-reported race and ethnicity and genetic similarity. The results highlight disparities in cardiometabolic risk factors and CVD among breast cancer survivors that warrant more research and clinical attention in these distinct, high-risk populations.

9
Attention-Guided CNN Ensemble for Binary Classification of High-Grade and Low-Grade Serous Ovarian Carcinoma from Histopathological WSI Patches

rani, a.; mishra, s.

2026-04-22 oncology 10.64898/2026.04.21.26351441 medRxiv
Top 1%
4.0%
Show abstract

Accurate histopathological differentiation between High-Grade Serous Carcinoma (HGSC) and Low-Grade Serous Carcinoma (LGSC) remains a critical yet challenging aspect of ovarian cancer diagnosis due to their similar morphology and different clinical outcomes. This study presents a deep learning framework that uses custom attention mechanisms, including the Convolutional Block Attention Module (CBAM), Squeeze-and-Excitation (SE) blocks, and a Differential Attention module within five CNN architectures for automated binary classification of ovarian cancer subtypes from H&E WSI patches. Although individual models achieved higher accuracy, the ensemble stacking framework with a shallow MLP meta-learner delivered the best overall performance, with a ROC-AUC of 0.9211, an accuracy of 0.85, and F1-scores of 0.84 and 0.85 across both subtypes. These findings demonstrate that attention-guided feature recalibration combined with ensemble stacking provides robust and clinically interpretable discrimination of ovarian carcinoma subtypes.

10
Tumor Biology and Patterns of Recurrence in High-Grade Glioma: Implications for Radiation Target Delineation

Barve, R.; Gowda, D.; Illiayaraja, K. J.

2026-04-25 oncology 10.64898/2026.04.23.26351633 medRxiv
Top 1%
3.9%
Show abstract

Abstract: Purpose: Recurrence in high grade glioma (HGG) predominantly occurs within the high dose radiation field, raising the question of whether treatment failure reflects limitations in radiation target delineation or is driven by intrinsic tumor biology. This study evaluated recurrence patterns following standard chemoradiotherapy and their treatment implications. Material and Methods: This retrospective single center study included 41 patients with histologically confirmed HGG treated with surgery followed by radiotherapy with concurrent and adjuvant temozolomide (TMZ). Patients were followed through August 2018; those with recurrence were included in the analysis. Recurrence patterns were classified based on their spatial relationship to the 60 Gy isodose line as central, infield, marginal, or distant. Survival outcomes were estimated using the Kaplan-Meier method and compared using the log rank test. Results: The most common pattern of recurrence was central (15 patients, 36.5%), followed by infield (11, 26.8%), distant (6, 14.6%), marginal (5, 12.1%), and multicentric (4, 9.8%). Central and in field recurrences (local failures) accounted for 26 patients (63%). Median overall survival (OS) was 27 months, and median progression-free survival (PFS) was 12 months. Survival differed significantly by recurrence pattern (log-rank p = 0.018), with marginal recurrence associated with more favorable outcomes. Conclusion: The predominance of central and infield recurrences within the high-dose region suggests that treatment failure in HGG is not solely explained by inadequate target delineation and may also be driven, in part, by intrinsic tumor biology, including radioresistant subpopulations and tumor heterogeneity. Future strategies may benefit from incorporating biologically guided approaches alongside optimization of radiation treatment parameters.

11
A Cross-Cohort Validated Plasma Lipid Biomarker Assay for Early Breast Cancer Detection Using Machine Learning

Huang, T.; Koch, F. C.; Peake, D. A.; Adam, K.-P.; David, M.; Li, D.; Heffernan, K.; Lim, A.; Hurrell, J. G.; Preston, S.; Baterseh, A.; Vafaee, F.

2026-04-23 oncology 10.64898/2026.04.23.26351564 medRxiv
Top 1%
3.7%
Show abstract

Early detection of breast cancer remains essential for improving clinical outcomes, and complementary non-invasive approaches are needed to support existing screening methods, particularly for women with dense breast tissue. We have previously reported plasma lipid biomarker discovery using untargeted high-resolution liquid chromatography tandem mass spectrometry (LC-MS/MS). In this study, we performed biomarker confirmation and developed machine-learning models applied to targeted plasma lipid measurements for the non-invasive detection of early-stage breast cancer across international cohorts with independent external validation. Targeted LC-MS/MS was used to quantify candidate lipid panels in plasma samples from European discovery cohorts (n = 554) and an independent Australian cohort (n = 266) used for external validation. Data-driven feature selection identified a 15-lipid panel with strong performance in European cohorts (AUC >= 0.94). External validation prior to confidence stratification yielded 76% sensitivity, 64% specificity, and an AUC of 0.81 in the Australian validation cohort. Clinical assay development requires iterative panel and model testing to support translational feasibility and performance in the intended-use population. An analytically viable panel, excluding lipids requiring complex and costly synthesis, achieved comparable accuracy with improved assay robustness. Confidence-based analysis showed enhanced performance for predictions made with moderate to high confidence, with sensitivity up to 89% and AUC up to 0.85, suggesting that ongoing research should focus on strategies to enhance diagnostic model confidence. Importantly, model predictions were independent of breast density, tumour size, grade, subtype, and morphology, indicating biological specificity of the lipid signature. These results demonstrate that calibrated machine-learning models applied to plasma lipid biomarkers can support non-invasive breast cancer detection. Expanding training datasets to include greater diversity will further improve performance in the ongoing development of this lipid-based detection approach.

12
Diminished sex hormone levels influence the risk of skewed X chromosome inactivation

Roberts, A. L.; Osterdahl, M. F.; Sahoo, A.; Pickles, J.; Franklin-Cheung, C.; Wadge, S.; Mohamoud, N. A.; Morea, A.; Amar, A.; Morris, D. L.; Vyse, T. J.; Steves, C. J.; Small, K. S.

2026-04-22 genetic and genomic medicine 10.64898/2026.04.20.26351303 medRxiv
Top 2%
3.7%
Show abstract

BackgroundX chromosome inactivation (XCI) is the mechanism which randomly silences one X chromosome to equalise gene expression between 46, XX females and 46, XY males. Though XCI is expected to result in a random pattern of mosaicism across tissues, some females display a significantly unbalanced ratio in immune cells, termed XCI-skew, in which [&ge;]75% of cells have the same X inactivated. XCI-skew is associated with adverse health outcomes and its prevalence increases with age - particularly after midlife - yet the specific risk factors have yet to be identified. The menopausal transition, which is driven by profound shifts in sex hormone levels, has significant impact on chronic disease risk yet the molecular and cellular effects are incompletely understood. We hypothesised that the menopausal transition may impact XCI-skew. MethodsUsing XCI data measured in blood-derived DNA from 1,395 females from the TwinsUK population cohort, along with questionnaires, genetic data, and sex hormone measures, we carried out a cross-sectional study to assess the impact of the menopausal transition and sex hormones on XCI-skew. ResultsWe demonstrate that early menopause (<45yrs) is associated with increased risk of XCI-skew. In subset analyses across those who had a surgically induced or natural menopause, we find the association restricted to those who underwent a surgical menopause. We next identify a low polygenic score (PGS) for testosterone levels is significantly associated with XCI-skew, which we replicate in an independent dataset (n=149), while a PGS for age at natural menopause is not associated. Finally, using longitudinal measures across two time points spanning [~]18 years we show XCI-skew is a stable cellular phenotype that typically increases over time. DiscussionThese data represent the first environmental and genetic risk factors of XCI-skew, both of which implicate endogenous sex hormone levels, particularly testosterone. We propose XCI-skew may have clinical relevance in postmenopausal females.

13
Semaglutide is associated with improved breast cancer survival, lower metastatic burden, and a dose-survival relationship uncoupled from weight-loss magnitude

Murugadoss, K.; Venkatakrishnan, A. J.; Soundararajan, V.

2026-04-24 oncology 10.64898/2026.04.23.26351609 medRxiv
Top 2%
3.6%
Show abstract

Metabolic dysfunction is increasingly recognized as a risk factor for poor outcomes in breast cancer, but whether incretin-based therapies confer survival benefit beyond weight loss remains unresolved. Using a federated electronic health record platform spanning nearly 29 million patients, we evaluated breast cancer survival after semaglutide and tirzepatide initiation in routine care. In 1:1 propensity-matched pooled-comparator analyses, semaglutide was associated with improved overall survival versus metformin, sodium-glucose cotransporter 2 (SGLT2) inhibitor, and dipeptidyl peptidase 4 (DPP4) inhibitor users, with 54 deaths among 2,433 semaglutide users (2.2%) versus 395 deaths among 2,433 comparators (16.2%) over 24 months (log-rank P < 0.001). Tirzepatide showed a favorable survival association relative to pooled anti-diabetic comparators that did not meet statistical significance (P = 0.24), with 3 deaths among 220 users (1.4%) versus 64 deaths among 220 comparators (29.1%). In a head-to-head propensity-score-matched comparison, overall survival did not differ significantly between semaglutide and tirzepatide treated patients with pre-existing breast cancer (2,117 per arm; P = 0.12). In semaglutide-treated patients alive and observable at the 1-year landmark, higher maximum dose achieved was significantly associated with lower post-landmark mortality (P = 0.034), with an event rate of approximately 1.0% in the high-dose group (>=1.7 mg) versus approximately 4.5% in the low-dose group (0.25-1.0 mg). Despite a linear dose weight loss relationship for semaglutide, however, weight loss strata did not separate survival outcomes (global P = 0.22). In tirzepatide-treated patients alive and observable at the same landmark, neither maximum dose achieved nor weight loss strata separated post-landmark survival (P = 0.98 and P = 0.50, respectively). Structured EHR and AI-based clinical note analyses further showed significantly lower frequency of documented metastatic disease in semaglutide-treated patients relative to pooled anti-diabetic comparators, including any metastasis (7.0% versus 15.0%, rate ratio 0.5, P < 0.001), bone metastasis (1.0% versus 5.2%, rate ratio 0.2, P < 0.001), and liver, lung, or brain metastases (all P < 0.001). LLM-derived cause-of-death extraction further showed a 60% lower relative proportion of cancer-associated deaths in semaglutide-treated patients (19% of ascertainable deaths) than in matched pooled anti-diabetic comparators (47% of ascertainable deaths), with comparator deaths more often attributed to cancer progression involving metastatic breast cancer, leptomeningeal carcinomatosis, and cancer-driven organ failure. Overall, this study demonstrates that semaglutide use in patients with pre-existing breast cancer is associated with a dose correlated but weight loss independent improvement in overall survival. These findings motivate prospective trials of GLP-1 receptor agonists in breast cancer across various stages and treatment settings.

14
In Silico study of clinical implication of markers associated with PTHrP regulatory mechanisms and linked to angiogenesis and EMT program of colorectal cancer

Carriere, P. M.; Novoa Diaz, M. B.; Birkenstok, C.; Gentili, C.

2026-04-20 cancer biology 10.64898/2026.04.15.718767 medRxiv
Top 2%
3.5%
Show abstract

Parathyroid hormone-related peptide (PTHrP), encoded by PTHLH, has been implicated in tumor progression through its involvement in epithelial-mesenchymal transition (EMT), angiogenesis, and tumor cell migration. Previous experimental studies suggest that PTHrP may promote these processes in colorectal cancer (CRC), partly through the modulation of factors such as secreted protein acidic and rich in cysteine (SPARC) and vascular endothelial growth factor (VEGFA). These events play a key role in the acquisition of an aggressive phenotype in our experimental models. In this study, we performed an integrative in silico analysis of multiple transcriptomic datasets to investigate the potential role of PTHLH in CRC. Differential expression analysis identified a set of consistently dysregulated genes across independent datasets. Functional enrichment and network analyses revealed that PTHLH expression is associated with biological processes related to extracellular matrix remodeling, EMT, and angiogenesis. Correlation analyses showed a positive association between PTHLH and SPARC expression, while network-based approaches suggested a potential functional connection with VEGFA. To assess the clinical relevance of these findings, survival analysis was performed using publicly available datasets. High expression levels of PTHLH, SPARC, and VEGFA were significantly associated with reduced overall survival in patients. Notably, a combined gene signature based on these three factors demonstrated a stronger prognostic effect than individual genes, indicating enhanced predictive value. These findings suggest that PTHrP is associated with molecular pathways involved in tumor progression and, together with SPARC and VEGF, may contribute to a coordinated regulatory axis with prognostic relevance in CRC, warranting further experimental validation.

15
Elucidation of putative key genes involved in the regulation of triple negative breast cancer development and progression

Kumar, A.; Upadhyay, G. S.; Kashif, M.; Malik, M. Z.; Subbarao, N.; Rajala, M. S.

2026-04-20 cancer biology 10.64898/2026.04.15.718835 medRxiv
Top 2%
3.1%
Show abstract

The molecular basis of triple-negative breast cancer (TNBC), a highly aggressive and therapy-resistant subtype of breast cancer, is poorly understood. This study aims to identify key genes and pathways involved in TNBC development and progression using a systems biology approach followed by experimental validation. Here, two transcriptome microarray datasets from the GEO database were analysed using the R package LIMMA to detect differentially expressed genes (DEGs) in TNBC tumors. Gene Ontology (GO) and Kyoto Encyclopaedia of Genes and Genomes (KEGG) enrichment analyses using the DAVID database were performed to identify DEGs regulated biological functions and pathways. Further, a protein-protein interaction (PPI) network was constructed using the STRING online database, and the topological properties were determined using MCODE and Cytohubba plug-ins. The expression and the prognostic value of the hub genes were validated using the Cancer Genome Atlas (TCGA) survival analysis. We found 727 DEGs, of which 473 were downregulated and 254 were upregulated in TNBC vs. non-TNBC samples. The GO and KEGG analyses indicated that the DEGs were mainly related to cell adhesion, tumorigenesis, and cellular immunity. The PPI network had shown six hub genes, namely CCND1, CDH1, ESR1, FN1, IL6, and PPARG, as the top key regulators. All these genes were validated by quantitative real-time PCR in the TNBC cell line using non-TNBC cell line as a calibrator, and the obtained results were in accordance with the bioinformatics data. This information may contribute to understanding the various molecular mechanisms that drive the development and progression of TNBC tumors.

16
A catalogue of missense and nonsense mutation abundances for the U.S. cancer patient population

Arun, A.; Liarakos, D.; Mendiratta, G.; McFall, T.; Hargreaves, D. C.; Wahl, G. M.; Hu, J.; Stites, E. C.

2026-04-22 oncology 10.64898/2026.04.20.26351248 medRxiv
Top 2%
3.0%
Show abstract

Widespread genomic sequencing efforts have characterized the molecular foundations of the different cancers. By combining these genomic data in a manner proportional to the population-level abundances of these different cancers, we estimate the overall abundances of each observed missense and nonsense mutation within the U.S. cancer patient population. We find BRAF V600E (5.2%) is the most common mutation in the cancer patient population, TP53 R175H (1.5%) is the most common tumor suppressor mutation, and APC R876X (0.4%) is the most common nonsense mutation. These values differ largely and significantly from what would be found in a typical pan-cancer analysis, where different cancer types are included out of proportion to population level incidence. We present the full ordered lists of population-level abundances for specific missense and nonsense mutations, and we demonstrate the value of these data by further analyzing high priority genes (e.g., TP53, KRAS, BRAF) and pathways (e.g., RTK/RAS, PI3K, and WNT/{beta}-catenin). Overall, this information is a resource that should benefit the basic science, translational, and clinical cancer research communities.

17
Comparing Gleason Pattern 4 Measurement Approaches on Prostate Biopsy Using Machine Learning: A Proof-of-Principle Study

Buzoianu, M. M.; Yu, R.; Assel, M.; Bozkurt, A.; Aghdam, H.; Fine, S.; Vickers, A.

2026-04-24 oncology 10.64898/2026.04.23.26351615 medRxiv
Top 2%
1.9%
Show abstract

Objective: To demonstrate the proof of principle that machine learning (ML) can be used to quantify Gleason Pattern (GP) 4 on digitized biopsy slides using multiple measurement approaches, allowing direct comparison of their prognostic performance. Methods: We assembled a convenience sample of 726 patients with grade group 2-4 prostate cancer on systematic biopsy who underwent radical prostatectomy between 2014 and 2023. Digitized biopsy slides were analyzed using a machine-learning algorithm (PAIGE-AI) to quantify GP4 using multiple measurement approaches, particularly with respect to how gaps between cancer foci (interfocal stroma) were handled. GP4 extent was quantified using linear measurements or a pixel-based area metric. Discrimination of each GP4 quantification approach, along with Grade Group (GG), was assessed for adverse radical prostatectomy pathology and biochemical recurrence. Results: We identified 15 different quantification approaches and observed differences between their discrimination. The highest discrimination was in the pixel-counting method (AUC 0.648). GP4 quantification outperformed GG for predicting adverse pathology (AUC 0.627 vs 0.608). Amount of GP3 was non-predictive once GP4 was known. These findings were consistent for BCR. Conclusions: We were able to measure slides using 15 distinct measurement approaches and replicated prior findings using ML to quantify GP4. Our findings support the use of ML as a research tool to compare different GP4 quantification approaches. We intend to use our method on larger cohorts to determine with which measurement approach best predicts oncologic outcome.

18
Phase 1a Evaluation of LP-184 in Recurrent Glioblastoma: Safety, Pharmacokinetics, and Translational Optimization of CNS Exposure

Schreck, K.; Lal, B.; Zhou, J.; Lopez Bertoni, H.; Holdhoff, M.; Ewesudo, R.; Bhatia, K.; Chamberlain, M.; Laterra, J.

2026-04-24 oncology 10.64898/2026.04.21.26351406 medRxiv
Top 3%
1.8%
Show abstract

Purpose: Limited CNS bioavailability and pharmacodynamics are obstacles to effective systemic therapies for glioblastoma. One strategy to overcome these challenges is drug combinations enhancing CNS penetration and/or tumor chemosensitivity. LP-184, a synthetic acylfulvene class alkylator, induces DNA damage and inhibits glioblastoma cell viability in pre-clinical models. LP-184 is a prodrug converted to active metabolites by intracellular prostaglandin reductase 1 (PTGR1) that is over-expressed in >70% of glioblastoma. DNA damage induced by LP-184 is MGMT agnostic and reversed by transcription-dependent NER. Patients: LP-184 was evaluated in a Phase 1a study (NCT05933265) in 63 adult patients with advanced malignancies including 16 patients with recurrent glioblastoma. All patients with glioblastoma received prior standard-of-care therapy and most had received 1 or more additional therapies before enrollment. Results: Patients with glioblastoma experienced more frequent transaminitis, Grade 1-2 nausea and a trend towards more frequent and severe thrombocytopenia compared to the non-glioblastoma cohort. Otherwise, overall toxicity profiles were similar. Clinical pharmacokinetic analysis combined with published pre-clinical intra-tumoral bioavailability data (~20% penetration) predicted that LP-184 at the recommended dose for expansion (RDE) would achieve cytotoxic levels if combined with spironolactone, a BBB permeable ERCC3 degrader and TC-NER inhibitor that sensitizes glioblastoma cells to LP-184 3-6-fold. We show that three daily doses of spironolactone deplete orthotopic glioblastoma PDX ERCC3 protein by ~ 80% and increases tumor LP-184 cytotoxicity 2-fold. Conclusions: LP-184 is well tolerated at the RDE, and we establish a clinically translatable scheme for dosing spironolactone in combination with LP-184 for a future Phase 1b clinical trial.

19
Estimation of cancer cases in transgender and gender diverse people in England

Pasin, C.; Jackson, S. S.; Thynne, L.-E.; McWade, B.; Westerman, T.; Ball, R.; Kavanagh, J.; O'Callaghan, S.; Ring, K.; Orkin, C.; Berner, A. M.

2026-04-22 oncology 10.64898/2026.04.21.26351378 medRxiv
Top 3%
1.8%
Show abstract

ObjectivesTo estimate current, and 5- and 10-year projected, number of cases of cancer per year in transgender and gender diverse (TGD) people in England, overall and by tumour type, accounting for uptake of gender affirming care (GAC). DesignPopulation-based epidemiological modelling study using an age-stratified Monte Carlo simulations approach and the NORDPRED method for predictions. SettingModels estimating cancer case numbers for TGD people in England based on publicly available 2023 cancer surveillance data and survey-based 2025 GAC access, and predicted at 5 and 10 years hence. ParticipantsTGD people aged 15 years and above. Main outcome measuresPrimary cancer cases per year overall, by gender, age group, tumour type, and current and planned GAC. ResultsThe estimated TGD population size in England is 441547 (95% uncertainty interval (UI) 429207- 452890). Total cases per year of cancer in TGD people is expected to be 966 (95% UI 882-1069) excluding non-melanoma skin. Most cases are expected to occur in people aged 60-64. The top 5 expected cancers in TGD people are breast (19%, n = 187, 95% UI 149-241), colorectal (12%, n = 117, 95% UI 106-129), lung (11%, n = 108, 95% UI 96-122), melanoma (7.1%, n = 69, 95% UI 64-74) and urinary (6.2%, n = 60, 95% UI 54-67). Total cases of cancer in TGD people are estimated to be 1740 (95% UI 1584-1934) in 5 years and 2258 (95% UI 2066-2507) in 10 years (excluding non-melanoma skin). If TGD people were able to access their planned level of GAC, this would reduce these figures to 1555 (95% CI 1386-1766) and 2012 (95% CI 1797-2282) respectively. ConclusionsThis study provides prediction of cancer cases in TGD people in England, supporting the planning of service provision and training. This is vital, as with increasing disclosure, and long wait times for GAC, cancer cases in TGD people are predicted to increase. Summary BoxesO_ST_ABSWhat is already known on this topicC_ST_ABSThe annual number of cases of cancer in transgender and gender diverse (TGD) people in England is currently unknown as gender incongruence is not collected as part of the National Cancer Registration and Analysis Service. Some gender-affirming care (GAC) interventions are known to modulate cancer risk. Use of testosterone and chest reconstruction for transmasculine people is known to reduce their incidence of breast cancer compared to cisgender women. Use of oestradiol alongside medical or surgical androgen suppression has been shown to reduce the incidence of prostate cancer in transfeminine people while increasing their risk of breast cancer, compared to cisgender men. What this study addsThis study found that there are likely to be approximately 966 cases of cancer (excluding non-melanoma skin) in TGD people per year in the UK. Though total annual cases of cancer in TGD people are expected to be 2258 in 10 years, improved access to gender-affirming care could reduce total cases to 2012 (a 11% reduction). These figures provide additional justification for funding to improve access to GAC via the National Health Service (NHS), as well as for training on the oncological needs of this population.

20
Accessible and Reproducible Renal Cell Carcinoma Research Through Open-Sourcing Data and Annotations

de Boer, S.; Häntze, H.; Ziegelmayer, S.; van Ginneken, B.; Prokop, M.; Bressem, K. K.; Hering, A.

2026-04-23 radiology and imaging 10.64898/2026.04.22.26351451 medRxiv
Top 3%
1.7%
Show abstract

Background: Medical imaging, especially computed tomography and magnetic resonance imaging, is essential in clinical care of patients with renal cell carcinoma (RCC). Artificial intelligence (AI) research into computer-aided diagnosis, staging and treatment planning needs curated and annotated datasets. Across literature, The Cancer Genome Atlas (TCGA) datasets are widely used for model training and validation. However, re-annotation is often necessary due to limited access to public annotations, raising entry barriers and hindering comparison with prior work. Methods: We screened 1915 CT scans from three TCGA-RCC databases and employed a segmentation model to annotate kidney lesion. After a meta-data-based exclusion step, we hosted a reader study with all papillary (n=56), chromophobe (n=27) and 200 randomly selected clear cell RCC cases. Two students quality checked and corrected the data as well as annotated tumors and cysts. Uncertain cases were checked by a board-certified radiologist. Results: After data exclusion and quality control a total of 142 annotated CT scans from 101 patients (26 female, 75 male, mean age 56 years) remained. This includes 95 CTs with clear cell RCC, 29 with papillary RCC and 18 with chromophobe RCC. Images and voxel-level annotations of kidneys and lesions are open sourced at https://zenodo.org/records/19630298. Conclusion: By making the annotations open-source, we encourage accessible and reproducible AI research for renal cell carcinoma. We invite other researchers who have previously annotated any of these cohorts to share their annotations.