Oncogene
○ Springer Science and Business Media LLC
Preprints posted in the last 7 days, ranked by how well they match Oncogene's content profile, based on 76 papers previously published here. The average preprint has a 0.07% match score for this journal, so anything above that is already an above-average fit.
Velazquez, D.; Molnar, C.; Reina, J.; Mora, J.; Gonzalez, C.
Show abstract
Ewing sarcoma (EwS) is an aggressive, human-exclusive tumor typically driven by the EWS::FLI1 fusion protein. To assess whether the neomorphic functions of EWS::FLI1 are fundamentally dependent on evolutionarily recent cofactors such as ETS transcription factors (ETS-TFs), Plycomb group (PcG) proteins, CBP/p300, or specific subunits of the BAF complex, we expressed EWS::FLI1 in the model organism Saccharomyces cerevisiae. This minimal system was chosen because several key EWS::FLI 's cofactors possess greatly reduced sequence homology (e.g., BAF) or are lacking altogether (e.g., ETS-TFs, PcG, or CBP/p300). We used co-IP/MS to map the yeast interactome, Chip-Seq to identify gDNA binding sequences, RNA-Seq for global gene expression, and engineered reporters to test conversion of (GGAA) tandem repeats (GGAASat) into neoenhancers. We found that the yeast EWS::FLI1 interactome was more limited and qualitatively distinct from its human counterpart, sharing core machinery (e.g. RNA Polymerase II, FACT) but lacking the BAF/SWI-SNF and spliceosome complexes, and showing strong enrichment for the SAGA chromatin remodeling complex. We also found that EWS::FLI1 binds to hundreds of sites in the yeast genome with a clear preference for putative ETS-TF consensus sequences and (CA) dinucleotide repeats. Yet, EWS::FLI1 expressing cells presented only minimal transcriptional dysregulation, a stark contrast to the extensive changes observed in humans and Drosophila cells. Finally, we found that EWS::FLI1 successfully converted silent GGAASat sequences into active enhancers in yeast. This remarkable result occurs despite the absence of homologs for key human activators, such as CBP/p300, strongly suggesting that EWS::FLI1 can mobilize functionally related, non-homologous pathways to establish neoenhancers at GGAASat sites. Altogether, our results indicate that EWS::FLI1's core ability to drive GGAASat-dependent gene expression is a conserved, ancient property, while GGAASat-independent extensive transcriptome reprogramming is dependent on co-factors and pathways specific to animal cells.
Hoskins, J. W.; Christensen, T. A.; Eiser, D.; Char, E.; Mobaraki, M.; O'Brien, A.; Collins, I.; Zhong, J.; Patel, M. B.; Prasad, G.; Pancreatic Cancer Cohort Consortium and Pancreatic Cancer Case-Control Consortium (PanScan/PanC4), ; Arda, E.; Connelly, K. E.; Amundadottir, L. T.
Show abstract
Pancreatic ductal adenocarcinoma (PDAC) remains one of the deadliest human cancers. The current largest published PDAC Genome-Wide Association Study (GWAS) identified 23 genetic risk signals, but most lack sufficient characterization. This study aimed to functionally characterize the chr13q12.2 (PLUT/PDX1) PDAC GWAS risk locus. Fine-mapping, luciferase reporter assays, and electrophoretic mobility shift assays implicated rs9581943, a PDX1 promoter SNP, as a functional variant underlying this GWAS signal. GTEx expression QTL analyses identified rs9581943 as a significant PDX1 eQTL in pancreas, and CRISPR/Cas9 editing in PDAC-derived cell lines confirmed a functional relationship. PDX1 is a transcription factor involved in early pancreas development and {beta}-cell homeostasis, but its role in exocrine pancreatic cells is unclear. Single-nucleus RNA-seq analyses of pancreatic acinar and ductal cells from neonatal, adult, and chronic pancreatitis donors suggested PDX1 activity alleviates high secretory load and ER-stress in acinar and biases ducts toward homeostatic phenotypes. Similarly, scRNA-seq analyses of pancreatic tumors suggested PDX1 activity reduces biosynthetic and inflammatory stress and promotes epithelial differentiation. Our study therefore implicates rs9581943 as a causal variant for the chr13q12.2 PDAC GWAS signal wherein the risk allele reduces PDX1 expression, eroding PDX1's capacity to buffer stress and stabilize epithelial cell fate in the exocrine compartment.
Diaz, F. C.; Waldrup, B.; Carranza, F. G.; Manjarrez, S.; Velazquez-Villarreal, E.
Show abstract
BackgroundDespite extensive characterization of key oncogenic drivers, pancreatic ductal adenocarcinoma (PDAC) continues to exhibit profound molecular heterogeneity and inconsistent responses to standard therapies, including gemcitabine. The role of pathway-level alterations, particularly in the context of age at onset and therapeutic exposure, remains insufficiently defined. MethodsIn this study, we leveraged a conversational artificial intelligence framework (AI-HOPE-TP53 and AI-HOPE-PI3K) to enable precision oncology, driven interrogation of clinical and genomic data from 184 PDAC tumors, stratified by age at diagnosis and gemcitabine exposure. Using AI-enabled cohort construction and pathway-centric analyses, we evaluated alterations in TP53 and PI3K signaling networks, with findings validated through conventional statistical methods. ResultsTP53 pathway analysis revealed a significantly higher frequency of TP53 mutations in early-onset compared to late-onset PDAC among gemcitabine-treated patients (86.7% vs. 57.1%, p = 0.04), with a similar trend observed between treated and untreated early-onset cases (86.7% vs. 40%, p = 0.07). Notably, in late-onset PDAC patients not treated with gemcitabine, absence of TP53 pathway alterations was associated with improved overall survival (p = 0.011). Complementary analyses of the PI3K pathway demonstrated a higher prevalence of pathway alterations in late-onset gemcitabine-treated tumors compared to untreated counterparts (13.2% vs. 2.7%, p = 0.02). Importantly, among late-onset patients not receiving gemcitabine, those without PI3K pathway alterations exhibited significantly improved overall survival (p < 0.0001). ConclusionTogether, these findings identify distinct TP53 and PI3K pathway dependencies that are modulated by both age-of-onset and treatment exposure in PDAC. This work highlights the utility of conversational artificial intelligence in enabling rapid, integrative, and hypothesis-generating analyses within a precision oncology framework, supporting the identification of clinically relevant molecular stratification strategies for this aggressive disease.
Oszer, A.; Pastorczak, A.; Urbanska, Z.; Miarka, K.; Marschollek, P.; Richert-Przygonska, M.; Mielcarek-Siedziuk, M.; Baggott, C.; Schultz, L.; Moon, J.; Aftandilian, C.; Styczynski, J.; Kalwak, K.; Mlynarski, W.; Davis, K. L.
Show abstract
Chimeric antigen receptor T-cell (CAR-T) therapy targeting CD19 has transformed outcomes for children with relapsed or refractory (R/R) B-cell acute lymphoblastic leukemia (B-ALL), yet the influence of molecular subtype on outcomes remains unclear. We evaluated the impact of cytogenetic and molecular signatures on complete response (CR), overall survival (OS), and leukemia-free survival (LFS) after CD19 CAR-T therapy in eighty-six pediatric patients with R/R B-ALL treated with tisagenlecleucel. CR was assessed 30 days after infusion. Cytogenetic data were available for 84 patients and molecular profiling for 62. Survival analyses included 72 patients who received CD19 CAR-T as the sole cellular therapy. Seventy-seven patients achieved CR (89.5%). Pre-infusion bone marrow blasts of [≥]20% were associated with lower CR rates (53.8% vs 95.9%, p<0.0001) and significantly reduced OS and LFS (both p<0.0001). Among molecular markers, RAS mutations correlated with inferior OS (p=0.0222) and LFS (0.0402). In multivariate analysis, bone marrow blasts >20% and RAS mutations independently predicted inferior OS. Post CAR-T, CD19 negative relapses showed almost twice higher prevalence of RAS mutations (66% vs 37.5%). These findings highlight RAS mutations as a key molecular predictor of outcome after CD19 CAR-T therapy and suggest emergence of unique risk stratification for patients receiving CD19-targeting therapy.
Roy, R.; Patnaik, J.; Chakraborty, A.; Patnaik, S.; Parija, T.
Show abstract
Background: Stomach adenocarcinoma is driven by heterogeneity, limiting therapeutic success. Although ROS acts as a continuous redox rheostat for tumor evolution, it is categorized based on binary models that are masked by tumor-microenvironment (TME) confounders. Here, we have defined a continuous, TME-independent ROS axis to help identify intrinsic vulnerabilities and improve patient stratification. Methods: Non-negative matrix factorization (NMF) defined a ROS-Axis in TCGA-STAD which was validated in ACRG Cohort. Multivariate regression model isolated intrinsic signatures via residual ROS scores by adjusting for TME confounders. Survival was assessed using Cox hazard models. Drug sensitivities were mapped using GDSC2/ElasticNet modeling with cross-cohort replication. Results: Our results define a reproducible ROS gradient, driven by effectors like NQO1 and SOD1, characterizing ROS-high tumors as proliferative, epithelial and immune -cold. High residual ROS score was associated with an improved prognosis, regardless of TNM stage and age. Pharmacogenomic mapping revealed an overlapping sensitivity to mTOR inhibitors in ROS-high gastric cancer tumors which persisted after TME confounder adjustment. Conclusion: The continuous ROS axis provides a functional readout of metabolic dependency that refines traditional anatomical staging. By identifying mTOR dependent cold tumors, our framework offers a precision strategy for immunotherapy-resistant patients like those affected by microsatellite-stable gastric cancer.
Nguyen, D. H.; Majdi, A.; Marliot, F.; Houtart, V.; Kirilovsky, A.; Hijazi, A.; Fredriksen, T.; de Sousa Carvalho, N.; Bach, A.- S.; Gaultier, A.- L.; Fabiano, E.; Kreps, S.; Tartour, E.; Pere, H.; Veyer, D.; Blanchard, P.; Angell, H. K.; Pages, F.; Mirghani, H.; Galon, J.
Show abstract
BackgroundTreatment optimization in HPV-associated oropharyngeal cancer (OPSCC) remains challenging, as recent de-escalation trials have shown limited success. Current patient selection strategies based on smoking history and TNM classification are insufficient, highlighting the need for robust, standardized prognostic biomarkers. We report the first validation of the Immunoscore (IS) for prognostic stratification in HPV-associated OPSCC. Patients and methodsWe analyzed 191 HPV-associated (p16+ and HPV DNA/RNA+) OPSCC patients from an international multicenter cohort (2015-2024), comprising a French monocentric retrospective training cohort (N = 48) and three validation cohorts: French monocentric retrospective (N = 48), French multicenter prospective (N = 50), and US multicenter retrospective (N = 45). IS is a standardized digital pathology assay quantifying CD3lJ and CD8lJ densities in tumor cores and invasive margins, with cut-offs defined in the training cohort and validated across cohorts. Associations with disease-free survival (DFS), time to recurrence (TTR) and overall survival (OS) were assessed, alongside 3RNA-seq and sequential immunofluorescence profiling of immune composition. ResultsMedian age 65; 80% male; 74% smokers; 66% T1-2; 82% N0-1 (AJCC8th). IS-High patients demonstrated superior 3-year DFS in the training and validation cohorts 1-3 (all log-rank P < 0.05). Multivariable analysis identified IS-Low as the strongest independent risk factor for DFS (HR 9.03; 95% CI: 4.02-20.31; P < 0.001). The model combining IS with clinical factors showed higher predictive accuracy for DFS (C-index 0.82) than clinical variables alone (0.7; P < 0.0001). Similar findings were observed for TTR and OS. IS-High tumors showed markedly higher enrichment of lymphoid and myeloid immune cell populations, contrasting with immune-poor signatures in IS-Low tumors. ConclusionsIS is a robust biomarker that outperforms standard clinical variables in both prognostic and predictive accuracy. The enriched cytotoxic immune infiltrate in IS-High tumors explains favorable outcomes and supports their suitability for treatment de-escalation. Prospective validation is warranted.
Shen, Q.; Wang, G.; Fu, M.; Yao, K.; Yang, Y.; Zeng, Q.; Guo, Y.
Show abstract
Background: Lateral lymph node metastasis (LLNM) is associated with poor prognosis in patients with rectal cancer and may influence the indication for lateral lymph node dissection. Accurate preoperative identification of LLNM remains challenging. This study aimed to develop and internally validate a clinicoradiological model for preoperative prediction of LLNM in rectal cancer. Methods A retrospective cohort of 64 patients undergoing lateral lymph node dissection (LLND) for rectal cancer was analysed; 21 (32.8%) had pathological lateral lymph node metastasis (LLNM). A prespecified preoperative clinicoradiological model was fitted using penalised logistic regression with L2 regularisation (ridge), incorporating MRI-measured lateral lymph node short-axis diameter (LLN-SAD), dichotomised clinical T stage (T3-4 vs T1-2), dichotomised clinical N stage (N+ vs N0), and log(CA19-9+1). Model performance was evaluated using the area under the receiver operating characteristic curve (AUC), calibration analysis, and bootstrap internal validation. Results The model showed good discrimination (AUC 0.914), with an optimism-corrected AUC of 0.887 on bootstrap validation. Calibration remained acceptable after optimism correction (calibration intercept -0.127; slope 1.045). Decision curve analysis suggested net benefit across clinically relevant threshold probabilities, particularly between 0.10 and 0.30. The model was implemented as a web-based calculator to facilitate clinical use. Conclusion This clinicoradiological model showed good discrimination, acceptable calibration, and potential clinical utility for preoperative assessment of LLNM risk in rectal cancer. It may assist individualized risk stratification and treatment planning, although external validation is required before routine clinical implementation.
Fischer, J.; Spindler, M. P.; Britton, G. J.; Weiler, J.; Tankelevich, M.; Dai, D.; Canales-Herrerias, P.; Jha, D.; Rajpal, U.; Mehandru, S.; Faith, J. J.
Show abstract
Our understanding of human mucosal T cell clonotype distribution in health and disease has centered on immunodominant antigens. We performed single cell T cell receptor (TCR) and RNA sequencing as an untargeted approach to define distributions of T cell clonal groups in health and ulcerative colitis (UC) across 333,088 T cells in colon and peripheral blood. Healthy donor-specific TCR repertoires had limited blood-colon clonal sharing, which was highest in cytotoxic T effector memory (Tem) populations and lowest in regulatory T cells (Tregs), reflecting tissue-based compartmentalization. Within healthy colon, TCR repertoires showed high T cell clonal sharing independent of anatomic distance, associated with high intra-clonal phenotypic diversity. Colon cytotoxic and Th17 populations showed high dispersion across sites, while Tregs were compartmentalized. Clonal lineages dispersed across blood and colon upregulated trafficking markers, suggesting active movement between tissues, while those dispersed across colon sites upregulated residency markers, suggesting intra-colon repertoire sharing is mediated by long-term, slow moving clonal groups. In UC, Tregs were expanded across inflamed sites, and increased CD8 Tem clonal groups showed increased dispersion regardless of inflammation. These findings reveal principles of T cell clonal organization in the human colon during health and disease, identifying opposing patterns of clonal dispersion among Treg and Th17 clonal groups, high phenotypic diversity within dispersed clonal groups, and elevated cross-colon dispersion of CD8 Tem clonotypes in UC.
Shapiro, J. R.; Dorogy, A.; Science, M.; Gupta, S.; Alexander, S.; Bolotin, S.; Watts, T. H.
Show abstract
Children with acute lymphoblastic leukemia (ALL) are treated with multiagent chemotherapy that causes profound changes to the immune system. There are limited data on how disease and therapy impact antigen-specific immune memory, leading to inconsistent guidelines on best practices for revaccination of this population. Here, to inform vaccine guidance, we investigated whether immunity derived from routine childhood measles and varicella zoster virus (VZV) vaccines is maintained during and after therapy for childhood ALL. We report that antibodies against measles and VZV were significantly reduced in children with ALL (n=45) compared to healthy controls (n=13), particularly in older children in whom a longer time had passed since their most recent vaccine dose. However, the avidity of the measles and VZV-specific antibodies was indistinguishable between groups. Despite changes to the composition of the T cell compartment, both overall and antigen-specific T cell function were preserved in children with ALL. These data provide compelling evidence for revaccination of children following ALL treatment. Intact T cell responses suggest that post-treatment revaccination would be effective.
Pore, M.; Balamurugan, K.; Atkinson, A.; Breen, D.; Mallory, P.; Cardamone, A.; McKennett, L.; Newkirk, C.; Sharan, S.; Bocik, W.; Sterneck, E.
Show abstract
Circulating tumor cells (CTCs), and especially CTC-clusters, are linked to poor prognosis and may reveal mechanisms of metastasis and treatment resistance. Therefore, developing unbiased methods for the functional characterization of CTCs in liquid biopsies is an urgent need. Here, we present an evaluation of multiplex imaging mass cytometry (IMC) to analyze CTCs in mice with human xenograft tumors. In a single-step process, IMC uses metal-labeled antibodies to simultaneously detect a large number of proteins/modifications within minimally manipulated small volumes of blood from the tail vein or heart. We used breast cancer cell lines and a patient-derived xenograft (PDX) to assess antibodies for cross-species interpretation. Along with manual verification, HALO-AI-based cell segmentation was used to identify CTCs and quantify markers. Despite some limitations regarding human-specificity, this technology can be used to investigate the effect of genetic and pharmacological interventions on the properties of single and cluster CTCs in tumor-bearing mice.
Diaz, F. C.; Waldrup, B.; Carranza, F. G.; Manjarrez, S.; Velazquez-Villarreal, E.
Show abstract
Background: Sezary syndrome (SS) is an aggressive leukemic variant of cutaneous T-cell lymphoma (CTCL) with distinct clinical and biological features compared to rarer entities such as primary cutaneous CD8+ aggressive epidermotropic cytotoxic T-cell lymphoma (PCAECTCL). Although recurrent genomic alterations in CTCL have been described, comparative analyses at the pathway level across biologically divergent subtypes remain limited. Here, we leveraged a conversational artificial intelligence (AI) platform for precision oncology to enable rapid, integrative, and hypothesis-driven interrogation of publicly available genomic datasets. Methods: We conducted a secondary analysis of somatic mutation and clinical data from the Columbia University CTCL cohort accessed via cBioPortal. Cases were stratified into SS (n=26) and PCAECTCL (n=13). High-confidence coding variants were curated and mapped to biologically relevant signaling pathways and functional gene categories implicated in CTCL pathogenesis. Pathway-level mutation frequencies were compared using Chi-square or Fisher's exact tests, with effect sizes quantified as odds ratios. Tumor mutational burden (TMB) was compared using the Wilcoxon rank-sum test. Subtype-specific co-mutation patterns were evaluated using pairwise association analyses and visualized through oncoplots and network heatmaps. Conversational AI agents, AI-HOPE, were used to iteratively refine cohort definitions, prioritize pathway-level signals, and contextualize findings. Results: TMB was comparable between SS and PCAECTCL (p = 0.96), indicating no significant difference in global mutational load. In contrast, pathway-centric analyses revealed marked qualitative differences. SS demonstrated enrichment of alterations in epigenetic regulators, tumor suppressor and cell-cycle control pathways, NFAT signaling, and DNA damage response mechanisms, consistent with transcriptional dysregulation and immune modulation. PCAECTCL exhibited relatively higher frequencies of alterations involving epigenetic regulators and MAPK pathway signaling, suggesting distinct oncogenic dependencies. Co-mutation analysis revealed a more constrained and focused interaction landscape in SS, whereas PCAECTCL displayed broader and more heterogeneous co-mutation networks, indicative of divergent evolutionary trajectories. Notably, ERBB2 mutations were significantly enriched between subtypes (p = 0.031), highlighting a potential subtype-specific therapeutic vulnerability. Conclusions: This study demonstrates that SS is distinguished from PCAECTCL not by increased mutational burden but by distinct pathway-level architectures, particularly involving epigenetic regulation, immune signaling, and transcriptional control. These findings generate biologically grounded, testable hypotheses for subtype-specific therapeutic targeting and underscore the value of conversational AI as a scalable framework for accelerating discovery in translational cancer genomics.
Lahtinen, E.; Schigiltchoff, N.; Jia, K.; Kundrot, S.; Palchuk, M. B.; Warnick, J.; Chan, L.; Shigiltchoff, N.; Sawhney, M. S.; Rinard, M.; Appelbaum, L.
Show abstract
Background and aims: Pancreatic ductal adenocarcinoma (PDAC) surveillance is limited to individuals with familial or genetic risk although most future cases arise outside these groups. In a retrospective study, PRISM, an electronic health record (EHR)-based PDAC risk model, identified individuals in the general population at elevated near-term risk of PDAC. We aimed to prospectively evaluate whether PRISM can identify high-risk individuals beyond current surveillance groups across U.S. health systems. Methods: We performed a prospective multicenter cohort study after deployment of PRISM in April 2023 across 44 U.S. health care organizations. Eligible adults aged [≥]40 years without prior PDAC received a single baseline risk score and were assigned to prespecified risk tiers. Patients were followed for incident PDAC for 30 months. We estimated tier-specific 30-month cumulative incidence (positive predictive value, PPV), number needed to screen (NNS), standardized incidence ratios (SIRs), and time from deployment and first high-risk flag to diagnosis. Results: Among 6,282,123 adults assigned a PRISM score, 5,058,067 had follow-up; 3,609 developed PDAC. The highest-risk tier had 30-fold higher PDAC incidence than the study population. At the SIR 5 threshold, 30-month cumulative incidence was 0.35% (NNS, 284.2); at SIR 16, 1.14% (NNS, 87.4); and at SIR 30, 2.19% (NNS, 45.7). Median time from deployment to PDAC diagnosis was 9.5 months, and median time from first high-risk flag to diagnosis at SIR 5 was 3.5 years. Shapley additive explanations (SHAP) analyses supported patient- and tier-level interpretability. Conclusions: Prospective deployment of PRISM across multiple U.S. health care organizations identified individuals at elevated near-term risk for PDAC, with substantial risk enrichment and lead time before diagnosis. These findings support the real-world scalability and generalizability of EHRbased risk stratification for risk-adapted early detection. ClinicalTrials.gov identifier NCT05973331
Steffen, F. D.; Lissat, A.; Alten, J.; Kriston, A.; Scheidegger, N.; Eckert, C.; Bodmer, N.; Schori, L.; Schühle, S.; Arpagaus, A.; Gutnik, S.; Manioti, D.; Bruderer, N.; Zeckanovic, A.; Västrik, I.; Nyiri, G.; Kovacs, F.; Thorhauge Als-Nielsen, B. E.; Attarbaschi, A.; Rademacher, A.; Elitzur, S.; Jacoby, E.; De Moerloose, B.; Svenberg, P.; Ancliff, P.; Sramkova, L.; Buldini, B.; Balduzzi, A.; Boer, J. M.; Mielcarek, M.; Ceppi, F.; Ansari, M.; Halter, J.; Schmiegelow, K.; Locatelli, F.; DelBufalo, F.; Stanulla, M.; Kulozik, A. E.; Schrappe, M.; Rohrlich, P.; Cave, H.; Baruchel, A.; von Stack
Show abstract
Children with relapsed or refractory acute lymphoblastic leukemia (ALL) require more effective and less toxic therapies. We established a prospective, multicenter Drug Response Profiling (DRP) registry (NCT06550102) integrating functional testing into precision-guided treatment. DRP was performed for 340 patients from 17 European countries with a turn-around time of two-weeks. Image-based drug screening with over 135000 unique perturbations revealed a heterogeneous landscape of ex vivo responses to 88 drugs on average. Ranking drug responses across the patient cohort defined individual drug fingerprints, identifying "DRP twins" by similarity in sensitivity and resistance independent of genetic ALL subtypes. Of 239 high-risk patients with follow-up, DRP-informed interventions were reported for 63 patients (26%). Patients received combination therapies based on venetoclax, tyrosine kinase inhibitors, trametinib, bortezomib or selinexor, resulting in objective clinical responses in 43 cases (68%). Precision-guided treatments allowed bridging to cellular therapies in 42 patients among whom 28 (67%) were still alive with a median follow-up of 21 months after DRP (IQR: 14.7-26.6 months). Top responders to venetoclax, ranked within the first tertile of the cohort, had superior 1-year event-survival compared to venetoclax non-responders (0.57 [95% CI, 0.39-0.85] vs. 0.25 [95% CI, 0.11-0.58]). Collectively, these findings demonstrate the feasibility and clinical relevance of functional profiling within an international network. This scalable framework enables individualized therapy selection for enrolment in adaptive precision trials for high-risk pediatric ALL.
Chandra, S.
Show abstract
Background: Current deep learning models in computational pathology, radiology, and digital pathology produce opaque predictions that lack the explainable artificial intelligence (xAI) capabilities required for clinical adoption. Despite achieving radiologist-level performance in tasks from whole-slide image (WSI) classification to mammographic screening, these models function as black boxes: clinicians cannot trace predictions to specific biological features, verify outputs against established morphological criteria, or integrate AI reasoning into precision oncology workflows and tumor board decision-making. Methods: We present Virtual Spectral Decomposition (VSD), a modality-agnostic, interpretable-by-design framework that decomposes medical images into six biologically interpretable tissue composition channels using sigmoid threshold functions - the same mathematical structure as CT windowing. Unlike post-hoc xAI methods (Grad-CAM, SHAP, LIME) applied to black-box deep learning models, VSD channels have pre-defined biological meanings derived from tissue physics, providing inherent explainability without sacrificing quantitative rigor. For whole-slide image (WSI) analysis in digital pathology, we introduce the dendritic tile selection algorithm, a biologically-inspired hierarchical architecture achieving 70-80% computational reduction while preferentially sampling the tumor immune microenvironment. VSD is validated across three cancer types and imaging modalities: pancreatic ductal adenocarcinoma (PDAC) on CT imaging, lung adenocarcinoma (LUAD) on H&E-stained pathology slides using TCGA data, and breast cancer on screening mammography. Composition entropy of the six-channel vector is computed as a visual Biological Entropy Index (vBEI) - an imaging biomarker quantifying the diversity of active biological defense systems. Results: In pancreatic cancer, the fat-to-stroma ratio (a novel CT-derived radiomics biomarker) declines from >5.0 (normal) to <0.5 (advanced PDAC), enabling early detection of desmoplastic invasion before mass formation on standard imaging. In lung cancer, composition entropy from H&E whole-slide images correlates with tumor immune microenvironment markers from RNA-seq (CD3: rho=+0.57, p=0.009; CD8: rho=+0.54, p=0.015; PD-1: rho=+0.54, p=0.013) and predicts overall survival (low entropy immune-desert phenotype: 71% mortality vs 29%, p=0.032; n=20 TCGA-LUAD), providing immune phenotyping for checkpoint immunotherapy patient selection from a $5 H&E slide without molecular assays. In breast cancer, each lesion type produces a characteristic six-channel fingerprint functioning as an interpretable computer-aided diagnosis (CAD) system for quantitative BI-RADS assessment and subtype classification (IDC vs ILC vs DCIS vs IBC). A five-level xAI audit trail provides complete traceability from clinical decision support output to specific biological structures visible on the original images. Conclusion: VSD establishes a unified, interpretable-by-design mathematical framework for explainable tissue composition analysis across imaging modalities and cancer types. Unlike black-box deep learning and post-hoc xAI approaches, VSD provides inherently interpretable, clinically verifiable cancer detection and immune phenotyping from standard clinical imaging at existing costs - without requiring foundation model infrastructure, specialized hardware, or molecular assays. The open-source pipeline (Google Colab, Supplementary Material) enables immediate reproducibility and extension to additional cancer types across the pan-cancer TCGA atlas.
Boudreau, M. W.; Freire, V. F.; Corbett, S. C.; Martinez-Fructuoso, L.; Shenoy, S. R.; Yu, W.; Kumar, R.; Thornburg, C. C.; Akee, R. K.; Peyser, B. D.; Jiang, Q.; Splaine, J.; Pfaff, J. L.; Chandler, B. C.; Abeja, D. M.; Donovan, K. A.; Che, J.; Lampson, B. L.; Cooke, M.; Kazanietz, M. G.; Szajner, P.; Smith, J. A.; Koduri, V.; Grkovic, T.; OKeefe, B. R.; Kaelin, W. G.
Show abstract
Many genetically validated targets in cancer, including the transcription factor {beta}-catenin ({beta}-cat), have historically been viewed as undruggable. Cell-based phenotypic screening of chemical compounds can reveal new biological and pharmacological principles. Natural products are powerful probes because of their superior structural diversity, drug-like properties, and biological activities as compared to unoptimized synthetic compounds. We screened 326,304 natural product mixtures (40,744 extracts and 285,560 fractions derived from them) using mammalian cells expressing an oncogenic version of {beta}-cat fused to a suicide protein. Multiple fractions degraded the {beta}-cat fusion protein or drove it into a compartment where both fusion partners were apparently inactive. The active natural product from one of the latter specifically activates novel, but not classical, protein kinase Cs (PKCs) and thereby relocates {beta}-cat to juxtamembrane vacuolar structures. These findings suggest a path for inactivating oncogenic {beta}-cat and underscore the power of screening natural product collections with robust phenotypic assays.
Ramirez-Lopez, L.; Kang, P.
Show abstract
Irritable Bowel Syndrome (IBS) affects a substantial proportion of university students, yet its factors remain incompletely characterised in South Asian populations. We reanalysed a publicly available dataset of 550 Bangladeshi students from Hasan et al. (2025), conducting a data audit that identified implausible records, including males reporting menstrual symptoms, and reduced the analytic sample to 506 observations. Using Explainable Boosting Machines (EBMs), which capture non-linear effects and pairwise interactions without sacrificing interpretability, we found that psychological distress, elevated BMI and academic dissatisfaction were the strongest predictors of IBS (mean AUC = 0.852 across 100 stratified train-test splits). Critically, several findings diverged from the original logistic regression analysis. Physical activity showed a non-linear risk pattern only at high intensity, the association with gender was substantially weaker when we accounted for metabolic and psychological factors as well and malnourishment does not have a strong an impact as in the original study. These divergences likely arise because the machine-learning model captures non-linear effects and interactions that were not represented in the original regression specification. Our findings underscore the value of reanalysing existing datasets with methods suited to capturing complexity and highlight data quality verification as a necessary step in the secondary analysis.
Chandra, S.
Show abstract
Background. Pancreatic ductal adenocarcinoma (PDAC) has a five-year survival rate of approximately 12%, largely because it is typically diagnosed at an advanced stage. CT-based computational methods for early detection exist but rely on black-box deep learning or large texture feature sets without tissue-specific interpretability. Methods. We developed Virtual Spectral Decomposition (VSD), which applies six parameterized sigmoid functions S(HU) = 1/(1+exp(-alpha x (HU - mu))) to standard portal-venous CT, decomposing each pixel into tissue-specific response channels for fat (mu=-60), fluid (mu=10), parenchyma (mu=45), stroma (mu=75), vascular (mu=130), and calcification (mu=250). Dendritic Binary Gating identifies structural content per channel using morphological filtering, enabling co-firing analysis and lone firer identification. A 25-feature signature was extracted per patient. Three independent datasets were analyzed: NIH Pancreas-CT (n=78 healthy), Medical Segmentation Decathlon Task07 (n=281 PDAC, paired tumor/adjacent tissue), and CPTAC-PDA from The Cancer Imaging Archive (n=82, multi-institutional, with DICOM time point tags). The same six sigmoid parameters were used across all datasets without retraining. Results. VSD achieved AUC 0.943 for field effect detection (healthy vs cancer-adjacent parenchyma) and AUC 0.931 for patient-stratified tumor specification on MSD. On CPTAC-PDA, VSD achieved AUC 0.961 (6 features) and 0.979 (25 features) for distinguishing healthy from cancer-bearing pancreas on scans obtained prior to pathological diagnosis. All significant features replicated across datasets in the same direction: z_fat (d=-2.10, p=3.5e-27), z_fluid (d=-2.76, p=2.4e-38), fire_fat (d=+2.18, p=1.2e-28). Critically, VSD severity did not correlate with days-from-diagnosis (r=-0.008, p=0.944) across a range of day -1394 to day +249. Patient C3N-01375, scanned 3.8 years before pathological diagnosis, had VSD severity 1.87, well above the healthy mean of 0.94 +/- 0.33. The tissue transformation signature was temporally stable, indicating an early, persistent tissue state rather than a progressively worsening process. Conclusions. VSD with Dendritic Binary Gating detects a stable pancreatic tissue composition signature on standard CT that is present years before clinical diagnosis, validated across three independent datasets without parameter adjustment. The six sigmoid channels map to biologically meaningful tissue components through a fully transparent interpretability chain. The temporal stability of the signal implies a detection window of 3-7 years, consistent with known PanIN-3 microenvironment transformation timelines. VSD functions as a single-scan screening tool applicable to any abdominal CT performed during the pre-clinical window.
Ullman, T.; Krantz, D.; Avenel, C.; Lung, M.; Svedman, F. C.; Holmsten, K.; Ostling, P.; Ullen, A.; Stadler, C.
Show abstract
Effective predictive biomarkers for immune checkpoint inhibitor (ICI) therapy remain an unmet need across solid tumors. Here, we present an integrated spatial proteomics workflow that combines in situ proximity ligation assay with multiplexed immunofluorescence to directly resolve PD1/PDL1 signaling events at the level of defined cellular phenotypes and their spatial organization within intact tumor tissue. Applied as a proof of concept to tumor samples from patients with metastatic urothelial carcinoma treated with pembrolizumab, this approach reveals that PD1/PDL1 interactions specifically involving cytotoxic CD8CD3 T cells are significantly enriched in complete responders, while such interactions are rare in patients with progressive disease. This interaction defined T cell subset achieves superior discrimination of clinical response compared to single marker PDL1 expression or immune cell abundance alone. By integrating direct detection of protein protein interactions with high dimensional single cell phenotyping, our workflow provides a mechanistically informed, spatially resolved biomarker of functional immune engagement. Beyond urothelial carcinoma, this platform establishes a generalizable framework for translating spatial signaling biology into predictive tools for immunotherapy response across tumor types.
Ingold, N.; Frankcombe, S.; Bouttle, K.; Moro, E.; Canson, D.; Zoellner, S.; Patil, S.; Dzigurski, J.; Glubb, D. M.; Laisk, T.; O'Mara, T. A.
Show abstract
Female genital tract (FGT) polyps are common benign growths affecting up to half of all women. However, they carry malignant potential, and their genetic architecture remains poorly defined. We conducted a genome-wide association study (GWAS) meta-analysis across four biobanks (48,400 cases, 477,134 controls), identifying 26 risk loci for FGT polyps, 12 of which were previously unreported. Integrative gene prioritisation highlighted 193 candidate genes, revealing a potential convergent biological mechanism: where germline variation in DNA replication and maintenance (e.g., PRIM1, TERT and HMGA1) compromises genomic stability in the context of hormone-driven proliferation (e.g., ESR1 and GREB1). This susceptibility is further modulated by metabolic drivers of estrogen biosynthesis, underscored by specific adiposity-related loci (e.g. RSPO3 and PLCE1) and the aromatase gene CYP19A1. Mendelian randomisation demonstrated bidirectional causal relationships with endometriosis and fibroids, and endometrial cancer. Leveraging the shared genetic architecture of FGT polyps and other gynaecological disorders via multi-trait analysis revealed an additional 26 loci, validating sub-threshold regions encompassing HMGA1 and GREB1. In total, 52 risk loci were identified (36 novel), 39 of which replicated in an independent cohort. These findings reframe polyps not merely as local gynaecological overgrowths but as manifestations of a systemic proliferative syndrome characterised by dysregulated genome stability and estrogen signalling, which may also impact malignant transformation.
Hassan, S. S.; Nordqvist-Kleppe, S.; Asinger, N.; Wang, J.; Dillner, J.; Arroyo Muhr, L. S.
Show abstract
Human papillomavirus (HPV) testing is the primary method for cervical cancer screening, and a negative HPV test is associated with a very low subsequent risk of invasive cancer. Nevertheless, a small number of cervical cancers are diagnosed following an HPV-negative testing result, posing challenges within HPV-based screening pathways. Using nationwide Swedish registry data of HPV testing, we identified women diagnosed with invasive cervical cancer between 2019 and 2024 and reconstructed HPV testing histories from the National Cervical Screening Registry (NKCx). The most recent HPV test prior to diagnosis was defined as the index test, and longitudinal HPV testing trajectories were classified among women with an HPV-negative index test. Of 3,000 women diagnosed with invasive cancer, 243 (8.1%) had an HPV-negative index test. These women were older at diagnosis and more frequently diagnosed at advanced stages compared with women with an HPV-positive index test. Most HPV-negative index tests (66.3%) were performed in the peri-diagnostic period (+/- 30 days). Among women with an HPV-negative index test, 52.7% (128/243) had no prior HPV testing recorded, while the remainder had consistently HPV-negative histories (33.3%, 83/243) or evidence of prior HPV positivity before the index negative test (14%, 32/243). Possible recurrent HPV positivity following an intervening negative test was rare (0.4%, 1/243). HPV-negative screening results preceding invasive cancer reflect heterogeneous screening histories and cannot be explained solely by test failure. Findings highlighting the importance of reaching women earlier in screening programs and show that fluctuating HPV detectability is rare.