Back

Cell-type-specific nuclear morphology predicts genomic instability and prognosis in multiple cancer types

Abel, J.; Jain, S.; Rajan, D.; Padigela, H.; Leidal, K.; Prakash, A.; Conway, J.; Nercessian, M.; Kirkup, C.; Javed, S. A.; Egger, R.; Trotter, B.; Gerardin, Y.; Brosnan-Cashman, J. A.; Dhoot, A.; Montalto, M. C.; Wapinski, I.; Khosla, A.; Drage, M. G.; Yu, L.; Taylor-Weiner, A.

2023-05-15 cancer biology
10.1101/2023.05.15.539600 bioRxiv
Show abstract

While alterations in nucleus size, shape, and color are ubiquitous in cancer, comprehensive quantification of nuclear morphology across a whole-slide histologic image remains a challenge. Here, we describe the development of a pan-tissue, deep learning-based digital pathology pipeline for exhaustive nucleus detection, segmentation, and classification and the utility of this pipeline for nuclear morphologic biomarker discovery. Manually-collected nucleus annotations were used to train an object detection and segmentation model for identifying nuclei, which was deployed to segment nuclei in H&E-stained slides from the BRCA, LUAD, and PRAD TCGA cohorts. Interpretable features describing the shape, size, color, and texture of each nucleus were extracted from segmented nuclei and compared to measurements of genomic instability, gene expression, and prognosis. The nuclear segmentation and classification model trained herein performed comparably to previously reported models. Features extracted from the model revealed differences sufficient to distinguish between BRCA, LUAD, and PRAD. Furthermore, cancer cell nuclear area was associated with increased aneuploidy score and homologous recombination deficiency. In BRCA, increased fibroblast nuclear area was indicative of poor progression-free and overall survival and was associated with gene expression signatures related to extracellular matrix remodeling and anti-tumor immunity. Thus, we developed a powerful pan-tissue approach for nucleus segmentation and featurization, enabling the construction of predictive models and the identification of features linking nuclear morphology with clinically-relevant prognostic biomarkers across multiple cancer types.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
npj Precision Oncology
48 papers in training set
Top 0.1%
13.9%
2
Nature Communications
4913 papers in training set
Top 15%
12.0%
3
Genome Medicine
154 papers in training set
Top 0.5%
9.8%
4
Modern Pathology
21 papers in training set
Top 0.1%
4.2%
5
Cancer Research
116 papers in training set
Top 0.6%
4.0%
6
Advanced Science
249 papers in training set
Top 6%
3.5%
7
Scientific Reports
3102 papers in training set
Top 39%
3.5%
50% of probability mass above
8
Laboratory Investigation
13 papers in training set
Top 0.1%
3.5%
9
Cancers
200 papers in training set
Top 2%
3.2%
10
Clinical Cancer Research
58 papers in training set
Top 0.6%
3.0%
11
Cell Reports Medicine
140 papers in training set
Top 2%
3.0%
12
Communications Biology
886 papers in training set
Top 4%
2.5%
13
PLOS Computational Biology
1633 papers in training set
Top 17%
1.6%
14
PLOS ONE
4510 papers in training set
Top 56%
1.6%
15
Journal of Translational Medicine
46 papers in training set
Top 1%
1.3%
16
Breast Cancer Research
32 papers in training set
Top 0.4%
1.2%
17
Science Advances
1098 papers in training set
Top 27%
0.9%
18
Cell Death & Disease
126 papers in training set
Top 2%
0.8%
19
eBioMedicine
130 papers in training set
Top 4%
0.8%
20
Oncogene
76 papers in training set
Top 2%
0.8%
21
Cell Reports
1338 papers in training set
Top 34%
0.7%
22
Frontiers in Bioinformatics
45 papers in training set
Top 1%
0.7%
23
Frontiers in Oncology
95 papers in training set
Top 4%
0.7%
24
Cell Reports Methods
141 papers in training set
Top 6%
0.7%
25
Nucleic Acids Research
1128 papers in training set
Top 19%
0.7%
26
npj Breast Cancer
18 papers in training set
Top 0.2%
0.7%
27
Cancer Research Communications
46 papers in training set
Top 2%
0.6%
28
npj Systems Biology and Applications
99 papers in training set
Top 3%
0.6%
29
Cancer Discovery
61 papers in training set
Top 2%
0.6%
30
iScience
1063 papers in training set
Top 38%
0.6%