Back

Learning Patient Similarity from Genomics for Precision Oncology

Shady, M.; Reardon, B.; Jiang, S.; Pimenta, E.; O'Meara, T.; Park, J.; kehl, K. L.; Elmarakeby, H. A.; Sunyaev, S. R.; Van Allen, E. M.

2025-12-18 oncology
10.64898/2025.12.17.25342480 medRxiv
Show abstract

IntroductionPrecision oncology has informed cancer care by enabling the discovery and application of diagnostic, prognostic, and/or predictive molecular biomarkers. However, many patients lack actionable biomarkers or fail to respond to biomarker-directed therapies. Patient similarity approaches can leverage comprehensive tumor profiling and prior clinical experiences from large cohorts for decision support, facilitating broader realization of precision oncology insights. MethodsWe developed a deep learning-based modeling framework using real-world clinicogenomic data from a tertiary cancer center to (i) measure patient similarity based on embedded tumor genomic profiles and (ii) evaluate the association of derived patient subgroups and neighborhoods with shared therapeutic outcomes in breast cancer-specific and histology-agnostic pan-cancer settings. ResultsThe model recovered clinically meaningful patient clusters reflecting both expected and previously unknown therapeutic associations, as well as patient-specific neighborhoods that could inform therapeutic trajectories more often than expected by chance in multiple clinical contexts. Moreover, model utility extended to patients without actionable genomic biomarkers and those with cancer of unknown primary (CUP) diagnoses, where neighborhoods aligned with independently predicted primary cancer type. These neighborhoods could also be examined over time in a continuously learning scenario. ConclusionThis similarity-based modeling framework distilled complex molecular and clinical data into concise, context-specific insights that augment clinician judgment, providing a foundation for a real-time learning, patient-centered decision support model in precision oncology.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.1%
37.1%
2
Clinical Cancer Research
58 papers in training set
Top 0.2%
6.8%
3
PLOS Computational Biology
1633 papers in training set
Top 8%
4.3%
4
npj Precision Oncology
48 papers in training set
Top 0.2%
3.6%
50% of probability mass above
5
npj Digital Medicine
97 papers in training set
Top 1%
3.1%
6
Nature Communications
4913 papers in training set
Top 43%
2.9%
7
Scientific Reports
3102 papers in training set
Top 53%
1.9%
8
Breast Cancer Research
32 papers in training set
Top 0.3%
1.9%
9
Cancer Cell
38 papers in training set
Top 0.8%
1.9%
10
JCO Precision Oncology
14 papers in training set
Top 0.2%
1.8%
11
Cancer Research
116 papers in training set
Top 2%
1.8%
12
European Journal of Cancer
10 papers in training set
Top 0.2%
1.7%
13
Frontiers in Oncology
95 papers in training set
Top 2%
1.7%
14
Nature Cancer
35 papers in training set
Top 0.7%
1.7%
15
Annals of Oncology
13 papers in training set
Top 0.5%
1.5%
16
JAMA Network Open
127 papers in training set
Top 3%
1.3%
17
Cancer Epidemiology, Biomarkers & Prevention
17 papers in training set
Top 0.4%
1.2%
18
Cancer Medicine
24 papers in training set
Top 1%
1.2%
19
JNCI: Journal of the National Cancer Institute
16 papers in training set
Top 0.5%
0.9%
20
Cell Reports Medicine
140 papers in training set
Top 6%
0.9%
21
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 43%
0.8%
22
BMC Infectious Diseases
118 papers in training set
Top 5%
0.7%
23
Biomedicines
66 papers in training set
Top 3%
0.7%
24
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.7%
25
Database
51 papers in training set
Top 1.0%
0.7%
26
Frontiers in Immunology
586 papers in training set
Top 8%
0.7%
27
JAMIA Open
37 papers in training set
Top 2%
0.7%
28
Nature Machine Intelligence
61 papers in training set
Top 3%
0.7%
29
Modern Pathology
21 papers in training set
Top 0.5%
0.7%
30
Molecular Cancer Therapeutics
33 papers in training set
Top 0.7%
0.7%