Back

Machine Learning Analysis to Define Cell Lineage in Leiomyosarcoma

van IJzendoorn, D. G. P.; Przybyl, J.; Hastie, T.; Bovee, J. V. M. G.; Matusiak, M.; van de Rijn, M.

2026-05-12 cancer biology
10.64898/2026.05.08.723931 bioRxiv
Show abstract

IntroductionCellular differentiation and lineage commitment are known to be associated with differences in DNA methylation. Leiomyosarcoma (LMS) is a tumor thought to originate from smooth muscle cells in the walls of vessels in the soft tissue (STLMS) or from the uterine myometrium (ULMS). Here, we identify the methylation signatures of normal smooth muscle cells from blood vessels and the uterine wall and compare these with those found in STLMS and ULMS. We hypothesized that these methylation signatures could be used to assign a smooth muscle subtype of origin to individual leiomyosarcomas, and that tumors of different origin would show biological differences with potential therapeutic relevance. MethodsTo define methylation profiles for smooth muscle from vessel walls versus those found in myometrium, EPIC methylation profiling was performed on DNA from 49 formalin-fixed paraffin-embedded (FFPE) normal smooth muscle samples. A supervised machine learning algorithm (Random Forest) was used to distinguish the methylation patterns of normal smooth muscle cells in vessel walls from those in the myometrium. The resulting classifier was applied to methylation data on 67 cases of LMS with corresponding bulk RNAseq data to identify which tumors showed a methylation signature most consistent with either blood vessel wall (LMSvessel) or myometrial smooth muscle (LMSwall). A custom signature matrix derived from scRNAseq data from 6 samples of LMS was used in CIBERSORTx analysis to compare the cellular composition of LMS cases with a vessel or uterine wall methylation signature. ResultsA high degree of correlation was found between the known site of origin for LMS (STLMS vs ULMS) and the methylation signature derived from different types of normal smooth muscle. LMSwall tumors compared to LMSvessel tumors had significantly higher activation of the PD-1 checkpoint pathway in RNAseq analysis. Digital flow cytometry by CIBERSORTx analysis showed an increased expression of transcriptomic signatures of several immune cell subtypes in LMSvessel tumors. ConclusionUsing a supervised machine learning approach we classified LMS samples as either showing a high similarity in methylation patterns to normal smooth muscle cells of either the vessel wall or the myometrium. We found a correlation between LMS showing either a "vessel" or "muscle wall" methylation signature and their site of origin, but notably we also identified some exceptions. When classified based on their methylation signature LMSwall and LMSvessel differed in their PD-1 pathway activation and in their predicted immune cell populations, suggesting potential implications for immunotherapeutic approaches.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Cancers
200 papers in training set
Top 0.2%
14.5%
2
PLOS ONE
4510 papers in training set
Top 18%
10.2%
3
Scientific Reports
3102 papers in training set
Top 9%
8.5%
4
Frontiers in Oncology
95 papers in training set
Top 0.5%
6.4%
5
Frontiers in Immunology
586 papers in training set
Top 2%
4.0%
6
Clinical Epigenetics
53 papers in training set
Top 0.2%
3.7%
7
PeerJ
261 papers in training set
Top 2%
3.7%
50% of probability mass above
8
BMC Cancer
52 papers in training set
Top 0.6%
3.7%
9
F1000Research
79 papers in training set
Top 0.7%
2.8%
10
npj Precision Oncology
48 papers in training set
Top 0.4%
2.1%
11
PLOS Computational Biology
1633 papers in training set
Top 15%
1.8%
12
Journal of Translational Medicine
46 papers in training set
Top 0.9%
1.7%
13
Journal of Clinical Medicine
91 papers in training set
Top 3%
1.7%
14
Diagnostics
48 papers in training set
Top 1%
1.5%
15
Biochemistry and Biophysics Reports
28 papers in training set
Top 0.6%
1.5%
16
Frontiers in Physiology
93 papers in training set
Top 3%
1.3%
17
The Journal of Pathology
22 papers in training set
Top 0.2%
1.3%
18
Frontiers in Genetics
197 papers in training set
Top 7%
1.1%
19
British Journal of Cancer
42 papers in training set
Top 1%
1.0%
20
Frontiers in Bioinformatics
45 papers in training set
Top 0.6%
0.9%
21
Immunology
29 papers in training set
Top 0.8%
0.9%
22
Cancer Epidemiology, Biomarkers & Prevention
17 papers in training set
Top 0.5%
0.9%
23
Biology Methods and Protocols
53 papers in training set
Top 2%
0.9%
24
Journal of Clinical Pathology
12 papers in training set
Top 0.5%
0.8%
25
Archives of Clinical and Biomedical Research
28 papers in training set
Top 2%
0.8%
26
Acta Neuropathologica Communications
81 papers in training set
Top 1%
0.8%
27
JNCI Cancer Spectrum
10 papers in training set
Top 0.5%
0.8%
28
BioData Mining
15 papers in training set
Top 0.9%
0.8%
29
Oncotarget
15 papers in training set
Top 0.5%
0.7%
30
Cancer Medicine
24 papers in training set
Top 2%
0.7%