Back

Filtering cells with high mitochondrial content removes viable metabolically altered malignant cell populations in cancer single-cell studies

Yates, J.; Kraft, A.; Boeva, V.

2024-10-25 bioinformatics
10.1101/2024.10.24.620025 bioRxiv
Show abstract

BackgroundSingle-cell transcriptomics has transformed our understanding of cellular diversity in biological systems. However, systematic noise, often introduced by low-quality cells, can obscure biological signals if not properly accounted for. Thus, one of the common quality control steps involves filtering out cells with a high percentage of mitochondrial RNA counts (pctMT), as high pctMT typically indicates cell death. Yet, commonly used filtering thresholds, primarily derived from studies on healthy tissues, may be overly stringent for malignant cells, which often naturally exhibit higher baseline mitochondrial gene expression. We analyzed public single-cell RNA-seq and spatial data to investigate if malignant cells with high pctMT are viable and functionally significant subpopulations. ResultsWe analyzed nine single-cell RNA-seq datasets from uveal melanoma, breast, lung, kidney, head and neck, prostate, and pancreatic cancers, including 439,507 cells from 151 patients. Malignant cells exhibited significantly higher pctMT than nonmalignant cells without a significant increase in dissociation-induced stress signature scores. Malignant cells with high pctMT showed metabolic dysregulation, including increased xenobiotic metabolism, which is implicated in cancer therapeutic response. Our analysis of pctMT in cancer cell lines uncovered associations with resistance and sensitivity to certain classes of drugs. Additionally, we observed a link between pctMT and malignant cell transcriptional heterogeneity as well as patient clinical features. ConclusionsThis study provides a detailed exploration of the functional characteristics of malignant cells with elevated pctMT, challenging current quality control practices in single-cell RNA-seq analyses of tumors. Our findings have the potential to improve data interpretation and refine the biological conclusions of future cancer studies.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
BMC Bioinformatics
383 papers in training set
Top 0.2%
22.5%
2
Cancer Research Communications
46 papers in training set
Top 0.1%
10.1%
3
PLOS ONE
4510 papers in training set
Top 34%
4.3%
4
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.5%
4.0%
5
Bioinformatics
1061 papers in training set
Top 5%
4.0%
6
Cancers
200 papers in training set
Top 2%
3.6%
7
Scientific Reports
3102 papers in training set
Top 37%
3.6%
50% of probability mass above
8
PLOS Computational Biology
1633 papers in training set
Top 12%
2.6%
9
PeerJ
261 papers in training set
Top 5%
2.1%
10
Genome Medicine
154 papers in training set
Top 3%
2.1%
11
npj Precision Oncology
48 papers in training set
Top 0.4%
2.1%
12
Nucleic Acids Research
1128 papers in training set
Top 9%
2.1%
13
GigaScience
172 papers in training set
Top 1%
1.8%
14
Biology Methods and Protocols
53 papers in training set
Top 0.8%
1.8%
15
iScience
1063 papers in training set
Top 13%
1.8%
16
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
17
Mitochondrion
11 papers in training set
Top 0.1%
1.5%
18
Journal of Translational Medicine
46 papers in training set
Top 1%
1.3%
19
NAR Cancer
36 papers in training set
Top 0.1%
1.3%
20
Scientific Data
174 papers in training set
Top 2%
1.2%
21
Nature Communications
4913 papers in training set
Top 56%
1.2%
22
Computational and Structural Biotechnology Journal
216 papers in training set
Top 7%
1.1%
23
BMC Genomics
328 papers in training set
Top 4%
1.0%
24
Database
51 papers in training set
Top 0.8%
0.8%
25
Clinical Chemistry
22 papers in training set
Top 0.8%
0.7%
26
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
27
npj Genomic Medicine
33 papers in training set
Top 1%
0.7%
28
Cell Reports Medicine
140 papers in training set
Top 8%
0.7%
29
Cancer Epidemiology, Biomarkers & Prevention
17 papers in training set
Top 0.7%
0.6%
30
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.6%