Back

sc-REnF: An entropy guided robust feature selection for clustering of single-cell rna-seq data

Lall, S.; Ghosh, A.; Ray, S.; Bandyopadhyay, S.

2020-10-10 bioinformatics
10.1101/2020.10.10.334573 bioRxiv
Show abstract

Many single-cell typing methods require pure clustering of cells, which is susceptible towards the technical noise, and heavily dependent on high quality informative genes selected in the preliminary steps of downstream analysis. Techniques for gene selection in single-cell RNA sequencing (scRNA-seq) data are seemingly simple which casts problems with respect to the resolution of (sub-)types detection, marker selection and ultimately impacts towards cell annotation. We introduce sc-REnF, a novel and robust entropy based feature (gene) selection method, which leverages the landmark advantage of Renyi and Tsallis entropy achieved in their original application, in single cell clustering. Thereby, gene selection is robust and less sensitive towards the technical noise present in the data, producing a pure clustering of cells, beyond classifying independent and unknown sample with utmost accuracy. The corresponding software is available at: https://github.com/Snehalikalall/sc-REnF

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genome Research
409 papers in training set
Top 0.1%
21.8%
2
Genome Biology
555 papers in training set
Top 0.2%
12.1%
3
Bioinformatics
1061 papers in training set
Top 3%
9.8%
4
Nature Communications
4913 papers in training set
Top 31%
6.1%
5
Briefings in Bioinformatics
326 papers in training set
Top 1%
4.7%
50% of probability mass above
6
Nucleic Acids Research
1128 papers in training set
Top 4%
4.7%
7
Nature Biotechnology
147 papers in training set
Top 3%
3.5%
8
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.8%
3.5%
9
BMC Bioinformatics
383 papers in training set
Top 3%
3.5%
10
iScience
1063 papers in training set
Top 8%
2.6%
11
Bioinformatics Advances
184 papers in training set
Top 2%
2.3%
12
PLOS ONE
4510 papers in training set
Top 51%
1.8%
13
Cell Reports Methods
141 papers in training set
Top 2%
1.8%
14
Communications Biology
886 papers in training set
Top 10%
1.6%
15
PLOS Computational Biology
1633 papers in training set
Top 18%
1.4%
16
Nature Methods
336 papers in training set
Top 5%
1.3%
17
Genome Medicine
154 papers in training set
Top 6%
1.3%
18
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.3%
19
BMC Genomics
328 papers in training set
Top 4%
1.1%
20
Nature Computational Science
50 papers in training set
Top 1%
0.9%
21
Frontiers in Genetics
197 papers in training set
Top 9%
0.8%
22
Scientific Reports
3102 papers in training set
Top 76%
0.7%
23
Cell Systems
167 papers in training set
Top 13%
0.7%
24
Advanced Science
249 papers in training set
Top 20%
0.7%
25
Nature Genetics
240 papers in training set
Top 8%
0.7%
26
Life Science Alliance
263 papers in training set
Top 3%
0.6%