Back

Dr. Sim: Similarity Learning for Transcriptional Phenotypic Drug discovery

Wei, Z.; Zhu, S.; Chen, X.; Zhu, C.; Duan, B.; Liu, Q.

2021-09-24 bioinformatics
10.1101/2021.09.23.461458 bioRxiv
Show abstract

Transcriptional phenotypic drug discovery has achieved great success, and various compound perturbation-based data resources, such as Connectivity Map (CMap) and Library of Integrated Network-Based Cellular Signatures (LINCS), have been presented. Computational strategies fully mining these resources for phenotypic drug discovery have been proposed, and among them, a fundamental issue is to define the proper similarity between the transcriptional profiles to elucidate the drug mechanism of actions and identify new drug indications. Traditionally, this similarity has been defined in an unsupervised way, and due to the high dimensionality and the existence of high noise in those high-throughput data, it lacks robustness with limited performance. In our study, we present Dr. Sim, which is a general learning-based framework that automatically infers similarity measurement rather than being manually designed and can be used to characterize transcriptional phenotypic profiles for drug discovery with generalized good performance. We evaluated Dr. Sim on comprehensively publicly available in vitro and in vivo datasets in drug annotation and repositioning using high-throughput transcriptional perturbation data and indicated that Dr. Sim significantly outperforms the existing methods and is proved to be a conceptual improvement by learning transcriptional similarity to facilitate the broad utility of high-throughput transcriptional perturbation data for phenotypic drug discovery. The source code and usage of Dr. Sim is available at https://github.com/bm2-lab/DrSim/.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Briefings in Bioinformatics
326 papers in training set
Top 0.1%
40.5%
2
Bioinformatics
1061 papers in training set
Top 3%
10.4%
50% of probability mass above
3
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.5%
10.4%
4
Journal of Chemical Information and Modeling
207 papers in training set
Top 1.0%
5.0%
5
BMC Bioinformatics
383 papers in training set
Top 3%
2.7%
6
PLOS Computational Biology
1633 papers in training set
Top 13%
2.1%
7
Computational and Structural Biotechnology Journal
216 papers in training set
Top 4%
1.8%
8
Journal of Cheminformatics
25 papers in training set
Top 0.3%
1.8%
9
PLOS ONE
4510 papers in training set
Top 56%
1.5%
10
Quantitative Biology
11 papers in training set
Top 0.3%
1.5%
11
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.3%
1.5%
12
Scientific Reports
3102 papers in training set
Top 69%
1.0%
13
Patterns
70 papers in training set
Top 2%
0.9%
14
Frontiers in Molecular Biosciences
100 papers in training set
Top 4%
0.8%
15
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.8%
16
Advanced Science
249 papers in training set
Top 17%
0.8%
17
Nucleic Acids Research
1128 papers in training set
Top 17%
0.8%
18
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.8%
19
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.6%
0.7%
20
Frontiers in Genetics
197 papers in training set
Top 10%
0.7%
21
Computers in Biology and Medicine
120 papers in training set
Top 5%
0.7%
22
iScience
1063 papers in training set
Top 39%
0.5%
23
Bioinformatics Advances
184 papers in training set
Top 5%
0.5%