Back

selfRL: Two-Level Self-Supervised Transformer Representation Learning for Link Prediction of Heterogeneous Biomedical Networks

Wang, X.; Yang, Y.; Liao, X.; Li, K.; Li, F.; Peng, S.

2020-10-21 bioinformatics
10.1101/2020.10.20.347153 bioRxiv
Show abstract

Predicting potential links in heterogeneous biomedical networks (HBNs) can greatly benefit various important biomedical problem. However, the self-supervised representation learning for link prediction in HBNs has been slightly explored in previous researches. Therefore, this study proposes a two-level self-supervised representation learning, namely selfRL, for link prediction in heterogeneous biomedical networks. The meta path detection-based self-supervised learning task is proposed to learn representation vectors that can capture the global-level structure and semantic feature in HBNs. The vertex entity mask-based self-supervised learning mechanism is designed to enhance local association of vertices. Finally, the representations from two tasks are concatenated to generate high-quality representation vectors. The results of link prediction on six datasets show selfRL outperforms 25 state-of-the-art methods. In particular, selfRL reveals great performance with results close to 1 in terms of AUC and AUPR on the NeoDTI-net dataset. In addition, the PubMed publications demonstrate that nine out of ten drugs screened by selfRL can inhibit the cytokine storm in COVID-19 patients. In summary, selfRL provides a general frame-work that develops self-supervised learning tasks with unlabeled data to obtain promising representations for improving link prediction.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 0.1%
26.0%
2
Briefings in Bioinformatics
326 papers in training set
Top 0.9%
6.3%
3
IEEE/ACM Transactions on Computational Biology and Bioinformatics
32 papers in training set
Top 0.1%
4.3%
4
Computers in Biology and Medicine
120 papers in training set
Top 0.5%
4.3%
5
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
4.0%
6
Bioinformatics
1061 papers in training set
Top 5%
3.6%
7
Journal of Biomedical Informatics
45 papers in training set
Top 0.4%
3.6%
50% of probability mass above
8
IEEE Access
31 papers in training set
Top 0.1%
3.6%
9
Advanced Science
249 papers in training set
Top 7%
2.7%
10
PLOS ONE
4510 papers in training set
Top 45%
2.6%
11
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.1%
2.4%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.1%
13
Scientific Reports
3102 papers in training set
Top 58%
1.7%
14
PLOS Computational Biology
1633 papers in training set
Top 16%
1.7%
15
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
1.7%
16
Neurocomputing
13 papers in training set
Top 0.2%
1.7%
17
Journal of Chemical Information and Modeling
207 papers in training set
Top 2%
1.1%
18
Patterns
70 papers in training set
Top 2%
1.1%
19
Artificial Intelligence in Medicine
15 papers in training set
Top 0.6%
0.9%
20
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
21
Bioengineering
24 papers in training set
Top 1%
0.8%
22
BMC Bioinformatics
383 papers in training set
Top 6%
0.8%
23
iScience
1063 papers in training set
Top 29%
0.8%
24
Expert Systems with Applications
11 papers in training set
Top 0.4%
0.8%
25
Communications Biology
886 papers in training set
Top 24%
0.7%
26
Life
27 papers in training set
Top 0.4%
0.7%
27
Quantitative Biology
11 papers in training set
Top 0.8%
0.7%
28
Computational Biology and Chemistry
23 papers in training set
Top 0.6%
0.7%
29
JMIR Medical Informatics
17 papers in training set
Top 2%
0.6%
30
Journal of Molecular Biology
217 papers in training set
Top 4%
0.6%