Back

Exact p-values for global network alignments via combinatorial analysis of shared GO terms

Hayes, W. B.

2020-10-09 molecular biology
10.1101/2020.10.08.332254 bioRxiv
Show abstract

Network alignment aims to uncover topologically similar regions in the protein-protein interaction (PPI) networks of two or more species under the assumption that topologically similar regions perform similar functions. Although there exist a plethora of both network alignment algorithms and measures of topological similarity, currently no "gold standard" exists for evaluating how well either is able to uncover functionally similar regions. Here we propose a formal, mathematically and statistically rigorous method for evaluating the statistical significance of shared GO terms in a global, 1-to-1 alignment between two PPI networks. We use combinatorics to precisely count the number of possible network alignments in which k proteins share a particular GO term. When divided by the number of all possible network alignments, this provides an explicit, exact p-value for a network alignment with respect to a particular GO term.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Entropy
20 papers in training set
Top 0.1%
19.1%
2
PLOS ONE
4510 papers in training set
Top 21%
8.6%
3
Scientific Reports
3102 papers in training set
Top 9%
8.6%
4
Physical Review Research
46 papers in training set
Top 0.1%
6.5%
5
PLOS Computational Biology
1633 papers in training set
Top 7%
5.0%
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 17%
4.1%
50% of probability mass above
7
BMC Bioinformatics
383 papers in training set
Top 3%
3.8%
8
Physical Review E
95 papers in training set
Top 0.3%
3.7%
9
Cell Systems
167 papers in training set
Top 5%
2.8%
10
Nature Communications
4913 papers in training set
Top 46%
2.1%
11
Journal of Theoretical Biology
144 papers in training set
Top 0.6%
2.1%
12
Communications Biology
886 papers in training set
Top 7%
1.8%
13
Journal of The Royal Society Interface
189 papers in training set
Top 2%
1.7%
14
Journal of Molecular Biology
217 papers in training set
Top 2%
1.4%
15
eLife
5422 papers in training set
Top 48%
1.3%
16
Chaos: An Interdisciplinary Journal of Nonlinear Science
16 papers in training set
Top 0.2%
1.0%
17
FEBS Letters
42 papers in training set
Top 0.2%
1.0%
18
Journal of Computational Chemistry
11 papers in training set
Top 0.2%
0.8%
19
Bioinformatics
1061 papers in training set
Top 9%
0.8%
20
iScience
1063 papers in training set
Top 28%
0.8%
21
Biomolecules
95 papers in training set
Top 2%
0.8%
22
International Journal of Molecular Sciences
453 papers in training set
Top 16%
0.7%
23
Journal of Biosciences
12 papers in training set
Top 0.1%
0.7%
24
npj Systems Biology and Applications
99 papers in training set
Top 3%
0.7%
25
Statistics in Medicine
34 papers in training set
Top 0.4%
0.7%
26
Bioinformatics Advances
184 papers in training set
Top 5%
0.7%
27
Cell Reports
1338 papers in training set
Top 36%
0.5%
28
Cerebral Cortex
357 papers in training set
Top 3%
0.5%
29
Journal of the Royal Society Interface
18 papers in training set
Top 0.3%
0.5%
30
IEEE Transactions on Computational Biology and Bioinformatics
17 papers in training set
Top 0.9%
0.5%