Back

Dynamic, stage-course protein interaction network using high power CpG sites in Head and Neck Squamous Cell Carcinoma

Riaz, A.; Shah, M.; Zaheer, S.; Salam, A.; Khan, F. F.

2021-07-05 health informatics
10.1101/2021.06.30.21259548 medRxiv
Show abstract

Head and neck cancer is the sixth leading cause of cancer across the globe and is significantly more prevalent in South Asian countries, including Pakistan. Prediction of pathological stages of cancer can play a pivotal role in early diagnosis and personalized medicine. This project ventures into the prediction of different stages of head and neck squamous cell carcinoma (HNSCC) using prioritized DNA methylation patterns. DNA methylation profiles for each HNSCC stage (stage-I-IV) were used to extensively analyze 485,577 methylation CpG sites and prioritize them on the basis of the highest predictive power using a wrapper-based feature selection method, along with different classification models. We identified 68 high-power methylation sites which predicted the pathological stage of HNSCC samples with 90.62 % accuracy using a Random Forest classifier. We set out to construct a protein-protein interaction network for the proteins encoded by the 67 genes associated with these sites to study its network topology and also undertook enrichment analysis of nodes in their immediate neighborhood for GO and KEGG Pathway annotations which revealed their role in cancer-related pathways, cell differentiation, signal transduction, metabolic and biosynthetic processes. With information on the predictive power of each of the 67 genes in each HNSCC stage, we unveil a dynamic stage-course network for HNSCC. We also intend to further study these genes in light of functional datasets from CRISPR, RNAi, drug screens for their putative role in HNSCC initiation and progression.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
BioMed Research International
25 papers in training set
Top 0.1%
18.9%
2
Informatics in Medicine Unlocked
21 papers in training set
Top 0.1%
12.7%
3
PLOS ONE
4510 papers in training set
Top 25%
6.9%
4
Scientific Reports
3102 papers in training set
Top 30%
4.0%
5
Computers in Biology and Medicine
120 papers in training set
Top 0.8%
3.6%
6
Heliyon
146 papers in training set
Top 0.6%
2.9%
7
Journal of Personalized Medicine
28 papers in training set
Top 0.1%
2.8%
50% of probability mass above
8
Bioscience Reports
25 papers in training set
Top 0.3%
2.4%
9
Frontiers in Public Health
140 papers in training set
Top 3%
2.1%
10
PeerJ
261 papers in training set
Top 5%
1.9%
11
Advanced Biology
29 papers in training set
Top 0.3%
1.7%
12
Biology Methods and Protocols
53 papers in training set
Top 1.0%
1.7%
13
Frontiers in Medicine
113 papers in training set
Top 4%
1.7%
14
Diagnostics
48 papers in training set
Top 1%
1.2%
15
BMC Medical Informatics and Decision Making
39 papers in training set
Top 2%
1.2%
16
Frontiers in Genetics
197 papers in training set
Top 7%
1.2%
17
BMJ Open
554 papers in training set
Top 10%
1.2%
18
Frontiers in Microbiology
375 papers in training set
Top 7%
1.2%
19
JAMIA Open
37 papers in training set
Top 1%
1.0%
20
JMIR Public Health and Surveillance
45 papers in training set
Top 3%
1.0%
21
Frontiers in Oncology
95 papers in training set
Top 3%
1.0%
22
Cancer Medicine
24 papers in training set
Top 1%
1.0%
23
Frontiers in Molecular Biosciences
100 papers in training set
Top 3%
1.0%
24
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 7%
0.9%
25
Frontiers in Pharmacology
100 papers in training set
Top 4%
0.9%
26
Frontiers in Bioinformatics
45 papers in training set
Top 0.6%
0.9%
27
Cell Death & Disease
126 papers in training set
Top 2%
0.9%
28
Physical Biology
43 papers in training set
Top 2%
0.8%
29
International Journal of Environmental Research and Public Health
124 papers in training set
Top 6%
0.8%
30
Journal of Medical Virology
137 papers in training set
Top 4%
0.8%