Back

Genetic differences between extreme and composite constitution types from whole exome sequences reveal actionable variations

Abbas, T.; Kutum, R.; Pandey, R.; Dakle, P.; Narang, A.; Manchanda, V.; Patil, R.; Aggarwal, D.; Bansal, G.; Sharma, P.; Chaturvedi, G.; Girase, B.; Srivastava, A.; Juvekar, S.; Dash, D.; Prasher, B.; Mukerji, M.

2020-04-28 genomics
10.1101/2020.04.24.059006 bioRxiv
Show abstract

Personalized medicine relies on successful identification of genome-wide variations that governs inter-individual differences in phenotypes and system level outcomes. In Ayurveda, assessment of composite constitution types "Prakriti" forms the basis for risk stratification, predicting health and disease trajectories and personalized recommendations. Here, we report a novel method for identifying pleiotropic genes and variants that associate with healthy individuals of three extreme and contrasting "Prakriti" constitutions through exome sequencing and state-of-the-art computational methods. Exome Seq of three extreme Prakriti types from 108 healthy individuals 54 each from genetically homogeneous populations of North India (NI, Discovery cohort) and Western India (VADU, Replication cohort) were evaluated. Fishers Exact Test was applied between Prakriti types in both cohorts and further permutation based p-value was used for selection of exonic variants. To investigate the effect of sample size per genetic association test, we performed power analysis. Functional impact of differentiating genes and variations were inferred using diverse resources -Toppfun, GTEx, GWAS, PheWAS, UK Biobank and mouse knockdown/knockout phenotype (MGI). We also applied supervised machine learning approach to evaluate the association of exonic variants with multisystem phenotypes of Prakriti. Our targeted investigation into exome sequencing from NI (discovery) and VADU (validation) cohorts datasets provide ~7,000 differentiating SNPs. Closer inspection further identified a subset of SNPs (2407 (NI) and 2393 (VADU)), that mapped to an overlapping set of 1181 genes. This set can robustly stratify the Prakriti groups into three distinct clusters with distinct gene ontological (GO) enrichments. Functional analysis further strengthens the potential pleiotropic effects of these differentiating genes/variants and multisystem phenotypic consequences. Replicated SNPs map to some very prominent genes like FIG4, EDNRA, ANKLE1, BCKDHA, ATP5SL, EXOCS5, IFIT5, ZNF502, PNPLA3 and IL6R. Lastly, multivariate analysis using random forest uncovered rs7244213 within urea transporter SLC14A2, that associate with an ensemble of features linked to distinct constitutions. Our results reinforce the concept of integration of Prakriti based deep phenotypes for risk stratification of healthy individuals and provides markers for early actionable interventions.

Matching journals

The top 10 journals account for 50% of the predicted probability mass.

1
Frontiers in Genetics
197 papers in training set
Top 0.5%
8.1%
2
Scientific Reports
3102 papers in training set
Top 14%
6.8%
3
Computational and Structural Biotechnology Journal
216 papers in training set
Top 0.4%
6.8%
4
Nature Communications
4913 papers in training set
Top 29%
6.3%
5
Human Genomics
21 papers in training set
Top 0.1%
6.3%
6
Cell Genomics
162 papers in training set
Top 0.8%
4.8%
7
Genomics
60 papers in training set
Top 0.2%
3.9%
8
Advanced Science
249 papers in training set
Top 6%
3.6%
9
Communications Biology
886 papers in training set
Top 3%
2.7%
10
Science Advances
1098 papers in training set
Top 10%
2.6%
50% of probability mass above
11
iScience
1063 papers in training set
Top 9%
2.3%
12
PLOS Genetics
756 papers in training set
Top 7%
2.1%
13
eLife
5422 papers in training set
Top 36%
2.1%
14
BMC Medical Genomics
36 papers in training set
Top 0.4%
1.9%
15
Human Genetics and Genomics Advances
70 papers in training set
Top 0.3%
1.7%
16
Nucleic Acids Research
1128 papers in training set
Top 11%
1.7%
17
Human Molecular Genetics
130 papers in training set
Top 2%
1.5%
18
Briefings in Bioinformatics
326 papers in training set
Top 4%
1.5%
19
Journal of Translational Medicine
46 papers in training set
Top 1%
1.5%
20
International Journal of Biological Macromolecules
65 papers in training set
Top 2%
1.2%
21
PLOS Computational Biology
1633 papers in training set
Top 20%
1.2%
22
Journal of Personalized Medicine
28 papers in training set
Top 0.6%
1.2%
23
International Journal of Molecular Sciences
453 papers in training set
Top 11%
1.2%
24
PLOS ONE
4510 papers in training set
Top 62%
1.1%
25
BMC Genomics
328 papers in training set
Top 4%
0.9%
26
Journal of Genetics and Genomics
36 papers in training set
Top 2%
0.9%
27
Journal of Advanced Research
15 papers in training set
Top 0.6%
0.9%
28
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
29
Bioinformatics
1061 papers in training set
Top 9%
0.8%
30
PNAS Nexus
147 papers in training set
Top 2%
0.7%