Capturing India's phenotypic diversity: Health insights from the GenomeIndia project
Mondal, D.; Bhattacharyya, C.; Shekhawat, D. S.; Tada, N. G.; Rajial, T.; Parameswaran, A. S.; Jena, D.; Datta, S.; Swain, M.; Jena, S.; Mishra, A.; Mahapatra, S.; Sathi, S. N.; Alam, M.; Ali, A.; Choudhury, P.; Ghosh, P.; Tripathi, D.; Anilkumar, S.; Ashwath, D.; Chithimmaiah, M.; Hameed, S. K. S.; Gunasegaran, R.; Singh, N.; Mala, G.; De, T.; Reza, S.; Mukherjee, A.; Prajapati, B.; Dave, B.; Yumnam, S.; Vimi, K.; Sharma, G. N.; Malik, A.; Sarma, R. J.; Vanlallawma, A.; Samartha, D. K.; G, T. S.; Kavya, P. V.; Deshpande, S.; GenomeIndia Consortium, ; Singh, K.; Sharma, P.; Raghav, S. K.; Pra
Show abstract
Background India represents 18% of the global population yet remains underrepresented in health research. Moreover, existing national surveys miss critical variation across its 4,600 ethnolinguistic groups. We present a comprehensive phenotypic characterisation of 81 populations from the GenomeIndia project. Methods We analysed 67 sociodemographic, anthropometric, and blood biochemistry variables from 17,777 individuals sampled across 81 ethnolinguistic populations from India, examining population-level variation, disease reporting fractions, and age- and sex-specific life-course trends. Findings Ethnolinguistic identity predicted health outcomes independently of administrative state, improving phenotypic variance explained by an average of 7.4%. 95% of participants had at least one abnormal biochemical or anthropometric marker, driven by low HDL (52.2%) and elevated triglycerides (43.6%). Metabolic risk, however, was highly stratified: adjusted prevalence for low HDL ranged four-fold across ancestry groups from 17.2% to 67.7%. We also identified an "awareness gap"; only 17.6% of people with hypertension and 2.2% of people with dyslipidemia were aware of their condition. This awareness gap was higher in tribal populations, in which women did not show the higher HDL levels typically seen compared to men, pointing to distinct metabolic profiles and healthcare access barriers across India. Interpretation The Indian phenotypic landscape is highly structured along ethnolinguistic lines, where ancestry and environment both influence risk. The high systemic burden of abnormalities necessitates population-specific reference intervals. GenomeIndia provides a foundational map for precision public health, shifting the focus from state-level averages to population-specific risk profiles. Funding This work was funded by the Department of Biotechnology, Ministry of Science and Technology, Government of India.
Matching journals
The top 6 journals account for 50% of the predicted probability mass.