Back

Investigating the Y chromosome in complex disease: Phenome-wide scan across 104,334 Finnish men

Preussner, A.; Leinonen, J. T.; FinnGen, ; Pirinen, M.; Tukiainen, T.

2026-06-10 genetic and genomic medicine
10.64898/2026.06.09.26355235 medRxiv
Show abstract

Although the Y chromosome represents roughly 2% of the male genome, it is often ignored in genome-wide association studies (GWAS). Subsequently, the potential health impacts of Y-chromosomal genetic variation remain incompletely understood. To fill this gap, we performed a phenome-wide association study (PheWAS) in FinnGen across 1,426 binary and quantitative traits using Y-chromosomal variation (frequency [&ge;] 1%) in 104,334 genotyped men. As Y chromosome variation is prone to population stratification, we performed carefully adjusted association analyses and further examined these through kin-based validation in 19,275 female and 24,712 male 1st degree relatives. We found 121 suggestive (p < 5.6x10-3) phenotypic associations in the Y chromosome, yet none of these were strong enough to reach phenome-wide significance (p < 3.9x10-6). While only 38 associations were supported in the kin-based validation, intriguingly we found support for a previously suggested link between haplogroup I1 and coronary heart disease (CHD; OR=1.06, 95%CI=1.02-1.11, p=3.7x10-3; male validation OR=1.05; female validation OR=0.97). The I1-CHD association was detected across distinct geographical areas within Finland and was independent from Loss of Y (LOY) and the autosomal risk to CHD, proposing a link between germline Y-chromosomal variation and heart disease risk. Overall, this study presents a comprehensive phenome-wide analysis of Y-chromosomal associations, highlighting the potential relevance of Y-chromosomal variation beyond sex determination. Our findings further emphasize the need for improved capture of Y-chromosomal variants and further analyses in biobank-scale data to allow for deeper exploration of male-specific genetic architecture of complex diseases.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
European Journal of Human Genetics
49 papers in training set
Top 0.1%
37.1%
2
The American Journal of Human Genetics
206 papers in training set
Top 0.3%
14.5%
50% of probability mass above
3
Scientific Reports
3102 papers in training set
Top 20%
6.2%
4
Human Genetics
25 papers in training set
Top 0.1%
6.2%
5
Human Molecular Genetics
130 papers in training set
Top 0.6%
3.9%
6
Genome Medicine
154 papers in training set
Top 2%
3.5%
7
Frontiers in Genetics
197 papers in training set
Top 2%
3.5%
8
Nature Communications
4913 papers in training set
Top 46%
2.4%
9
Genetic Epidemiology
46 papers in training set
Top 0.5%
1.7%
10
Nature Genetics
240 papers in training set
Top 5%
1.7%
11
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 0.8%
1.6%
12
PLOS Genetics
756 papers in training set
Top 10%
1.5%
13
Human Genomics
21 papers in training set
Top 0.1%
1.5%
14
Human Genetics and Genomics Advances
70 papers in training set
Top 0.5%
1.1%
15
PLOS ONE
4510 papers in training set
Top 66%
0.8%
16
npj Genomic Medicine
33 papers in training set
Top 0.8%
0.8%
17
eLife
5422 papers in training set
Top 56%
0.8%
18
International Journal of Epidemiology
74 papers in training set
Top 2%
0.8%
19
International Journal of Molecular Sciences
453 papers in training set
Top 16%
0.7%
20
Life Science Alliance
263 papers in training set
Top 2%
0.7%
21
Medicine & Science in Sports & Exercise
15 papers in training set
Top 0.5%
0.7%
22
Atherosclerosis
29 papers in training set
Top 1%
0.7%
23
BMC Medical Genomics
36 papers in training set
Top 2%
0.7%
24
Genes
126 papers in training set
Top 4%
0.6%