Back

Integrative Identification and Characterization of PCOS-Associated lncRNAs From the Interface of Genetic Association, Transcriptomics, and Gene Structure Evolution

He, Z.; Li, Y.; Shkurat, T. P.; Butenko, E. V.; Derevyanchuk, E. G.; Lomteva, S. V.; Chen, L.; Lipovich, L.

2026-04-02 genomics
10.64898/2026.03.31.715548 bioRxiv
Show abstract

BackgroundPolycystic ovary syndrome (PCOS) is a prevalent endocrine disorder and a leading cause of female infertility, with complex genetic, metabolic, and hormonal etiologies. Long non-coding RNAs (lncRNAs) have emerged as important regulators of diverse biological processes, yet their roles in PCOS remain underexplored. Here, we identified and characterized PCOS differentially expressed gene-associated lncRNAs (PDEGAL) with an integrative approach combining expression data, genetic association, and evolutionary analysis. MethodsThirty-three PCOS-associated protein-coding genes were obtained from our prior study, and all their nearby and overlapping lncRNAs were annotated. These candidates were analyzed using UCSC Genome Browser-mapped annotations and datasets, including NCBI RefSeq, GENCODE, GTEx, GWAS SNPs, and conservation, as well as the FANTOM5 cap analysis of gene expression (CAGE) promoter data, to assess their expression, regulatory potential, genetic variant overlaps, and evolutionary conservation. ResultsTwenty-three PDEGALs (18 antisense to, and 5 sharing bidirectional promoters with, known PCOS-associated protein-coding genes) were identified. 17 PDEGALs contained GWAS SNPs with statistically significant disease associations, 9 of which were associated with PCOS-related traits. 5 PDEGALs demonstrated expression in the KGN granulosa cell model of PCOS. Key gene structure element (KGSE) analysis revealed that most PDEGALs are primate-specific. Integrating four criteria--GTEx expression, GWAS SNPs, FANTOM promoterome, and KGSE conservation--highlighted HELLPAR as the only lncRNA fulfilling all four, while five others--PGR-AS1, MTOR-AS1, ENSG00000265179, ENSG00000256218, and LOC105377276--fulfilled three of the four criteria. ConclusionsWe have systematically identified candidate PCOS regulatory lncRNAs with convergent genetic, expression, and evolutionary evidence. These results provide a framework for functional validation and highlight lncRNAs as potential biomarkers and therapeutic targets in PCOS that function by regulating their nearby and overlapping protein-coding genes.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Frontiers in Genetics
197 papers in training set
Top 0.2%
10.5%
2
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.8%
8.3%
3
eLife
5422 papers in training set
Top 13%
6.4%
4
Biology of Sex Differences
29 papers in training set
Top 0.1%
6.4%
5
Frontiers in Endocrinology
53 papers in training set
Top 0.4%
4.3%
6
Endocrinology
38 papers in training set
Top 0.1%
4.0%
7
Genes
126 papers in training set
Top 0.2%
4.0%
8
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 2%
3.6%
9
Scientific Reports
3102 papers in training set
Top 40%
3.3%
50% of probability mass above
10
Genomics
60 papers in training set
Top 0.4%
3.3%
11
PLOS ONE
4510 papers in training set
Top 42%
3.1%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.1%
13
Physiological Genomics
15 papers in training set
Top 0.1%
2.1%
14
Molecular Human Reproduction
11 papers in training set
Top 0.1%
1.7%
15
Cell Genomics
162 papers in training set
Top 3%
1.7%
16
The Journal of Clinical Endocrinology & Metabolism
35 papers in training set
Top 0.8%
1.5%
17
Journal of Clinical Medicine
91 papers in training set
Top 4%
1.5%
18
Journal of Genetics and Genomics
36 papers in training set
Top 1%
1.5%
19
BMC Genomics
328 papers in training set
Top 3%
1.3%
20
Gene
41 papers in training set
Top 1%
1.1%
21
International Journal of Molecular Sciences
453 papers in training set
Top 12%
1.0%
22
Molecular Genetics and Genomics
11 papers in training set
Top 0.3%
0.9%
23
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
24
Biochimica et Biophysica Acta (BBA) - Molecular Basis of Disease
25 papers in training set
Top 0.8%
0.8%
25
PeerJ
261 papers in training set
Top 15%
0.8%
26
Bioinformatics
1061 papers in training set
Top 9%
0.8%
27
iScience
1063 papers in training set
Top 32%
0.8%
28
PLOS Genetics
756 papers in training set
Top 16%
0.7%
29
Journal of Personalized Medicine
28 papers in training set
Top 1%
0.7%
30
Journal of Translational Medicine
46 papers in training set
Top 3%
0.6%