Back

Identifying disease-causing mechanisms and fundamental biology of neuromuscular disorder genes through genomic feature analysis

Martin, A.; Llanes-Cuesta, M. A.; Hartley, J. N.; Frosk, P.; Drogemoller, B. I.; Wright, G. E. B.

2026-04-22 genetics
10.64898/2026.04.21.719902 bioRxiv
Show abstract

IntroductionNeuromuscular disorders (NMDs) encompass a broad group of conditions that primarily affect the peripheral nervous system. They are often caused by genetic alterations that impair skeletal muscle function and result in debilitating symptoms. Obtaining an accurate molecular diagnosis remains a challenge, potentially because variants in genes that have yet to be identified as causal. We therefore used advanced computational methods to study the genetic architecture of NMDs and to identify key features that distinguish NMD genes from other genes in the broader genome. MethodsCurated genes implicated in NMDs (n = 639; GeneTable of NMDs) were obtained and merged with a comprehensive set of genomic features for human autosomal protein-coding genes. Machine-learning-based feature selection and ranking were performed using Boruta, along with complementary analytical approaches. These analyses were used to identify the most important genic features (n = 134, subcategories: gene complexity, genetic variation, expression patterns, and other general gene traits) for discriminating NMD genes from other genes in the genome ResultsNMD genes exhibit enriched expression in disease-relevant tissues, including skeletal muscle and heart. Additionally, compared with other protein-coding genes, these genes exhibit increased transcriptomic complexity (e.g., longer transcripts and more unique isoforms), contain more short tandem repeats, and show greater variation in conservation across model organisms. ConclusionsThis study identified several key genomic features that may distinguish NMD genes from the rest of the genome. This may enhance the identification of novel causal genes and could ultimately facilitate earlier diagnosis and medical management for affected individuals.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Genetics in Medicine
69 papers in training set
Top 0.1%
17.3%
2
Human Molecular Genetics
130 papers in training set
Top 0.1%
14.2%
3
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.1%
10.0%
4
Human Mutation
29 papers in training set
Top 0.1%
6.3%
5
Annals of Clinical and Translational Neurology
29 papers in training set
Top 0.1%
6.3%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 32%
4.8%
7
Scientific Reports
3102 papers in training set
Top 28%
4.3%
8
Human Genetics and Genomics Advances
70 papers in training set
Top 0.1%
3.6%
9
Journal of Medical Genetics
28 papers in training set
Top 0.1%
3.6%
10
American Journal of Medical Genetics Part A
17 papers in training set
Top 0.1%
3.6%
11
Frontiers in Genetics
197 papers in training set
Top 4%
2.1%
12
European Journal of Human Genetics
49 papers in training set
Top 0.6%
1.8%
13
Human Genetics
25 papers in training set
Top 0.3%
1.2%
14
The American Journal of Human Genetics
206 papers in training set
Top 3%
0.9%
15
npj Genomic Medicine
33 papers in training set
Top 0.7%
0.9%
16
PLOS Genetics
756 papers in training set
Top 13%
0.9%
17
Disease Models & Mechanisms
119 papers in training set
Top 2%
0.8%
18
International Journal of Molecular Sciences
453 papers in training set
Top 17%
0.7%
19
eLife
5422 papers in training set
Top 60%
0.7%
20
BMC Bioinformatics
383 papers in training set
Top 7%
0.7%
21
Frontiers in Neurology
91 papers in training set
Top 5%
0.7%
22
Journal of Cachexia, Sarcopenia and Muscle
27 papers in training set
Top 0.4%
0.6%
23
BMC Methods
11 papers in training set
Top 0.2%
0.6%
24
Gigabyte
60 papers in training set
Top 2%
0.6%
25
Genes
126 papers in training set
Top 4%
0.6%
26
Neuropathology and Applied Neurobiology
14 papers in training set
Top 0.9%
0.6%