Back

A Course-Undergraduate Research Experience (CURE) to explore the effect of structural variants on gene expression in C. elegans balancers

Maroilley, T.; Barbosa, V. R. A.; Mascarenhas, R.; Ferris, S.; Diao, C.; AlAwadhi, F.; Aldakheel, S.; Ali, A.; Alkanderi, D.; Alshatti, M.; Alsuwaileh, S.; Asghar, K.; Bui, R.; Chai, B.; Dsouza, L.; Nezhad, P. E.; Garcia-Volk, E.; Haq, Z.; Hossain, S.; Johnson, G.; Kotikalapudi, N.; Lalani, I.; Lenz, C.; Louie, T.; Moore, S.; Patel, S.; Prasai, S.; Qureshi, R.; Rahmani, F.; Shakir, B.; Ahamed, S. S.; Tran, H. A.; Waziha, R.; Wood, C. M.; Zbinden, S.; Anderson, D.; Tarailo-Graovac, M.

2026-01-23 bioinformatics
10.64898/2026.01.21.700799 bioRxiv
Show abstract

Bioinformatics, a discipline at the crossroads of Biology and Computational Sciences, also referred to as Computational Biology, is nowadays widely spread in research programs. However, implementing any Bioinformatics projects requires the ability to comprehend biological concepts and apply computational approaches, and rare are the undergraduate programs offering such multi-disciplinary training. In addition, understanding the dynamic between Biology research projects and Bioinformatics analyses is challenging with no real-life experience. Course-based undergraduate research experience (CURE) courses are innovative programs that allow more students to acquire research experience and provide the perfect setting to introduce students to applied bioinformatics. As a part of the Bachelor of Health Sciences of the Cumming School of Medicine at the University of Calgary (Canada), a CURE applied bioinformatics was implemented in the Winter of 2023 to 2025. Students investigated the effect of structural variants (SVs, genetic variants larger than 50 bp) on gene expression in the model organism Caenorhabditis elegans (a hermaphrodite 1-mm long roundworm). The students detected and characterized SVs by analyzing genome and transcriptome sequencing data of C. elegans strains called balancers, as they are known to carry large genomic variations balancing regions of the genome by limiting recombination and allowing maintenance of lethal mutations. They used Galaxy, a public web-based supercomputing resource, but also a local High-Performance computing system, and R, to report different effects of SVs on gene expression and splicing. Students research explained the molecular mechanism behind the uncoordinated phenotype caused by the reciprocal translocation eT1(III;V) and uncovered unexpected effects on gene expression on an understudied gene. We evaluated the courses impact on student learning journeys and showed that the CURE favored students understanding of the Bioinformatics field and fostered their research interest. We provide here guidelines to facilitate the CURE implementations to improve access for undergraduate students to bioinformatics research experiences.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
BMC Bioinformatics
383 papers in training set
Top 0.3%
18.6%
2
PLOS Computational Biology
1633 papers in training set
Top 4%
8.4%
3
GigaScience
172 papers in training set
Top 0.2%
6.3%
4
Frontiers in Genetics
197 papers in training set
Top 2%
4.0%
5
Briefings in Bioinformatics
326 papers in training set
Top 2%
4.0%
6
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.6%
7
Nucleic Acids Research
1128 papers in training set
Top 7%
3.1%
8
F1000Research
79 papers in training set
Top 0.9%
2.4%
50% of probability mass above
9
Bioinformatics
1061 papers in training set
Top 6%
2.1%
10
iScience
1063 papers in training set
Top 10%
2.1%
11
PeerJ
261 papers in training set
Top 5%
2.1%
12
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 3%
2.1%
13
PLOS ONE
4510 papers in training set
Top 50%
1.9%
14
Frontiers in Molecular Biosciences
100 papers in training set
Top 1%
1.8%
15
Bioinformatics Advances
184 papers in training set
Top 3%
1.8%
16
Scientific Reports
3102 papers in training set
Top 58%
1.7%
17
Quantitative Biology
11 papers in training set
Top 0.3%
1.7%
18
BMC Genomics
328 papers in training set
Top 3%
1.3%
19
Journal of Bioinformatics and Systems Biology
14 papers in training set
Top 0.3%
1.3%
20
eLife
5422 papers in training set
Top 47%
1.3%
21
BioData Mining
15 papers in training set
Top 0.5%
1.2%
22
Biology
43 papers in training set
Top 2%
0.9%
23
NAR Genomics and Bioinformatics
214 papers in training set
Top 3%
0.8%
24
G3 Genes|Genomes|Genetics
351 papers in training set
Top 2%
0.8%
25
Patterns
70 papers in training set
Top 2%
0.8%
26
Genes
126 papers in training set
Top 3%
0.7%
27
Database
51 papers in training set
Top 1.0%
0.7%
28
eneuro
389 papers in training set
Top 10%
0.7%
29
Molecular Plant
36 papers in training set
Top 2%
0.6%
30
Peer Community Journal
254 papers in training set
Top 4%
0.6%