Back

A gap-free, telomere-to-telomere genome assembly for the Caenorhabditis briggsae reference strain AF16

O'Connor, L. M.; Moya, N. D.; Jhaveri, N. S.; Tanny, R. E.; Khorshidian, A.; Lyu, H.; Chamberlin, H. M.; Baird, S. E.; Andersen, E. C.

2026-05-04 genomics
10.64898/2026.04.30.721887 bioRxiv
Show abstract

The nematode Caenorhabditis elegans was the first metazoan to have its genome completely sequenced and assembled. Since that time, researchers have continuously updated the reference genome and manually curated its approximately 20,000 genes. The closely related species, Caenorhabditis briggsae, has served as a comparative model because of its similar morphology, mode of reproduction, and patterns of intra-species genetic variation. However, the genomic resources for C. briggsae lag behind C. elegans, hindering comparative genomics studies between the species. Decades of experimentation have been performed in the AF16 reference strain genetic background, so we obtained high-coverage long-read sequencing and high-throughput chromosome conformation capture data to create an updated reference genome for an isogenic derivative of AF16, named CGC2. The CGC2 genome is vastly improved relative to the existing AF16 assemblies, with no unplaced sequence, no gaps, and telomere-to-telomere contiguity. To provide genomic resources for CGC2, we exploited deep RNA-seq libraries from all developmental stages to predict protein-coding gene annotations comparable in accuracy and completeness to the existing AF16 gene models. We also performed lift-over of 108 validated insertion-deletion variants to the updated coordinate system of the CGC2 genome to facilitate future mappings of mutations. In summary, we present an updated reference genome for the canonical AF16 reference strain with improved genomic resources to enable high-quality intra- and inter-species comparative studies.

Matching journals

The top 8 journals account for 50% of the predicted probability mass.

1
Nucleic Acids Research
1128 papers in training set
Top 0.9%
14.3%
2
Nature Communications
4913 papers in training set
Top 26%
7.0%
3
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 0.9%
6.6%
4
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.4%
6.1%
5
BMC Genomics
328 papers in training set
Top 0.3%
6.1%
6
Genome Research
409 papers in training set
Top 0.6%
4.7%
7
Scientific Data
174 papers in training set
Top 0.4%
4.2%
8
Scientific Reports
3102 papers in training set
Top 29%
4.2%
50% of probability mass above
9
BMC Biology
248 papers in training set
Top 0.2%
3.8%
10
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.2%
3.5%
11
DNA Research
23 papers in training set
Top 0.1%
3.5%
12
PLOS Genetics
756 papers in training set
Top 5%
3.5%
13
Cell Genomics
162 papers in training set
Top 2%
3.0%
14
Frontiers in Genetics
197 papers in training set
Top 4%
2.0%
15
Genome Biology
555 papers in training set
Top 4%
2.0%
16
Communications Biology
886 papers in training set
Top 10%
1.6%
17
Genome Biology and Evolution
280 papers in training set
Top 1%
1.6%
18
Gigabyte
60 papers in training set
Top 0.8%
1.4%
19
GigaScience
172 papers in training set
Top 2%
1.2%
20
Genetics
225 papers in training set
Top 3%
1.2%
21
eLife
5422 papers in training set
Top 54%
0.9%
22
PLOS Biology
408 papers in training set
Top 17%
0.9%
23
Developmental Dynamics
50 papers in training set
Top 0.7%
0.8%
24
mBio
750 papers in training set
Top 11%
0.8%
25
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.8%
26
Genomics
60 papers in training set
Top 2%
0.8%
27
Journal of Genetics and Genomics
36 papers in training set
Top 2%
0.7%
28
Current Biology
596 papers in training set
Top 14%
0.7%
29
Science
429 papers in training set
Top 21%
0.7%
30
PLOS ONE
4510 papers in training set
Top 70%
0.7%