Back

Chromosome-scale scaffolding of the fungus gnat genome (Diptera: Bradysia coprophila)

Urban, J. M.; Gerbi, S. A.; Spradling, A. C.

2022-11-04 genomics
10.1101/2022.11.03.515061 bioRxiv
Show abstract

BackgroundThe fungus gnat, Bradysia (Sciara) coprophila, has compelling chromosome biology. Paternal chromosomes are eliminated during spermatogenesis whereas both maternal X sister chromatids are retained. Embryos start with three copies of the X chromosome, but 1-2 copies are eliminated from somatic cells as part of sex determination, and one is eliminated in the germline to restore diploidy. These developmentally normal events present opportunities to study chromosome movements that are unusual in other systems. To support such studies, we previously generated a highly contiguous optical-map-scaffolded long-read assembly (Bcop_v1) of the male somatic genome. However, the scaffolds were not chromosome-scale, the majority of the assembly lacked chromosome assignments, and the order and orientation of the contigs along chromosomes remained unknown. FindingsMale pupae Hi-C data was used to correct, order, and orient the contigs from Bcop_v1 into chromosome-scale scaffolds, producing the updated assembly, Bcop_v2. Several orthogonal analyses allowed us to (i) identify the corresponding chromosome for each scaffold, (ii) orient them with respect to polytene maps, and (iii) determine that they were highly concordant with the chromosomes they represent. Gene annotations produced for Bcop_v1 were lifted over to Bcop_v2. Chromosomal repeat distributions highlight a potential telomeric sequence. Finally, the Hi-C data shed new light on three "fold-back regions" seen to physically interact in images of polytene X chromosomes. ConclusionsStudies of the unusual chromosome movements in Bradysia coprophila will benefit from the updated assembly (Bcop_v2) where each somatic chromosome is represented by a single scaffold.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
G3
33 papers in training set
Top 0.1%
22.3%
2
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.1%
14.2%
3
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.3%
6.3%
4
BMC Biology
248 papers in training set
Top 0.1%
6.3%
5
Genetics
225 papers in training set
Top 1%
3.9%
50% of probability mass above
6
Gigabyte
60 papers in training set
Top 0.4%
2.9%
7
Molecular Ecology Resources
161 papers in training set
Top 0.4%
2.7%
8
Scientific Reports
3102 papers in training set
Top 51%
2.1%
9
Chromosome Research
18 papers in training set
Top 0.1%
1.9%
10
GENETICS
189 papers in training set
Top 0.5%
1.9%
11
BMC Genomics
328 papers in training set
Top 3%
1.7%
12
Genome Biology and Evolution
280 papers in training set
Top 1%
1.7%
13
PLOS ONE
4510 papers in training set
Top 59%
1.3%
14
Journal of Heredity
35 papers in training set
Top 0.1%
1.2%
15
Frontiers in Marine Science
55 papers in training set
Top 0.8%
1.2%
16
Biology Open
130 papers in training set
Top 2%
1.1%
17
BMC Bioinformatics
383 papers in training set
Top 6%
1.1%
18
PLOS Genetics
756 papers in training set
Top 12%
1.1%
19
Frontiers in Genetics
197 papers in training set
Top 8%
0.9%
20
Frontiers in Fungal Biology
10 papers in training set
Top 0.1%
0.9%
21
Developmental Dynamics
50 papers in training set
Top 0.6%
0.9%
22
Bioinformatics
1061 papers in training set
Top 9%
0.9%
23
Nucleic Acids Research
1128 papers in training set
Top 16%
0.9%
24
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.8%
25
Insects
36 papers in training set
Top 1%
0.7%
26
Insect Science
11 papers in training set
Top 0.2%
0.7%
27
PeerJ
261 papers in training set
Top 16%
0.7%
28
mBio
750 papers in training set
Top 13%
0.6%
29
PLOS Computational Biology
1633 papers in training set
Top 27%
0.6%
30
Insect Molecular Biology
19 papers in training set
Top 0.2%
0.6%