Back

A near chromosome-scale genome assembly of the Common pine sawfly (Diprion pini, Linnaeus, 1758)

Wutke, S.; Michell, C.; Lindstedt, C.

2026-03-21 genomics
10.64898/2026.03.19.712881 bioRxiv
Show abstract

The common pine sawfly, Diprion pini, is a widespread defoliator of pine forests across Europe and Asia, with outbreaks causing substantial ecological and economic damages. However, genomic resources for this species have been limited, hindering advances in molecular ecology or pest management. Here, we present a near chromosome-level reference genome for D.pini, generated using PacBio HiFi reads, Oxford Nanopore MionION long reads, and 10x Genomics linked reads. The final assembly is organized into mostly chromosome-sized scaffolds. It spans a length of 268 Mb, comprises 81 scaffolds, and has a scaffold N50 of 18.7 Mb. BUSCO analysis (hymenoptera_odb10) indicates a high genome completeness of 97.2%. With 22,7 kb the mitochondrial genome is unusually large due to an extended non-coding control region (6,874 bp). Gene prediction identified 26,335 protein-coding genes, of which 12,769 were functionally annotated. Comparative analyses with other sawflies and Apocrita identified 2,472 proteins unique to D. pini, some of which are putatively associated with the processing of plant secondary metabolites. Notably, our genome assembly highlights that, when a closely related, high-quality reference genome is available, chromosome-scale assemblies can be generated without the need of Hi-C sequencing. The genome provides a valuable foundation for the development of improved monitoring and management strategies for D. pini outbreaks and contributes to advancing fundamental research on Hymenoptera evolution.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 7%
18.0%
2
Scientific Data
174 papers in training set
Top 0.2%
9.8%
3
The Plant Journal
197 papers in training set
Top 0.6%
7.0%
4
New Phytologist
309 papers in training set
Top 1.0%
6.6%
5
Molecular Ecology Resources
161 papers in training set
Top 0.2%
6.1%
6
Scientific Reports
3102 papers in training set
Top 25%
4.7%
50% of probability mass above
7
DNA Research
23 papers in training set
Top 0.1%
3.8%
8
Communications Biology
886 papers in training set
Top 2%
3.6%
9
Horticulture Research
43 papers in training set
Top 0.7%
3.0%
10
Frontiers in Plant Science
240 papers in training set
Top 3%
2.5%
11
BMC Biology
248 papers in training set
Top 0.5%
2.5%
12
Plant Biotechnology Journal
56 papers in training set
Top 0.5%
2.0%
13
Molecular Ecology
304 papers in training set
Top 3%
1.6%
14
Peer Community Journal
254 papers in training set
Top 2%
1.6%
15
BMC Genomics
328 papers in training set
Top 3%
1.6%
16
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 4%
1.3%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 37%
1.3%
18
Gigabyte
60 papers in training set
Top 0.9%
1.3%
19
Current Biology
596 papers in training set
Top 11%
1.2%
20
Microbiology Resource Announcements
22 papers in training set
Top 0.6%
1.2%
21
PLOS ONE
4510 papers in training set
Top 62%
1.1%
22
PLOS Genetics
756 papers in training set
Top 13%
0.9%
23
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
24
Science Advances
1098 papers in training set
Top 31%
0.7%
25
PLANTS, PEOPLE, PLANET
21 papers in training set
Top 0.8%
0.7%
26
GigaScience
172 papers in training set
Top 4%
0.6%
27
Science
429 papers in training set
Top 22%
0.6%
28
G3: Genes, Genomes, Genetics
222 papers in training set
Top 1%
0.6%
29
International Journal of Biological Macromolecules
65 papers in training set
Top 4%
0.6%
30
Nature Genetics
240 papers in training set
Top 9%
0.6%