Back

Pan-genome Analysis in Sorghum Highlights the Extent of Genomic Variation and Sugarcane Aphid Resistance Genes

Wang, B.; Jiao, Y.; Chougule, K.; Olson, A.; Huang, J.; Llaca, V.; Fengler, K.; Wei, X.; Wang, L.; Wang, X.; Regulski, M.; Drenkow, J.; Gingeras, T. R.; Hayes, C.; Armstrong, J. S.; Huang, Y.; Xin, Z.; Ware, D.

2021-01-04 genomics
10.1101/2021.01.03.424980 bioRxiv
Show abstract

Sorghum bicolor, one of the most important grass crops around the world, harbors a high degree of genetic diversity. We constructed chromosome-level genome assemblies for two important sorghum inbred lines, Tx2783 and RTx436. The final high-quality reference assemblies consist of 19 and 18 scaffolds, respectively, with contig N50 values of 25.6 and 20.3 Mb. Genes were annotated using evidence-based and de novo gene predictors, and RAMPAGE data demonstrate that transcription start sites were effectively captured. Together with other public sorghum genomes, BTx623, RTx430, and Rio, extensive structural variations (SVs) of various sizes were characterized using Tx2783 as a reference. Genome-wide scanning for disease resistance (R) genes revealed high levels of diversity among these five sorghum accessions. To characterize sugarcane aphid (SCA) resistance in Tx2783, we mapped the resistance region on chromosome 6 using a recombinant inbred line (RIL) population and found a SV of 191 kb containing a cluster of R genes in Tx2783. Using Tx2783 as a backbone, along with the SVs, we constructed a pan-genome to support alignment of resequencing data from 62 sorghum accessions, and then identified core and dispensable genes using this population. This study provides the first overview of the extent of genomic structural variations and R genes in the sorghum population, and reveals potential targets for breeding of SCA resistance.

Matching journals

The top 3 journals account for 50% of the predicted probability mass.

1
Plant Biotechnology Journal
56 papers in training set
Top 0.1%
25.8%
2
The Plant Journal
197 papers in training set
Top 0.1%
14.7%
3
The Plant Genome
53 papers in training set
Top 0.1%
10.1%
50% of probability mass above
4
Plant Communications
35 papers in training set
Top 0.3%
4.3%
5
Frontiers in Plant Science
240 papers in training set
Top 2%
3.6%
6
Horticulture Research
43 papers in training set
Top 0.5%
3.6%
7
PLOS ONE
4510 papers in training set
Top 40%
3.6%
8
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 2%
2.7%
9
G3 Genes|Genomes|Genetics
351 papers in training set
Top 0.9%
2.6%
10
PLOS Genetics
756 papers in training set
Top 6%
2.4%
11
Scientific Reports
3102 papers in training set
Top 51%
2.1%
12
Nature Communications
4913 papers in training set
Top 48%
1.9%
13
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
14
Genetics
225 papers in training set
Top 2%
1.7%
15
BMC Plant Biology
47 papers in training set
Top 0.6%
1.2%
16
Plant Direct
81 papers in training set
Top 2%
0.9%
17
New Phytologist
309 papers in training set
Top 4%
0.9%
18
Communications Biology
886 papers in training set
Top 17%
0.9%
19
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.7%
0.9%
20
Journal of Experimental Botany
195 papers in training set
Top 2%
0.9%
21
Cell Genomics
162 papers in training set
Top 6%
0.7%
22
Genome Biology
555 papers in training set
Top 7%
0.7%
23
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 46%
0.7%
24
Science Advances
1098 papers in training set
Top 31%
0.7%
25
Molecular Plant
36 papers in training set
Top 1%
0.7%