Rice Annotation Project Database (RAP-DB): literature-curated gene annotation and integrated omics resources for rice functional genomics and molecular breeding
Kawahara, Y.; Kishikawa, T. H.; Hirata, R.; Wang, X.; Tamagaki, Y.; Kumagai, M.; Tabei, N.; Sakai, H.; Itoh, T.
Show abstract
High-throughput sequencing technologies have enabled the generation of high-quality reference genomes for numerous rice cultivars. However, inferring gene functions, associated phenotypes, and causal variants from these sequences remains challenging. The Rice Annotation Project Database (RAP-DB; https://rapdb.dna.affrc.go.jp) is a curated genomic resource that provides comprehensive gene annotations for the reference genome of Oryza sativa ssp. japonica cv. Nipponbare. Since its major update in 2013, gene models and functional annotations have been continuously revised through expert manual curation of newly published literature related to rice genes. As of March 2025, a total of 6,631 transcripts corresponding to 6,371 loci have been curated based on 4,699 peer-reviewed publications. These curated genes are functionally characterized and are frequently associated with agronomic traits, including yield components, stress tolerance, and disease resistance. To support molecular breeding, RAP-DB now provides a curated catalogue of 904 agronomically important loci, including gene symbols, functional descriptions, and associated traits, together with more than 1,000 functionally characterized alleles compiled from the literature. In addition to in-house expert curation, RAP-DB integrates community-curated datasets for major gene families, such as WRKY transcription factors, S-domain receptor-like kinases, and leucine-rich repeat-containing receptors, thereby expanding coverage of key regulatory and defense-related genes. RAP-DB also incorporates reanalyzed RNA sequencing expression profiles alongside microarray-based expression data and co-expression networks, offering gene-centric views of expression patterns across tissues, conditions, and developmental stages. Furthermore, RAP-DB is linked to genome-wide variation datasets from diverse rice varieties through the TASUKE+ genome browser, enabling exploration of allelic diversity across varieties. To enhance annotation quality and long-term sustainability, AI-assisted literature screening and a web-based feedback system have been introduced, allowing users to submit corrections to gene models and report newly characterized genes or relevant publications. Together, these developments strengthen RAP-DB as a primary, literature-based gene annotation resource and provide a practical foundation for molecular breeding in rice.
Matching journals
The top 4 journals account for 50% of the predicted probability mass.