Oryza glaberrima CG14_v1 Assembly & Annotation

Overview

Analysis Name Oryza glaberrima CG14_v1 Assembly & Annotation
Sequencing technology PacBio
Assembly method CANU v. 1.5
Release Date 2021-03-10
Reference Publication(s)

Wang M, Yu Y, Haberer G, Marri PR, Fan C, Goicoechea JL, Zuccolo A, Song X, Kudrna D, Ammiraju JS, Cossu RM, Maldonado C, Chen J, Lee S, Sisneros N, de Baynast K, Golser W, Wissotski M, Kim W, Sanchez P, Ndjiondjop MN, Sanni K, Long M, Carney J, Panaud O, Wicker T, Machado CA, Chen M, Mayer KF, Rounsley S, Wing RA. The genome sequence of African rice (Oryza glaberrima) and evidence for independent domestication. Nat Genet. 2014 Sep;46(9):982-8. doi: 10.1038/ng.3044.

Abstract

The cultivation of rice in Africa dates back more than 3,000 years. Interestingly, African rice is not of the same origin as Asian rice (Oryza sativa L.) but rather is an entirely different species (i.e., Oryza glaberrima Steud.). Here we present a high-quality assembly and annotation of the O. glaberrima genome and detailed analyses of its evolutionary history of domestication and selection. Population genomics analyses of 20 O. glaberrima and 94 Oryza barthii accessions support the hypothesis that O. glaberrima was domesticated in a single region along the Niger river as opposed to noncentric domestication events across Africa. We detected evidence for artificial selection at a genome-wide scale, as well as with a set of O. glaberrima genes orthologous to O. sativa genes that are known to be associated with domestication, thus indicating convergent yet independent selection of a common set of genes during two geographically and culturally distinct domestication processes.

Assembly statistics

Genome size347.3 Mb
Total ungapped length347.3 Mb
Gaps between scaffolds13
Number of chromosomes12
Number of scaffolds72
Scaffold N5015.2 Mb
Scaffold L5010
Number of contigs72
Contig N5015.2 Mb
Contig L5010
GC percent43
Genome coverage130.0x
Assembly levelChromosome

Assembly

The Oryza glaberrima CG14_v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_000147395.3_OglaRS2_genomic.fna.gz

Gene Predictions

The Oryza glaberrima CG14_v1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Oryza_glaberrima.maker.gff.gz
CDS sequences (FASTA file) Oryza_glaberrima.coding.fasta.gz
Protein sequences (FASTA file) Oryza_glaberrima.protein.fasta.gz

Functional Analysis

Functional annotation for the Oryza glaberrima CG14_v1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Oryza_glaberrima.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatestBLASTn HittBLASTn %IDDomain
DUF247I-SΨCM029712.1262740455165649-5165798LpSDUF247-I_chromosome167DUF247
DUF247II-SΨCM029712.1262740455153275-5153913LpSDUF247-II_chromosome161DUF247
HPS10-SCM029712.1262740455156869-5156971,
5157074-5157210
LpsS_contig1102938-
DUF247I-ZΨCM029711.13157283429134773-29135126Olongistaminata62DUF247
DUF247II-ZΨCM029711.13157283429145676-29146596AlongiglumisDUF247II-Z75DUF247
HPS10-ZCM029711.13157283429143428-29143525,
29143655-29143775
Osativa64-

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences