Brassica carinata PGLv1 Assembly & Annotation

Overview

Analysis Name Brassica carinata PGLv1 Assembly & Annotation
Sequencing technology PacBio, illumina, Hi-C
Assembly method Canu v2.0
Release Date 2022-08-12
Reference Publication(s)

Yim WC, Swain ML, Ma D, An H, Bird KA, Curdie DD, Wang S, Ham HD, Luzuriaga-Neira A, Kirkwood JS, Hur M, Solomon JKQ, Harper JF, Kosma DK, Alvarez-Ponce D, Cushman JC, Edger PP, Mason AS, Pires JC, Tang H, Zhang X. The final piece of the Triangle of U: Evolution of the tetraploid Brassica carinata genome. Plant Cell. 2022 Oct 27;34(11):4143-4172. doi: 10.1093/plcell/koac249.

Abstract

Ethiopian mustard (Brassica carinata) is an ancient crop with remarkable stress resilience and a desirable seed fatty acid profile for biofuel uses. Brassica carinata is one of six Brassica species that share three major genomes from three diploid species (AA, BB, and CC) that spontaneously hybridized in a pairwise manner to form three allotetraploid species (AABB, AACC, and BBCC). Of the genomes of these species, that of B. carinata is the least understood. Here, we report a chromosome scale 1.31-Gbp genome assembly with 156.9-fold sequencing coverage for B. carinata, completing the reference genomes comprising the classic Triangle of U, a classical theory of the evolutionary relationships among these six species. Our assembly provides insights into the hybridization event that led to the current B. carinata genome and the genomic features that gave rise to the superior agronomic traits of B. carinata. Notably, we identified an expansion of transcription factor networks and agronomically important gene families. Completion of the Triangle of U comparative genomics platform has allowed us to examine the dynamics of polyploid evolution and the role of subgenome dominance in the domestication and continuing agronomic improvement of B. carinata and other Brassica species.

Assembly

The Brassica carinata PGLv1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Brassica_carinata.fna.gz

Gene Predictions

The Brassica carinata PGLv1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Brassica_carinata.gff.gz
CDS sequences (FASTA file) Brassica_carinata.cds.fa.gz
Protein sequences (FASTA file) Brassica_carinata.protein.fa.gz

Functional Analysis

Functional annotation for the Brassica carinata PGLv1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Brassica_carinata_PGLv1.Pfam.tsv.gz

S genes

Summary

QueryChromosomeSize(bp)CoordinatesBLASTp HitBLASTp %ID
SRK1Bca_GomB077890430718854762-18856109,18856191-18856301,
18856410-18856573,18856654-18856864,
18856962-18857199,18857278-18857428,
18857516-18857809
SRKb|AB052756.1_prot_BAB40987.1_132
SCR1Bca_GomB077890430719480562-19480631,19480242-19480462XP_018438641.1 65
SRK2Bca_GomC06 6427804652033016-52034333,52036552-52036698,
52036802-52036983,52039167-52039377,
52039469-52039706,52039803-52039953,
52040027-52040344
sp|Q09092|SRK6_BRAOV65
SCR2Bca_GomC06 6427804652017412-52017496,52017129-52017322BAD29945.180

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences