Fragaria orientalis FOR_r1.1 Assembly & Annotation

Overview

Analysis Name Fragaria orientalis FOR_r1.1 Assembly & Annotation
Sequencing technology Roche 454 GS FLX+ and Illumina GAIIx/Hiseq 1000 platforms
Assembly method Newbler 2.7, SOAPdenovo v1.05
Release Date 2015-09-09
Reference Publication(s)

Hirakawa H, Shirasawa K, Kosugi S, Tashiro K, Nakayama S, Yamada M, Kohara M, Watanabe A, Kishida Y, Fujishiro T, Tsuruoka H, Minami C, Sasamoto S, Kato M, Nanri K, Komaki A, Yanagi T, Guoxin Q, Maeda F, Ishikawa M, Kuhara S, Sato S, Tabata S, Isobe SN. Dissection of the octoploid strawberry genome by deep sequencing of the genomes of Fragaria species. DNA Res. 2014;21(2):169-81. doi: 10.1093/dnares/dst049.

Abstract

Cultivated strawberry (Fragaria x ananassa) is octoploid and shows allogamous behaviour. The present study aims at dissecting this octoploid genome through comparison with its wild relatives, F. iinumae, F. nipponica, F. nubicola, and F. orientalis by de novo whole-genome sequencing on an Illumina and Roche 454 platforms. The total length of the assembled Illumina genome sequences obtained was 698 Mb for F. x ananassa, and ∼200 Mb each for the four wild species. Subsequently, a virtual reference genome termed FANhybrid_r1.2 was constructed by integrating the sequences of the four homoeologous subgenomes of F. x ananassa, from which heterozygous regions in the Roche 454 and Illumina genome sequences were eliminated. The total length of FANhybrid_r1.2 thus created was 173.2 Mb with the N50 length of 5137 bp. The Illumina-assembled genome sequences of F. x ananassa and the four wild species were then mapped onto the reference genome, along with the previously published F. vesca genome sequence to establish the subgenomic structure of F. x ananassa. The strategy adopted in this study has turned out to be successful in dissecting the genome of octoploid F. x ananassa and appears promising when applied to the analysis of other polyploid plant species.

Assembly statistics

Genome size (bp)214,184,023
Chromosomes, organellas and plasmids sequnece No.0
Genome sequence No.323,163
Scaffold sequence No.323,163
Scaffold L5085,403
Scaffold N50 (bp)722
Scaffold N75 (bp)465
Scaffold N90 (bp)358
Contig sequence No.323,675
Contig L5085,534
Contig N50 (bp)721
Unique components No.323,675
No. of regions with alternate loci or patches0
Ns gap sequence No. (bp)512
Ns gap sequence region length (bp)14,119
Gaps No. between scaffolds0
Assembly level Scaffold

Assembly

The Fragaria orientalis FOR_r1.1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) Fragaria_orientalis_FOR_r1.1_scaffolds.fasta.gz

Gene Predictions

The Fragaria orientalis FOR_r1.1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Fragaria_orientalis_FOR_r1.1_gene.gff3.gz
CDS sequences (FASTA file) Fragaria_orientalis_FOR_r1.1_cds.fasta.gz
Protein sequences (FASTA file) Fragaria_orientalis_FOR_r1.1_pep.fasta.gz

Functional Analysis

Functional annotation for the Fragaria orientalis FOR_r1.1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Fragaria_orientalis_FOR_r1.1.Pfam.tsv.gz
© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences