Analysis Name | Fragaria iinumae FII_r1.1 Assembly & Annotation |
Sequencing technology | Roche 454 GS FLX+ and Illumina GAIIx/Hiseq 1000 platforms |
Assembly method | Newbler 2.7 (Roche Diagnostics) and SOAPdenovo v1.05 |
Release Date | 2015-09-09 |
Hirakawa H, Shirasawa K, Kosugi S, Tashiro K, Nakayama S, Yamada M, Kohara M, Watanabe A, Kishida Y, Fujishiro T, Tsuruoka H, Minami C, Sasamoto S, Kato M, Nanri K, Komaki A, Yanagi T, Guoxin Q, Maeda F, Ishikawa M, Kuhara S, Sato S, Tabata S, Isobe SN. Dissection of the octoploid strawberry genome by deep sequencing of the genomes of Fragaria species. DNA Res. 2014;21(2):169-81. doi: 10.1093/dnares/dst049.
AbstractCultivated strawberry (Fragaria x ananassa) is octoploid and shows allogamous behaviour. The present study aims at dissecting this octoploid genome through comparison with its wild relatives, F. iinumae, F. nipponica, F. nubicola, and F. orientalis by de novo whole-genome sequencing on an Illumina and Roche 454 platforms. The total length of the assembled Illumina genome sequences obtained was 698 Mb for F. x ananassa, and ∼200 Mb each for the four wild species. Subsequently, a virtual reference genome termed FANhybrid_r1.2 was constructed by integrating the sequences of the four homoeologous subgenomes of F. x ananassa, from which heterozygous regions in the Roche 454 and Illumina genome sequences were eliminated. The total length of FANhybrid_r1.2 thus created was 173.2 Mb with the N50 length of 5137 bp. The Illumina-assembled genome sequences of F. x ananassa and the four wild species were then mapped onto the reference genome, along with the previously published F. vesca genome sequence to establish the subgenomic structure of F. x ananassa. The strategy adopted in this study has turned out to be successful in dissecting the genome of octoploid F. x ananassa and appears promising when applied to the analysis of other polyploid plant species.
Assembly statistics
Genome size (bp) | 199,627,509 |
Chromosomes, organellas and plasmids sequnece No. | 0 |
Genome sequence No. | 117,822 |
Scaffold sequence No. | 117,822 |
Scaffold L50 | 15,738 |
Scaffold N50 (bp) | 3,309 |
Scaffold N75 (bp) | 1,352 |
Scaffold N90 (bp) | 633 |
Contig sequence No. | 118,549 |
Contig L50 | 15,770 |
Contig N50 (bp) | 3,302 |
Unique components No. | 118,549 |
No. of regions with alternate loci or patches | 0 |
Ns gap sequence No. (bp) | 727 |
Ns gap sequence region length (bp) | 19,264 |
Gaps No. between scaffolds | 0 |
Assembly level | Scaffold |
The Fragaria iinumae FII_r1.1 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | Fragaria_iinumae_Genome_v1.0_FII_r1.1_scaffolds.fasta.gz |
The Fragaria iinumae FII_r1.1 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | Fragaria_iinumae_Genome_v1.0_FII_r1.1_gene.gff3.gz |
CDS sequences (FASTA file) | Fragaria_iinumae_Genome_v1.0_FII_r1.1_cds.fasta.gz |
Protein sequences (FASTA file) | Fragaria_iinumae_Genome_v1.0_FII_r1.1_pep.fasta.gz |
Functional annotation for the Fragaria iinumae FII_r1.1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | TSV |
Fragaria S genes Nucleotide
Fragaria S genes Protein