Fragaria nilgerrensis Genome v1.0 Assembly & Annotation

Overview

Analysis Name Fragaria nilgerrensis Genome v1.0 Assembly & Annotation
Sequencing technology PacBio, Illumina
Assembly method Falcon and Canu
Release Date 2020-07-01
Reference Publication(s)

Zhang J, Lei Y, Wang B, Li S, Yu S, Wang Y, Li H, Liu Y, Ma Y, Dai H, Wang J, Zhang Z. The high-quality genome of diploid strawberry (Fragaria nilgerrensis) provides new insights into anthocyanin accumulation. Plant Biotechnol J. 2020 Sep;18(9):1908-1924. doi: 10.1111/pbi.13351.

Abstract

Fragaria nilgerrensis is a wild diploid strawberry species endemic to east and southeast region in Asia and provides a rich source of genetic variations for strawberry improvement. Here, we present a chromosome-scale assembly of F. nilgerrensis using single-molecule real-time (SMRT) Pacific Biosciences sequencing and chromosome conformation capture (Hi-C) genome scaffolding. The genome assembly size was 270.3 Mb, with a contig N50 of ∼8.5 Mb. A total of 28 780 genes and 117.2 Mb of transposable elements were annotated for this genome. Next, detailed comparative genomics with the high-quality F. vesca reference genome was conducted to obtain the difference among transposable elements, SNPs, Indels, and so on. The genome size of F. nilgerrensis was enhanced by around 50 Mb relatively to F. vesca, which is mainly due to expansion of transposable elements. In comparison with the F. vesca genome, we identified 4 561 825 SNPs, 846 301 Indels, 4243 inversions, 35 498 translocations and 10 099 relocations. We also found a marked expansion of genes involved in phenylpropanoid biosynthesis, starch and sucrose metabolism, cyanoamino acid metabolism, plant–pathogen interaction, brassinosteroid biosynthesis and plant hormone signal transduction in F. nilgerrensis, which may account for its specific phenotypes and considerable environmental adaptability. Interestingly, we found sequence variations in the upstream regulatory region of FnMYB10, a core transcriptional activator of anthocyanin biosynthesis, resulted in the low expression level of the FnMYB10 gene, which is likely responsible for white fruit phenotype of F. nilgerrensis. The high-quality F. nilgerrensis genome will be a valuable resource for biological research and comparative genomics research.

Assembly statistics

Total assembly size270.3 Mb
Assembly % of genome97.91
Repeats region % of assembly43.40
Predicted gene models28 780
Number of contigs430
Contig N508.51 Mb
Number of scaffolds257
Scaffold N5038.3 Mb
Assembly level Scaffold

Assembly

The Fragaria nilgerrensis Genome v1.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHABKC00000000.genome.fasta.gz

Gene Predictions

The Fragaria nilgerrensis Genome v1.0 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHABKC00000000.gff.gz
CDS sequences (FASTA file) GWHABKC00000000.RNA.fasta.gz
Protein sequences (FASTA file) GWHABKC00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Fragaria nilgerrensis Genome v1.0 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Fragaria_nilgerrensis_Genome_v1.0.Pfam.tsv.gz
© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences