Analysis Name | Fragaria nilgerrensis Genome v1.0 Assembly & Annotation |
Sequencing technology | PacBio, Illumina |
Assembly method | Falcon and Canu |
Release Date | 2020-07-01 |
Zhang J, Lei Y, Wang B, Li S, Yu S, Wang Y, Li H, Liu Y, Ma Y, Dai H, Wang J, Zhang Z. The high-quality genome of diploid strawberry (Fragaria nilgerrensis) provides new insights into anthocyanin accumulation. Plant Biotechnol J. 2020 Sep;18(9):1908-1924. doi: 10.1111/pbi.13351.
AbstractFragaria nilgerrensis is a wild diploid strawberry species endemic to east and southeast region in Asia and provides a rich source of genetic variations for strawberry improvement. Here, we present a chromosome-scale assembly of F. nilgerrensis using single-molecule real-time (SMRT) Pacific Biosciences sequencing and chromosome conformation capture (Hi-C) genome scaffolding. The genome assembly size was 270.3 Mb, with a contig N50 of ∼8.5 Mb. A total of 28 780 genes and 117.2 Mb of transposable elements were annotated for this genome. Next, detailed comparative genomics with the high-quality F. vesca reference genome was conducted to obtain the difference among transposable elements, SNPs, Indels, and so on. The genome size of F. nilgerrensis was enhanced by around 50 Mb relatively to F. vesca, which is mainly due to expansion of transposable elements. In comparison with the F. vesca genome, we identified 4 561 825 SNPs, 846 301 Indels, 4243 inversions, 35 498 translocations and 10 099 relocations. We also found a marked expansion of genes involved in phenylpropanoid biosynthesis, starch and sucrose metabolism, cyanoamino acid metabolism, plant–pathogen interaction, brassinosteroid biosynthesis and plant hormone signal transduction in F. nilgerrensis, which may account for its specific phenotypes and considerable environmental adaptability. Interestingly, we found sequence variations in the upstream regulatory region of FnMYB10, a core transcriptional activator of anthocyanin biosynthesis, resulted in the low expression level of the FnMYB10 gene, which is likely responsible for white fruit phenotype of F. nilgerrensis. The high-quality F. nilgerrensis genome will be a valuable resource for biological research and comparative genomics research.
Assembly statistics
Total assembly size | 270.3 Mb |
Assembly % of genome | 97.91 |
Repeats region % of assembly | 43.40 |
Predicted gene models | 28 780 |
Number of contigs | 430 |
Contig N50 | 8.51 Mb |
Number of scaffolds | 257 |
Scaffold N50 | 38.3 Mb |
Assembly level | Scaffold |
The Fragaria nilgerrensis Genome v1.0 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GWHABKC00000000.genome.fasta.gz |
The Fragaria nilgerrensis Genome v1.0 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | GWHABKC00000000.gff.gz |
CDS sequences (FASTA file) | GWHABKC00000000.RNA.fasta.gz |
Protein sequences (FASTA file) | GWHABKC00000000.Protein.faa.gz |
Functional annotation for the Fragaria nilgerrensis Genome v1.0 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | Fragaria_nilgerrensis_Genome_v1.0.Pfam.tsv.gz |
Fragaria S genes Nucleotide
Fragaria S genes Protein