Prunus salicina Sanyueli_v2.0 Assembly & Annotation

Overview

Analysis Name Prunus salicina Sanyueli_v2.0 Assembly & Annotation
Sequencing technology PacBio and Illumina reads
Assembly method FALCON (v0.3.0)
Release Date 2020-10-10
Reference Publication(s)

Liu C, Feng C, Peng W, Hao J, Wang J, Pan J, He Y. Chromosome-level draft genome of a diploid plum (Prunus salicina). Gigascience. 2020 Dec 10;9(12):giaa130. doi: 10.1093/gigascience/giaa130.

Abstract

Background: Plums are one of the most economically important Rosaceae fruit crops and comprise dozens of species distributed across the world. Until now, only limited genomic information has been available for the genetic studies and breeding programs of plums. Prunus salicina, an important diploid plum species, plays a predominant role in modern commercial plum production. Here we selected P. salicina for whole-genome sequencing and present a chromosome-level genome assembly through the combination of Pacific Biosciences sequencing, Illumina sequencing, and Hi-C technology. Findings: The assembly had a total size of 284.2 Mb, with contig N50 of 1.78 Mb and scaffold N50 of 32.32 Mb. A total of 96.56% of the assembled sequences were anchored onto 8 pseudochromosomes, and 24,448 protein-coding genes were identified. Phylogenetic analysis showed that P. salicina had a close relationship with Prunus mume and Prunus armeniaca, with P. salicina diverging from their common ancestor ∼9.05 million years ago. During P. salicina evolution 146 gene families were expanded, and some cell wall–related GO terms were significantly enriched. It was noteworthy that members of the DUF579 family, a new class involved in xylan biosynthesis, were significantly expanded in P. salicina, which provided new insight into the xylan metabolism in plums. Conclusions: We constructed the first high-quality chromosome-level plum genome using Pacific Biosciences, Illumina, and Hi-C technologies. This work provides a valuable resource for facilitating plum breeding programs and studying the genetic diversity mechanisms of plums and Prunus species.

Assembly statistics

Scaffolds total length (bp)284,209,110
Scaffolds No.75
Scaffolds N50 (bp)32,324,625
Contigs total length (bp)284,189,410
Contigs No.272
Contigs N50 (bp)1,777,944

Assembly

The Prunus salicina Sanyueli_v2.0 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) psalicina_v2.0.fasta.gz

Gene Predictions

The Prunus salicina Sanyueli_v2.0 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) psalicina_v2.0.genes.gff3.gz
CDS sequences (FASTA file) psalicina_v2.0.transcripts.fasta.gz
Protein sequences (FASTA file) psalicina_v2.0.protein.fasta.gz

S genes

Summary

QueryChrSize(bp)CoordinatesDomain
PsaSLF7.4Chr63629247932829807-32830955F-box
PsaSLF7.2Chr63629247932836287-32837765F-box
PsaSLF7.1Chr63629247933268448-33269941F-box; F_box_assoc
PsaSLF7.3Chr63629247933274938-33276416F-box
PsaSLF4Chr63629247933699243-33697873F-box; F_box_assoc
PsaSLF2Chr63629247933784750-33786024F-box; F_box_assoc
PsaSLF1Chr63629247933822191-33820980F-box; F_box_assoc
PsaSFBChr63629247933824425-33825555F-box; F_box_assoc
PsaSLF3Chr63629247933856713-33857936F-box; F_box_assoc
PsaSLF5Chr63629247933900514-33899336F-box; F_box_assoc
S-RNaseChr63629247933830243-33830168,33829876-33829692,33826993-33826583RNase_T2

Prunus salicina Sanyueli_v2.0 S genes Nucleotide

Prunus salicina Sanyueli_v2.0 S genes Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences