Malus x domestica HFTH1 Assembly & Annotation

Overview

Analysis Name Malus x domestica HFTH1 Assembly & Annotation
Sequencing technology Illumina PE, Pacific Biosciences SMRT
Assembly method Falcon45 (v0.4)
Release Date 2019-04-09
Reference Publication(s)

Zhang L, Hu J, Han X, Li J, Gao Y, Richards CM, Zhang C, Tian Y, Liu G, Gul H, Wang D, Tian Y, Yang C, Meng M, Yuan G, Kang G, Wu Y, Wang K, Zhang H, Wang D, Cong P. A high-quality apple genome assembly reveals the association of a retrotransposon and red fruit colour. Nat Commun. 2019 Apr 2;10(1):1494. doi: 10.1038/s41467-019-09518-x.

Abstract

A complete and accurate genome sequence provides a fundamental tool for functional genomics and DNA-informed breeding. Here, we assemble a high-quality genome (contig N50 of 6.99 Mb) of the apple anther-derived homozygous line HFTH1, including 22 telomere sequences, using a combination of PacBio single-molecule real-time (SMRT) sequencing, chromosome conformation capture (Hi-C) sequencing, and optical mapping. In comparison to the Golden Delicious reference genome, we identify 18,047 deletions, 12,101 insertions and 14 large inversions. We reveal that these extensive genomic variations are largely attributable to activity of transposable elements. Interestingly, we find that a long terminal repeat (LTR) retrotransposon insertion upstream of MdMYB1, a core transcriptional activator of anthocyanin biosynthesis, is associated with red-skinned phenotype. This finding provides insights into the molecular mechanisms underlying red fruit coloration, and highlights the utility of this high-quality genome assembly in deciphering agriculturally important trait in apple.

Assembly statistics

Sequence coverage (×)183
Sequenced genome size (Mb)660.5
Contig Number502
Contig N50 (Mb)6.99

Assembly

The Malus x domestica HFTH1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) HFTH1.all.chr.fa.gz

Gene Predictions

The Malus x domestica HFTH1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) HFTH1.gene.gff3.gz
CDS sequences (FASTA file) HFTH1.gene.cds.fa.gz
Protein sequences (FASTA file) HFTH1.gene.pep.fa.gz

S genes

Summary

QueryChrSize(bp)CoordinatesDomain
MdSFBB.XVI-S9Chr 173399882529860443-29861630F-box; F_box_assoc
MdSFBB.XVII-S9Chr 173399882529880819-29879614F-box; F_box_assoc
MdSFBB.XIV-S9Chr 173399882529883107-29884312F-box; F_box_assoc
MdSFBB.Ib-S9Chr 173399882529943756-29942554F-box; F_box_assoc
MdSFBB.VI-S9Chr 173399882529999810-29998632F-box; F_box_assoc
MdSFBB.III-S9Chr 173399882530121504-30122685F-box; F_box_assoc
MdSFBB.II-S9Chr 173399882530224230-30225423F-box; F_box_assoc
MdSFBB.IV-S9Chr 173399882530236989-30238173F-box; F_box_assoc
MdSFBB.XV-S9Chr 173399882530289634-30290848F-box; F_box_assoc
MdSFBB.Ia-S9Chr 173399882530392432-30393634F-box; F_box_assoc
MdSFBB.X-S9Chr 173399882530477857-30476676F-box; F_box_assoc
MdSFBB.XI-S9Chr 173399882530520884-30519700F-box; F_box_assoc
MdSFBB.XII-S9Chr 173399882530649328-30648156F-box; F_box_assoc
MdSFBB.XVIII-S9Chr 173399882530716312-30715128F-box; F_box_assoc
MdSFBB.VII-S9Chr 173399882530787708-30786530F-box; F_box_assoc
S9-RNaseChr 173399882530891703-30891452,30891307-30890873RNase_T2
MdSFBB.V-S9Chr 173399882530933865-30935043F-box; F_box_assoc
MdSFBB.VIII.2-S9Chr 173399882531068766-31067573F-box; F_box_assoc
MdSFBB.VIII.1-S9Chr 173399882531116911-31115721F-box; F_box_assoc
MdSFBB.XXI-S9Chr 173399882531827099-31828448F-box; F_box_assoc

Malus x domestica HFTH1 S genes Nucleotide

Malus x domestica HFTH1 S genes Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences