Solanum lycopersicum var. cerasiforme (cherry tomato) 'ZY65 (cultivar)' ASM2770512v1 Assembly & Annotation

Overview

Analysis Name Solanum lycopersicum var. cerasiforme (cherry tomato) 'ZY65 (cultivar)' ASM2770512v1 Assembly & Annotation
Sequencing technology PacBio
Assembly method Canu v. 1.5
Release Date 2023-01-11
Reference Publication(s)

Li N, He Q, Wang J, Wang B, Zhao J, Huang S, Yang T, Tang Y, Yang S, Aisimutuola P, Xu R, Hu J, Jia C, Ma K, Li Z, Jiang F, Gao J, Lan H, Zhou Y, Zhang X, Huang S, Fei Z, Wang H, Li H, Yu Q. Super-pangenome analyses highlight genomic diversity and structural variation across wild and cultivated tomato species. Nat Genet. 2023 May;55(5):852-860. doi: 10.1038/s41588-023-01340-y.

Abstract

Effective utilization of wild relatives is key to overcoming challenges in genetic improvement of cultivated tomato, which has a narrow genetic basis; however, current efforts to decipher high-quality genomes for tomato wild species are insufficient. Here, we report chromosome-scale tomato genomes from nine wild species and two cultivated accessions, representative of Solanum section Lycopersicon, the tomato clade. Together with two previously released genomes, we elucidate the phylogeny of Lycopersicon and construct a section-wide gene repertoire. We reveal the landscape of structural variants and provide entry to the genomic diversity among tomato wild relatives, enabling the discovery of a wild tomato gene with the potential to increase yields of modern cultivated tomatoes. Construction of a graph-based genome enables structural-variant-based genome-wide association studies, identifying numerous signals associated with tomato flavor-related traits and fruit metabolites. The tomato super-pangenome resources will expedite biological studies and breeding of this globally important crop.

Assembly statistics

Genome size 777.8 Mb
Number of scaffolds 329
Scaffold N50 62.6 Mb
Scaffold L50 6
Number of contigs 720
Contig N50 2.5 Mb
Contig L50 97
Assembly level Scaffold

Assembly

The Solanum lycopersicum var. cerasiforme (cherry tomato) 'ZY65 (cultivar)' ASM2770512v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_027704885.1_ASM2770512v1_genomic.fna.gz

Gene Predictions

The Solanum lycopersicum var. cerasiforme (cherry tomato) 'ZY65 (cultivar)' ASM2770512v1 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) S.lycopersicum.ZY65.cds.fa.gz
Protein sequences (FASTA file) S.lycopersicum.ZY65.pep.fa.gz

Functional Analysis

Functional annotation for the Solanum lycopersicum var. cerasiforme (cherry tomato) 'ZY65 (cultivar)' ASM2770512v1 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF15JALGYU01
0000318.1
899830422143976-2142717SL2.31ch01:2198500-2196501_SLF1599.8F-box domain
SLF16JALGYU01
0000318.1
899830422673210-2672029SL2.31ch01:2723400-2721301_SLF16100F-box domain
SLF17ΨJALGYU01
0000318.1
8998304241098243-41097158SL2.31ch01:40853100-40851001_SLF17Ψ99.9-
SLF1JALGYU01
0000318.1
8998304243942054-43943223NM_001301439.2, SLF199.8F-box domain
S-RNaseJALGYU01
0000318.1
8998304244751111-44750872
44750774-44750349
XM_004229015.1,
Ribonuclease S-3
100Ribonuclease T2 family
SLF2ΨJALGYU01
0000318.1
8998304245627679-45626498KJ814870.1, SLF2100-
SLF12ΨJALGYU01
0000318.1
8998304245684241-45685372SL2.31ch01:45516501-45518600_SLF12Ψ99.9-
SLF4ΨJALGYU01
0000318.1
8998304245751544-45750378KJ814943.1, SLF4100-
SLF5ΨJALGYU01
0000318.1
8998304245832209-45831041KJ814872.1, SLF5100-
SLF6ΨJALGYU01
0000318.1
8998304245849744-45848599KJ814944.1, SLF699.9-
SLF8ΨJALGYU01
0000318.1
8998304246407094-46405926SL2.31ch01:46243000-46240701_SLF8Ψ99.9-
SLF7ΨJALGYU01
0000318.1
8998304246431873-46430776SL2.31ch01:46267800-46265701_SLF7Ψ100-
SLF9JALGYU01
0000318.1
8998304248617004-48615940NM_001329461.2, SLF9100F-box domain
SLF10ΨJALGYU01
0000318.1
8998304249062969-49064200KJ814899.1, SLF1099.8-
SLF11JALGYU01
0000318.1
8998304251013941-51015113KJ814877.1, SLF11100F-box associated
SLF12JALGYU01
0000318.1
8998304252787154-52785991NM_001301441.1, SLF12100F-box associated
SLF13JALGYU01
0000318.1
8998304253639114-53637921NM_001301435.1, SLF13100F-box associated
SLF14ΨJALGYU01
0000318.1
8998304256373065-56371895KJ814903.1, SLF14100-
SLF18JALGYU01
0000318.1
8998304267518937-67520052SL2.31ch01:67739501-67741500_SLF18100F-box domain
SLF19JALGYU01
0000318.1
8998304267537963-67536854SL2.31ch01:67757501-67759600_SLF19100F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences