Analysis Name | Solanum lycopersicum 'M82 (cultivar)' ASM2770488v1 Assembly & Annotation |
Sequencing technology | PacBio |
Assembly method | Canu v. 1.5 |
Release Date | 2023-01-11 |
Li N, He Q, Wang J, Wang B, Zhao J, Huang S, Yang T, Tang Y, Yang S, Aisimutuola P, Xu R, Hu J, Jia C, Ma K, Li Z, Jiang F, Gao J, Lan H, Zhou Y, Zhang X, Huang S, Fei Z, Wang H, Li H, Yu Q. Super-pangenome analyses highlight genomic diversity and structural variation across wild and cultivated tomato species. Nat Genet. 2023 May;55(5):852-860. doi: 10.1038/s41588-023-01340-y.
AbstractEffective utilization of wild relatives is key to overcoming challenges in genetic improvement of cultivated tomato, which has a narrow genetic basis; however, current efforts to decipher high-quality genomes for tomato wild species are insufficient. Here, we report chromosome-scale tomato genomes from nine wild species and two cultivated accessions, representative of Solanum section Lycopersicon, the tomato clade. Together with two previously released genomes, we elucidate the phylogeny of Lycopersicon and construct a section-wide gene repertoire. We reveal the landscape of structural variants and provide entry to the genomic diversity among tomato wild relatives, enabling the discovery of a wild tomato gene with the potential to increase yields of modern cultivated tomatoes. Construction of a graph-based genome enables structural-variant-based genome-wide association studies, identifying numerous signals associated with tomato flavor-related traits and fruit metabolites. The tomato super-pangenome resources will expedite biological studies and breeding of this globally important crop.
Assembly statistics
Genome size | 880.3 Mb |
Number of scaffolds | 5,156 |
Scaffold N50 | 54.6 Mb |
Scaffold L50 | 8 |
Number of contigs | 6,409 |
Contig N50 | 600 kb |
Contig L50 | 402 |
Assembly level | Scaffold |
The Solanum lycopersicum 'M82 (cultivar)' ASM2770488v1 Assembly file is available in FASTA format.
Downloads
Chromosomes (FASTA file) | GCA_027704885.1_ASM2770488v1_genomic.fna.gz |
The Solanum lycopersicum 'M82 (cultivar)' ASM2770488v1 genome gene prediction files are available in GFF3 and FASTA format.
Downloads
Genes (GFF3 file) | - |
CDS sequences (FASTA file) | S.lycopersicum.M82.cds.fa.gz |
Protein sequences (FASTA file) | S.lycopersicum.M82.pep.fa.gz |
Functional annotation for the Solanum lycopersicum 'M82 (cultivar)' ASM2770488v1 is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).
Downloads
Domain from InterProScan | - |
Summary
Query | Chr | Size(bp) | Coordinates | BLASTn Hit | BLASTn %ID | Domain |
SLF15 | JALGYV010005145.1 | 81465427 | 2239093-2237834 | SL2.31ch01:2198500-2196501_SLF15 | 100 | F-box domain |
SLF16 | JALGYV010005145.1 | 81465427 | 2763360-2762179 | SL2.31ch01:2723400-2721301_SLF16 | 100 | F-box domain |
SLF17Ψ | JALGYV010005145.1 | 81465427 | 35184227-35183142 | SL2.31ch01:40853100-40851001_SLF17Ψ | 100 | - |
SLF1 | JALGYV010005145.1 | 81465427 | 37706188-37707357 | NM_001301439.2, SLF1 | 100 | F-box domain |
S-RNase | JALGYV010005145.1 | 81465427 | 38318180-38317941,38317843-38317418 | XM_004229015.1, Ribonuclease S-3 | 100 | Ribonuclease T2 family |
SLF2Ψ | JALGYV010005145.1 | 81465427 | 38787133-38785952 | KJ814870.1, SLF2 | 100 | - |
SLF12Ψ | JALGYV010005145.1 | 81465427 | 38843710-38844841 | SL2.31ch01:45516501-45518600_SLF12Ψ | 100 | - |
SLF4Ψ | JALGYV010005145.1 | 81465427 | 38911008-38909842 | KJ814943.1, SLF4 | 100 | - |
SLF5Ψ | JALGYV010005145.1 | 81465427 | 38991724-38990556 | KJ814872.1, SLF5 | 100 | - |
SLF6Ψ | JALGYV010005145.1 | 81465427 | 39009259-39008114 | KJ814944.1, SLF6 | 100 | - |
SLF8Ψ | JALGYV010005145.1 | 81465427 | 39566604-39565436 | SL2.31ch01:46243000-46240701_SLF8Ψ | 100 | - |
SLF7Ψ | JALGYV010005145.1 | 81465427 | 39591376-39590279 | SL2.31ch01:46267800-46265701_SLF7Ψ | 100 | - |
SLF9 | JALGYV010005145.1 | 81465427 | 41756801-41755737 | NM_001329461.2, SLF9 | 100 | F-box domain |
SLF10Ψ | JALGYV010005145.1 | 81465427 | 42199796-42201027 | KJ814899.1, SLF10 | 100 | - |
SLF11 | JALGYV010005145.1 | 81465427 | 44144229-44145401 | KJ814877.1, SLF11 | 100 | F-box associated |
SLF12 | JALGYV010005145.1 | 81465427 | 45897944-45896781 | NM_001301441.1, SLF12 | 100 | F-box associated |
SLF13 | JALGYV010005145.1 | 81465427 | 46621357-46620164 | NM_001301435.1, SLF13 | 100 | F-box associated |
SLF14Ψ | JALGYV010005145.1 | 81465427 | 49503017-49501847 | KJ814903.1, SLF14 | 100 | - |
SLF18 | JALGYV010005145.1 | 81465427 | 59494816-59495931 | SL2.31ch01:67739501-67741500_SLF18 | 100 | F-box domain |
SLF19 | JALGYV010005145.1 | 81465427 | 59513833-59512724 | SL2.31ch01:67757501-67759600_SLF19 | 100 | F-box domain |
Nucleotide
Protein