Solanum sitiens 'LA1974 (cultivar)' ASM1680187v1 Assembly & Annotation

Overview

Analysis Name Solanum sitiens 'LA1974 (cultivar)' ASM1680187v1 Assembly & Annotation
Sequencing technology Illumina HiSeq; PacBio RSII; PacBio Sequel;
BioNano Irys; 10x Genomics Chromium
Assembly method MaSuRCA v. 3.2.2
Release Date 2021-02-02
Reference Publication(s)

Molitor C, Kurowski TJ, Fidalgo de Almeida PM, Eerolla P, Spindlow DJ, Kashyap SP, Singh B, Prasanna H, Thompson AJ, Mohareb FR. De Novo Genome Assembly Of Solanum Sitiens Reveals Structural Variation Associated With Drought And Salinity Tolerance. Bioinformatics. 2021 Jan 30;37(14):1941–5. doi: 10.1093/bioinformatics/btab048.

Abstract

Motivation: Solanum sitiens is a self-incompatible wild relative of tomato, characterised by salt and drought resistance traits, with the potential to contribute through breeding programmes to crop improvement in cultivated tomato. This species has a distinct morphology, classification and ecotype compared to other stress resistant wild tomato relatives such as S. pennellii and S. chilense. Therefore, the availability of a reference genome for S. sitiens will facilitate the genetic and molecular understanding of salt and drought resistance.

Results: A high-quality de novo genome and transcriptome assembly for S. sitiens (Accession LA1974) has been developed. A hybrid assembly strategy was followed using Illumina short reads (∼159X coverage) and PacBio long reads (∼44X coverage), generating a total of ∼262 Gbp of DNA sequence. A reference genome of 1,245 Mbp, arranged in 1,483 scaffolds with a N50 of 1.826 Mbp was generated. Genome completeness was estimated at 95% using the Benchmarking Universal Single-Copy Orthologs (BUSCO) and the K-mer Analysis Tool (KAT). In addition, ∼63 Gbp of RNA-Seq were generated to support the prediction of 31,164 genes from the assembly, and to perform a de novo transcriptome. Lastly, we identified three large inversions compared to S. lycopersicum, containing several drought resistance related genes, such as beta-amylase 1 and YUCCA7.

Availability: S. sitiens (LA1974) raw sequencing, transcriptome and genome assembly have been deposited at the NCBI's Sequence Read Archive, under the BioProject number "PRJNA633104".All the commands and scripts necessary to generate the assembly are available at the following github repository: https://github.com/MCorentin/Solanum_sitiens_assembly.

Assembly statistics

Genome size 1.2 Gb
Number of scaffolds 1,478
Scaffold N50 1.8 Mb
Scaffold L50 186
Number of contigs 3,997
Contig N50 844.5 kb
Contig L50 366
Assembly level Scaffold

Assembly

The Solanum sitiens 'LA1974 (cultivar)' ASM1680187v1 Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_016801875.1_ASM1680187v1_genomic.fna.gz

Gene Predictions

The Solanum sitiens 'LA1974 (cultivar)' ASM1680187v1 genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Solanum sitiens 'LA1974 (cultivar)' ASM1680187v1 is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryScaffoldSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF11JABWTJ01
0000016.1
47212001398330-1397158Solanum pennellii NM_001323461.1, SLF1197.5 F-box domain
SLF16JABWTJ01
0000101.1
43712033439774-3440955Solanum lycopersicum SL2.31, SLF1696.7 F-box domain
SLF15JABWTJ01
0000101.1
43712034009713-4010972Solanum lycopersicum SL2.31, SLF1595.6 F-box domain
SLF7JABWTJ01
0000122.1
35241822100380-2101549Solanum peruvianum KJ814851.1, SLF796.8 F-box domain
SLF6JABWTJ01
0000122.1
35241822221175-2222320Solanum lycopersicoides KU987626.1, SLF698.1 F-box domain
SLF5JABWTJ01
0000122.1
35241822242634-2243803Solanum chilense KJ814884.1, SLF597.9 F-box domain
SLF4JABWTJ01
0000122.1
35241822268140-2269306Solanum peruvianum KJ814848.1, SLF496.8 F-box domain
SLF2JABWTJ01
0000137.1
1814728233034-231847Solanum pennellii BK009231.1, SLF298.1 F-box domain
SLF10ΨJABWTJ01
0000160.1
21389472023891-2025121Solanum chilense KJ814888.1, SLF1094.8 -
SLF5-2JABWTJ01
0000168.1
211808757472-58659Solanum lycopersicoides KU987627.2, SLF597.2 F-box domain
SLF17JABWTJ01
0000199.1
2111152569184-568003Solanum lycopersicoides KU960921.1, SLF1796.4 F-box domain
SLF22ΨJABWTJ01
0000209.1
1536909854576-853450Solanum lycopersicoides KU960924.1, SLF2299.4 -
SLF9JABWTJ01
0000219.1
1342457621184-620039Solanum lycopersicoides KU987631.1, SLF999.6 F-box domain
SLF21JABWTJ01
0000251.1
1194399321642-320332Solanum lycopersicoides KU960923.1, SLF2199.6 F-box domain
SLF6-2JABWTJ01
0000251.1
1194399808296-809441Solanum lycopersicoides KU987626.1, SLF699.0 F-box domain
SLF5-3JABWTJ01
0000251.1
1194399837504-838673Solanum chilense KJ814884.1, SLF597.7 F-box domain
SLF4-2JABWTJ01
0000251.1
1194399892061-893224Solanum peruvianum KJ814848.1, SLF496.6 F-box domain
SLF2-2JABWTJ01
0000251.1
11943991051318-1052505Solanum habrochaites KJ814908.1, SLF2-S197.1 F-box domain
SLF20JABWTJ01
0000322.1
979256587603-588769Solanum lycopersicoides KU960922.1, SLF2099.5 F-box domain
SLF6-3JABWTJ01
0000332.1
12771441075590-1076735Solanum tuberosum DM8.1, SLF691.8 F-box domain
S-RNaseJABWTJ01
0000348.1
1340021338583-338350,
338263-337850
Solanum tuberosum
MZ561415.1, SRNase-S12
92.8 Ribonuclease T2 family
SLF22-2JABWTJ01
0000382.1
1553066393596-394729Solanum lycopersicoides KU960924.1, SLF2299.4 F-box domain
SLF3JABWTJ01
0000442.1
957390855517-856680Solanum pennellii BK009230.1, SLF397.9 F-box domain
SLF18JABWTJ01
0000458.1
665964318027-319136Solanum lycopersicum SL2.31, SLF1895.4 F-box domain
SLF19JABWTJ01
0000458.1
665964358640-357531Solanum lycopersicum SL2.31, SLF1996.0 F-box domain
SLF1ΨJABWTJ01
0000490.1
623224477117-475947Solanum peruvianum KJ814846.1, SLF195.8 -
SLF17-2JABWTJ01
0000745.1
70594846693-47874Solanum lycopersicoides KU960921.1, SLF1797.8 F-box domain
SLF1-2JABWTJ01
0000757.1
317718185148-186302Solanum peruvianum KJ814846.1, SLF194.8 F-box domain
SLF23JABWTJ01
0000828.1
265193102516-103673Solanum lycopersicoides KU960925.1, SLF2398.7 F-box domain
SLF7-2ΨJABWTJ01
0001027.1
16105127713-26544Solanum peruvianum KJ814851.1, SLF797.2 -
SLF20-2ΨJABWTJ01
0001043.1
15380643558-42391Solanum lycopersicoides KU960922.1, SLF2099.8 -

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences