Physalis pubescens gwh_assembly P. floridana Assembly & Annotation

Overview

Analysis Name Physalis pubescens gwh_assembly P. floridana Assembly & Annotation
Sequencing technology PacBio
Assembly method FALCON v1.0
Release Date 2021-09-25
Reference Publication(s)

Lu J, Luo M, Wang L, Li K, Yu Y, Yang W, Gong P, Gao H, Li Q, Zhao J, Wu L, Zhang M, Liu X, Zhang X, Zhang X, Kang J, Yu T, Li Z, Jiao Y, Wang H, He C. The Physalis floridana genome provides insights into the biochemical and morphological evolution of Physalis fruits. Hortic Res. 2021 Nov 18;8(1):244. doi: 10.1038/s41438-021-00705-w.

Abstract

The fruits of Physalis (Solanaceae) have a unique structure, a lantern-like fruiting calyx known as inflated calyx syndrome (ICS) or the Chinese lantern, and are rich in steroid-related compounds. However, the genetic variations underlying the origin of these characteristic traits and diversity in Physalis remain largely unknown. Here, we present a high-quality chromosome-level reference genome assembly of Physalis floridana (~1.40 Gb in size) with a contig N50 of ~4.87 Mb. Through evolutionary genomics and experimental approaches, we found that the loss of the SEP-like MADS-box gene MBP21 subclade is likely a key mutation that, together with the previously revealed mutation affecting floral MPF2 expression, might have contributed to the origination of ICS in Physaleae, suggesting that the origination of a morphological novelty may have resulted from an evolutionary scenario in which one mutation compensated for another deleterious mutation. Moreover, the significant expansion of squalene epoxidase genes is potentially associated with the natural variation of steroid-related compounds in Physalis fruits. The results reveal the importance of gene gains (duplication) and/or subsequent losses as genetic bases of the evolution of distinct fruit traits, and the data serve as a valuable resource for the evolutionary genetics and breeding of solanaceous crops.

Assembly statistics

Genome size (bp) 1,389,299,811
GC content 41.96%
Chromosomes sequence No. 12
Genome sequence No. 327
Maximum genome sequence length (bp) 149,502,688
Minimum genome sequence length (bp) 5,852
Average genome sequence length (bp) 4,248,623
Genome sequence N50 (bp) 113,373,559
Genome sequence N90 (bp) 96,490,955
Assembly level Chromosome

Assembly

The Physalis pubescens gwh_assembly P. floridana Assembly file is available in FASTA format.

Downloads

Chromosomes (FASTA file) GWHANUX00000000.genome.fasta.gz

Gene Predictions

The Physalis pubescens gwh_assembly P. floridana genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) GWHANUX00000000.gff.gz
CDS sequences (FASTA file) GWHANUX00000000.CDS.fasta.gz
Protein sequences (FASTA file) GWHANUX00000000.Protein.faa.gz

Functional Analysis

Functional annotation for the Physalis pubescens gwh_assembly P. floridana is available for download below. The proteins were analyzed using InterProScan to assign InterPro domains(Pfam).

Downloads

Domain from InterProScan Physalis_pubescens_GWH.Pfam.tsv.gz

S genes

Summary

QueryContigSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF18GWHANUX000000091192923868210963-8209851Solanum tuberosum DM8.1, SLF1882.379F-box domain
SLF19GWHANUX0000001112649098811832337-11831225Solanum tuberosum DM8.1, SLF1987.928F-box domain

Nucleotide

Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences