Citrus garrawayi UN-2024d Assembly & Annotation

Overview

Analysis Name Citrus garrawayi UN-2024d Assembly & Annotation
Sequencing technology PacBio Sequel
Assembly method hifiasm v. v.0.19.8
Release Date 2024-12-17
Reference Publication(s)

Nakandala U, Furtado A, Masouleh AK, Smith MW, Mason P, Williams DC, Henry RJ. The genomes of Australian wild limes. Plant Mol Biol. 2024 Sep 24;114(5):102. doi: 10.1007/s11103-024-01502-4.

Abstract

Australian wild limes occur in highly diverse range of environments and are a unique genetic resource within the genus Citrus. Here we compare the haplotype-resolved genome assemblies of six Australian native limes, including four new assemblies generated using PacBio HiFi and Hi-C sequencing data. The size of the genomes was between 315 and 391 Mb with contig N50s from 29.5 to 35 Mb. Gene completeness of the assemblies was estimated to be from 98.4 to 99.3% and the annotations from 97.7 to 98.9% based upon BUSCO, confirming the high contiguity and completeness of the assembled genomes. High collinearity was observed among the genomes and the two haplotype assemblies for each species. Gene duplication and evolutionary analysis demonstrated that the Australian citrus have undergone only one ancient whole-genome triplication event during evolution. The highest number of species-specific and expanded gene families were found in C. glauca and they were primarily enriched in purine, thiamine metabolism, amino acids and aromatic amino acids metabolism which might help C. glauca to mitigate drought, salinity, and pathogen attacks in the drier environments in which this species is found. Unique genes related to terpene biosynthesis, glutathione metabolism, and toll-like receptors in C. australasica, and starch and sucrose metabolism genes in both C. australis and C. australasica might be important candidate genes for HLB tolerance in these species. Expanded gene families were not lineage specific, however, a greater number of genes related to plant-pathogen interactions, predominantly disease resistant protein, was found in C. australasica and C. australis.

Assembly statistics

Assembly

The Citrus garrawayi UN-2024d Assembly files are available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_046118895.1_ASM4611889v1_genomic.fna.gz
Chromosomes (FASTA file) GCA_046118905.1_ASM4611890v1_genomic.fna.gz

Gene Predictions

The Citrus garrawayi UN-2024d genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Citrus garrawayi UN-2024d is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF1CM100405.12885663127715359-27716462PP719840.1, S30-SLF197F-box; F_box_assoc
SLF2CM100405.12885663127725389-27726513PP719841.1, S30-SLF291F-box; F_box_assoc
SLF3CM100405.12885663127740654-27741775PP719842.1, S30-SLF391F-box; F_box_assoc
SLF4CM100405.12885663127753425-27754537PP719829.1, S2-SLF491F-box; F_box_assoc
SLF5CM100405.12885663127834714-27835838PP719830.1, S2-SLF588F-box; F_box_assoc
SLF6CM100405.12885663127844664-27843540PP719831.1, S2-SLF688F-box; F_box_assoc
SLF7CM100405.12885663127847950-27846817PP719832.1, S2-SLF790F-box; F_box_assoc
SLF8CM100405.12885663127876374-27877513PP719847.1, S30-SLF8a74F-box; F_box_assoc
SLF9CM100405.12885663127882534-27883679PP719834.1, S2-SLF974F-box; F_box_assoc
SLF10CM100405.12885663127927607-27928752PP719835.1, S2-SLF1099F-box; F_box_assoc
SLF12CM100405.12885663127935352-27934237PP719852.1, S30-SLF1299F-box; F_box_assoc
S-RNaseCM100405.12885663127776523-27776278,27776184-27775750MN652898.1, S2-RNase71RNase_T2
SLF1CM100423.12897672727889157-27890257PP719840.1, S30-SLF197F-box; F_box_assoc
SLF2CM100423.12897672727892119-27893243PP719841.1, S30-SLF296F-box; F_box_assoc
SLF3CM100423.12897672727917781-27918902PP719842.1, S30-SLF393F-box; F_box_assoc
SLF4CM100423.12897672727928053-27929180PP719843.1, S30-SLF491F-box; F_box_assoc
SLFxCM100423.12897672727966743-27965637PP719849.1, S30-SLF966F-box; F_box_assoc
SLF10CM100423.12897672727984155-27983028 PP719835.1, S2-SLF1076F-box; F_box_assoc
SLF6CM100423.12897672727992937-27991792PP719845.1, S30-SLF691F-box; F_box_assoc
SLF7CM100423.12897672728007848-28006718PP719832.1, S2-SLF791F-box; F_box_assoc
SLF8CM100423.12897672728015074-28016213PP719847.1, S30-SLF8a74F-box; F_box_assoc
SLF9CM100423.12897672728018749-28019891PP719834.1, S2-SLF974F-box; F_box_assoc
SLF10-2CM100423.12897672728041290-28042435PP719835.1, S2-SLF1099F-box; F_box_assoc
SLF12CM100423.12897672728049056-28047941PP719852.1, S30-SLF1299F-box; F_box_assoc
S-RNaseCM100423.12897672727964189-27964419,27964765-27965193MN652909.1, S13-RNase85RNase_T2

Citrus garrawayi UN-2024d S genes Nucleotide

Citrus garrawayi UN-2024d S genes Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences