Citrus glauca UN-2024c Assembly & Annotation

Overview

Analysis Name Citrus glauca UN-2024c Assembly & Annotation
Sequencing technology PacBio Sequel
Assembly method hifiasm v. v.0.19.8
Release Date 2024-12-17
Reference Publication(s)

Nakandala U, Furtado A, Masouleh AK, Smith MW, Mason P, Williams DC, Henry RJ. The genomes of Australian wild limes. Plant Mol Biol. 2024 Sep 24;114(5):102. doi: 10.1007/s11103-024-01502-4.

Abstract

Australian wild limes occur in highly diverse range of environments and are a unique genetic resource within the genus Citrus. Here we compare the haplotype-resolved genome assemblies of six Australian native limes, including four new assemblies generated using PacBio HiFi and Hi-C sequencing data. The size of the genomes was between 315 and 391 Mb with contig N50s from 29.5 to 35 Mb. Gene completeness of the assemblies was estimated to be from 98.4 to 99.3% and the annotations from 97.7 to 98.9% based upon BUSCO, confirming the high contiguity and completeness of the assembled genomes. High collinearity was observed among the genomes and the two haplotype assemblies for each species. Gene duplication and evolutionary analysis demonstrated that the Australian citrus have undergone only one ancient whole-genome triplication event during evolution. The highest number of species-specific and expanded gene families were found in C. glauca and they were primarily enriched in purine, thiamine metabolism, amino acids and aromatic amino acids metabolism which might help C. glauca to mitigate drought, salinity, and pathogen attacks in the drier environments in which this species is found. Unique genes related to terpene biosynthesis, glutathione metabolism, and toll-like receptors in C. australasica, and starch and sucrose metabolism genes in both C. australis and C. australasica might be important candidate genes for HLB tolerance in these species. Expanded gene families were not lineage specific, however, a greater number of genes related to plant-pathogen interactions, predominantly disease resistant protein, was found in C. australasica and C. australis.

Assembly statistics

Assembly

The Citrus glauca UN-2024c Assembly files are available in FASTA format.

Downloads

Chromosomes (FASTA file) GCA_046118965.1_ASM4611896v1_genomic.fna.gz
Chromosomes (FASTA file) GCA_046118885.1_ASM4611888v1_genomic.fna.gz

Gene Predictions

The Citrus glauca UN-2024c genome gene prediction files are not available.

Downloads

Genes (GFF3 file) -
CDS sequences (FASTA file) -
Protein sequences (FASTA file) -

Functional Analysis

Functional annotation for the Citrus glauca UN-2024c is not available.

Downloads

Domain from InterProScan -

S genes

Summary

QueryChrSize(bp)CoordinatesBLASTn HitBLASTn %IDDomain
SLF1CM100414.12629151425114196-25115296PP719840.1, S30-SLF197F-box; F_box_assoc
SLF2CM100414.12629151425117285-25118409PP719841.1, S30-SLF296F-box; F_box_assoc
SLF3CM100414.12629151425123306-25124406PP719842.1, S30-SLF395F-box; F_box_assoc
SLF4CM100414.12629151425151078-25152193PP719843.1, S30-SLF492F-box; F_box_assoc
SLF6CM100414.12629151425220246-25219119PP719845.1, S30-SLF689F-box; F_box_assoc
SLF7CM100414.12629151425241685-25240561PP719832.1, S2-SLF789F-box; F_box_assoc
SLF8CM100414.12629151425256000-25257139PP719847.1, S30-SLF8a75F-box; F_box_assoc
SLF9CM100414.12629151425261046-25262185PP719834.1, S2-SLF973F-box; F_box_assoc
SLF10CM100414.12629151425284384-25285529PP719835.1, S2-SLF1098F-box; F_box_assoc
SLF11CM100414.12629151425287981-25286806PP719851.1, S30-SLF1197F-box; F_box_assoc
SLF12CM100414.12629151425310341-25309220PP719837.1, S2-SLF1298F-box; F_box_assoc
SLF1CM100432.12656914525461429-25462529PP719840.1, S30-SLF197F-box; F_box_assoc
SLF2CM100432.12656914525464497-25465621PP719841.1, S30-SLF297F-box; F_box_assoc
SLF3CM100432.12656914525470520-25471641PP719842.1, S30-SLF395F-box; F_box_assoc
SLF4CM100432.12656914525483881-25485005PP719843.1, S30-SLF491F-box; F_box_assoc
SLF7CM100432.12656914525516806-25517936PP719832.1, S2-SLF792F-box; F_box_assoc
SLF5CM100432.12656914525551312-25552433PP719830.1, S2-SLF584F-box; F_box_assoc
SLF8CM100432.12656914525619567-25620709PP719847.1, S30-SLF8a74F-box; F_box_assoc
SLF9CM100432.12656914525624453-25625592PP719834.1, S2-SLF973F-box; F_box_assoc
SLF10CM100432.12656914525647765-25648910PP719835.1, S2-SLF1098F-box; F_box_assoc
SLF12CM100432.12656914525655506-25654385PP719837.1, S2-SLF1298F-box; F_box_assoc
S-RNaseCM100432.12656914525608038-25608268,25608371-25608823MN652903.1, S7-RNase88RNase T2

Citrus glauca UN-2024c S genes Nucleotide

Citrus glauca UN-2024c S genes Protein

© 2023 National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences