(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Eukaryota: NE > Viridiplantae: NE > Streptophyta: NE > Streptophytina: NE > Embryophyta: NE > Tracheophyta: NE > Euphyllophyta: NE > Spermatophyta: NE > Magnoliophyta: NE > Mesangiospermae: NE > eudicotyledons: NE > Gunneridae: NE > Pentapetalae: NE > asterids: NE > Ericales: NE > Actinidiaceae: NE > Actinidia: NE > Actinidia deliciosa: NE
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MNSSEVTHDFPPFFRVYKDGRIERYVAIGYVPPVVDPQTGVESKDVTISQ ETDLKARIFIPKINSSDPKIPLVVHYHGGAFCIGSPFDALSHSFLTSLAS KARAIVVSVDYRLAPEHPLPIAYDDSWSALQWIAAHSTGQGPDPWLNQHV DFGRVFLAGESAGANIAHHVAVRAGLAGPGYLQVHGLILVHPFFANNEPD EIIRFLYPGSSWSDNDPRLSPLEDPDLDKLGCSQVIVFVAGKDWLKSRGV GYCEILKNRGWEGTVELVESEGEDHCYPLVQSPSEKAVLLVQSLGFFHQS RLMQCNYMESLHHAL
BACKGROUND: Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongyang' draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models. RESULTS: A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within 'Hongyang' The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned 'Hort16A' cDNAs and comparing with the predicted protein models for Red5 and both the original 'Hongyang' assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised 'Hongyang' annotation, respectively, compared with 90.9% to the Red5 models. CONCLUSIONS: Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.
Carboxylesterases hydrolyze esters of short-chain fatty acids and have roles in animals ranging from signal transduction to xenobiotic detoxification. In plants, however, little is known of their roles. We have systematically mined the genome from the model plant Arabidopsis thaliana for carboxylesterase genes and studied their distribution in the genome and expression profile across a range of tissues. Twenty carboxylesterase genes (AtCXE) were identified. The AtCXE family shares conserved sequence motifs and secondary structure characteristics with carboxylesterases and other members of the larger alpha/beta hydrolase fold superfamily of enzymes. Phylogenetic analysis of the AtCXE genes together with other plant carboxylesterases distinguishes seven distinct clades, with an Arabidopsis thaliana gene represented in six of the seven clades. The AtCXE genes are widely distributed across the genome (present in four of five chromosomes), with the exception of three clusters of tandemly duplicated genes. Of the interchromosomal duplication events, two have been mediated through newly identified partial chromosomal duplication events that also include other genes surrounding the AtCXE loci. Eighteen of the 20 AtCXE genes are expressed over a broad range of tissues, while the remaining 2 (unrelated) genes are expressed only in the flowers and siliques. Finally, hypotheses for the functional roles of the AtCXE family members are presented based on the phylogenetic relationships with other plant carboxylesterases of known function, their expression profile, and knowledge of likely esterase substrates found in plants.