CS-917 (MB06322) is a selective small compound inhibitor of fructose 1,6-bisphosphatase (FBPase), which is expected to be a novel drug for the treatment of type 2 diabetes by inhibiting gluconeogenesis. CS-917 is a bisamidate prodrug and activation of CS-917 requires a two-step enzyme catalyzed reaction. The first-step enzyme, esterase, catalyzes the conversion of CS-917 into the intermediate form (R-134450) and the second-step enzyme, phosphoramidase, catalyzes the conversion of R-134450 into the active form (R-125338). In this study, we biochemically purified the CS-917 esterase activity in monkey small intestine and liver. We identified cathepsin A (CTSA) and elastase 3B (ELA3B) as CS-917 esterases in the small intestine by mass spectrometry, whereas we found CTSA and carboxylesterase 1 (CES1) in monkey liver. We also purified R-134450 phosphoramidase activity in monkey liver and identified sphingomyelin phosphodiesterase, acid-like 3A (SMPADL3A), as an R-134450 phosphoramidase, which has not been reported to have any enzyme activity. Recombinant human CTSA, ELA3B, and CES1 showed CS-917 esterase activity and recombinant human SMPDL3A showed R-134450 phosphoramidase activity, which confirmed the identification of those enzymes. Identification of metabolic enzymes responsible for the activation process is the requisite first step to understanding the activation process, pharmacodynamics and pharmacokinetics of CS-917 at the molecular level. This is the first identification of a phosphoramidase other than histidine triad nucleotide-binding protein (HINT) family enzymes and SMPDL3A might generally contribute to activation of the other bisamidate prodrugs.
We collected and completely sequenced 28,469 full-length complementary DNA clones from Oryza sativa L. ssp. japonica cv. Nipponbare. Through homology searches of publicly available sequence data, we assigned tentative protein functions to 21,596 clones (75.86%). Mapping of the cDNA clones to genomic DNA revealed that there are 19,000 to 20,500 transcription units in the rice genome. Protein informatics analysis against the InterPro database revealed the existence of proteins presented in rice but not in Arabidopsis. Sixty-four percent of our cDNAs are homologous to Arabidopsis proteins.
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
Full-length complementary DNAs (cDNAs) are essential for the correct annotation of genomic sequences and for the functional analysis of genes and their products. We isolated 155,144 RIKEN Arabidopsis full-length (RAFL) cDNA clones. The 3'-end expressed sequence tags (ESTs) of 155,144 RAFL cDNAs were clustered into 14,668 nonredundant cDNA groups, about 60% of predicted genes. We also obtained 5' ESTs from 14,034 nonredundant cDNA groups and constructed a promoter database. The sequence database of the RAFL cDNAs is useful for promoter analysis and correct annotation of predicted transcription units and gene products. Furthermore, the full-length cDNAs are useful resources for analyses of the expression profiles, functions, and structures of plant proteins.
The RIKEN Mouse Gene Encyclopaedia Project, a systematic approach to determining the full coding potential of the mouse genome, involves collection and sequencing of full-length complementary DNAs and physical mapping of the corresponding genes to the mouse genome. We organized an international functional annotation meeting (FANTOM) to annotate the first 21,076 cDNAs to be analysed in this project. Here we describe the first RIKEN clone collection, which is one of the largest described for any organism. Analysis of these cDNAs extends known gene families and identifies new ones.