Tabata S

References (31)

Title : Klebsormidium flaccidum genome reveals primary factors for plant terrestrial adaptation - Hori_2014_Nat.Commun_5_3978
Author(s) : Hori K , Maruyama F , Fujisawa T , Togashi T , Yamamoto N , Seo M , Sato S , Yamada T , Mori H , Tajima N , Moriyama T , Ikeuchi M , Watanabe M , Wada H , Kobayashi K , Saito M , Masuda T , Sasaki-Sekimoto Y , Mashiguchi K , Awai K , Shimojima M , Masuda S , Iwai M , Nobusawa T , Narise T , Kondo S , Saito H , Sato R , Murakawa M , Ihara Y , Oshima-Yamada Y , Ohtaka K , Satoh M , Sonobe K , Ishii M , Ohtani R , Kanamori-Sato M , Honoki R , Miyazaki D , Mochizuki H , Umetsu J , Higashi K , Shibata D , Kamiya Y , Sato N , Nakamura Y , Tabata S , Ida S , Kurokawa K , Ohta H
Ref : Nat Commun , 5 :3978 , 2014
Abstract : The colonization of land by plants was a key event in the evolution of life. Here we report the draft genome sequence of the filamentous terrestrial alga Klebsormidium flaccidum (Division Charophyta, Order Klebsormidiales) to elucidate the early transition step from aquatic algae to land plants. Comparison of the genome sequence with that of other algae and land plants demonstrate that K. flaccidum acquired many genes specific to land plants. We demonstrate that K. flaccidum indeed produces several plant hormones and homologues of some of the signalling intermediates required for hormone actions in higher plants. The K. flaccidum genome also encodes a primitive system to protect against the harmful effects of high-intensity light. The presence of these plant-related systems in K. flaccidum suggests that, during evolution, this alga acquired the fundamental machinery required for adaptation to terrestrial environments.
ESTHER : Hori_2014_Nat.Commun_5_3978
PubMedSearch : Hori_2014_Nat.Commun_5_3978
PubMedID: 24865297
Gene_locus related to this paper: kleni-a0a1y1i5c5 , kleni-a0a1y1i3f9 , kleni-a0a1y1hnk2 , kleni-a0a1y1hsz2 , kleni-a0a1y1hva2 , kleni-a0a1y1i2g3 , kleni-a0a1y1i4h5 , kleni-a0a1y1i9h9

Title : Identification of a novel salt tolerance gene in wild soybean by whole-genome sequencing - Qi_2014_Nat.Commun_5_4340
Author(s) : Qi X , Li MW , Xie M , Liu X , Ni M , Shao G , Song C , Kay-Yuen Yim A , Tao Y , Wong FL , Isobe S , Wong CF , Wong KS , Xu C , Li C , Wang Y , Guan R , Sun F , Fan G , Xiao Z , Zhou F , Phang TH , Tong SW , Chan TF , Yiu SM , Tabata S , Wang J , Xu X , Lam HM
Ref : Nat Commun , 5 :4340 , 2014
Abstract : Using a whole-genome-sequencing approach to explore germplasm resources can serve as an important strategy for crop improvement, especially in investigating wild accessions that may contain useful genetic resources that have been lost during the domestication process. Here we sequence and assemble a draft genome of wild soybean and construct a recombinant inbred population for genotyping-by-sequencing and phenotypic analyses to identify multiple QTLs relevant to traits of interest in agriculture. We use a combination of de novo sequencing data from this work and our previous germplasm re-sequencing data to identify a novel ion transporter gene, GmCHX1, and relate its sequence alterations to salt tolerance. Rapid gain-of-function tests show the protective effects of GmCHX1 towards salt stress. This combination of whole-genome de novo sequencing, high-density-marker QTL mapping by re-sequencing and functional analyses can serve as an effective strategy to unveil novel genomic information in wild soybean to facilitate crop improvement.
ESTHER : Qi_2014_Nat.Commun_5_4340
PubMedSearch : Qi_2014_Nat.Commun_5_4340
PubMedID: 25004933
Gene_locus related to this paper: soybn-i1k636 , soybn-i1j4c6 , glyso-a0a0b2sjw6 , soybn-a0a0r0i9y7 , soybn-a0a0r0j241 , soybn-i1kfz9 , glyso-a0a0b2rre9 , soybn-i1jx17

Title : Draft genome sequence of Arthrospira platensis C1 (PCC9438) - Cheevadhanarak_2012_Stand.Genomic.Sci_6_43
Author(s) : Cheevadhanarak S , Paithoonrangsarid K , Prommeenate P , Kaewngam W , Musigkain A , Tragoonrung S , Tabata S , Kaneko T , Chaijaruwanich J , Sangsrakru D , Tangphatsornruang S , Chanprasert J , Tongsima S , Kusonmano K , Jeamton W , Dulsawat S , Klanchui A , Vorapreeda T , Chumchua V , Khannapho C , Thammarongtham C , Plengvidhya V , Subudhi S , Hongsthong A , Ruengjitchatchawalya M , Meechai A , Senachak J , Tanticharoen M
Ref : Stand Genomic Sci , 6 :43 , 2012
Abstract : Arthrospira platensis is a cyanobacterium that is extensively cultivated outdoors on a large commercial scale for consumption as a food for humans and animals. It can be grown in monoculture under highly alkaline conditions, making it attractive for industrial production. Here we describe the complete genome sequence of A. platensis C1 strain and its annotation. The A. platensis C1 genome contains 6,089,210 bp including 6,108 protein-coding genes and 45 RNA genes, and no plasmids. The genome information has been used for further comparative analysis, particularly of metabolic pathways, photosynthetic efficiency and barriers to gene transfer.
ESTHER : Cheevadhanarak_2012_Stand.Genomic.Sci_6_43
PubMedSearch : Cheevadhanarak_2012_Stand.Genomic.Sci_6_43
PubMedID: 22675597
Gene_locus related to this paper: artma-b5w5h8 , artma-b5w601 , artpn-d5a0k5

Title : Genomic structure of the cyanobacterium Synechocystis sp. PCC 6803 strain GT-S - Tajima_2011_DNA.Res_18_393
Author(s) : Tajima N , Sato S , Maruyama F , Kaneko T , Sasaki NV , Kurokawa K , Ohta H , Kanesaki Y , Yoshikawa H , Tabata S , Ikeuchi M , Sato N
Ref : DNA Research , 18 :393 , 2011
Abstract : Synechocystis sp. PCC 6803 is the most popular cyanobacterial strain, serving as a standard in the research fields of photosynthesis, stress response, metabolism and so on. A glucose-tolerant (GT) derivative of this strain was used for genome sequencing at Kazusa DNA Research Institute in 1996, which established a hallmark in the study of cyanobacteria. However, apparent differences in sequences deviating from the database have been noticed among different strain stocks. For this reason, we analysed the genomic sequence of another GT strain (GT-S) by 454 and partial Sanger sequencing. We found 22 putative single nucleotide polymorphisms (SNPs) in comparison to the published sequence of the Kazusa strain. However, Sanger sequencing of 36 direct PCR products of the Kazusa strains stored in small aliquots resulted in their identity with the GT-S sequence at 21 of the 22 sites, excluding the possibility of their being SNPs. In addition, we were able to combine five split open reading frames present in the database sequence, and to remove the C-terminus of an ORF. Aside from these, two of the Insertion Sequence elements were not present in the GT-S strain. We have thus become able to provide an accurate genomic sequence of Synechocystis sp. PCC 6803 for future studies on this important cyanobacterial strain.
ESTHER : Tajima_2011_DNA.Res_18_393
PubMedSearch : Tajima_2011_DNA.Res_18_393
PubMedID: 21803841
Gene_locus related to this paper: synsp-ester

Title : The genome of the mesopolyploid crop species Brassica rapa - Wang_2011_Nat.Genet_43_1035
Author(s) : Wang X , Wang H , Wang J , Sun R , Wu J , Liu S , Bai Y , Mun JH , Bancroft I , Cheng F , Huang S , Li X , Hua W , Freeling M , Pires JC , Paterson AH , Chalhoub B , Wang B , Hayward A , Sharpe AG , Park BS , Weisshaar B , Liu B , Li B , Tong C , Song C , Duran C , Peng C , Geng C , Koh C , Lin C , Edwards D , Mu D , Shen D , Soumpourou E , Li F , Fraser F , Conant G , Lassalle G , King GJ , Bonnema G , Tang H , Belcram H , Zhou H , Hirakawa H , Abe H , Guo H , Jin H , Parkin IA , Batley J , Kim JS , Just J , Li J , Xu J , Deng J , Kim JA , Yu J , Meng J , Min J , Poulain J , Hatakeyama K , Wu K , Wang L , Fang L , Trick M , Links MG , Zhao M , Jin M , Ramchiary N , Drou N , Berkman PJ , Cai Q , Huang Q , Li R , Tabata S , Cheng S , Zhang S , Sato S , Sun S , Kwon SJ , Choi SR , Lee TH , Fan W , Zhao X , Tan X , Xu X , Wang Y , Qiu Y , Yin Y , Li Y , Du Y , Liao Y , Lim Y , Narusaka Y , Wang Z , Li Z , Xiong Z , Zhang Z
Ref : Nat Genet , 43 :1035 , 2011
Abstract : We report the annotation and analysis of the draft genome sequence of Brassica rapa accession Chiifu-401-42, a Chinese cabbage. We modeled 41,174 protein coding genes in the B. rapa genome, which has undergone genome triplication. We used Arabidopsis thaliana as an outgroup for investigating the consequences of genome triplication, such as structural and functional evolution. The extent of gene loss (fractionation) among triplicated genome segments varies, with one of the three copies consistently retaining a disproportionately large fraction of the genes expected to have been present in its ancestor. Variation in the number of members of gene families present in the genome may contribute to the remarkable morphological plasticity of Brassica species. The B. rapa genome sequence provides an important resource for studying the evolution of polyploid genomes and underpins the genetic improvement of Brassica oil and vegetable crops.
ESTHER : Wang_2011_Nat.Genet_43_1035
PubMedSearch : Wang_2011_Nat.Genet_43_1035
PubMedID: 21873998
Gene_locus related to this paper: braol-Q8GTM3 , braol-Q8GTM4 , brarp-m4ei94 , brarp-m4c988 , brana-a0a078j4a9 , brana-a0a078e1m0 , brana-a0a078cd75 , brarp-m4dwa6 , brana-a0a078j4f0 , brana-a0a078cus4 , brana-a0a078f8c2 , brana-a0a078jql1 , brana-a0a078dgj3 , brana-a0a078hw50 , brana-a0a078cuu0 , brana-a0a078dfa9 , brana-a0a078ic91 , brarp-m4ctw3 , brana-a0a078ca65 , brana-a0a078ctc8 , brana-a0a078h021 , brana-a0a078jx23 , brarp-m4da84 , brarp-m4dwr7 , brana-a0a078dh94 , brana-a0a078h612 , brana-a0a078j2t3 , braol-a0a0d3dpb2 , braol-a0a0d3dx76 , brana-a0a078jxa8 , brana-a0a078i2k3 , brarp-m4cwq4 , brarp-m4dcj8 , brarp-m4eh17 , brarp-m4eey4 , brarp-m4dnj8 , brarp-m4ey83 , brarp-m4ey84

Title : Complete genomic structure of the cultivated rice endophyte Azospirillum sp. B510 - Kaneko_2010_DNA.Res_17_37
Author(s) : Kaneko T , Minamisawa K , Isawa T , Nakatsukasa H , Mitsui H , Kawaharada Y , Nakamura Y , Watanabe A , Kawashima K , Ono A , Shimizu Y , Takahashi C , Minami C , Fujishiro T , Kohara M , Katoh M , Nakazaki N , Nakayama S , Yamada M , Tabata S , Sato S
Ref : DNA Research , 17 :37 , 2010
Abstract : We determined the nucleotide sequence of the entire genome of a diazotrophic endophyte, Azospirillum sp. B510. Strain B510 is an endophytic bacterium isolated from stems of rice plants (Oryza sativa cv. Nipponbare). The genome of B510 consisted of a single chromosome (3,311,395 bp) and six plasmids, designated as pAB510a (1,455,109 bp), pAB510b (723,779 bp), pAB510c (681,723 bp), pAB510d (628,837 bp), pAB510e (537,299 bp), and pAB510f (261,596 bp). The chromosome bears 2893 potential protein-encoding genes, two sets of rRNA gene clusters (rrns), and 45 tRNA genes representing 37 tRNA species. The genomes of the six plasmids contained a total of 3416 protein-encoding genes, seven sets of rrns, and 34 tRNAs representing 19 tRNA species. Eight genes for plasmid-specific tRNA species are located on either pAB510a or pAB510d. Two out of eight genomic islands are inserted in the plasmids, pAB510b and pAB510e, and one of the islands is inserted into trnfM-CAU in the rrn located on pAB510e. Genes other than the nif gene cluster that are involved in N(2) fixation and are homologues of Bradyrhizobium japonicum USDA110 include fixABCX, fixNOQP, fixHIS, fixG, and fixLJK. Three putative plant hormone-related genes encoding tryptophan 2-monooxytenase (iaaM) and indole-3-acetaldehyde hydrolase (iaaH), which are involved in IAA biosynthesis, and ACC deaminase (acdS), which reduces ethylene levels, were identified. Multiple gene-clusters for tripartite ATP-independent periplasmic-transport systems and a diverse set of malic enzymes were identified, suggesting that B510 utilizes C(4)-dicarboxylate during its symbiotic relationship with the host plant.
ESTHER : Kaneko_2010_DNA.Res_17_37
PubMedSearch : Kaneko_2010_DNA.Res_17_37
PubMedID: 20047946
Gene_locus related to this paper: azos1-d3nrk5 , azos1-d3ns85 , azos1-d3nsh5 , azos1-d3nt15 , azos1-d3ntk4 , azos1-d3nv04 , azos1-d3nx23 , azos1-d3ny33 , azos1-d3p0k9 , azos1-d3p0n8 , azos1-d3p1p1 , azos1-d3p2z0 , azos1-d3p4h2 , azos1-d3p4k9 , azos1-d3p5e7 , azos1-d3p281 , azos1-d3p626

Title : Complete genomic structure of the bloom-forming toxic cyanobacterium Microcystis aeruginosa NIES-843 - Kaneko_2007_DNA.Res_14_247
Author(s) : Kaneko T , Nakajima N , Okamoto S , Suzuki I , Tanabe Y , Tamaoki M , Nakamura Y , Kasai F , Watanabe A , Kawashima K , Kishida Y , Ono A , Shimizu Y , Takahashi C , Minami C , Fujishiro T , Kohara M , Katoh M , Nakazaki N , Nakayama S , Yamada M , Tabata S , Watanabe MM
Ref : DNA Research , 14 :247 , 2007
Abstract : The nucleotide sequence of the complete genome of a cyanobacterium, Microcystis aeruginosa NIES-843, was determined. The genome of M. aeruginosa is a single, circular chromosome of 5,842,795 base pairs (bp) in length, with an average GC content of 42.3%. The chromosome comprises 6312 putative protein-encoding genes, two sets of rRNA genes, 42 tRNA genes representing 41 tRNA species, and genes for tmRNA, the B subunit of RNase P, SRP RNA, and 6Sa RNA. Forty-five percent of the putative protein-encoding sequences showed sequence similarity to genes of known function, 32% were similar to hypothetical genes, and the remaining 23% had no apparent similarity to reported genes. A total of 688 kb of the genome, equivalent to 11.8% of the entire genome, were composed of both insertion sequences and miniature inverted-repeat transposable elements. This is indicative of a plasticity of the M. aeruginosa genome, through a mechanism that involves homologous recombination mediated by repetitive DNA elements. In addition to known gene clusters related to the synthesis of microcystin and cyanopeptolin, novel gene clusters that may be involved in the synthesis and modification of toxic small polypeptides were identified. Compared with other cyanobacteria, a relatively small number of genes for two component systems and a large number of genes for restriction-modification systems were notable characteristics of the M. aeruginosa genome.
ESTHER : Kaneko_2007_DNA.Res_14_247
PubMedSearch : Kaneko_2007_DNA.Res_14_247
PubMedID: 18192279
Gene_locus related to this paper: micae-a8yde8 , micae-a8yen2 , micae-a8yma5 , micae-MCYC , mican-b0jqg0 , mican-b0jqq0 , mican-b0jsa2 , mican-b0jxh1 , mican-b0jux6 , mican-b0jyg0

Title : Complete genome structure of Gloeobacter violaceus PCC 7421, a cyanobacterium that lacks thylakoids - Nakamura_2003_DNA.Res_10_137
Author(s) : Nakamura Y , Kaneko T , Sato S , Mimuro M , Miyashita H , Tsuchiya T , Sasamoto S , Watanabe A , Kawashima K , Kishida Y , Kiyokawa C , Kohara M , Matsumoto M , Matsuno A , Nakazaki N , Shimpo S , Takeuchi C , Yamada M , Tabata S
Ref : DNA Research , 10 :137 , 2003
Abstract : The nucleotide sequence of the entire genome of a cyanobacterium Gloeobacter violaceus PCC 7421 was determined. The genome of G. violaceus was a single circular chromosome 4,659,019 bp long with an average GC content of 62%. No plasmid was detected. The chromosome comprises 4430 potential protein-encoding genes, one set of rRNA genes, 45 tRNA genes representing 44 tRNA species and genes for tmRNA, B subunit of RNase P, SRP RNA and 6Sa RNA. Forty-one percent of the potential protein-encoding genes showed sequence similarity to genes of known function, 37% to hypothetical genes, and the remaining 22% had no apparent similarity to reported genes. Comparison of the assigned gene components with those of other cyanobacteria has unveiled distinctive features of the G. violaceus genome. Genes for PsaI, PsaJ, PsaK, and PsaX for Photosystem I and PsbY, PsbZ and Psb27 for Photosystem II were missing, and those for PsaF, PsbO, PsbU, and PsbV were poorly conserved. cpcG for a rod core linker peptide for phycobilisomes and nblA related to the degradation of phycobilisomes were also missing. Potential signal peptides of the presumptive products of petJ and petE for soluble electron transfer catalysts were less conserved than the remaining portions. These observations may be related to the fact that photosynthesis in G. violaceus takes place not in thylakoid membranes but in the cytoplasmic membrane. A large number of genes for sigma factors and transcription factors in the LuxR, LysR, PadR, TetR, and MarR families could be identified, while those for major elements for circadian clock, kaiABC were not found. These differences may reflect the phylogenetic distance between G. violaceus and other cyanobacteria.
ESTHER : Nakamura_2003_DNA.Res_10_137
PubMedSearch : Nakamura_2003_DNA.Res_10_137
PubMedID: 14621292
Gene_locus related to this paper: glovi-GLL0053 , glovi-GLL0778 , glovi-GLL1217 , glovi-GLL1281 , glovi-GLL1310 , glovi-GLL1435 , glovi-GLL2002 , glovi-gll2009 , glovi-GLL2335 , glovi-GLL2500 , glovi-GLL3208 , glovi-GLL3677 , glovi-GLL4259 , glovi-GLR0796 , glovi-GLR1368 , glovi-GLR1422 , glovi-GLR2241 , glovi-GLR2809 , glovi-GLR3058 , glovi-GLR3329 , glovi-GLR3546 , glovi-GLR3833 , glovi-q7ncx6 , glovi-q7nd10 , glovi-q7ndi8 , glovi-q7ndy7 , glovi-q7nek8 , glovi-q7net1 , glovi-q7new7 , glovi-q7ney7 , glovi-q7nfx3 , glovi-q7nga2 , glovi-q7ngw1 , glovi-q7nj78 , glovi-q7nj91 , glovi-q7nj98 , glovi-q7njx8 , glovi-q7njz2 , glovi-q7nkk7 , glovi-q7nly3 , glovi-q7nm82 , glovi-q7nmt4 , glovi-q7nmz0 , glovi-q7nn33 , glovi-q7nn46 , glovi-q7nn64 , glovi-q7nny4 , glovi-q7np81 , glovi-q7npc6

Title : Complete genome structure of Gloeobacter violaceus PCC 7421, a cyanobacterium that lacks thylakoids (supplement) -
Author(s) : Nakamura Y , Kaneko T , Sato S , Mimuro M , Miyashita H , Tsuchiya T , Sasamoto S , Watanabe A , Kawashima K , Kishida Y , Kiyokawa C , Kohara M , Matsumoto M , Matsuno A , Nakazaki N , Shimpo S , Takeuchi C , Yamada M , Tabata S
Ref : DNA Research , 10 :181 , 2003
PubMedID: 14621296
Gene_locus related to this paper: glovi-gll2009

Title : Structural analysis of four large plasmids harboring in a unicellular cyanobacterium, Synechocystis sp. PCC 6803 - Kaneko_2003_DNA.Res_10_221
Author(s) : Kaneko T , Nakamura Y , Sasamoto S , Watanabe A , Kohara M , Matsumoto M , Shimpo S , Yamada M , Tabata S
Ref : DNA Research , 10 :221 , 2003
Abstract : The genome of the unicellular cyanobacterium Synechocystis sp. PCC 6803 consists of a single chromosome and several plasmids of different sizes, and the nucleotide sequences of the chromosome and three small plasmids (5.2 kb, 2.4 kb, and 2.3 kb) have already been sequenced. We newly determined the nucleotide sequences of four large plasmids, which have been identified in our laboratory (pSYSM:120 kb, pSYSX:106 kb, pSYSA:103 kb, and pSYSG:44 kb). Computer-aided analysis was performed to explore the genetic information carried by these plasmids. A total of 397 potential protein-encoding genes were predicted, but little information was obtained about the functional relationship of plasmids to host cell, as a large portion of the predicted genes (77%) were of unknown function. The occurrence of the potential genes on plasmids was divergent, and parA was the only gene common to all four large plasmids. The distribution data of a Cyanobacterium-specific sequence (HIP1: 5'-GCGATCGC-3') suggested that respective plasmids could have originated from different cyanobacterial strains.
ESTHER : Kaneko_2003_DNA.Res_10_221
PubMedSearch : Kaneko_2003_DNA.Res_10_221
PubMedID: 14686584
Gene_locus related to this paper: syny3-q6zeq0 , 9sync-m1mb45

Title : Complete genome structure of the thermophilic cyanobacterium Thermosynechococcus elongatus BP-1 - Nakamura_2002_DNA.Res_9_123
Author(s) : Nakamura Y , Kaneko T , Sato S , Ikeuchi M , Katoh H , Sasamoto S , Watanabe A , Iriguchi M , Kawashima K , Kimura T , Kishida Y , Kiyokawa C , Kohara M , Matsumoto M , Matsuno A , Nakazaki N , Shimpo S , Sugimoto M , Takeuchi C , Yamada M , Tabata S
Ref : DNA Research , 9 :123 , 2002
Abstract : The entire genome of a thermophilic unicellular cyanobacterium, Thermosynechococcus elongatus BP-1, was sequenced. The genome consisted of a circular chromosome 2,593,857 bp long, and no plasmid was detected. A total of 2475 potential protein-encoding genes, one set of rRNA genes, 42 tRNA genes representing 42 tRNA species and 4 genes for small structural RNAs were assigned to the chromosome by similarity search and computer prediction. The translated products of 56% of the potential protein-encoding genes showed sequence similarity to experimentally identified and predicted proteins of known function, and the products of 34% of these genes showed sequence similarity to the translated products of hypothetical genes. The remaining 10% lacked significant similarity to genes for predicted proteins in the public DNA databases. Sixty-three percent of the T. elongatus genes showed significant sequence similarity to those of both Synechocystis sp. PCC 6803 and Anabaena sp. PCC 7120, while 22% of the genes were unique to this species, indicating a high degree of divergence of the gene information among cyanobacterial strains. The lack of genes for typical fatty acid desaturases and the presence of more genes for heat-shock proteins in comparison with other mesophilic cyanobacteria may be genomic features of thermophilic strains. A remarkable feature of the genome is the presence of 28 copies of group II introns, 8 of which contained a presumptive gene for maturase/reverse transcriptase. A trace of genome rearrangement mediated by the group II introns was also observed.
ESTHER : Nakamura_2002_DNA.Res_9_123
PubMedSearch : Nakamura_2002_DNA.Res_9_123
PubMedID: 12240834
Gene_locus related to this paper: theeb-q8dg48 , theeb-TLL0292 , theeb-TLL0340 , theeb-TLL0918 , theeb-TLL0989 , theeb-TLL1717 , theeb-TLL2163 , theeb-TLR0492 , theeb-TLR1045 , theeb-TLR1157 , theeb-TLR1274 , theeb-TLR1393 , theeb-TLR1423 , theeb-TLR1685 , theeb-TLR1725 , theeb-TLR1892 , theeb-TLR1982 , theeb-TLR2038 , theeb-TLR2066

Title : Complete genomic sequence of nitrogen-fixing symbiotic bacterium Bradyrhizobium japonicum USDA110 - Kaneko_2002_DNA.Res_9_189
Author(s) : Kaneko T , Nakamura Y , Sato S , Minamisawa K , Uchiumi T , Sasamoto S , Watanabe A , Idesawa K , Iriguchi M , Kawashima K , Kohara M , Matsumoto M , Shimpo S , Tsuruoka H , Wada T , Yamada M , Tabata S
Ref : DNA Research , 9 :189 , 2002
Abstract : The complete nucleotide sequence of the genome of a symbiotic bacterium Bradyrhizobium japonicum USDA110 was determined. The genome of B. japonicum was a single circular chromosome 9,105,828 bp in length with an average GC content of 64.1%. No plasmid was detected. The chromosome comprises 8317 potential protein-coding genes, one set of rRNA genes and 50 tRNA genes. Fifty-two percent of the potential protein genes showed sequence similarity to genes of known function and 30% to hypothetical genes. The remaining 18% had no apparent similarity to reported genes. Thirty-four percent of the B. japonicum genes showed significant sequence similarity to those of both Mesorhizobium loti and Sinorhizobium meliloti, while 23% were unique to this species. A presumptive symbiosis island 681 kb in length, which includes a 410-kb symbiotic region previously reported by Gottfert et al., was identified. Six hundred fifty-five putative protein-coding genes were assigned in this region, and the functions of 301 genes, including those related to symbiotic nitrogen fixation and DNA transmission, were deduced. A total of 167 genes for transposases/104 copies of insertion sequences were identified in the genome. It was remarkable that 100 out of 167 transposase genes are located in the presumptive symbiotic island. DNA segments of 4 to 97 kb inserted into tRNA genes were found at 14 locations in the genome, which generates partial duplication of the target tRNA genes. These observations suggest plasticity of the B. japonicum genome, which is probably due to complex genome rearrangements such as horizontal transfer and insertion of various DNA elements, and to homologous recombination.
ESTHER : Kaneko_2002_DNA.Res_9_189
PubMedSearch : Kaneko_2002_DNA.Res_9_189
PubMedID: 12597275
Gene_locus related to this paper: braja-BLL0118 , braja-BLL0272 , braja-BLL0546 , braja-BLL0558 , braja-BLL0837 , braja-BLL0839 , braja-BLL1016 , braja-BLL1128 , braja-BLL1234 , braja-BLL1350 , braja-BLL1401 , braja-BLL2323 , braja-BLL2387 , braja-BLL2443 , braja-BLL2527 , braja-BLL2884 , braja-BLL2902 , braja-BLL3246 , braja-BLL3359 , braja-BLL3416 , braja-BLL3418 , braja-BLL3470 , braja-BLL3759 , braja-BLL3777 , braja-BLL4001 , braja-BLL4189 , braja-BLL4284 , braja-BLL4335 , braja-BLL4360 , braja-BLL4548 , braja-BLL4985 , braja-BLL4989 , braja-BLL4997 , braja-BLL5160 , braja-BLL5514 , braja-BLL5588 , braja-BLL5740 , braja-bll6073 , braja-BLL6264 , braja-BLL6275 , braja-BLL6428 , braja-bll6463 , braja-BLL6574 , braja-BLL6577 , braja-BLL6614 , braja-bll6820 , braja-BLL6841 , braja-BLL6890 , braja-BLL6974 , braja-BLL7123 , braja-BLL7368 , braja-BLL7370 , braja-BLL7497 , braja-BLL7506 , braja-BLL7509 , braja-BLL7545 , braja-BLL7692 , braja-BLL7862 , braja-BLL8011 , braja-BLL8277 , braja-BLR0230 , braja-BLR0418 , braja-BLR0711 , braja-BLR0899 , braja-BLR0908 , braja-BLR1078 , braja-BLR1197 , braja-BLR1251 , braja-BLR2261 , braja-BLR2321 , braja-BLR2487 , braja-BLR2879 , braja-BLR2885 , braja-BLR2889 , braja-BLR2982 , braja-BLR3322 , braja-BLR3456 , braja-BLR3519 , braja-blr3732 , braja-BLR3878 , braja-BLR4157 , braja-BLR4181 , braja-BLR5346 , braja-BLR5359 , braja-BLR6083 , braja-BLR6127 , braja-BLR6186 , braja-BLR6271 , braja-BLR6465 , braja-BLR6576 , braja-BLR6677 , braja-BLR6703 , braja-BLR6960 , braja-BLR6965 , braja-BLR7068 , braja-BLR7144 , braja-BLR7443 , braja-BLR7556 , braja-BLR7622 , braja-BLR7741 , braja-BLR7809 , braja-BLR7810 , braja-BLR7894 , braja-BLR8188 , braja-dhaa , braja-EPHA , braja-EPHB , braja-ID587 , braja-ID930 , braja-METX , braja-pcaD , braja-PIP , braja-PLDB , braja-PTRB , braja-q89c18 , braja-q89c36 , braja-q89mj3 , braja-q89ql9 , braja-q89y23

Title : Complete genomic sequence of nitrogen-fixing symbiotic bacterium Bradyrhizobium japonicum USDA110 (supplement) -
Author(s) : Kaneko T , Nakamura Y , Sato S , Minamisawa K , Uchiumi T , Sasamoto S , Watanabe A , Idesawa K , Iriguchi M , Kawashima K , Kohara M , Matsumoto M , Shimpo S , Tsuruoka H , Wada T , Yamada M , Tabata S
Ref : DNA Research , 9 :225 , 2002
PubMedID: 12597279
Gene_locus related to this paper: braja-bll6463 , braja-bll6820 , braja-EPHB , braja-ID930 , braja-pcaD

Title : Complete genomic sequence of the filamentous nitrogen-fixing cyanobacterium Anabaena sp. strain PCC 7120 - Kaneko_2001_DNA.Res_8_205
Author(s) : Kaneko T , Nakamura Y , Wolk CP , Kuritz T , Sasamoto S , Watanabe A , Iriguchi M , Ishikawa A , Kawashima K , Kimura T , Kishida Y , Kohara M , Matsumoto M , Matsuno A , Muraki A , Nakazaki N , Shimpo S , Sugimoto M , Takazawa M , Yamada M , Yasuda M , Tabata S
Ref : DNA Research , 8 :205 , 2001
Abstract : The nucleotide sequence of the entire genome of a filamentous cyanobacterium, Anabaena sp. strain PCC 7120, was determined. The genome of Anabaena consisted of a single chromosome (6,413,771 bp) and six plasmids, designated pCC7120alpha (408,101 bp), pCC7120beta (186,614 bp), pCC7120gamma (101,965 bp), pCC7120delta (55,414 bp), pCC7120epsilon (40,340 bp), and pCC7120zeta (5,584 bp). The chromosome bears 5368 potential protein-encoding genes, four sets of rRNA genes, 48 tRNA genes representing 42 tRNA species, and 4 genes for small structural RNAs. The predicted products of 45% of the potential protein-encoding genes showed sequence similarity to known and predicted proteins of known function, and 27% to translated products of hypothetical genes. The remaining 28% lacked significant similarity to genes for known and predicted proteins in the public DNA databases. More than 60 genes involved in various processes of heterocyst formation and nitrogen fixation were assigned to the chromosome based on their similarity to the reported genes. One hundred and ninety-five genes coding for components of two-component signal transduction systems, nearly 2.5 times as many as those in Synechocystis sp. PCC 6803, were identified on the chromosome. Only 37% of the Anabaena genes showed significant sequence similarity to those of Synechocystis, indicating a high degree of divergence of the gene information between the two cyanobacterial strains.
ESTHER : Kaneko_2001_DNA.Res_8_205
PubMedSearch : Kaneko_2001_DNA.Res_8_205
PubMedID: 11759840
Gene_locus related to this paper: anasp-ALL0111 , anasp-ALL0193 , anasp-ALL0254 , anasp-ALL0316 , anasp-ALL0955 , anasp-ALL0969 , anasp-ALL1161 , anasp-ALL1205 , anasp-ALL1353 , anasp-ALL1695 , anasp-ALL2050 , anasp-ALL2056 , anasp-ALL2058 , anasp-ALL2068 , anasp-ALL2529 , anasp-ALL2533 , anasp-ALL2753 , anasp-ALL2761 , anasp-ALL3898 , anasp-ALL4221 , anasp-ALL4875 , anasp-ALL8511 , anasp-ALR0039 , anasp-ALR0079 , anasp-ALR0130 , anasp-ALR0235 , anasp-ALR0851 , anasp-ALR1077 , anasp-ALR1270 , anasp-ALR1352 , anasp-ALR1362 , anasp-ALR1709 , anasp-ALR2045 , noss1-ALR3140 , anasp-ALR3514 , anasp-ALR3685 , anasp-ALR3911 , anasp-ALR4625 , anasp-ALR5028 , anasp-AROE , anasp-q8ymv5 , anasp-q8yxx2 , anasp-y1448 , noss1-ALL3113 , noss1-ALL4967 , noss1-ALR4786 , noss1-q8yrg0 , noss1-y2406

Title : Structural analysis of Arabidopsis thaliana chromosome 3. I. Sequence features of the regions of 4,504,864 bp covered by sixty P1 and TAC clones - Sato_2000_DNA.Res_7_131
Author(s) : Sato S , Nakamura Y , Kaneko T , Katoh T , Asamizu E , Tabata S
Ref : DNA Research , 7 :131 , 2000
Abstract : Based on the physical map of Arabidopsis thaliana chromosome 3 previously constructed with CIC YAC, TAC, P1 and BAC clones (Sato, S. et al., DNA Res., 5, 163-168, 1998), a total of 60 P1 and TAC clones were sequenced, and the sequence features of the resulting 4,504,864 bp regions were analyzed by applying various computer programs for similarity search and gene modeling. As a result, a total of 1054 potential protein-coding genes were identified. The average density of the genes identified was 1 gene per 4066 bp. Introns were observed in 77% of the genes, and the average number per gene and the average length of the introns were 3.9 and 156 bp, respectively. These sequence features are essentially identical to those of chromosome 5 in our previous reports, but the gene density was slightly higher than that observed for chromosomes 2 and 4. The regions also contained 10 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http:\/\/www.kazusa.or.jp/kaos/.
ESTHER : Sato_2000_DNA.Res_7_131
PubMedSearch : Sato_2000_DNA.Res_7_131
PubMedID: 10819329
Gene_locus related to this paper: arath-At3g14360 , arath-At3g27325 , arath-Y3684 , arath-Q9LI62 , arath-Q9LUG8 , arath-Q9LUH1 , arath-Q6NKN2 , arath-Q9LW14 , arath-Q9LW28 , arath-SCP21 , arath-SCP33

Title : Sequence and analysis of chromosome 5 of the plant Arabidopsis thaliana - Tabata_2000_Nature_408_823
Author(s) : Tabata S , Kaneko T , Nakamura Y , Kotani H , Kato T , Asamizu E , Miyajima N , Sasamoto S , Kimura T , Hosouchi T , Kawashima K , Kohara M , Matsumoto M , Matsuno A , Muraki A , Nakayama S , Nakazaki N , Naruo K , Okumura S , Shinpo S , Takeuchi C , Wada T , Watanabe A , Yamada M , Yasuda M , Sato S , de la Bastide M , Huang E , Spiegel L , Gnoj L , O'Shaughnessy A , Preston R , Habermann K , Murray J , Johnson D , Rohlfing T , Nelson J , Stoneking T , Pepin K , Spieth J , Sekhon M , Armstrong J , Becker M , Belter E , Cordum H , Cordes M , Courtney L , Courtney W , Dante M , Du H , Edwards J , Fryman J , Haakensen B , Lamar E , Latreille P , Leonard S , Meyer R , Mulvaney E , Ozersky P , Riley A , Strowmatt C , Wagner-McPherson C , Wollam A , Yoakum M , Bell M , Dedhia N , Parnell L , Shah R , Rodriguez M , See LH , Vil D , Baker J , Kirchoff K , Toth K , King L , Bahret A , Miller B , Marra M , Martienssen R , McCombie WR , Wilson RK , Murphy G , Bancroft I , Volckaert G , Wambutt R , Dusterhoft A , Stiekema W , Pohl T , Entian KD , Terryn N , Hartley N , Bent E , Johnson S , Langham SA , McCullagh B , Robben J , Grymonprez B , Zimmermann W , Ramsperger U , Wedler H , Balke K , Wedler E , Peters S , van Staveren M , Dirkse W , Mooijman P , Lankhorst RK , Weitzenegger T , Bothe G , Rose M , Hauf J , Berneiser S , Hempel S , Feldpausch M , Lamberth S , Villarroel R , Gielen J , Ardiles W , Bents O , Lemcke K , Kolesov G , Mayer K , Rudd S , Schoof H , Schueller C , Zaccaria P , Mewes HW , Bevan M , Fransz P
Ref : Nature , 408 :823 , 2000
Abstract : The genome of the model plant Arabidopsis thaliana has been sequenced by an international collaboration, The Arabidopsis Genome Initiative. Here we report the complete sequence of chromosome 5. This chromosome is 26 megabases long; it is the second largest Arabidopsis chromosome and represents 21% of the sequenced regions of the genome. The sequence of chromosomes 2 and 4 have been reported previously and that of chromosomes 1 and 3, together with an analysis of the complete genome sequence, are reported in this issue. Analysis of the sequence of chromosome 5 yields further insights into centromere structure and the sequence determinants of heterochromatin condensation. The 5,874 genes encoded on chromosome 5 reveal several new functions in plants, and the patterns of gene organization provide insights into the mechanisms and extent of genome evolution in plants.
ESTHER : Tabata_2000_Nature_408_823
PubMedSearch : Tabata_2000_Nature_408_823
PubMedID: 11130714
Gene_locus related to this paper: arath-At5g11650 , arath-At5g16120 , arath-at5g18630 , arath-AT5G20520 , arath-At5g21950 , arath-AT5G27320 , arath-CXE15 , arath-F1N13.220 , arath-F14F8.240 , arath-q3e9e4 , arath-q8lae9 , arath-Q8LFB7 , arath-q9ffg7 , arath-q9fij5 , arath-Q9LVU7 , arath-q66gm8 , arath-SCPL34 , arath-B9DFR3 , arath-a0a1p8bcz0

Title : Structural analysis of Arabidopsis thaliana chromosome 3. II. Sequence features of the 4,251,695 bp regions covered by 90 P1, TAC and BAC clones - Kaneko_2000_DNA.Res_7_217
Author(s) : Kaneko T , Katoh T , Sato S , Nakamura A , Asamizu E , Tabata S
Ref : DNA Research , 7 :217 , 2000
Abstract : To deduce the entire sequence of the top arm of the Arabidopsis thaliana chromosome 3, the sequence determination was performed on a total of 90 P1, TAC and BAC clones chosen according to our sequencing strategy. Sequence features of the resulting 4,251,695 bp regions were analyzed with various computer programs for similarity search and gene modeling. As a result, a total of 941 potential protein-coding genes were identified. The average density of the genes identified was 1 gene per 4210 bp. Introns were observed in 73% of the genes, and the average number per gene and the average length of the introns were 3.6 and 159 bp, respectively. These sequence features are essentially identical to those of chromosomes 3 and 5 in our previous reports. The regions also contained 14 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http:\/\/www.kazusa.or.jp/kaos/.
ESTHER : Kaneko_2000_DNA.Res_7_217
PubMedSearch : Kaneko_2000_DNA.Res_7_217
PubMedID: 10907853
Gene_locus related to this paper: arath-SCP16 , arath-CXE11 , arath-q9lhe8 , arath-Q9LJI1 , arath-Q9LK01 , arath-SCP15

Title : Structural analysis of Arabidopsis thaliana chromosome 5. X. Sequence features of the regions of 3,076,755 bp covered by sixty P1 and TAC clones - Sato_2000_DNA.Res_7_31
Author(s) : Sato S , Nakamura Y , Kaneko T , Katoh T , Asamizu E , Kotani H , Tabata S
Ref : DNA Research , 7 :31 , 2000
Abstract : In our ongoing project to deduce the nucleotide sequence of Arabidopsis thaliana chromosome 5, non-redundant P1 and TAC clones have been sequenced on the basis of the fine physical map, and as of January, 2000, the sequences of 16.6 Mb representing approximately 60% of chromosome 5 have been accumulated and released at our web site. Along with the sequence determination, structural features of the sequenced regions have been analyzed by applying a variety of computer programs, and we already predicted a total of 2697 potential protein coding genes in the 11,166,130 bp regions, which are covered by 159 P1 and TAC clones. In this paper, we describe the structural features of the 3,076,755 bp regions covered by newly analyzed 60 P1 and TAC clones. A total of 715 potential protein coding genes were identified, giving an average density of the genes identified of 1 gene per 4001 bp. Introns were observed in 80% of the genes, and the average number per gene and the average length of the introns were 4.5 and 147 bp, respectively. These sequence features are nearly identical to those in our latest report in which the data were compiled based on a new standard of gene assignment including the computer-predicted hypothetical genes. The regions also contained 12 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http:\/\/www.kazusa.or.jp/kaos/.
ESTHER : Sato_2000_DNA.Res_7_31
PubMedSearch : Sato_2000_DNA.Res_7_31
PubMedID: 10718197
Gene_locus related to this paper: arath-At5g47330 , arath-AT5G51180 , arath-AT5G62180 , arath-At5g67050 , arath-MES18 , arath-CXE18 , arath-Q9LTX5 , arath-Q9LVS4 , arath-Q9LVS5 , arath-Q9LVU7 , arath-Q9LVX9 , arath-SCP41 , arath-SCP42

Title : Sequence and analysis of chromosome 3 of the plant Arabidopsis thaliana - Salanoubat_2000_Nature_408_820
Author(s) : Salanoubat M , Lemcke K , Rieger M , Ansorge W , Unseld M , Fartmann B , Valle G , Blocker H , Perez-Alonso M , Obermaier B , Delseny M , Boutry M , Grivell LA , Mache R , Puigdomenech P , de Simone V , Choisne N , Artiguenave F , Robert C , Brottier P , Wincker P , Cattolico L , Weissenbach J , Saurin W , Quetier F , Schafer M , Muller-Auer S , Gabel C , Fuchs M , Benes V , Wurmbach E , Drzonek H , Erfle H , Jordan N , Bangert S , Wiedelmann R , Kranz H , Voss H , Holland R , Brandt P , Nyakatura G , Vezzi A , D'Angelo M , Pallavicini A , Toppo S , Simionati B , Conrad A , Hornischer K , Kauer G , Lohnert TH , Nordsiek G , Reichelt J , Scharfe M , Schon O , Bargues M , Terol J , Climent J , Navarro P , Collado C , Perez-Perez A , Ottenwalder B , Duchemin D , Cooke R , Laudie M , Berger-Llauro C , Purnelle B , Masuy D , de Haan M , Maarse AC , Alcaraz JP , Cottet A , Casacuberta E , Monfort A , Argiriou A , Flores M , Liguori R , Vitale D , Mannhaupt G , Haase D , Schoof H , Rudd S , Zaccaria P , Mewes HW , Mayer KF , Kaul S , Town CD , Koo HL , Tallon LJ , Jenkins J , Rooney T , Rizzo M , Walts A , Utterback T , Fujii CY , Shea TP , Creasy TH , Haas B , Maiti R , Wu D , Peterson J , Van Aken S , Pai G , Militscher J , Sellers P , Gill JE , Feldblyum TV , Preuss D , Lin X , Nierman WC , Salzberg SL , White O , Venter JC , Fraser CM , Kaneko T , Nakamura Y , Sato S , Kato T , Asamizu E , Sasamoto S , Kimura T , Idesawa K , Kawashima K , Kishida Y , Kiyokawa C , Kohara M , Matsumoto M , Matsuno A , Muraki A , Nakayama S , Nakazaki N , Shinpo S , Takeuchi C , Wada T , Watanabe A , Yamada M , Yasuda M , Tabata S
Ref : Nature , 408 :820 , 2000
Abstract : Arabidopsis thaliana is an important model system for plant biologists. In 1996 an international collaboration (the Arabidopsis Genome Initiative) was formed to sequence the whole genome of Arabidopsis and in 1999 the sequence of the first two chromosomes was reported. The sequence of the last three chromosomes and an analysis of the whole genome are reported in this issue. Here we present the sequence of chromosome 3, organized into four sequence segments (contigs). The two largest (13.5 and 9.2 Mb) correspond to the top (long) and the bottom (short) arms of chromosome 3, and the two small contigs are located in the genetically defined centromere. This chromosome encodes 5,220 of the roughly 25,500 predicted protein-coding genes in the genome. About 20% of the predicted proteins have significant homology to proteins in eukaryotic genomes for which the complete sequence is available, pointing to important conserved cellular functions among eukaryotes.
ESTHER : Salanoubat_2000_Nature_408_820
PubMedSearch : Salanoubat_2000_Nature_408_820
PubMedID: 11130713
Gene_locus related to this paper: arath-MES17 , arath-AT3G12150 , arath-At3g61680 , arath-AT3g62590 , arath-CXE12 , arath-eds1 , arath-SCP25 , arath-F1P2.110 , arath-F1P2.140 , arath-F11F8.28 , arath-F14D17.80 , arath-F16B3.4 , arath-SCP27 , arath-At3g50790 , arath-At3g05600 , arath-PAD4 , arath-At3g51000 , arath-SCP16 , arath-gid1 , arath-GID1B , arath-Q9LUG8 , arath-Q84JS1 , arath-Q9SFF6 , arath-q9m236 , arath-q9sr22 , arath-q9sr23 , arath-SCP7 , arath-SCP14 , arath-SCP15 , arath-SCP17 , arath-SCP36 , arath-SCP37 , arath-SCP39 , arath-SCP40 , arath-SCP49 , arath-T19F11.2

Title : Complete genome structure of the nitrogen-fixing symbiotic bacterium Mesorhizobium loti - Kaneko_2000_DNA.Res_7_331
Author(s) : Kaneko T , Nakamura Y , Sato S , Asamizu E , Kato T , Sasamoto S , Watanabe A , Idesawa K , Ishikawa A , Kawashima K , Kimura T , Kishida Y , Kiyokawa C , Kohara M , Matsumoto M , Matsuno A , Mochizuki Y , Nakayama S , Nakazaki N , Shimpo S , Sugimoto M , Takeuchi C , Yamada M , Tabata S
Ref : DNA Research , 7 :331 , 2000
Abstract : The complete nucleotide sequence of the genome of a symbiotic bacterium Mesorhizobium loti strain MAFF303099 was determined. The genome of M. loti consisted of a single chromosome (7,036,071 bp) and two plasmids, designated as pMLa (351,911 bp) and pMLb (208, 315 bp). The chromosome comprises 6752 potential protein-coding genes, two sets of rRNA genes and 50 tRNA genes representing 47 tRNA species. Fifty-four percent of the potential protein genes showed sequence similarity to genes of known function, 21% to hypothetical genes, and the remaining 25% had no apparent similarity to reported genes. A 611-kb DNA segment, a highly probable candidate of a symbiotic island, was identified, and 30 genes for nitrogen fixation and 24 genes for nodulation were assigned in this region. Codon usage analysis suggested that the symbiotic island as well as the plasmids originated and were transmitted from other genetic systems. The genomes of two plasmids, pMLa and pMLb, contained 320 and 209 potential protein-coding genes, respectively, for a variety of biological functions. These include genes for the ABC-transporter system, phosphate assimilation, two-component system, DNA replication and conjugation, but only one gene for nodulation was identified.
ESTHER : Kaneko_2000_DNA.Res_7_331
PubMedSearch : Kaneko_2000_DNA.Res_7_331
PubMedID: 11214968
Gene_locus related to this paper: meslo-acoc , meslo-EphB , meslo-est , meslo-lipest , meslo-MLL0014 , meslo-MLL0351 , meslo-MLL0537 , meslo-mll0601 , meslo-MLL0618 , meslo-MLL1209 , meslo-MLL1226 , meslo-mll1328 , meslo-MLL1329 , meslo-MLL1495 , meslo-MLL1869 , meslo-mll1900 , meslo-MLL2018 , meslo-MLL2072 , meslo-mll2481 , meslo-mll2689 , meslo-MLL2788 , meslo-MLL3556 , meslo-MLL3568 , meslo-MLL3682 , meslo-mll3776 , meslo-MLL4497 , meslo-MLL4552 , meslo-MLL5128 , meslo-mll5179 , meslo-mll5392 , meslo-MLL5717 , meslo-mll5743 , meslo-MLL6746 , meslo-MLL6752 , meslo-mll6871 , meslo-MLL7643 , meslo-mll7742 , meslo-MLL9722 , meslo-MLR0094 , meslo-mlr0145 , meslo-mlr0170 , meslo-MLR0240 , meslo-mlr0493 , meslo-MLR0937 , meslo-mlr0978 , meslo-MLR0992 , meslo-MLR1612 , meslo-mlr1789 , meslo-mlr1864 , meslo-mlr2176 , meslo-MLR2262 , meslo-mlr2612 , meslo-mlr2710 , meslo-mlr3034 , meslo-MLR3538 , meslo-mlr3816 , meslo-mlr4436 , meslo-MLR4903 , meslo-MLR5045 , meslo-MLR5063 , rhilo-dhaa , meslo-MLR6087 , meslo-MLR6657 , meslo-mlr6682 , meslo-mlr6683 , meslo-MLR6684 , meslo-MLR6787 , meslo-MLR6993 , meslo-mlr6999 , meslo-mlr7206 , meslo-mlr7232 , meslo-mlr7803 , meslo-MLR9053 , meslo-mlr9622 , meslo-mlr9641 , rhilo-MLL0076 , rhilo-MLL1824 , rhilo-MLL7123 , rhilo-MLL8374 , rhilo-MLR1247 , rhilo-MLR2444 , rhilo-MLR4383 , rhilo-MLR8175 , rhilo-q98nf6 , rhilo-q98nf8 , rhilo-q988i9

Title : Structural analysis of Arabidopsis thaliana chromosome 5. IX. Sequence features of the regions of 1,011,550 bp covered by seventeen P1 and TAC clones - Kaneko_1999_DNA.Res_6_183
Author(s) : Kaneko T , Katoh T , Sato S , Nakamura Y , Asamizu E , Kotani H , Miyajima N , Tabata S
Ref : DNA Research , 6 :183 , 1999
Abstract : In this series of projects sequencing the entire genome of Arabidopsis thaliana chromosome 5, non-redundant P1 and TAC clones have been sequenced according to the fine physical map, and as of May 7, 1999, the sequences of 16.2 Mb representing approximately 60% of chromosome 5 have been accumulated and released at our web site. In parallel, structural features of the sequenced regions have been analyzed by applying a variety of computer programs, and to date we have predicted a total of 2380 potential protein-coding genes in the 10,154,580 bp regions, which are covered by 142 P1 and TAC clones. In this paper, we newly analyzed the structural features of the 1,011,550 bp regions covered by additional 17 P1 and TAC clones, and predicted 298 protein-coding genes. The average density of the genes identified was 1 gene per 3394 bp. Introns were observed in 67% of the genes, and the average number per gene and the average length of the introns were 3.2 and 159 bp, respectively. The gene density became higher than the value estimated in the previously analyzed regions (1 gene per 4,267 bp), as the data in this paper were compiled based on a new standard of gene assignment including the computer-predicted hypothetical genes. The regions also contained 8 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available on the database KAOS (Kazusa Arabidopsis data Opening Site) at http:\/\/www.kazusa.or.jp/arabi/.
ESTHER : Kaneko_1999_DNA.Res_6_183
PubMedSearch : Kaneko_1999_DNA.Res_6_183
PubMedID: 10470850
Gene_locus related to this paper: arath-At5g37710 , arath-SCP19 , arath-Q9FI59

Title : Structural analysis of Arabidopsis thaliana chromosome 5. VIII. Sequence features of the regions of 1,081,958 bp covered by seventeen physically assigned P1 and TAC clones. -
Author(s) : Asamizu E , Sato S , Kaneko T , Nakamura Y , Kotani H , Miyajima N , Tabata S
Ref : DNA Research , 5 :379 , 1998
PubMedID: 10048488
Gene_locus related to this paper: arath-At5g14930 , arath-at5g18630 , arath-AT5G19050 , arath-At5g24180 , arath-AT5G24190 , arath-At5g24200 , arath-F2G14.100 , arath-F6H11.120 , arath-F9G14.280 , arath-F14F8.240 , arath-HNL , arath-At5g14310

Title : Structural analysis of Arabidopsis thaliana chromosome 5. VII. Sequence features of the regions of 1,013,767 bp covered by sixteen physically assigned P1 and TAC clones - Nakamura_1998_DNA.Res_5_297
Author(s) : Nakamura Y , Sato S , Asamizu E , Kaneko T , Kotani H , Miyajima N , Tabata S
Ref : DNA Research , 5 :297 , 1998
Abstract : Sixteen P1 and TAC clones assigned to Arabidopsis thaliana chromosome 5 were sequenced, and their sequence features were analyzed using various computer programs. The total length of the sequences determined was 1,013,767 bp. Together with the nucleotide sequences of 109 clones previously reported, the regions of chromosome 5 sequenced so far now total 9,072,622 bp, which presumably covers approximately one-third of the chromosome. A similarity search against the reported gene sequences predicted the presence of a total of 225 protein-coding genes and/or gene segments in the newly sequenced regions, indicating an average gene density of one gene per 4.5 kb. Introns were identified in 72.4% of the potential protein genes for which the entire gene structure was predicted, and the average number per gene and the average length of the introns were 3.3 and 163 bp, respectively. These sequence features are essentially identical to those in the previously reported sequences. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.
ESTHER : Nakamura_1998_DNA.Res_5_297
PubMedSearch : Nakamura_1998_DNA.Res_5_297
PubMedID: 9872454
Gene_locus related to this paper: arath-At5g41900 , arath-Q9FJ29

Title : Structural analysis of Arabidopsis thaliana chromosome 5. V. Sequence features of the regions of 1,381,565 bp covered by twenty one physically assigned P1 and TAC clones - Kaneko_1998_DNA.Res_5_131
Author(s) : Kaneko T , Kotani H , Nakamura Y , Sato S , Asamizu E , Miyajima N , Tabata S
Ref : DNA Research , 5 :131 , 1998
Abstract : The nucleotide sequences of 21 P1 and TAC clones which have been precisely localized to the fine physical map of the Arabidopsis thaliana chromosome 5, were determined, and their sequence features were analyzed. The total length of the regions sequenced in this study were 1,381,565 bp, bringing the total length of the chromosome 5 sequences determined so far to 6,691,670 bp together with the regions of the 69 clones previously reported. By computer-aided analyses including similarity search against protein and EST databases and gene modeling with computer programs, a total of 337 potential protein-coding genes and/or gene segments were identified on the basis of similarity to the reported gene sequences. An average density of the genes and/or gene segments thus assigned was 1 gene/4,100 bp. Introns were identified in 76.7% of the potential protein genes for which the entire gene structure were predicted, and the average number per gene and the average length of the introns were 3.9 and 176 bp, respectively. These sequence features are essentially identical to those in the previously reported sequences. The numbers of the Arabidopsis ESTs matched to each of the predicted genes have been counted to monitor the transcription level. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http:@www.kazusa.or.jp@arabi
ESTHER : Kaneko_1998_DNA.Res_5_131
PubMedSearch : Kaneko_1998_DNA.Res_5_131
PubMedID: 9679202
Gene_locus related to this paper: arath-AT5G39220 , arath-Q9FKP9

Title : Structural analysis of Arabidopsis thaliana chromosome 5. IV. Sequence features of the regions of 1,456,315 bp covered by nineteen physically assigned P1 and TAC clones - Sato_1998_DNA.Res_5_41
Author(s) : Sato S , Kaneko T , Kotani H , Nakamura Y , Asamizu E , Miyajima N , Tabata S
Ref : DNA Research , 5 :41 , 1998
Abstract : Nineteen P1 and TAC clones, which have been precisely localized to the fine physical map of Arabidopsis thaliana chromosome 5, were newly sequenced, and their sequence features were analysed. The total length of the clones sequenced was 1,456,315 bp. Together with the previously reported sequences, the regions of chromosome 5 that have been sequenced to date is now 5,310,105 bp. When the sequences determined in this study were subjected to similarity search against protein and expressed sequence tag (EST) databases and analysis with computer programs for gene modeling, a total of 354 potential protein-coding genes and/or gene segments were identified. The average density of the assigned genes and/or gene segments was one gene per 4,114 bp. Introns were identified in 75% of the potential protein genes, and the average number per gene and the average length of the introns were 3.7 and 194 bp, respectively. These sequence features are essentially identical to those in the previously reported sequences. The numbers of the Arabidopsis ESTs matched to each of the predicted genes have been counted to monitor the transcription level. The sequence data and gene information are available on the World Wide Web database KAOS (the Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.
ESTHER : Sato_1998_DNA.Res_5_41
PubMedSearch : Sato_1998_DNA.Res_5_41
PubMedID: 9628582
Gene_locus related to this paper: arath-AT5G41120 , arath-At5g41130 , arath-F6H11.120

Title : Structural analysis of Arabidopsis thaliana chromosome 5. VI. Sequence features of the regions of 1,367,185 bp covered by 19 physically assigned P1 and TAC clones. -
Author(s) : Kotani H , Nakamura Y , Sato S , Asamizu E , Kaneko T , Miyajima N , Tabata S
Ref : DNA Research , 5 :203 , 1998
PubMedID: 9734815
Gene_locus related to this paper: arath-At5g56750

Title : Structural analysis of Arabidopsis thaliana chromosome 5. II. Sequence features of the regions of 1,044,062 bp covered by thirteen physically assigned P1 clones - Kotani_1997_DNA.Res_4_291
Author(s) : Kotani H , Nakamura Y , Sato S , Kaneko T , Asamizu E , Miyajima N , Tabata S
Ref : DNA Research , 4 :291 , 1997
Abstract : A total of 13 P1 clones, each containing a marker(s) specifically mapped on chromosome 5, were isolated from a P1 library of the Arabidopsis thaliana Columbia genome, and their nucleotide sequences were determined according to the shot gun based strategy and precisely located on the physical map of chromosome 5. The total length of the sequenced regions was 1,044,062 bp. Since we have previously reported the sequence of 1,621,245 bp by analysis of 20 non-redundant P1 clones, the total length of the sequences of chromosome 5 determined so far reached 2,665,307 bp. The regions sequenced in this study were analysed by comparison with the sequences in protein and EST databases and analysis with computer programs for gene modeling; a total of 225 potential protein-coding genes and/or gene segments with known or predicted functions were identified. The positions of exons which do not exhibit similarity to known genes were also predicted by computer-aided analysis. An average density of the genes and/or gene was 1 gene/4,640 bp. Introns were identified in approximately 84% of the potential genes, and the average number and length of the introns per gene were 5.3 and 184 bp, respectively. These sequence features are essentially identical to those for the previously sequenced regions. The transcription level of the predicted genes has been roughly monitored by counting the numbers of matched Arabidopsis ESTs. The sequence data and gene information are available through the World Wide Web at https://www.kazusa.or.jp/kaos/.
ESTHER : Kotani_1997_DNA.Res_4_291
PubMedSearch : Kotani_1997_DNA.Res_4_291
PubMedID: 9405937
Gene_locus related to this paper: arath-At5g13640 , arath-AT5G24210 , arath-At5g24220 , arath-AT5G24230 , arath-At5g17670 , arath-Q9FN74 , arath-Q9FN79 , arath-Q9FNF6

Title : Structural analysis of Arabidopsis thaliana chromosome 5. III. Sequence features of the regions of 1,191,918 bp covered by seventeen physically assigned P1 clones - Nakamura_1997_DNA.Res_4_401
Author(s) : Nakamura Y , Sato S , Kaneko T , Kotani H , Asamizu E , Miyajima N , Tabata S
Ref : DNA Research , 4 :401 , 1997
Abstract : A total of 17 P1 and TAC clones each containing a marker(s) specifically mapped on chromosome 5 were isolated from P1 and TAC libraries of the Arabidopsis thaliana Columbia genome, and their nucleotide sequences were determined according to the shot gun-based strategy and precisely located on the physical map of chromosome 5. The total length of the clones sequenced in this study was 1,191,918 bp. As we have previously reported the sequence of 2,662,078 bp by analysis of 33 P1 clones, the total length of the sequences of chromosome 5 determined so far is now 3,853,996 bp. The sequences determined in this study were subjected to similarity search against protein and EST databases and analysis with computer programs for gene modeling, and a total of 310 potential protein-coding genes and/or gene segments with known or predicted functions were identified. The positions of exons which do not show apparent similarity to known genes were also predicted by computer-aided analysis. An average density of the assigned genes and/or gene segments was 1 gene/3,845 bp. Introns were identified in 78% of the potential protein genes, and the average number per gene and the average length of the introns were 3.7 and 185 bp, respectively. The numbers of the Arabidopsis ESTs matched to each of the predicted genes have been counted to monitor the transcription level. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.
ESTHER : Nakamura_1997_DNA.Res_4_401
PubMedSearch : Nakamura_1997_DNA.Res_4_401
PubMedID: 9501997
Gene_locus related to this paper: arath-AT5G22460 , arath-At5g42930 , arath-LIP2 , arath-SCPL34

Title : Structural analysis of Arabidopsis thaliana chromosome 5. I. Sequence features of the 1.6 Mb regions covered by twenty physically assigned P1 clones - Sato_1997_DNA.Res_4_215
Author(s) : Sato S , Kotani H , Nakamura Y , Kaneko T , Asamizu E , Fukami M , Miyajima N , Tabata S
Ref : DNA Research , 4 :215 , 1997
Abstract : A total of 20 P1 clones with an average insert size of 80 kb and each containing a marker(s) specifically mapped on chromosome 5 were isolated from a P1 library of the Arabidopsis thaliana genome, and their nucleotide sequences were determined according to a shotgun-based strategy and precisely located on the physical map of chromosome 5 separately constructed. The total length of the sequenced regions were summed up to 1,621,245 bp. By comparison with the sequences in protein and EST databases and analysis with computer programs for gene modeling, a total of 347 potential protein-coding genes and/or gene segments with known or predicted functions were identified. The positions of exons which do not exhibit any similarity to known genes were also predicted. An average density of the genes and/or gene segments assigned so far as 1 gene/4,672 bp. Introns were identified in approximately 78% of the potential genes, and the average number and length of the introns per gene were 3.7 and 161 bp. The transcription level of the predicted genes was roughly monitored by counting the numbers of identified Arabidopsis ESTs. The sequence data and gene information are available through the World Wide Web at http:/(/)www.kazusa.or.jp/arabi/.
ESTHER : Sato_1997_DNA.Res_4_215
PubMedSearch : Sato_1997_DNA.Res_4_215
PubMedID: 9330910
Gene_locus related to this paper: arath-AT5G38220 , arath-At5g38520 , arath-Q8LFB7 , arath-Q9FF27 , arath-q9ffg7 , arath-Q9FFZ1 , arath-SCP47 , arath-B9DFR3

Title : Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions - Kaneko_1996_DNA.Res_3_109
Author(s) : Kaneko T , Sato S , Kotani H , Tanaka A , Asamizu E , Nakamura Y , Miyajima N , Hirosawa M , Sugiura M , Sasamoto S , Kimura T , Hosouchi T , Matsuno A , Muraki A , Nakazaki N , Naruo K , Okumura S , Shimpo S , Takeuchi C , Wada T , Watanabe A , Yamada M , Yasuda M , Tabata S
Ref : DNA Research , 3 :109 , 1996
Abstract : The sequence determination of the entire genome of the Synechocystis sp. strain PCC6803 was completed. The total length of the genome finally confirmed was 3,573,470 bp, including the previously reported sequence of 1,003,450 bp from map position 64% to 92% of the genome. The entire sequence was assembled from the sequences of the physical map-based contigs of cosmid clones and of lambda clones and long PCR products which were used for gap-filling. The accuracy of the sequence was guaranteed by analysis of both strands of DNA through the entire genome. The authenticity of the assembled sequence was supported by restriction analysis of long PCR products, which were directly amplified from the genomic DNA using the assembled sequence data. To predict the potential protein-coding regions, analysis of open reading frames (ORFs), analysis by the GeneMark program and similarity search to databases were performed. As a result, a total of 3,168 potential protein genes were assigned on the genome, in which 145 (4.6%) were identical to reported genes and 1,257 (39.6%) and 340 (10.8%) showed similarity to reported and hypothetical genes, respectively. The remaining 1,426 (45.0%) had no apparent similarity to any genes in databases. Among the potential protein genes assigned, 128 were related to the genes participating in photosynthetic reactions. The sum of the sequences coding for potential protein genes occupies 87% of the genome length. By adding rRNA and tRNA genes, therefore, the genome has a very compact arrangement of protein- and RNA-coding regions. A notable feature on the gene organization of the genome was that 99 ORFs, which showed similarity to transposase genes and could be classified into 6 groups, were found spread all over the genome, and at least 26 of them appeared to remain intact. The result implies that rearrangement of the genome occurred frequently during and after establishment of this species.
ESTHER : Kaneko_1996_DNA.Res_3_109
PubMedSearch : Kaneko_1996_DNA.Res_3_109
PubMedID: 8905231
Gene_locus related to this paper: synsp-ester , synsp-PHBC , synsp-prxc , synsp-Q55130 , synsp-SLL0482 , synsp-sll0553 , synsp-SLL0992 , synsp-sll1305 , synsp-SLL1969 , synsp-SLR0825 , synsp-slr1235 , synsp-SLR1506 , synsp-SLR1771 , synsp-SLR1807 , synsp-slr1827 , synsp-slr1916 , synsp-slr1917 , synsp-slr1932 , synsp-SLR1944 , synsp-SLR2053 , synsp-todF , syny3-dlhh , syny3-P73192 , syny3-p73194 , syny3-y249 , syny3-y264

Title : Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. I. Sequence features in the 1 Mb region from map positions 64\% to 92\% of the genome - Kaneko_1995_DNA.Res_2_153
Author(s) : Kaneko T , Tanaka A , Sato S , Kotani H , Sazuka T , Miyajima N , Sugiura M , Tabata S
Ref : DNA Research , 2 :153 , 1995
Abstract : The contiguous sequence of 1,003,450 bp spanning map positions 64% to 92% of the genome of Synechocystis sp. strain PCC6803 has been deduced. Computer analysis of the sequence predicts that this region contains at least 818 potential ORFs, in which 255 (31%) were either genes that had already been identified or their homologues, 84 (10%) were homologues to registered hypothetical genes, and 149 (18%) showed weak similarities to reported genes. The remaining 330 ORFs showed no apparent similarity to any reported genes or carried no significant protein motifs. The potential ORFs as a whole occupied 86% of the sequenced region, implying compact arrangement of genes in the genome. As to the structural RNA genes, one rRNA operon consisting of 5,028 bp and at least 11 species of tRNA genes were identified. It is noteworthy that 10 out of the 11 tRNA species showed significant sequence similarities to tRNAs reported in plant chloroplasts. As other notable unique sequences, three classes of IS-like elements each with characteristics typical of IS elements were identified, and a typical unit of WD(Trp-Asp)-repeats which have only been detected in the regulatory proteins of eukaryotes was identified within the large 5,079-bp ORF located at map position 69%.
ESTHER : Kaneko_1995_DNA.Res_2_153
PubMedSearch : Kaneko_1995_DNA.Res_2_153
PubMedID: 8590279
Gene_locus related to this paper: synsp-prxc , synsp-Q55130 , synsp-SLL0482 , synsp-sll0553 , synsp-SLR0825