N link to NCBI taxonomic web page and E link to ESTHER gene locus found in this strain. > cellular organisms: NE > Bacteria: NE > Terrabacteria group: NE > Firmicutes: NE > Bacilli: NE > Bacillales: NE > Bacillaceae: NE > Bacillus: NE > Bacillus subtilis group: NE > Bacillus subtilis: NE
Warning: This entry is a compilation of different species or line or strain with more than 90% amino acide identity. You can retrieve all strain data
N link to NCBI taxonomic web page and E link to ESTHER gene locus found in this strain. Bacillus subtilis subsp. spizizenii ATCC 6633: N, E.
Bacillus subtilis subsp. spizizenii: N, E.
Bacillus subtilis BSn5: N, E.
Bacillus subtilis subsp. spizizenii str. W23: N, E.
Bacillus subtilis subsp. natto BEST195: N, E.
Bacillus subtilis subsp. subtilis str. 168: N, E.
Bacillus subtilis subsp. subtilis str. SC-8: N, E.
Bacillus subtilis subsp. spizizenii TU-B-10: N, E.
Bacillus subtilis subsp. subtilis str. RO-NN-1: N, E.
Bacillus subtilis QH-1: N, E.
Bacillus subtilis PY79: N, E.
Bacillus subtilis QB928: N, E.
Bacillus subtilis XF-1: N, E.
Bacillus subtilis subsp. subtilis str. BSP1: N, E.
Bacillus subtilis subsp. subtilis str. BAB-1: N, E.
Bacillus subtilis BEST7003: N, E.
Bacillus subtilis MB73/2: N, E.
Bacillus subtilis BEST7613: N, E.
Bacillus subtilis subsp. subtilis 6051-HGW: N, E.
Bacillus subtilis subsp. subtilis str. JH642 substr. AG174: N, E.
Bacillus subtilis subsp. subtilis str. AG1839: N, E.
Bacillus subtilis subsp. subtilis str. OH 131.1: N, E.
Bacillus subtilis E1: N, E.
Bacillus subtilis TO-A: N, E.
Bacillus subtilis Miyagi-4: N, E.
Bacillus subtilis subsp. subtilis: N, E.
Bacillus subtilis subsp. niger: N, E.
Bacillus subtilis subsp. inaquosorum KCTC 13429: N, E.
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MDGVKCQFVNTNGITLHVAAAGREDGPLIVLLHGFPEFWYGWKNQIKPLV DAGYRVIAPDQRGYNLSDKPEGIDSYRIDTLRDDIIGLITQFTDEKAIVI GHDWGGAVAWHLASTRPEYLEKLIAINIPHPHVMKTVTPLYPPQWLKSSY IAYFQLPDIPEASLRENDYDTLDKAIGLSDRPALFTSEDVSRYKEAWKQP GALTAMLNWYRALRKGSLAEKPSYETVPYRMIWGMEDRFLSRKLAKETER HCPNGHLIFVDEASHWINHEKPAIVNQLILEYLKNQ
BACKGROUND: Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and functions as a starter for the production of the traditional Japanese food "natto" made from soybeans. Although re-sequencing whole genomes of several laboratory domesticated B. subtilis 168 derivatives has already been attempted using short read sequencing data, the assembly of the whole genome sequence of a closely related strain, B. subtilis natto, from very short read data is more challenging, particularly with our aim to assemble one fully connected scaffold from short reads around 35 bp in length. RESULTS: We applied a comparative genome assembly method, which combines de novo assembly and reference guided assembly, to one of the B. subtilis natto strains. We successfully assembled 28 scaffolds and managed to avoid substantial fragmentation. Completion of the assembly through long PCR experiments resulted in one connected scaffold for B. subtilis natto. Based on the assembled genome sequence, our orthologous gene analysis between natto BEST195 and Marburg 168 revealed that 82.4% of 4375 predicted genes in BEST195 are one-to-one orthologous to genes in 168, with two genes in-paralog, 3.2% are deleted in 168, 14.3% are inserted in BEST195, and 5.9% of genes present in 168 are deleted in BEST195. The natto genome contains the same alleles in the promoter region of degQ and the coding region of swrAA as the wild strain, RO-FF-1. These are specific for gamma-PGA production ability, which is related to natto production. Further, the B. subtilis natto strain completely lacked a polyketide synthesis operon, disrupted the plipastatin production operon, and possesses previously unidentified transposases. CONCLUSIONS: The determination of the whole genome sequence of Bacillus subtilis natto provided detailed analyses of a set of genes related to natto production, demonstrating the number and locations of insertion sequences that B. subtilis natto harbors but B. subtilis 168 lacks. Multiple genome-level comparisons among five closely related Bacillus species were also carried out. The determined genome sequence of B. subtilis natto and gene annotations are available from the Natto genome browser http:\/\/natto-genome.org/.
Bacillus subtilis is the best-characterized member of the Gram-positive bacteria. Its genome of 4,214,810 base pairs comprises 4,100 protein-coding genes. Of these protein-coding genes, 53% are represented once, while a quarter of the genome corresponds to several gene families that have been greatly expanded by gene duplication, the largest family containing 77 putative ATP-binding transport proteins. In addition, a large proportion of the genetic capacity is devoted to the utilization of a variety of carbon sources, including many plant-derived molecules. The identification of five signal peptidase genes, as well as several genes for components of the secretion apparatus, is important given the capacity of Bacillus strains to secrete large amounts of industrially important enzymes. Many of the genes are involved in the synthesis of secondary metabolites, including antibiotics, that are more typically associated with Streptomyces species. The genome contains at least ten prophages or remnants of prophages, indicating that bacteriophage infection has played an important evolutionary role in horizontal gene transfer, in particular in the propagation of bacterial pathogenesis.
        
Title: Cloning and sequencing of a 27.8-kb nucleotide sequence of the 79 degrees-81 degrees region of the Bacillus subtilis genome containing the sspE locus Yamamoto H, Uchiyama S, Sekiguchi J Ref: DNA Research, 3:257, 1996 : PubMed
The nucleotide sequence of a 27830-bp DNA segment in the 79 degrees-81 degrees region of the Bacillus subtilis genome has been determined. This region contains 29 complete ORFs including the sspE gene, which encodes a small acid-soluble spore protein gamma and locates on the one side terminal of our assigned region. A homology search for the products deduced from the 29 ORFs revealed that nine of them exhibit significant similarity to known proteins, e.g. proteins involved in an iron uptake system, a multidrug resistance protein, a chloramphenicol resistance protein, epoxide hydrolase, adenine glycosylase, and a glucose-1-dehydrogenase homolog.
BACKGROUND: Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and functions as a starter for the production of the traditional Japanese food "natto" made from soybeans. Although re-sequencing whole genomes of several laboratory domesticated B. subtilis 168 derivatives has already been attempted using short read sequencing data, the assembly of the whole genome sequence of a closely related strain, B. subtilis natto, from very short read data is more challenging, particularly with our aim to assemble one fully connected scaffold from short reads around 35 bp in length. RESULTS: We applied a comparative genome assembly method, which combines de novo assembly and reference guided assembly, to one of the B. subtilis natto strains. We successfully assembled 28 scaffolds and managed to avoid substantial fragmentation. Completion of the assembly through long PCR experiments resulted in one connected scaffold for B. subtilis natto. Based on the assembled genome sequence, our orthologous gene analysis between natto BEST195 and Marburg 168 revealed that 82.4% of 4375 predicted genes in BEST195 are one-to-one orthologous to genes in 168, with two genes in-paralog, 3.2% are deleted in 168, 14.3% are inserted in BEST195, and 5.9% of genes present in 168 are deleted in BEST195. The natto genome contains the same alleles in the promoter region of degQ and the coding region of swrAA as the wild strain, RO-FF-1. These are specific for gamma-PGA production ability, which is related to natto production. Further, the B. subtilis natto strain completely lacked a polyketide synthesis operon, disrupted the plipastatin production operon, and possesses previously unidentified transposases. CONCLUSIONS: The determination of the whole genome sequence of Bacillus subtilis natto provided detailed analyses of a set of genes related to natto production, demonstrating the number and locations of insertion sequences that B. subtilis natto harbors but B. subtilis 168 lacks. Multiple genome-level comparisons among five closely related Bacillus species were also carried out. The determined genome sequence of B. subtilis natto and gene annotations are available from the Natto genome browser http:\/\/natto-genome.org/.
        
Title: Diversity and biocatalytic potential of epoxide hydrolases identified by genome analysis van Loo B, Kingma J, Arand M, Wubbolts MG, Janssen DB Ref: Applied Environmental Microbiology, 72:2905, 2006 : PubMed
Epoxide hydrolases play an important role in the biodegradation of organic compounds and are potentially useful in enantioselective biocatalysis. An analysis of various genomic databases revealed that about 20% of sequenced organisms contain one or more putative epoxide hydrolase genes. They were found in all domains of life, and many fungi and actinobacteria contain several putative epoxide hydrolase-encoding genes. Multiple sequence alignments of epoxide hydrolases with other known and putative alpha/beta-hydrolase fold enzymes that possess a nucleophilic aspartate revealed that these enzymes can be classified into eight phylogenetic groups that all contain putative epoxide hydrolases. To determine their catalytic activities, 10 putative bacterial epoxide hydrolase genes and 2 known bacterial epoxide hydrolase genes were cloned and overexpressed in Escherichia coli. The production of active enzyme was strongly improved by fusion to the maltose binding protein (MalE), which prevented inclusion body formation and facilitated protein purification. Eight of the 12 fusion proteins were active toward one or more of the 21 epoxides that were tested, and they converted both terminal and nonterminal epoxides. Four of the new epoxide hydrolases showed an uncommon enantiopreference for meso-epoxides and/or terminal aromatic epoxides, which made them suitable for the production of enantiopure (S,S)-diols and (R)-epoxides. The results show that the expression of epoxide hydrolase genes that are detected by analyses of genomic databases is a useful strategy for obtaining new biocatalysts.
Bacillus subtilis is the best-characterized member of the Gram-positive bacteria. Its genome of 4,214,810 base pairs comprises 4,100 protein-coding genes. Of these protein-coding genes, 53% are represented once, while a quarter of the genome corresponds to several gene families that have been greatly expanded by gene duplication, the largest family containing 77 putative ATP-binding transport proteins. In addition, a large proportion of the genetic capacity is devoted to the utilization of a variety of carbon sources, including many plant-derived molecules. The identification of five signal peptidase genes, as well as several genes for components of the secretion apparatus, is important given the capacity of Bacillus strains to secrete large amounts of industrially important enzymes. Many of the genes are involved in the synthesis of secondary metabolites, including antibiotics, that are more typically associated with Streptomyces species. The genome contains at least ten prophages or remnants of prophages, indicating that bacteriophage infection has played an important evolutionary role in horizontal gene transfer, in particular in the propagation of bacterial pathogenesis.
        
Title: Cloning and sequencing of a 27.8-kb nucleotide sequence of the 79 degrees-81 degrees region of the Bacillus subtilis genome containing the sspE locus Yamamoto H, Uchiyama S, Sekiguchi J Ref: DNA Research, 3:257, 1996 : PubMed
The nucleotide sequence of a 27830-bp DNA segment in the 79 degrees-81 degrees region of the Bacillus subtilis genome has been determined. This region contains 29 complete ORFs including the sspE gene, which encodes a small acid-soluble spore protein gamma and locates on the one side terminal of our assigned region. A homology search for the products deduced from the 29 ORFs revealed that nine of them exhibit significant similarity to known proteins, e.g. proteins involved in an iron uptake system, a multidrug resistance protein, a chloramphenicol resistance protein, epoxide hydrolase, adenine glycosylase, and a glucose-1-dehydrogenase homolog.