(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Bacteria: NE > Terrabacteria group: NE > Firmicutes: NE > Bacilli: NE > Bacillales: NE > Bacillaceae: NE > Bacillus: NE > Bacillus subtilis group: NE > Bacillus subtilis: NE
Warning: This entry is a compilation of different species or line or strain with more than 90% amino acide identity. You can retrieve all strain data
(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) Bacillus subtilis subsp. spizizenii ATCC 6633: N, E.
Bacillus subtilis subsp. spizizenii: N, E.
Bacillus subtilis subsp. natto BEST195: N, E.
Bacillus subtilis subsp. spizizenii str. W23: N, E.
Bacillus subtilis BSn5: N, E.
Bacillus subtilis QH-1: N, E.
Bacillus subtilis QB928: N, E.
Bacillus subtilis subsp. subtilis str. BAB-1: N, E.
Bacillus subtilis BEST7613: N, E.
Bacillus subtilis subsp. subtilis str. SC-8: N, E.
Bacillus subtilis MB73/2: N, E.
Bacillus subtilis BEST7003: N, E.
Bacillus subtilis XF-1: N, E.
Bacillus subtilis subsp. spizizenii TU-B-10: N, E.
Bacillus subtilis subsp. subtilis str. 168: N, E.
Bacillus subtilis subsp. subtilis str. RO-NN-1: N, E.
Bacillus subtilis PY79: N, E.
Bacillus subtilis subsp. subtilis str. BSP1: N, E.
Bacillus subtilis subsp. subtilis 6051-HGW: N, E.
Bacillus subtilis subsp. subtilis str. JH642 substr. AG174: N, E.
Bacillus subtilis subsp. subtilis str. AG1839: N, E.
Bacillus subtilis subsp. subtilis str. OH 131.1: N, E.
Bacillus subtilis E1: N, E.
Bacillus subtilis TO-A: N, E.
Bacillus subtilis Miyagi-4: N, E.
Bacillus subtilis subsp. subtilis: N, E.
Bacillus subtilis subsp. niger: N, E.
Bacillus subtilis subsp. inaquosorum KCTC 13429: N, E.
Bacillus subtilis subsp. globigii: N, E.
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MIVEKRRFPSPSQHVRLYTICYLSNGLRVKGLLAEPAEPGQYDGFLYLRG GIKSVGMVRPGRIIQFASQGFVVFAPFYRGNQGGEGNEDFAGEDREDAFS AFRLLQQHPNVKKDRIHIFGFSRGGIMGMLTAIEMGGQAASFVSWGGVSD MILTYEERQDLRRMMKRVIGGTPKKVPEEYQWRTPFDQVNKIQAPVLLIH GEKDQNVSIQHSYLLEEKLKQLHKPVETWYYSTFTHYFPPKENRRIVRQL TQWMKNR
BACKGROUND: Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and functions as a starter for the production of the traditional Japanese food "natto" made from soybeans. Although re-sequencing whole genomes of several laboratory domesticated B. subtilis 168 derivatives has already been attempted using short read sequencing data, the assembly of the whole genome sequence of a closely related strain, B. subtilis natto, from very short read data is more challenging, particularly with our aim to assemble one fully connected scaffold from short reads around 35 bp in length. RESULTS: We applied a comparative genome assembly method, which combines de novo assembly and reference guided assembly, to one of the B. subtilis natto strains. We successfully assembled 28 scaffolds and managed to avoid substantial fragmentation. Completion of the assembly through long PCR experiments resulted in one connected scaffold for B. subtilis natto. Based on the assembled genome sequence, our orthologous gene analysis between natto BEST195 and Marburg 168 revealed that 82.4% of 4375 predicted genes in BEST195 are one-to-one orthologous to genes in 168, with two genes in-paralog, 3.2% are deleted in 168, 14.3% are inserted in BEST195, and 5.9% of genes present in 168 are deleted in BEST195. The natto genome contains the same alleles in the promoter region of degQ and the coding region of swrAA as the wild strain, RO-FF-1. These are specific for gamma-PGA production ability, which is related to natto production. Further, the B. subtilis natto strain completely lacked a polyketide synthesis operon, disrupted the plipastatin production operon, and possesses previously unidentified transposases. CONCLUSIONS: The determination of the whole genome sequence of Bacillus subtilis natto provided detailed analyses of a set of genes related to natto production, demonstrating the number and locations of insertion sequences that B. subtilis natto harbors but B. subtilis 168 lacks. Multiple genome-level comparisons among five closely related Bacillus species were also carried out. The determined genome sequence of B. subtilis natto and gene annotations are available from the Natto genome browser http:\/\/natto-genome.org/.
Bacillus subtilis is the best-characterized member of the Gram-positive bacteria. Its genome of 4,214,810 base pairs comprises 4,100 protein-coding genes. Of these protein-coding genes, 53% are represented once, while a quarter of the genome corresponds to several gene families that have been greatly expanded by gene duplication, the largest family containing 77 putative ATP-binding transport proteins. In addition, a large proportion of the genetic capacity is devoted to the utilization of a variety of carbon sources, including many plant-derived molecules. The identification of five signal peptidase genes, as well as several genes for components of the secretion apparatus, is important given the capacity of Bacillus strains to secrete large amounts of industrially important enzymes. Many of the genes are involved in the synthesis of secondary metabolites, including antibiotics, that are more typically associated with Streptomyces species. The genome contains at least ten prophages or remnants of prophages, indicating that bacteriophage infection has played an important evolutionary role in horizontal gene transfer, in particular in the propagation of bacterial pathogenesis.