We report the complete and annotated genome sequence of Bacillus cereus NC7401, a representative of the strain group that causes emetic-type food poisoning. The emetic toxin, cereulide, is produced by a nonribosomal protein synthesis (NRPS) system that is encoded by a gene cluster on a large resident plasmid, pNCcld.
        
Title: Bacillus subtilis RghR (YvaN) represses rapG and rapH, which encode inhibitors of expression of the srfA operon Hayashi K, Kensuke T, Kobayashi K, Ogasawara N, Ogura M Ref: Molecular Microbiology, 59:1714, 2006 : PubMed
Rap proteins regulate the activity of response regulators including Spo0F, DegU and ComA. We found that overexpression of either RapG or RapH severely downregulated the expression of srfA, which belongs to the ComA regulon. Disruption of those genes, however, showed small effects on srfA expression. These observations suggested that Bacillus subtilis cells possess a repressor for rapG and rapH. To identify candidate repressors we developed a novel transcription factor array (TF array) assay, in which disruptions of 287 genes encoding regulatory proteins were independently transformed into a strain carrying rapH-lacZ and the resultant transformants were grown on agar plates containing Xgal to detect beta-galactosidase activity. We identified a yvaN disruptant which showed a rapH-overproducing phenotype. DNA microarray analysis of the yvaN mutant suggested that both rapG and rapH were overproduced, leading to inhibition of srfA expression. In a gel retardation assay, purified His-tagged YvaN specifically bound to promoter sequences of rapG and rapH. Further footprint and gel retardation analyses using various deleted probes uncovered critical sequences for YvaN binding. In addition, a lacZ fusion analysis confirmed the significance of YvaN binding for transcription regulation of rapG and rapH. Thus, YvaN was renamed RghR (rapG and rapH repressor). As the rapH gene is activated by ComK and RapH inhibits comK indirectly, this constitutes an autoregulatory loop modulated by RghR.
The genome of Aspergillus oryzae, a fungus important for the production of traditional fermented foods and beverages in Japan, has been sequenced. The ability to secrete large amounts of proteins and the development of a transformation system have facilitated the use of A. oryzae in modern biotechnology. Although both A. oryzae and Aspergillus flavus belong to the section Flavi of the subgenus Circumdati of Aspergillus, A. oryzae, unlike A. flavus, does not produce aflatoxin, and its long history of use in the food industry has proved its safety. Here we show that the 37-megabase (Mb) genome of A. oryzae contains 12,074 genes and is expanded by 7-9 Mb in comparison with the genomes of Aspergillus nidulans and Aspergillus fumigatus. Comparison of the three aspergilli species revealed the presence of syntenic blocks and A. oryzae-specific blocks (lacking synteny with A. nidulans and A. fumigatus) in a mosaic manner throughout the genome of A. oryzae. The blocks of A. oryzae-specific sequence are enriched for genes involved in metabolism, particularly those for the synthesis of secondary metabolites. Specific expansion of genes for secretory hydrolytic enzymes, amino acid metabolism and amino acid/sugar uptake transporters supports the idea that A. oryzae is an ideal microorganism for fermentation.
Small, compact genomes of ultrasmall unicellular algae provide information on the basic and essential genes that support the lives of photosynthetic eukaryotes, including higher plants. Here we report the 16,520,305-base-pair sequence of the 20 chromosomes of the unicellular red alga Cyanidioschyzon merolae 10D as the first complete algal genome. We identified 5,331 genes in total, of which at least 86.3% were expressed. Unique characteristics of this genomic structure include: a lack of introns in all but 26 genes; only three copies of ribosomal DNA units that maintain the nucleolus; and two dynamin genes that are involved only in the division of mitochondria and plastids. The conserved mosaic origin of Calvin cycle enzymes in this red alga and in green plants supports the hypothesis of the existence of single primary plastid endosymbiosis. The lack of a myosin gene, in addition to the unexpressed actin gene, suggests a simpler system of cytokinesis. These results indicate that the C. merolae genome provides a model system with a simple gene composition for studying the origin, evolution and fundamental mechanisms of eukaryotic cells.
Clostridium perfringens is a Gram-positive anaerobic spore-forming bacterium that causes life-threatening gas gangrene and mild enterotoxaemia in humans, although it colonizes as normal intestinal flora of humans and animals. The organism is known to produce a variety of toxins and enzymes that are responsible for the severe myonecrotic lesions. Here we report the complete 3,031,430-bp sequence of C. perfringens strain 13 that comprises 2,660 protein coding regions and 10 rRNA genes, showing pronounced low overall G + C content (28.6%). The genome contains typical anaerobic fermentation enzymes leading to gas production but no enzymes for the tricarboxylic acid cycle or respiratory chain. Various saccharolytic enzymes were found, but many enzymes for amino acid biosynthesis were lacking in the genome. Twenty genes were newly identified as putative virulence factors of C. perfringens, and we found a total of five hyaluronidase genes that will also contribute to virulence. The genome analysis also proved an efficient method for finding four members of the two-component VirR/VirS regulon that coordinately regulates the pathogenicity of C. perfringens. Clearly, C. perfringens obtains various essential materials from the host by producing several degradative enzymes and toxins, resulting in massive destruction of the host tissues.
Escherichia coli O157:H7 is a major food-borne infectious pathogen that causes diarrhea, hemorrhagic colitis, and hemolytic uremic syndrome. Here we report the complete chromosome sequence of an O157:H7 strain isolated from the Sakai outbreak, and the results of genomic comparison with a benign laboratory strain, K-12 MG1655. The chromosome is 5.5 Mb in size, 859 Kb larger than that of K-12. We identified a 4.1-Mb sequence highly conserved between the two strains, which may represent the fundamental backbone of the E. coli chromosome. The remaining 1.4-Mb sequence comprises of O157:H7-specific sequences, most of which are horizontally transferred foreign DNAs. The predominant roles of bacteriophages in the emergence of O157:H7 is evident by the presence of 24 prophages and prophage-like elements that occupy more than half of the O157:H7-specific sequences. The O157:H7 chromosome encodes 1632 proteins and 20 tRNAs that are not present in K-12. Among these, at least 131 proteins are assumed to have virulence-related functions. Genome-wide codon usage analysis suggested that the O157:H7-specific tRNAs are involved in the efficient expression of the strain-specific genes. A complete set of the genes specific to O157:H7 presented here sheds new insight into the pathogenicity and the physiology of O157:H7, and will open a way to fully understand the molecular mechanisms underlying the O157:H7 infection.
BACKGROUND: Staphylococcus aureus is one of the major causes of community-acquired and hospital-acquired infections. It produces numerous toxins including superantigens that cause unique disease entities such as toxic-shock syndrome and staphylococcal scarlet fever, and has acquired resistance to practically all antibiotics. Whole genome analysis is a necessary step towards future development of countermeasures against this organism. METHODS: Whole genome sequences of two related S aureus strains (N315 and Mu50) were determined by shot-gun random sequencing. N315 is a meticillin-resistant S aureus (MRSA) strain isolated in 1982, and Mu50 is an MRSA strain with vancomycin resistance isolated in 1997. The open reading frames were identified by use of GAMBLER and GLIMMER programs, and annotation of each was done with a BLAST homology search, motif analysis, and protein localisation prediction. FINDINGS: The Staphylococcus genome was composed of a complex mixture of genes, many of which seem to have been acquired by lateral gene transfer. Most of the antibiotic resistance genes were carried either by plasmids or by mobile genetic elements including a unique resistance island. Three classes of new pathogenicity islands were identified in the genome: a toxic-shock-syndrome toxin island family, exotoxin islands, and enterotoxin islands. In the latter two pathogenicity islands, clusters of exotoxin and enterotoxin genes were found closely linked with other gene clusters encoding putative pathogenic factors. The analysis also identified 70 candidates for new virulence factors. INTERPRETATION: The remarkable ability of S aureus to acquire useful genes from various organisms was revealed through the observation of genome complexity and evidence of lateral gene transfer. Repeated duplication of genes encoding superantigens explains why S aureus is capable of infecting humans of diverse genetic backgrounds, eliciting severe immune reactions. Investigation of many newly identified gene products, including the 70 putative virulence factors, will greatly improve our understanding of the biology of staphylococci and the processes of infectious diseases caused by S aureus.
The 4 202 353 bp genome of the alkaliphilic bacterium Bacillus halodurans C-125 contains 4066 predicted protein coding sequences (CDSs), 2141 (52.7%) of which have functional assignments, 1182 (29%) of which are conserved CDSs with unknown function and 743 (18. 3%) of which have no match to any protein database. Among the total CDSs, 8.8% match sequences of proteins found only in Bacillus subtilis and 66.7% are widely conserved in comparison with the proteins of various organisms, including B.subtilis. The B. halodurans genome contains 112 transposase genes, indicating that transposases have played an important evolutionary role in horizontal gene transfer and also in internal genetic rearrangement in the genome. Strain C-125 lacks some of the necessary genes for competence, such as comS, srfA and rapC, supporting the fact that competence has not been demonstrated experimentally in C-125. There is no paralog of tupA, encoding teichuronopeptide, which contributes to alkaliphily, in the C-125 genome and an ortholog of tupA cannot be found in the B.subtilis genome. Out of 11 sigma factors which belong to the extracytoplasmic function family, 10 are unique to B. halodurans, suggesting that they may have a role in the special mechanism of adaptation to an alkaline environment.
In the course of the Bacillus subtilis genome sequencing project, we identified an open reading frame encoding a putative 16.4 kDa protein. This protein shows, respectively, 34% and 25% identity with the Escherichia coli regulatory proteins Lrp and AsnC. Phylogenetic analysis suggests that it represents a new group in the AsnC-Lrp family. Sequence comparisons, as well as immunodetection experiments, lead to the conclusion that the product of this B. subtilis lrp-like-gene is a bona fide Lrp protein-the first one to be detected in gram-positive bacteria. When expressed in E. coli, the B. subtilis Lrp-like protein is able to repress, by about two-fold, the expression of the ilvIH operon which is normally regulated by E. coli Lrp, indicating functional similarity in their regulatory targets. Vegetative growth of a B. subtilis lrp-like mutant is not affected in rich medium. However, the lrp-like mutation causes a transitory inhibition of growth in minimal medium in the presence of valine and isoleucine, which is relieved by leucine. This points to a possible role in regulation of amino acid metabolism. In addition, sporogenesis occurs earlier in the lrp-like mutant than in the reference strain, implying that the B subtilis Lrp-like protein plays a role in the growth phase transition.
        
Title: Sequence analysis of the groESL-cotA region of the Bacillus subtilis genome, containing the restriction/modification system genes Kasahara Y, Nakai S, Ogasawara N, Yata K, Sadaie Y Ref: DNA Research, 4:335, 1997 : PubMed
We have determined a 35-kb sequence of the groESL-gutR-cotA (45 degrees-52 degrees) region of the Bacillus subtilis genome. In addition to the groESL, gutRB and cotA genes reported previously, we have newly identified 24 ORFs including gutA and fruC genes, encoding glucitol permease and fructokinase, respectively. The inherent restriction/modification system genes, hsdMR and hsdMM, were mapped between groESL and gutRB, and we have identified two open reading frames (ORFs) encoding 5-methylcytosine forming DNA methyl transferase and an operon probably encoding a restriction enzyme complex. The unusual genome structure of few ORFs and lower GC content around the restriction/modification genes strongly suggests that the region originated from a bacteriophage integrated during evolution.
Bacillus subtilis is the best-characterized member of the Gram-positive bacteria. Its genome of 4,214,810 base pairs comprises 4,100 protein-coding genes. Of these protein-coding genes, 53% are represented once, while a quarter of the genome corresponds to several gene families that have been greatly expanded by gene duplication, the largest family containing 77 putative ATP-binding transport proteins. In addition, a large proportion of the genetic capacity is devoted to the utilization of a variety of carbon sources, including many plant-derived molecules. The identification of five signal peptidase genes, as well as several genes for components of the secretion apparatus, is important given the capacity of Bacillus strains to secrete large amounts of industrially important enzymes. Many of the genes are involved in the synthesis of secondary metabolites, including antibiotics, that are more typically associated with Streptomyces species. The genome contains at least ten prophages or remnants of prophages, indicating that bacteriophage infection has played an important evolutionary role in horizontal gene transfer, in particular in the propagation of bacterial pathogenesis.
        
Title: Systematic sequencing of the 180 kilobase region of the Bacillus subtilis chromosome containing the replication origin. Ogasawara N, Nakai S, Yoshikawa H Ref: DNA Research, 1:1, 1994 : PubMed