Here we report the use of a multi-genome DNA microarray to elucidate the genomic events associated with the emergence of the clonal variants of Haemophilus influenzae biogroup aegyptius causing Brazilian Purpuric Fever (BPF), an important pediatric disease with a high mortality rate. We performed directed genome sequencing of strain HK1212 unique loci to construct a species DNA microarray. Comparative genome hybridization using this microarray enabled us to determine and compare gene complements, and infer reliable phylogenomic relationships among members of the species. The higher genomic variability observed in the genomes of BPF-related strains (clones) and their close relatives may be characterized by significant gene flux related to a subset of functional role categories. We found that the acquisition of a large number of virulence determinants featuring numerous cell membrane proteins coupled to the loss of genes involved in transport, central biosynthetic pathways and in particular, energy production pathways to be characteristics of the BPF genomic variants.
Bacillus anthracis is the etiologic agent of anthrax, an acute fatal disease among mammals. It was thought to differ from Bacillus cereus, an opportunistic pathogen and cause of food poisoning, by the presence of plasmids pXO1 and pXO2, which encode the lethal toxin complex and the poly-gamma-d-glutamic acid capsule, respectively. This work describes a non-B. anthracis isolate that possesses the anthrax toxin genes and is capable of causing a severe inhalation anthrax-like illness. Although initial phenotypic and 16S rRNA analysis identified this isolate as B. cereus, the rapid generation and analysis of a high-coverage draft genome sequence revealed the presence of a circular plasmid, named pBCXO1, with 99.6% similarity with the B. anthracis toxin-encoding plasmid, pXO1. Although homologues of the pXO2 encoded capsule genes were not found, a polysaccharide capsule cluster is encoded on a second, previously unidentified plasmid, pBC218. A/J mice challenged with B. cereus G9241 confirmed the virulence of this strain. These findings represent an example of how genomics could rapidly assist public health experts responding not only to clearly identified select agents but also to novel agents with similar pathogenic potentials. In this study, we combined a public health approach with genome analysis to provide insight into the correlation of phenotypic characteristics and their genetic basis.
Bacillus anthracis is an endospore-forming bacterium that causes inhalational anthrax. Key virulence genes are found on plasmids (extra-chromosomal, circular, double-stranded DNA molecules) pXO1 (ref. 2) and pXO2 (ref. 3). To identify additional genes that might contribute to virulence, we analysed the complete sequence of the chromosome of B. anthracis Ames (about 5.23 megabases). We found several chromosomally encoded proteins that may contribute to pathogenicity--including haemolysins, phospholipases and iron acquisition functions--and identified numerous surface proteins that might be important targets for vaccines and drugs. Almost all these putative chromosomal virulence and surface proteins have homologues in Bacillus cereus, highlighting the similarity of B. anthracis to near-neighbours that are not associated with anthrax. By performing a comparative genome hybridization of 19 B. cereus and Bacillus thuringiensis strains against a B. anthracis DNA microarray, we confirmed the general similarity of chromosomal genes among this group of close relatives. However, we found that the gene sequences of pXO1 and pXO2 were more variable between strains, suggesting plasmid mobility in the group. The complete sequence of B. anthracis is a step towards a better understanding of anthrax pathogenesis.
The complete nucleotide sequence (580,070 base pairs) of the Mycoplasma genitalium genome, the smallest known genome of any free-living organism, has been determined by whole-genome random sequencing and assembly. A total of only 470 predicted coding regions were identified that include genes required for DNA replication, transcription and translation, DNA repair, cellular transport, and energy metabolism. Comparison of this genome to that of Haemophilus influenzae suggests that differences in genome content are reflected as profound differences in physiology and metabolic capacity between these two organisms.
A total of 508 random clones from five Mycoplasma genitalium genomic libraries were partially sequenced and analyzed. This resulted in the identification of 291 unique contigs. Sequence information from these clones (100,993 nucleotides), representing approximately 17% of this pathogen's genome, was analyzed by comparison to the DNA and protein sequence data bases. The frequency with which clones could be identified, by virtue of possessing homology to another data base entry, was 46%. Sequence analysis indicated the following. (i) The M. genitalium genome contains many genes involved in various metabolic processes. (ii) Repetitive DNA may comprise as much as 4% of this genome. (iii) The MgPa adhesin gene may be the result of horizontal transfer from an unknown origin. (iv) Not all dinucleotide pairs are present in this genome at the expected frequency. (v) This genome potentially encodes approximately 390 proteins and makes very efficient use of its limited amount of DNA. In addition, this study allowed us to estimate the number of genes involved with various cellular functions.