Here we report the complete and finished genome sequence of Streptomyces glaucescens GLA.O (DSM 40922), a natural producer of the alpha-glucosidase inhibitor acarbose, which is used in the treatment of type-2 diabetes mellitus. The genome of S. glaucescens GLA.O consists of two replicons, the chromosome with a size of 7,453,200bp and a G+C content of 73.0% as well as a plasmid named pSglau1 with a size of 170,574bp and a G+C content of 69.06%.
Actinoplanes friuliensis HAG 010964 (DSM 7358) was isolated from a soil sample from the Friuli region in Italy and characterized as a producer of the antibiotic friulimycin. The complete genome sequence includes genomic information of secondary metabolite biosynthesis and of its lifestyle. Genbank/EMBL/DDBJ Accession Nr: CP006272 (chromosome).
Pseudomonas aeruginosa is a major nosocomial bacterial pathogen causing complicated catheter-associated urinary tract infections (CAUTIs). Here, we present the 6.9-Mb draft genome sequence of P. aeruginosa MH38 isolated from an acute nosocomial CAUTI. It exhibits resistance to several antibiotics but revealed low-level production of virulence factors.
Cyanobacteria forged two major evolutionary transitions with the invention of oxygenic photosynthesis and the bestowal of photosynthetic lifestyle upon eukaryotes through endosymbiosis. Information germane to understanding those transitions is imprinted in cyanobacterial genomes, but deciphering it is complicated by lateral gene transfer (LGT). Here, we report genome sequences for the morphologically most complex true-branching cyanobacteria, and for Scytonema hofmanni PCC 7110, which with 12,356 proteins is the most gene-rich prokaryote currently known. We investigated components of cyanobacterial evolution that have been vertically inherited, horizontally transferred, and donated to eukaryotes at plastid origin. The vertical component indicates a freshwater origin for water-splitting photosynthesis. Networks of the horizontal component reveal that 60% of cyanobacterial gene families have been affected by LGT. Plant nuclear genes acquired from cyanobacteria define a lower bound frequency of 611 multigene families that, in turn, specify diazotrophic cyanobacterial lineages as having a gene collection most similar to that possessed by the plastid ancestor.
Streptomyces collinus Tu 365 (DSMZ 40733), isolated from Kouroussa (Guinea), is the producer of the elfamycin family antibiotic kirromycin, which inhibits bacterial protein biosynthesis by interfering with elongation factor EF-Tu. Here, we report on the Streptomyces collinus Tu 365 complete genome sequence of the 8.27 MB chromosome and the two plasmids SCO1 and SCO2.
Anastomosis group AG1-IB isolates of the anamorphic basidiomycetous fungus Rhizoctonia solani Kuhn affect various agricultural and horticultural important crops including bean, rice, soybean, figs, hortensia, cabbage and lettuce. To gain insights into the genome structure and content, the first draft genome sequence of R. solani AG1-IB isolate 7/3/14 was established. Four complete runs on the Genome Sequencer (GS) FLX platform (Roche Applied Science) yielding approx. a 25-fold coverage of the R. solani genome were accomplished. Assembly of the sequence reads by means of the gsAssembler software version 2.6 applying the heterozygotic mode resulted in numerous contigs and scaffolds and a predicted size of 87.1 Mb for the diploid status of the genome. 'Contig-length vs. read-count' analysis revealed that the assembled contigs can be classified into five different groups. Detailed BLAST-analysis revealed that most contigs of group II feature high-scoring matches to other contigs of the same group suggesting that distinguishable allelic variants exist for many genes. Due to the supposed diploid and heterokaryotic nature of R. solani AG1-IB 7/3/14, this result has been anticipated. However, the heterokaryotic character of the isolate is not really supported by sequencing data obtained for the isolate R. solani AG1-IB 7/3/14. Coverage of group III contigs is twice as high as for group II contigs which can also be explained by the diploid status of the genome and indistinguishable alleles on homologous chromosomes. Assembly of sequence data led to the identification of the rRNA unit (group V contigs) and the mitochondrial (mt) genome (group IV contigs) which is a circular molecule of 162,751 bp in size featuring a GC-content of 36.4%. The R. solani 7/3/14 mt-genome is one of the largest fungal mitochondrial genomes known to date. Its large size essentially is due to the presence of numerous non-conserved hypothetical ORFs and introns. Gene prediction for the R. solani AG1-IB 7/3/14 genome was conducted by the Augustus Gene Prediction Software for Eukaryotes (version 2.6.) applying the parameter set for the fungus Coprinopsis cinerea okayama 7 130. Gene prediction and annotation provided first insights into the R. solani AG1-IB 7/3/14 gene structure and content. In total, 12,422 genes were predicted. The average number of exons per gene is five. Exons have a mean length of 214 bp, whereas introns on average are 66 bp in length. Annotation of the genome revealed that 4169 of 12,422 genes could be assigned to KOG functional categories.
BACKGROUND: Listeria monocytogenes is a food-borne pathogen that causes infections with a high-mortality rate and has served as an invaluable model for intracellular parasitism. Here, we report complete genome sequences for two L. monocytogenes strains belonging to serotype 4a (L99) and 4b (CLIP80459), and transcriptomes of representative strains from lineages I, II, and III, thereby permitting in-depth comparison of genome- and transcriptome -based data from three lineages of L. monocytogenes. Lineage III, represented by the 4a L99 genome is known to contain strains less virulent for humans. RESULTS: The genome analysis of the weakly pathogenic L99 serotype 4a provides extensive evidence of virulence gene decay, including loss of several important surface proteins. The 4b CLIP80459 genome, unlike the previously sequenced 4b F2365 genome harbours an intact inlB invasion gene. These lineage I strains are characterized by the lack of prophage genes, as they share only a single prophage locus with other L. monocytogenes genomes 1/2a EGD-e and 4a L99. Comparative transcriptome analysis during intracellular growth uncovered adaptive expression level differences in lineages I, II and III of Listeria, notable amongst which was a strong intracellular induction of flagellar genes in strain 4a L99 compared to the other lineages. Furthermore, extensive differences between strains are manifest at levels of metabolic flux control and phosphorylated sugar uptake. Intriguingly, prophage gene expression was found to be a hallmark of intracellular gene expression. Deletion mutants in the single shared prophage locus of lineage II strain EGD-e 1/2a, the lma operon, revealed severe attenuation of virulence in a murine infection model. CONCLUSION: Comparative genomics and transcriptome analysis of L. monocytogenes strains from three lineages implicate prophage genes in intracellular adaptation and indicate that gene loss and decay may have led to the emergence of attenuated lineages.
Lactobacillus buchneri belongs to the group of heterofermentative lactic acid bacteria and is a common member of the silage microbiome. Here we report the completely annotated genomic sequence of L. buchneri CD034, a strain isolated from stable grass silage. The whole genome of L. buchneri CD034 was sequenced on the Roche Genome Sequencer FLX platform. It was found to consist of four replicons, a circular chromosome, and three plasmids. The circular chromosome was predicted to encode 2319 proteins and contains a genomic island and two prophages which significantly differ in G+C-content from the remaining chromosome. It possesses all genes for enzymes of a complete phosphoketolase pathway, whereas two enzymes necessary for glycolysis are lacking. This confirms the classification of L. buchneri CD034 as an obligate heterofermentative lactic acid bacterium. A set of genes considered to be involved in the lactate degradation pathway and genes putatively involved in the breakdown of plant cell wall polymers were identified. Moreover, several genes encoding putative S-layer proteins and two CRISPR systems, belonging to the subclasses I-E and II-A, are located on the chromosome. The largest plasmid pCD034-3 was predicted to encode 57 genes, including a putative polysaccharide synthesis gene cluster, whereas the functions of the two smaller plasmids, pCD034-1 and pCD034-2, remain cryptic. Phylogenetic analysis based on sequence comparison of the conserved marker gene rpoA reveals that L. buchneri CD034 is more closely related to Lactobacillus hilgardii strains than to Lactobacillus brevis and Lactobacillus plantarum strains. Comparison of the L. buchneri CD034 core genome to other fully sequenced and closely related members of the genus Lactobacillus disclosed a high degree of conservation between L. buchneri CD034 and the recently sequenced L. buchneri strain NRRL B-30929 and a more distant relationship to L. buchneri ATCC 11577 and L. brevis ssp. gravesensis ATCC 27305, which cluster together with L. hilgardii type strain ATCC 8290. L. buchneri CD034 genome information will certainly provide the basis for further postgenome studies with the objective to optimize application of the strain in silage production.
Wickerhamomyces ciferrii is a microorganism characterized by the production and secretion of large amounts of acetylated sphingoid bases, in particular tetraacetyl phytosphingosine. Here, we present the 15.90-Mbp draft genome sequence of W. ciferrii NRRL Y-1031 F-60-10 generated by pyrosequencing and de novo assembly. The draft genome sequence comprising 364 contigs in 150 scaffolds was annotated and covered 6,702 protein-coding sequences. This information will contribute to the metabolic engineering of this yeast to improve the yield and spectrum of acetylated sphingoid bases in biotechnological production.
Neisseria meningitidis is a commensal and accidental pathogen exclusively of humans. Although the production of polysaccharide capsules is considered to be essential for meningococcal virulence, there have been reports of constitutively unencapsulated strains causing invasive meningococcal disease (IMD). Here we report the genome sequence of a capsule null locus (cnl) strain of sequence type 198 (ST-198), which is found in half of the reported cases of IMD caused by cnl meningococcal strains.
BACKGROUND: The genus Saccharothrix is a representative of the family Pseudonocardiaceae, known to include producer strains of a wide variety of potent antibiotics. Saccharothrix espanaensis produces both saccharomicins A and B of the promising new class of heptadecaglycoside antibiotics, active against both bacteria and yeast. RESULTS: To better assess its capabilities, the complete genome sequence of S. espanaensis was established. With a size of 9,360,653 bp, coding for 8,501 genes, it stands alongside other Pseudonocardiaceae with large genomes. Besides a predicted core genome of 810 genes shared in the family, S. espanaensis has a large number of accessory genes: 2,967 singletons when compared to the family, of which 1,292 have no clear orthologs in the RefSeq database. The genome analysis revealed the presence of 26 biosynthetic gene clusters potentially encoding secondary metabolites. Among them, the cluster coding for the saccharomicins could be identified. CONCLUSION: S. espanaensis is the first completely sequenced species of the genus Saccharothrix. The genome discloses the cluster responsible for the biosynthesis of the saccharomicins, the largest oligosaccharide antibiotic currently identified. Moreover, the genome revealed 25 additional putative secondary metabolite gene clusters further suggesting the strain's potential for natural product synthesis.
Sinorhizobium fredii HH103 is a fast-growing rhizobial strain that is able to nodulate legumes that develop determinate nodules, e.g., soybean, and legumes that form nodules of the indeterminate type. Here we present the genome of HH103, which consists of one chromosome and five plasmids with a total size of 7.22 Mb.
The whole-genome-sequenced rhizobacterium Bacillus amyloliquefaciens FZB42(T) (Chen et al., 2007) and other plant-associated strains of the genus Bacillus described as belonging to the species Bacillus amyloliquefaciens or Bacillus subtilis are used commercially to promote the growth and improve the health of crop plants. Previous investigations revealed that a group of strains represented a distinct ecotype related to B. amyloliquefaciens; however, the exact taxonomic position of this group remains elusive (Reva et al., 2004). In the present study, we demonstrated the ability of a group of Bacillus strains closely related to strain FZB42(T) to colonize Arabidopsis roots. On the basis of their phenotypic traits, the strains were similar to Bacillus amyloliquefaciens DSM 7(T) but differed considerably from this type strain in the DNA sequences of genes encoding 16S rRNA, gyrase subunit A (gyrA) and histidine kinase (cheA). Phylogenetic analysis performed with partial 16S rRNA, gyrA and cheA gene sequences revealed that the plant-associated strains of the genus Bacillus, including strain FZB42(T), formed a lineage, which could be distinguished from the cluster of strains closely related to B. amyloliquefaciens DSM 7(T). DNA-DNA hybridizations (DDH) performed with genomic DNA from strains DSM 7(T) and FZB42(T) yielded relatedness values of 63.7-71.2 %. Several methods of genomic analysis, such as direct whole-genome comparison, digital DDH and microarray-based comparative genomichybridization (M-CGH) were used as complementary tests. The group of plant-associated strains could be distinguished from strain DSM 7(T) and the type strain of B. subtilis by differences in the potential to synthesize non-ribosomal lipopeptides and polyketides. Based on the differences found in the marker gene sequences and the whole genomes of these strains, we propose two novel subspecies, designated B. amyloliquefaciens subsp. plantarum subsp. nov., with the type strain FZB42(T) ( = DSM 23117(T) = BGSC 10A6(T)), and B. amyloliquefaciens subsp. amyloliquefaciens subsp. nov., with the type strain DSM 7(T)( = ATCC 23350(T) = Fukumoto Strain F(T)), for plant-associated and non-plant-associated representatives, respecitvely. This is in agreement with results of DDH and M-CGH tests and the MALDI-TOF MS of cellular components, all of which suggested that the ecovars represent two different subspecies.
The methylotrophic yeast Pichia pastoris (Komagataella phaffii) CBS7435 is the parental strain of commonly used P. pastoris recombinant protein production hosts making it well suited for improving the understanding of associated genomic features. Here, we present a 9.35 Mbp high-quality genome sequence of P. pastoris CBS7435 established by a combination of 454 and Illumina sequencing. An automatic annotation of the genome sequence yielded 5007 protein-coding genes, 124 tRNAs and 29 rRNAs. Moreover, we report the complete DNA sequence of the first mitochondrial genome of a methylotrophic yeast. Fifteen genes encoding proteins, 2 rRNA and 25 tRNA loci were identified on the 35.7 kbp circular, mitochondrial DNA. Furthermore, the architecture of the putative alpha mating factor protein of P. pastoris CBS7435 turned out to be more complex than the corresponding protein of Saccharomyces cerevisiae.
Isolates of the symbiotic nitrogen-fixing species Sinorhizobium meliloti usually contain a chromosome and two large megaplasmids encoding functions that are absolutely required for the specific interaction of the microsymbiont with corresponding host plants leading to an effective symbiosis. The complete genome sequence, including the megaplasmids pSmeSM11c (related to pSymA) and pSmeSM11d (related to pSymB), was established for the dominant, indigenous S. meliloti strain SM11 that had been isolated during a long-term field release experiment with genetically modified S. meliloti strains. The chromosome, the largest replicon of S. meliloti SM11, is 3,908,022bp in size and codes for 3785 predicted protein coding sequences. The size of megaplasmid pSmeSM11c is 1,633,319bp and it contains 1760 predicted protein coding sequences whereas megaplasmid pSmeSM11d is 1,632,395bp in size and comprises 1548 predicted coding sequences. The gene content of the SM11 chromosome is quite similar to that of the reference strain S. meliloti Rm1021. Comparison of pSmeSM11c to pSymA of the reference strain revealed that many gene regions of these replicons are variable, supporting the assessment that pSymA is a major hot-spot for intra-specific differentiation. Plasmids pSymA and pSmeSM11c both encode unique genes. Large gene regions of pSmeSM11c are closely related to corresponding parts of Sinorhizobium medicae WSM419 plasmids. Moreover, pSmeSM11c encodes further novel gene regions, e.g. additional plasmid survival genes (partition, mobilisation and conjugative transfer genes), acdS encoding 1-aminocyclopropane-1-carboxylate deaminase involved in modulation of the phytohormone ethylene level and genes having predicted functions in degradative capabilities, stress response, amino acid metabolism and associated pathways. In contrast to Rm1021 pSymA and pSmeSM11c, megaplasmid pSymB of strain Rm1021 and pSmeSM11d are highly conserved showing extensive synteny with only few rearrangements. Most remarkably, pSmeSM11b contains a new gene cluster predicted to be involved in polysaccharide biosynthesis. Compilation of the S. meliloti SM11 genome sequence contributes to an extension of the S. meliloti pan-genome.
Agrobacterium sp. H13-3, formerly known as Rhizobium lupini H13-3, is a soil bacterium that was isolated from the rhizosphere of Lupinus luteus. The isolate has been established as a model system for studying novel features of flagellum structure, motility and chemotaxis within the family Rhizobiaceae. The complete genome sequence of Agrobacterium sp. H13-3 has been established and the genome structure and phylogenetic assignment of the organism was analysed. For de novo sequencing of the Agrobacterium sp. H13-3 genome, a combined strategy comprising 454-pyrosequencing on the Genome Sequencer FLX platform and PCR-based amplicon sequencing for gap closure was applied. The finished genome consists of three replicons and comprises 5,573,770 bases. Based on phylogenetic analyses, the isolate could be assigned to the genus Agrobacterium biovar I and represents a genomic species G1 strain within this biovariety. The highly conserved circular chromosome (2.82 Mb) of Agrobacterium sp. H13-3 mainly encodes housekeeping functions characteristic for an aerobic, heterotrophic bacterium. Agrobacterium sp. H13-3 is a motile bacterium driven by the rotation of several complex flagella. Its behaviour towards external stimuli is regulated by a large chemotaxis regulon and a total of 17 chemoreceptors. Comparable to the genome of Agrobacterium tumefaciens C58, Agrobacterium sp. H13-3 possesses a linear chromosome (2.15 Mb) that is related to its reference replicon and features chromosomal and plasmid-like properties. The accessory plasmid pAspH13-3a (0.6 Mb) is only distantly related to the plasmid pAtC58 of A. tumefaciens C58 and shows a mosaic structure. A tumor-inducing Ti-plasmid is missing in the sequenced strain H13-3 indicating that it is a non-virulent isolate.
We report the complete and annotated genome sequence of the nonpathogenic Listeria seeligeri SLCC3954 serovar 1/2b type strain harboring the smallest completely sequenced genome of the genus Listeria.
BACKGROUND: Corynebacterium aurimucosum is a slightly yellowish, non-lipophilic, facultative anaerobic member of the genus Corynebacterium and predominantly isolated from human clinical specimens. Unusual black-pigmented variants of C. aurimucosum (originally named as C. nigricans) continue to be recovered from the female urogenital tract and they are associated with complications during pregnancy. C. aurimucosum ATCC 700975 (C. nigricans CN-1) was originally isolated from a vaginal swab of a 34-year-old woman who experienced a spontaneous abortion during month six of pregnancy. For a better understanding of the physiology and lifestyle of this potential urogenital pathogen, the complete genome sequence of C. aurimucosum ATCC 700975 was determined. RESULTS: Sequencing and assembly of the C. aurimucosum ATCC 700975 genome yielded a circular chromosome of 2,790,189 bp in size and the 29,037-bp plasmid pET44827. Specific gene sets associated with the central metabolism of C. aurimucosum apparently provide enhanced metabolic flexibility and adaptability in aerobic, anaerobic and low-pH environments, including gene clusters for the uptake and degradation of aromatic amines, L-histidine and L-tartrate as well as a gene region for the formation of selenocysteine and its incorporation into formate dehydrogenase. Plasmid pET44827 codes for a non-ribosomal peptide synthetase that plays the pivotal role in the synthesis of the characteristic black pigment of C. aurimucosum ATCC 700975. CONCLUSIONS: The data obtained by the genome project suggest that C. aurimucosum could be both a resident of the human gut and possibly a pathogen in the female genital tract causing complications during pregnancy. Since hitherto all black-pigmented C. aurimucosum strains have been recovered from female genital source, biosynthesis of the pigment is apparently required for colonization by protecting the bacterial cells against the high hydrogen peroxide concentration in the vaginal environment. The location of the corresponding genes on plasmid pET44827 explains why black-pigmented (formerly C. nigricans) and non-pigmented C. aurimucosum strains were isolated from clinical specimens.
Sulfate-reducing bacteria (SRB) belonging to the metabolically versatile Desulfobacteriaceae are abundant in marine sediments and contribute to the global carbon cycle by complete oxidation of organic compounds. Desulfobacterium autotrophicum HRM2 is the first member of this ecophysiologically important group with a now available genome sequence. With 5.6 megabasepairs (Mbp) the genome of Db. autotrophicum HRM2 is about 2 Mbp larger than the sequenced genomes of other sulfate reducers (SRB). A high number of genome plasticity elements (> 100 transposon-related genes), several regions of GC discontinuity and a high number of repetitive elements (132 paralogous genes Mbp(-1)) point to a different genome evolution when comparing with Desulfovibrio spp. The metabolic versatility of Db. autotrophicum HRM2 is reflected in the presence of genes for the degradation of a variety of organic compounds including long-chain fatty acids and for the Wood-Ljungdahl pathway, which enables the organism to completely oxidize acetyl-CoA to CO(2) but also to grow chemolithoautotrophically. The presence of more than 250 proteins of the sensory/regulatory protein families should enable Db. autotrophicum HRM2 to efficiently adapt to changing environmental conditions. Genes encoding periplasmic or cytoplasmic hydrogenases and formate dehydrogenases have been detected as well as genes for the transmembrane TpII-c(3), Hme and Rnf complexes. Genes for subunits A, B, C and D as well as for the proposed novel subunits L and F of the heterodisulfide reductases are present. This enzyme is involved in energy conservation in methanoarchaea and it is speculated that it exhibits a similar function in the process of dissimilatory sulfate reduction in Db. autotrophicum HRM2.
Clavibacter michiganensis subsp. michiganensis is a plant-pathogenic actinomycete that causes bacterial wilt and canker of tomato. The nucleotide sequence of the genome of strain NCPPB382 was determined. The chromosome is circular, consists of 3.298 Mb, and has a high G+C content (72.6%). Annotation revealed 3,080 putative protein-encoding sequences; only 26 pseudogenes were detected. Two rrn operons, 45 tRNAs, and three small stable RNA genes were found. The two circular plasmids, pCM1 (27.4 kbp) and pCM2 (70.0 kbp), which carry pathogenicity genes and thus are essential for virulence, have lower G+C contents (66.5 and 67.6%, respectively). In contrast to the genome of the closely related organism Clavibacter michiganensis subsp. sepedonicus, the genome of C. michiganensis subsp. michiganensis lacks complete insertion elements and transposons. The 129-kb chp/tomA region with a low G+C content near the chromosomal origin of replication was shown to be necessary for pathogenicity. This region contains numerous genes encoding proteins involved in uptake and metabolism of sugars and several serine proteases. There is evidence that single genes located in this region, especially genes encoding serine proteases, are required for efficient colonization of the host. Although C. michiganensis subsp. michiganensis grows mainly in the xylem of tomato plants, no evidence for pronounced genome reduction was found. C. michiganensis subsp. michiganensis seems to have as many transporters and regulators as typical soil-inhabiting bacteria. However, the apparent lack of a sulfate reduction pathway, which makes C. michiganensis subsp. michiganensis dependent on reduced sulfur compounds for growth, is probably the reason for the poor survival of C. michiganensis subsp. michiganensis in soil.
BACKGROUND: Bordetella petrii is the only environmental species hitherto found among the otherwise host-restricted and pathogenic members of the genus Bordetella. Phylogenetically, it connects the pathogenic Bordetellae and environmental bacteria of the genera Achromobacter and Alcaligenes, which are opportunistic pathogens. B. petrii strains have been isolated from very different environmental niches, including river sediment, polluted soil, marine sponges and a grass root. Recently, clinical isolates associated with bone degenerative disease or cystic fibrosis have also been described. RESULTS: In this manuscript we present the results of the analysis of the completely annotated genome sequence of the B. petrii strain DSMZ12804. B. petrii has a mosaic genome of 5,287,950 bp harboring numerous mobile genetic elements, including seven large genomic islands. Four of them are highly related to the clc element of Pseudomonas knackmussii B13, which encodes genes involved in the degradation of aromatics. Though being an environmental isolate, the sequenced B. petrii strain also encodes proteins related to virulence factors of the pathogenic Bordetellae, including the filamentous hemagglutinin, which is a major colonization factor of B. pertussis, and the master virulence regulator BvgAS. However, it lacks all known toxins of the pathogenic Bordetellae. CONCLUSION: The genomic analysis suggests that B. petrii represents an evolutionary link between free-living environmental bacteria and the host-restricted obligate pathogenic Bordetellae. Its remarkable metabolic versatility may enable B. petrii to thrive in very different ecological niches.
Corynebacterium kroppenstedtii is a lipophilic corynebacterial species that lacks in the cell envelope the characteristic alpha-alkyl-beta-hydroxy long-chain fatty acids, designated mycolic acids. We report here the bioinformatic analysis of genome data obtained by pyrosequencing of the type strain C. kroppenstedtii DSM44385 that was initially isolated from human sputum. A single run with the Genome Sequencer FLX system revealed 560,248 shotgun reads with 110,018,974 detected bases that were assembled into a contiguous genomic sequence with a total size of 2,446,804bp. Automatic annotation of the complete genome sequence resulted in the prediction of 2122 coding sequences, of which 29% were considered as specific for C. kroppenstedtii when compared with predicted proteins from hitherto sequenced pathogenic corynebacteria. This comparative content analysis of the genome data revealed a large repertoire of genes involved in sugar uptake and central carbohydrate metabolism and the presence of the mevalonate route for isoprenoid biosynthesis. The lack of mycolic acids and the lipophilic lifestyle of C. kroppenstedtii are apparently caused by gene loss, including a condensase gene cluster, a mycolate reductase gene, and a microbial type I fatty acid synthase gene. A complete beta-oxidation pathway involved in the degradation of fatty acids is present in the genome. Evaluation of the genomic data indicated that lipophilism is the dominant feature involved in pathogenicity of C. kroppenstedtii.
Corynebacterium urealyticum is a lipid-requiring, urealytic bacterium of the human skin flora that has been recognized as causative agent of urinary tract infections. We report the analysis of the complete genome sequence of C. urealyticum DSM7109, which was initially recovered from a patient with alkaline-encrusted cystitis. The genome sequence was determined by a combination of pyrosequencing and Sanger technology. The chromosome of C. urealyticum DSM7109 has a size of 2,369,219bp and contains 2024 predicted coding sequences, of which 78% were considered as orthologous with genes in the Corynebacterium jeikeium K411 genome. Metabolic analysis of the lipid-requiring phenotype revealed the absence of a fatty acid synthase gene and the presence of a beta-oxidation pathway along with a large repertoire of auxillary genes for the degradation of exogenous fatty acids. A urease locus with the gene order ureABCEFGD may play a pivotal role in virulence of C. urealyticum by the alkalinization of human urine and the formation of struvite stones. Multidrug resistance of C. urealyticum DSM7109 is mediated by transposable elements, conferring resistances to macrolides, lincosamides, ketolides, aminoglycosides, chloramphenicol, and tetracycline. The complete genome sequence of C. urealyticum revealed a detailed picture of the lifestyle of this opportunistic human pathogen.
The complete genome sequence of the Xanthomonas campestris pv. campestris strain B100 was established. It consisted of a chromosome of 5,079,003bp, with 4471 protein-coding genes and 62 RNA genes. Comparative genomics showed that the genes required for the synthesis of xanthan and xanthan precursors were highly conserved among three sequenced X. campestris pv. campestris genomes, but differed noticeably when compared to the remaining four Xanthomonas genomes available. For the xanthan biosynthesis genes gumB and gumK earlier translational starts were proposed, while gumI and gumL turned out to be unique with no homologues beyond the Xanthomonas genomes sequenced. From the genomic data the biosynthesis pathways for the production of the exopolysaccharide xanthan could be elucidated. The first step of this process is the uptake of sugars serving as carbon and energy sources wherefore genes for 15 carbohydrate import systems could be identified. Metabolic pathways playing a role for xanthan biosynthesis could be deduced from the annotated genome. These reconstructed pathways concerned the storage and metabolization of the imported sugars. The recognized sugar utilization pathways included the Entner-Doudoroff and the pentose phosphate pathway as well as the Embden-Meyerhof pathway (glycolysis). The reconstruction indicated that the nucleotide sugar precursors for xanthan can be converted from intermediates of the pentose phosphate pathway, some of which are also intermediates of glycolysis or the Entner-Doudoroff pathway. Xanthan biosynthesis requires in particular the nucleotide sugars UDP-glucose, UDP-glucuronate, and GDP-mannose, from which xanthan repeat units are built under the control of the gum genes. The updated genome annotation data allowed reconsidering and refining the mechanistic model for xanthan biosynthesis.
The genus Sorangium synthesizes approximately half of the secondary metabolites isolated from myxobacteria, including the anti-cancer metabolite epothilone. We report the complete genome sequence of the model Sorangium strain S. cellulosum So ce56, which produces several natural products and has morphological and physiological properties typical of the genus. The circular genome, comprising 13,033,779 base pairs, is the largest bacterial genome sequenced to date. No global synteny with the genome of Myxococcus xanthus is apparent, revealing an unanticipated level of divergence between these myxobacteria. A large percentage of the genome is devoted to regulation, particularly post-translational phosphorylation, which probably supports the strain's complex, social lifestyle. This regulatory network includes the highest number of eukaryotic protein kinase-like kinases discovered in any organism. Seventeen secondary metabolite loci are encoded in the genome, as well as many enzymes with potential utility in industry.
        
Title: Sequence analysis of the 181-kb accessory plasmid pSmeSM11b, isolated from a dominant Sinorhizobium meliloti strain identified during a long-term field release experiment Stiens M, Schneiker S, Puhler A, Schluter A Ref: FEMS Microbiology Letters, 271:297, 2007 : PubMed
The 181 251 bp accessory plasmid pSmeSM11b of Sinorhizobium meliloti strain SM11, belonging to a dominant indigenous S. meliloti subpopulation identified during a long-term field release experiment, was sequenced. This plasmid has 166 coding sequences (CDSs), 42% of which encode proteins with homology to proteins of known function. Plasmid pSmeSM11b is a member of the repABC replicon family and contains a large gene region coding for a conjugation system similar to that of other self-transmissible plasmids in Rhizobium and Agrobacterium. Another pSmeSM11b gene region, possibly involved in sugar metabolism and polysaccharide catabolism, resembled a region of S. meliloti 1021 megaplasmid pSymB and in the genome of Sinorhizobium medicae WSM419. Another module of plasmid pSmeSM11b encodes proteins similar to those of the nitrogen-fixing actinomycete Frankia CcI3, and which are likely to be involved in the synthesis of a secondary metabolite. Several ORFs of pSmeSM11b were predicted to play a role in nonribosomal peptide synthesis. Plasmid pSmeSM11b has many mobile genetic elements, which contribute to the mosaic composition of the plasmid.
We present the complete genome sequence of Listeria welshimeri, a nonpathogenic member of the genus Listeria. Listeria welshimeri harbors a circular chromosome of 2,814,130 bp with 2,780 open reading frames. Comparative genomic analysis of chromosomal regions between L. welshimeri, Listeria innocua, and Listeria monocytogenes shows strong overall conservation of synteny, with the exception of the translocation of an F(o)F(1) ATP synthase. The smaller size of the L. welshimeri genome is the result of deletions in all of the genes involved in virulence and of "fitness" genes required for intracellular survival, transcription factors, and LPXTG- and LRR-containing proteins as well as 55 genes involved in carbohydrate transport and metabolism. In total, 482 genes are absent from L. welshimeri relative to L. monocytogenes. Of these, 249 deletions are commonly absent in both L. welshimeri and L. innocua, suggesting similar genome evolutionary paths from an ancestor. We also identified 311 genes specific to L. welshimeri that are absent in the other two species, indicating gene expansion in L. welshimeri, including horizontal gene transfer. The species L. welshimeri appears to have been derived from early evolutionary events and an ancestor more compact than L. monocytogenes that led to the emergence of nonpathogenic Listeria spp.
Azoarcus sp. strain BH72, a mutualistic endophyte of rice and other grasses, is of agrobiotechnological interest because it supplies biologically fixed nitrogen to its host and colonizes plants in remarkably high numbers without eliciting disease symptoms. The complete genome sequence is 4,376,040-bp long and contains 3,992 predicted protein-coding sequences. Genome comparison with the Azoarcus-related soil bacterium strain EbN1 revealed a surprisingly low degree of synteny. Coding sequences involved in the synthesis of surface components potentially important for plant-microbe interactions were more closely related to those of plant-associated bacteria. Strain BH72 appears to be 'disarmed' compared to plant pathogens, having only a few enzymes that degrade plant cell walls; it lacks type III and IV secretion systems, related toxins and an N-acyl homoserine lactones-based communication system. The genome contains remarkably few mobile elements, indicating a low rate of recent gene transfer that is presumably due to adaptation to a stable, low-stress microenvironment.
Alcanivorax borkumensis is a cosmopolitan marine bacterium that uses oil hydrocarbons as its exclusive source of carbon and energy. Although barely detectable in unpolluted environments, A. borkumensis becomes the dominant microbe in oil-polluted waters. A. borkumensis SK2 has a streamlined genome with a paucity of mobile genetic elements and energy generation-related genes, but with a plethora of genes accounting for its wide hydrocarbon substrate range and efficient oil-degradation capabilities. The genome further specifies systems for scavenging of nutrients, particularly organic and inorganic nitrogen and oligo-elements, biofilm formation at the oil-water interface, biosurfactant production and niche-specific stress responses. The unique combination of these features provides A. borkumensis SK2 with a competitive edge in oil-polluted environments. This genome sequence provides the basis for the future design of strategies to mitigate the ecological damage caused by oil spills.
        
Title: Sequence analysis of the 144-kilobase accessory plasmid pSmeSM11a, isolated from a dominant Sinorhizobium meliloti strain identified during a long-term field release experiment Stiens M, Schneiker S, Keller M, Kuhn S, Puhler A, Schluter A Ref: Applied Environmental Microbiology, 72:3662, 2006 : PubMed
The genome of Sinorhizobium meliloti type strain Rm1021 consists of three replicons: the chromosome and two megaplasmids, pSymA and pSymB. Additionally, many indigenous S. meliloti strains possess one or more smaller plasmids, which represent the accessory genome of this species. Here we describe the complete nucleotide sequence of an accessory plasmid, designated pSmeSM11a, that was isolated from a dominant indigenous S. meliloti subpopulation in the context of a long-term field release experiment with genetically modified S. meliloti strains. Sequence analysis of plasmid pSmeSM11a revealed that it is 144,170 bp long and has a mean G+C content of 59.5 mol%. Annotation of the sequence resulted in a total of 160 coding sequences. Functional predictions could be made for 43% of the genes, whereas 57% of the genes encode hypothetical or unknown gene products. Two plasmid replication modules, one belonging to the repABC replicon family and the other belonging to the plasmid type A replicator region family, were identified. Plasmid pSmeSM11a contains a mobilization (mob) module composed of the type IV secretion system-related genes traG and traA and a putative mobC gene. A large continuous region that is about 42 kb long is very similar to a corresponding region located on S. meliloti Rm1021 megaplasmid pSymA. Single-base-pair deletions in the homologous regions are responsible for frameshifts that result in nonparalogous coding sequences. Plasmid pSmeSM11a carries additional copies of the nodulation genes nodP and nodQ that are responsible for Nod factor sulfation. Furthermore, a tauD gene encoding a putative taurine dioxygenase was identified on pSmeSM11a. An acdS gene located on pSmeSM11a is the first example of such a gene in S. meliloti. The deduced acdS gene product is able to deaminate 1-aminocyclopropane-1-carboxylate and is proposed to be involved in reducing the phytohormone ethylene, thus influencing nodulation events. The presence of numerous insertion sequences suggests that these elements mediated acquisition of accessory plasmid modules.
Corynebacterium jeikeium is a "lipophilic" and multidrug-resistant bacterial species of the human skin flora that has been recognized with increasing frequency as a serious nosocomial pathogen. Here we report the genome sequence of the clinical isolate C. jeikeium K411, which was initially recovered from the axilla of a bone marrow transplant patient. The genome of C. jeikeium K411 consists of a circular chromosome of 2,462,499 bp and the 14,323-bp bacteriocin-producing plasmid pKW4. The chromosome of C. jeikeium K411 contains 2,104 predicted coding sequences, 52% of which were considered to be orthologous with genes in the Corynebacterium glutamicum, Corynebacterium efficiens, and Corynebacterium diphtheriae genomes. These genes apparently represent the chromosomal backbone that is conserved between the four corynebacteria. Among the genes that lack an ortholog in the known corynebacterial genomes, many are located close to transposable elements or revealed an atypical G+C content, indicating that horizontal gene transfer played an important role in the acquisition of genes involved in iron and manganese homeostasis, in multidrug resistance, in bacterium-host interaction, and in virulence. Metabolic analyses of the genome sequence indicated that the "lipophilic" phenotype of C. jeikeium most likely originates from the absence of fatty acid synthase and thus represents a fatty acid auxotrophy. Accordingly, both the complete gene repertoire and the deduced lifestyle of C. jeikeium K411 largely reflect the strict dependence of growth on the presence of exogenous fatty acids. The predicted virulence factors of C. jeikeium K411 are apparently involved in ensuring the availability of exogenous fatty acids by damaging the host tissue.
The gram-negative plant-pathogenic bacterium Xanthomonas campestris pv. vesicatoria is the causative agent of bacterial spot disease in pepper and tomato plants, which leads to economically important yield losses. This pathosystem has become a well-established model for studying bacterial infection strategies. Here, we present the whole-genome sequence of the pepper-pathogenic Xanthomonas campestris pv. vesicatoria strain 85-10, which comprises a 5.17-Mb circular chromosome and four plasmids. The genome has a high G+C content (64.75%) and signatures of extensive genome plasticity. Whole-genome comparisons revealed a gene order similar to both Xanthomonas axonopodis pv. citri and Xanthomonas campestris pv. campestris and a structure completely different from Xanthomonas oryzae pv. oryzae. A total of 548 coding sequences (12.2%) are unique to X. campestris pv. vesicatoria. In addition to a type III secretion system, which is essential for pathogenicity, the genome of strain 85-10 encodes all other types of protein secretion systems described so far in gram-negative bacteria. Remarkably, one of the putative type IV secretion systems encoded on the largest plasmid is similar to the Icm/Dot systems of the human pathogens Legionella pneumophila and Coxiella burnetii. Comparisons with other completely sequenced plant pathogens predicted six novel type III effector proteins and several other virulence factors, including adhesins, cell wall-degrading enzymes, and extracellular polysaccharides.
        
Title: Identification and functional analysis of six mycolyltransferase genes of Corynebacterium glutamicum ATCC 13032: the genes cop1, cmt1, and cmt2 can replace each other in the synthesis of trehalose dicorynomycolate, a component of the mycolic acid layer of the cell envelope Brand S, Niehaus K, Puhler A, Kalinowski J Ref: Arch Microbiol, 180:33, 2003 : PubMed
By data mining in the sequence of the Corynebacterium glutamicum ATCC 13032 genome, six putative mycolyltransferase genes were identified that code for proteins with similarity to the N-terminal domain of the mycolic acid transferase PS1 of the related C. glutamicum strain ATCC 17965. The genes identified were designated cop1, cmt1, cmt2, cmt3, cmt4, and cmt5 ( cmt from corynebacterium mycolyl transferases). cop1 encodes a protein of 657 amino acids, which is larger than the proteins encoded by the cmt genes with 365, 341, 483, 483, and 411 amino acids. Using bioinformatics tools, it was shown that all six gene products are equipped with signal peptides and esterase domains. Proteome analyses of the cell envelope of C. glutamicum ATCC 13032 resulted in identification of the proteins Cop1, Cmt1, Cmt2, and Cmt4. All six mycolyltransferase genes were used for mutational analysis. cmt4 could not be mutated and is considered to be essential. cop1 was found to play an additional role in cell shape formation. A triple mutant carrying mutations in cop1, cmt1, and cmt2 aggregated when cultivated in MM1 liquid medium. This mutant was also no longer able to synthesize trehalose di coryno mycolate (TDCM). Since single and double mutants of the genes cop1, cmt1, and cmt2 could form TDCM, it is concluded that the three genes, cop1, cmt1, and cmt2, are involved in TDCM biosynthesis. The presence of the putative esterase domain makes it highly possible that cop1, cmt1, and cmt2 encode enzymes synthesizing TDCM from trehalose monocorynomycolate.
The complete genomic sequence of Corynebacterium glutamicum ATCC 13032, well-known in industry for the production of amino acids, e.g. of L-glutamate and L-lysine was determined. The C. glutamicum genome was found to consist of a single circular chromosome comprising 3282708 base pairs. Several DNA regions of unusual composition were identified that were potentially acquired by horizontal gene transfer, e.g. a segment of DNA from C. diphtheriae and a prophage-containing region. After automated and manual annotation, 3002 protein-coding genes have been identified, and to 2489 of these, functions were assigned by homologies to known proteins. These analyses confirm the taxonomic position of C. glutamicum as related to Mycobacteria and show a broad metabolic diversity as expected for a bacterium living in the soil. As an example for biotechnological application the complete genome sequence was used to reconstruct the metabolic flow of carbon into a number of industrially important products derived from the amino acid L-aspartate.
        
Title: The alanine racemase gene alr is an alternative to antibiotic resistance genes in cloning systems for industrial Corynebacterium glutamicum strains Tauch A, Gotker S, Puhler A, Kalinowski J, Thierbach G Ref: J Biotechnol, 99:79, 2002 : PubMed
The potential of the alanine racemase gene alr from Corynebacterium glutamicum ATCC 13032 to substitute for antibiotic resistance determinants in cloning systems has been investigated. The alr gene was identified by a PCR technique and its nucleotide sequence was determined. The deduced protein revealed the highest amino acid sequence similarity to the Alr protein from Mycobacterium smegmatis with 45% identical and 58% similar amino acids. A defined alr deletion mutant of C. glutamicum displayed a strict dependence on the presence of D-alanine for growth on complex and minimal medium. The alr gene was placed on a novel C. glutamicum vector which is completely free of antibiotic resistance genes. In vivo complementation of the chromosomal alr deletion with alr-carrying vectors permitted growth of the mutant strain in the absence of external D-alanine and provided strong selective pressure to maintain the plasmid. The alr gene enabled the selection of C. glutamicum transformants with a similar efficiency as the tetracycline resistance gene tetA(33). These data provided experimental evidence that the alr gene can be applied as an alternative selection marker to antibiotic resistance genes in industrial C. glutamicum strains. In an application example, the novel deltaalr host-alr(+) vector-system for C. glutamicum was used to overproduce the vitamin D-pantothenic acid.
Sinorhizobium meliloti is an alpha-proteobacterium that forms agronomically important N(2)-fixing root nodules in legumes. We report here the complete sequence of the largest constituent of its genome, a 62.7% GC-rich 3,654,135-bp circular chromosome. Annotation allowed assignment of a function to 59% of the 3,341 predicted protein-coding ORFs, the rest exhibiting partial, weak, or no similarity with any known sequence. Unexpectedly, the level of reiteration within this replicon is low, with only two genes duplicated with more than 90% nucleotide sequence identity, transposon elements accounting for 2.2% of the sequence, and a few hundred short repeated palindromic motifs (RIME1, RIME2, and C) widespread over the chromosome. Three regions with a significantly lower GC content are most likely of external origin. Detailed annotation revealed that this replicon contains all housekeeping genes except two essential genes that are located on pSymB. Amino acid/peptide transport and degradation and sugar metabolism appear as two major features of the S. meliloti chromosome. The presence in this replicon of a large number of nucleotide cyclases with a peculiar structure, as well as of genes homologous to virulence determinants of animal and plant pathogens, opens perspectives in the study of this bacterium both as a free-living soil microorganism and as a plant symbiont.
Analysis of the 1,683,333-nt sequence of the pSymB megaplasmid from the symbiotic N(2)-fixing bacterium Sinorhizobium meliloti revealed that the replicon has a high gene density with a total of 1,570 protein-coding regions, with few insertion elements and regions duplicated elsewhere in the genome. The only copies of an essential arg-tRNA gene and the minCDE genes are located on pSymB. Almost 20% of the pSymB sequence carries genes encoding solute uptake systems, most of which were of the ATP-binding cassette family. Many previously unsuspected genes involved in polysaccharide biosynthesis were identified and these, together with the two known distinct exopolysaccharide synthesis gene clusters, show that 14% of the pSymB sequence is dedicated to polysaccharide synthesis. Other recognizable gene clusters include many involved in catabolic activities such as protocatechuate utilization and phosphonate degradation. The functions of these genes are consistent with the notion that pSymB plays a major role in the saprophytic competence of the bacteria in the soil environment.
The scarcity of usable nitrogen frequently limits plant growth. A tight metabolic association with rhizobial bacteria allows legumes to obtain nitrogen compounds by bacterial reduction of dinitrogen (N2) to ammonium (NH4+). We present here the annotated DNA sequence of the alpha-proteobacterium Sinorhizobium meliloti, the symbiont of alfalfa. The tripartite 6.7-megabase (Mb) genome comprises a 3.65-Mb chromosome, and 1.35-Mb pSymA and 1.68-Mb pSymB megaplasmids. Genome sequence analysis indicates that all three elements contribute, in varying degrees, to symbiosis and reveals how this genome may have emerged during evolution. The genome sequence will be useful in understanding the dynamics of interkingdom associations and of life in soil environments.
        
Title: Molecular analysis of the Corynebacterium glutamicum lysl gene involved in lysine uptake Seep-Feldhaus AH, Kalinowski J, Puhler A Ref: Molecular Microbiology, 5:2995, 1991 : PubMed
Two Corynebacterium glutamicum mutants defective in lysine uptake were identified by analysing mutants resistant to S-(2-aminoethyl)-cysteine (AEC). A 5.6 kb genomic DNA fragment restoring AEC sensitivity and lysine uptake was isolated. A 4.2 kb subfragment was sequenced and three open reading frames were identified. Subcloning and gene disruption experiments showed that only the first open reading frame, termed lysl, is involved in lysine uptake. Lysl consists of 501 amino acids with a Mr of 53600. The hydrophobicity profile suggests that the lysl gene product is an integral membrane protein with 13 transmembrane segments. The amino acid sequence of lysl displays strong homology to that of the arcD gene product of Pseudomonas aeruginosa, which is proposed to act as an arginine-ornithine antiporter. Investigation of the influence of the lysl gene on lysine secretion suggests the existence of a separate lysine efflux system in C. glutamicum.
        
Title: Nucleotide sequence of the phosphinothricin N-acetyltransferase gene from Streptomyces viridochromogenes Tu494 and its expression in Nicotiana tabacum Wohlleben W, Arnold W, Broer I, Hillemann D, Strauch E, Puhler A Ref: Gene, 70:25, 1988 : PubMed
The phosphinothricin (Pt) N-acetyltransferase gene (pat) of Streptomyces viridochromogenes Tu494 is located on a 0.8-kb BglII fragment [Strauch et al., Gene 63 (1988) 65-74]. By sequencing a 1.3-kb BglII-SstII fragment, an open reading frame representing the pat gene was found. It encodes a polypeptide of 183 amino acids with an Mr of 20,621. The base composition of the pat gene is typical for Streptomyces [70.1 mol% (G + C) in total and 93.5 mol% (G + C) in the third position]. Translation of pat is initiated by a GTG codon which was identified by frameshift mutations in Escherichia coli as well as in Streptomyces lividans. Significant homology of the pat gene was found to the bialaphos-resistance gene (bar) of Streptomyces hygroscopicus [Thompson et al., EMBO J. 9 (1987) 2519-2523]. However, variations were detected in the 5'-noncoding region of the two resistance genes which may reflect differences in regulation. Since Pt is a potent herbicide, the pat gene was modified and introduced into Nicotiana tabacum by Agrobacterium-mediated leaf-disc transformation. The GTG start codon of pat was replaced by ATG. Subsequently the modified pat-coding region was fused to the 35S promoter of the cauliflower mosaic virus. Transgenic plants could directly be selected on Pt-containing medium.