(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Bacteria: NE > Terrabacteria group: NE > Actinobacteria [phylum]: NE > Actinobacteria [class]: NE > Corynebacteriales: NE > Mycobacteriaceae: NE > Mycobacterium: NE > Mycobacterium tuberculosis complex: NE > Mycobacterium tuberculosis: NE
Warning: This entry is a compilation of different species or line or strain with more than 90% amino acid identity. You can retrieve all strain data
(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) Mycobacterium bovis BCG str. Tokyo 172: N, E.
Mycobacterium bovis BCG str. Pasteur 1173P2: N, E.
Mycobacterium bovis: N, E.
Mycobacterium bovis BCG: N, E.
Mycobacterium bovis BCG str. Mexico: N, E.
Mycobacterium bovis AN5: N, E.
Mycobacterium bovis BCG str. Korea 1168P: N, E.
Mycobacterium bovis BCG str. ATCC 35743: N, E.
Mycobacterium bovis AF2122/97: N, E.
Mycobacterium bovis 04-303: N, E.
Mycobacterium bovis BCG str. Moreau RDJ: N, E.
Mycobacterium tuberculosis TKK-01-0051: N, E.
Mycobacterium tuberculosis EAS054: N, E.
Mycobacterium tuberculosis F11: N, E.
Mycobacterium tuberculosis KZN 1435: N, E.
Mycobacterium tuberculosis H37Ra: N, E.
Mycobacterium tuberculosis T17: N, E.
Mycobacterium tuberculosis T85: N, E.
Mycobacterium tuberculosis 94_M4241A: N, E.
Mycobacterium tuberculosis 02_1987: N, E.
Mycobacterium tuberculosis T46: N, E.
Mycobacterium tuberculosis C: N, E.
Mycobacterium tuberculosis GM 1503: N, E.
Mycobacterium tuberculosis CPHL_A: N, E.
Mycobacterium tuberculosis K85: N, E.
Mycobacterium tuberculosis CDC1551: N, E.
Mycobacterium tuberculosis SUMu011: N, E.
Mycobacterium tuberculosis SUMu010: N, E.
Mycobacterium tuberculosis SUMu009: N, E.
Mycobacterium tuberculosis SUMu008: N, E.
Mycobacterium tuberculosis SUMu007: N, E.
Mycobacterium tuberculosis SUMu006: N, E.
Mycobacterium tuberculosis SUMu003: N, E.
Mycobacterium tuberculosis SUMu012: N, E.
Mycobacterium tuberculosis SUMu005: N, E.
Mycobacterium tuberculosis SUMu004: N, E.
Mycobacterium tuberculosis SUMu002: N, E.
Mycobacterium tuberculosis SUMu001: N, E.
Mycobacterium tuberculosis str. Haarlem: N, E.
Mycobacterium tuberculosis T92: N, E.
Mycobacterium tuberculosis str. Erdman = ATCC 35801: N, E.
Mycobacterium tuberculosis FJ05194: N, E.
Mycobacterium tuberculosis EAI5/NITR206: N, E.
Mycobacterium tuberculosis UT205: N, E.
Mycobacterium tuberculosis CCDC5180: N, E.
Mycobacterium tuberculosis H37Rv: N, E.
Mycobacterium tuberculosis CDC1551A: N, E.
Mycobacterium tuberculosis CCDC5079: N, E.
Mycobacterium tuberculosis BT2: N, E.
Mycobacterium tuberculosis EAI5: N, E.
Mycobacterium tuberculosis W-148: N, E.
Mycobacterium tuberculosis CTRI-2: N, E.
Mycobacterium tuberculosis RGTB327: N, E.
Mycobacterium tuberculosis str. Haarlem/NITR202: N, E.
Mycobacterium tuberculosis '98-R604 INH-RIF-EM': N, E.
Mycobacterium tuberculosis str. Beijing/NITR203: N, E.
Mycobacterium tuberculosis HKBS1: N, E.
Mycobacterium tuberculosis CAS/NITR204: N, E.
Mycobacterium tuberculosis 7199-99: N, E.
Mycobacterium tuberculosis KZN 605: N, E.
Mycobacterium tuberculosis NCGM2209: N, E.
Mycobacterium tuberculosis BT1: N, E.
Mycobacterium tuberculosis RGTB423: N, E.
Mycobacterium tuberculosis KZN 4207: N, E.
Mycobacterium tuberculosis GuangZ0019: N, E.
Mycobacterium tuberculosis 2092HD: N, E.
Mycobacterium tuberculosis variant caprae: N, E.
Mycobacterium tuberculosis variant africanum: N, E.
Mycobacterium tuberculosis variant microti OV254: N, E.
Mycobacterium africanum K85: N, E.
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MARNPSPALDRPWRRPGALRYALERVRGVAKPPITVTDPPADVVIERDVE VPTRDGTLLRINVFRSAEGGARPVIASIHPYGKDALPRRRGNRWTFSPQY RMLRQPKPLTFSALTGWEAPDPAWWTAQGFVVVNADSRGCGRSDGTGDLL SHQEAEDTYDLVGWLADQAWSDGRVVMLGVSYLAISQYAVAALQPPALRA ICPWEGFTDAYRDLAFPGGIRESGFTRLWSRGVRRRTRQTYDMEQMQEAH PLRDDFWRSRVPDLSAIKVPMLVCGSFSDNNLHSRGSIRAFTRSGCGHAR LYTHRGGKWETFYSATALSEQLKFLRDALAGSSGSRSVRLEVREDRDTIT AVREETQWPLAGTRWRPMYLAGPGLLATEPPPTAGSIRFQTRSRAAAFNW TIPEDIELTGPMAARLWVQLDGCDDANLFVGVEKWRDGQFVAFEGSYGWG RDRVTTGWQRVSLRELDPELSQPWEPVPACARPRPVTAGEVVAVDVALGP SATLFRAGEQLRLVVGGRWLSPRNPLTGQFPAAYPRPPRGRVTLHWGPRY DAHLLIPEVPG
Mycobacterium bovis is the causative agent of tuberculosis in a range of animal species and man, with worldwide annual losses to agriculture of $3 billion. The human burden of tuberculosis caused by the bovine tubercle bacillus is still largely unknown. M. bovis was also the progenitor for the M. bovis bacillus Calmette-Guerin vaccine strain, the most widely used human vaccine. Here we describe the 4,345,492-bp genome sequence of M. bovis AF2122/97 and its comparison with the genomes of Mycobacterium tuberculosis and Mycobacterium leprae. Strikingly, the genome sequence of M. bovis is >99.95% identical to that of M. tuberculosis, but deletion of genetic information has led to a reduced genome size. Comparison with M. leprae reveals a number of common gene losses, suggesting the removal of functional redundancy. Cell wall components and secreted proteins show the greatest variation, indicating their potential role in host-bacillus interactions or immune evasion. Furthermore, there are no genes unique to M. bovis, implying that differential gene expression may be the key to the host tropisms of human and bovine bacilli. The genome sequence therefore offers major insight on the evolution, host preference, and pathobiology of M. bovis.
Virulence and immunity are poorly understood in Mycobacterium tuberculosis. We sequenced the complete genome of the M. tuberculosis clinical strain CDC1551 and performed a whole-genome comparison with the laboratory strain H37Rv in order to identify polymorphic sequences with potential relevance to disease pathogenesis, immunity, and evolution. We found large-sequence and single-nucleotide polymorphisms in numerous genes. Polymorphic loci included a phospholipase C, a membrane lipoprotein, members of an adenylate cyclase gene family, and members of the PE/PPE gene family, some of which have been implicated in virulence or the host immune response. Several gene families, including the PE/PPE gene family, also had significantly higher synonymous and nonsynonymous substitution frequencies compared to the genome as a whole. We tested a large sample of M. tuberculosis clinical isolates for a subset of the large-sequence and single-nucleotide polymorphisms and found widespread genetic variability at many of these loci. We performed phylogenetic and epidemiological analysis to investigate the evolutionary relationships among isolates and the origins of specific polymorphic loci. A number of these polymorphisms appear to have occurred multiple times as independent events, suggesting that these changes may be under selective pressure. Together, these results demonstrate that polymorphisms among M. tuberculosis strains are more extensive than initially anticipated, and genetic variation may have an important role in disease pathogenesis and immunity.
Countless millions of people have died from tuberculosis, a chronic infectious disease caused by the tubercle bacillus. The complete genome sequence of the best-characterized strain of Mycobacterium tuberculosis, H37Rv, has been determined and analysed in order to improve our understanding of the biology of this slow-growing pathogen and to help the conception of new prophylactic and therapeutic interventions. The genome comprises 4,411,529 base pairs, contains around 4,000 genes, and has a very high guanine + cytosine content that is reflected in the biased amino-acid content of the proteins. M. tuberculosis differs radically from other bacteria in that a very large portion of its coding capacity is devoted to the production of enzymes involved in lipogenesis and lipolysis, and to two new families of glycine-rich proteins with a repetitive structure that may represent a source of antigenic variation.
        
24 lessTitle: The alpha/beta Hydrolase Fold Proteins of Mycobacterium tuberculosis, With Reference to their Contribution to Virulence Johnson G Ref: Curr Protein Pept Sci, 18:190, 2016 : PubMed
The alpha/beta hydrolase fold superfamily is an ancient and widely diversified group of primarily hydrolytic enzymes. In this review, the adaptations of these proteins to the pathogenic lifestyle of Mycobacterium tuberculosis (Mtb), the causative agent of tuberculosis, are examined. Of the 105 alpha/beta hydrolases identified in Mtb, many are associated with lipid metabolism, particularly in the biosynthesis and maintenance of the Mtb's unique cell envelope, as well in the large number of extracellular lipases that are likely responsible for degradation of host lipid material. alpha/beta hydrolase fold proteins are also involved in the evasion and modulation of the immune response, detoxification and metabolic adaptations, including growth, response to acidification of the intracellular environment and dormancy. A striking feature of Mtb's alpha/beta hydrolases is their diversification into virulence-associated niches. It is clear that the alpha/beta hydrolase fold family has made a significant contribution to Mtb's remarkable success as a pathogen.
        
Title: Whole-Genome Sequencing and Annotation of a Clinical Isolate of Mycobacterium tuberculosis from Mumbai, India Al Rashdi AS, Jadhav BL, Deshpande T, Deshpande U Ref: Genome Announc, 2:e00154, 2014 : PubMed
Mycobacterium bovis strain AN5 has been used to produce purified protein derivative (PPD) for the intradermal test for bovine tuberculosis since it was introduced in 1948. This work reports the draft genome sequence of M. bovis AN5, which is used for the production of bovine PPD in Brazil, as well as comparisons to other strains of M. bovis and Mycobacterium tuberculosis.
        
Title: Evidence of Clonal Expansion in the Genome of a Multidrug-Resistant Mycobacterium tuberculosis Clinical Isolate from Peru Galarza M, Tarazona D, Borda V, Agapito JC, Guio H Ref: Genome Announc, 2:, 2014 : PubMed
We report the genome sequence of Mycobacterium tuberculosis INS-MDR from Peru, a multidrug-resistant tuberculosis (MDR-TB) and Latin American-Mediterranean (LAM) lineage strain. Our analysis showed mutations related to drug resistance in the rpoB (D516V), katG (S315T), kasA (G269S), and pncA (Q10R) genes. Our evidence suggests that INS-MDR may be a clonal expansion related to the African strain KZN 1435.
        
Title: Functional Analysis Using Whole-Genome Sequencing of a Drug-Sensitive Mycobacterium tuberculosis Strain from Peru Tarazona D, Borda V, Galarza M, Agapito JC, Guio H Ref: Genome Announc, 2:, 2014 : PubMed
We report the whole-genome sequence of a Latin American-Mediterranean (LAM) lineage drug-sensitive Mycobacterium tuberculosis strain from Peru, INS-SEN. The functional analysis revealed more mutations in secondary metabolite biosynthesis, transport, and catabolism (clusters of orthologous groups [COG] category Q) than for other LAM-sensitive strains. This study contributes to the understanding of the genomic diversity of drug-sensitive M. tuberculosis.
Tuberculosis caused by multidrug-resistant (MDR) and extensively drug-resistant (XDR) Mycobacterium tuberculosis (MTB) strains is a growing problem in many countries. The availability of the complete nucleotide sequences of several MTB genomes allows to use the comparative genomics as a tool to study the relationships of strains and differences in their evolutionary history including acquisition of drug-resistance. In our work, we sequenced three genomes of Russian MTB strains of different phenotypes--drug susceptible, MDR and XDR. Of them, MDR and XDR strains were collected in Tomsk (Siberia, Russia) during the local TB outbreak in 1998-1999 and belonged to rare KQ and KY families in accordance with IS6110 typing, which are considered endemic for Russia. Based on phylogenetic analysis, our isolates belonged to different genetic families, Beijing, Ural and LAM, which made the direct comparison of their genomes impossible. For this reason we performed their comparison in the broader context of all M. tuberculosis genomes available in GenBank. The list of unique individual non-synonymous SNPs for each sequenced isolate was formed by comparison with all SNPs detected within the same phylogenetic group. For further functional analysis, all proteins with unique SNPs were ascribed to 20 different functional classes based on Clusters of Orthologous Groups (COG). We have confirmed drug resistant status of our isolates that harbored almost all known drug-resistance associated mutations. Unique SNPs of an XDR isolate CTRI-4(XDR), belonging to a Beijing family were compared in more detail with SNPs of additional 14 Russian XDR strains of the same family. Only type specific mutations in genes of repair, replication and recombination system (COG category L) were found common within this group. Probably the other unique SNPs discovered in CTRI-4(XDR) may have an important role in adaptation of this microorganism to its surrounding and in escape from antituberculosis drugs treatment.
Mycobacterium bovis bacillus Calmette-Guerin (BCG) is the only vaccine available against tuberculosis, and the strains used worldwide represent a family of daughter strains with distinct genotypic characteristics. Here, we report the complete genome sequence of M. bovis BCG Korea, the strain that will be actually used in Korea for vaccine production.
        
Title: Draft genome sequences of two super-extensively drug-resistant isolates of Mycobacterium tuberculosis from China Lin N, Liu Z, Zhou J, Wang S, Fleming J Ref: FEMS Microbiology Letters, 347:93, 2013 : PubMed
The prevalence of drug-resistance in Mycobacterium tuberculosis is already having a negative impact on the control of tuberculosis. We report the draft genome sequences of two super-extensively drug-resistant M. tuberculosis isolates from China, FJ05194 (lineage 2) and GuangZ0019 (lineage 4), and compare them with the H37Rv reference strain to identify possible sources of genetic variation associated with their extensive drug resistance. Our results suggest that their extensive drug resistance probably results from the stepwise accumulation of resistances to individual drugs.
        
Title: Whole-Genome Sequences of Four Clinical Isolates of Mycobacterium tuberculosis from Tamil Nadu, South India Narayanan S, Deshpande U Ref: Genome Announc, 1:, 2013 : PubMed
Mycobacterium bovis strain 04-303 was isolated from a wild boar living in a free-ranging field in Argentina. This work reports the draft genome sequence of this highly virulent strain and the genomic comparison of its major virulence-related genes with those of M. bovis strain AF2122/97 and Mycobacterium tuberculosis strain H37Rv.
BACKGROUND: Understanding Mycobacterium tuberculosis (Mtb) transmission is essential to guide efficient tuberculosis control strategies. Traditional strain typing lacks sufficient discriminatory power to resolve large outbreaks. Here, we tested the potential of using next generation genome sequencing for identification of outbreak-related transmission chains. METHODS AND FINDINGS: During long-term (1997 to 2010) prospective population-based molecular epidemiological surveillance comprising a total of 2,301 patients, we identified a large outbreak caused by an Mtb strain of the Haarlem lineage. The main performance outcome measure of whole genome sequencing (WGS) analyses was the degree of correlation of the WGS analyses with contact tracing data and the spatio-temporal distribution of the outbreak cases. WGS analyses of the 86 isolates revealed 85 single nucleotide polymorphisms (SNPs), subdividing the outbreak into seven genome clusters (two to 24 isolates each), plus 36 unique SNP profiles. WGS results showed that the first outbreak isolates detected in 1997 were falsely clustered by classical genotyping. In 1998, one clone (termed "Hamburg clone") started expanding, apparently independently from differences in the social environment of early cases. Genome-based clustering patterns were in better accordance with contact tracing data and the geographical distribution of the cases than clustering patterns based on classical genotyping. A maximum of three SNPs were identified in eight confirmed human-to-human transmission chains, involving 31 patients. We estimated the Mtb genome evolutionary rate at 0.4 mutations per genome per year. This rate suggests that Mtb grows in its natural host with a doubling time of approximately 22 h (400 generations per year). Based on the genome variation discovered, emergence of the Hamburg clone was dated back to a period between 1993 and 1997, hence shortly before the discovery of the outbreak through epidemiological surveillance. CONCLUSIONS: Our findings suggest that WGS is superior to conventional genotyping for Mtb pathogen tracing and investigating micro-epidemics. WGS provides a measure of Mtb genome evolution over time in its natural host context.
Global spread and limited genetic variation are hallmarks of M. tuberculosis, the agent of human tuberculosis. In contrast, Mycobacterium canettii and related tubercle bacilli that also cause human tuberculosis and exhibit unusual smooth colony morphology are restricted to East Africa. Here, we sequenced and analyzed the whole genomes of five representative strains of smooth tubercle bacilli (STB) using Sanger (4-5x coverage), 454/Roche (13-18x coverage) and/or Illumina DNA sequencing (45-105x coverage). We show that STB isolates are highly recombinogenic and evolutionarily early branching, with larger genome sizes, higher rates of genetic variation, fewer molecular scars and distinct CRISPR-Cas systems relative to M. tuberculosis. Despite the differences, all tuberculosis-causing mycobacteria share a highly conserved core genome. Mouse infection experiments showed that STB strains are less persistent and virulent than M. tuberculosis. We conclude that M. tuberculosis emerged from an ancestral STB-like pool of mycobacteria by gain of persistence and virulence mechanisms, and we provide insights into the molecular events involved.
BACKGROUND: With the emergence of next-generation sequencing, the availability of prokaryotic genome sequences is expanding rapidly. A total of 5,276 genomes have been released since 2008, yet only 1,692 genomes were complete. The final phase of microbial genome sequencing, particularly gap closing, is frequently the rate-limiting step either because of complex genomic structures that cause sequence bias even with high genomic coverage, or the presence of repeat sequences that may cause gaps in assembly. RESULTS: We have developed a Cytoscape plugin to facilitate gap closing for high-throughput sequencing data from microbial genomes. This plugin is capable of interactively displaying the relationships among genomic contigs derived from various sequencing formats. The sequence contigs of plasmids and special repeats (IS elements, ribosomal RNAs, terminal repeats, etc.) can be displayed as well. CONCLUSIONS: Displaying relationships between contigs using graphs in Cytoscape rather than tables provides a more straightforward visual representation. This will facilitate a faster and more precise determination of the linkages among contigs and greatly improve the efficiency of gap closing.
BACKGROUND: M. africanum West African 2 constitutes an ancient lineage of the M. tuberculosis complex that commonly causes human tuberculosis in West Africa and has an attenuated phenotype relative to M. tuberculosis. METHODOLOGY/PRINCIPAL FINDINGS: In search of candidate genes underlying these differences, the genome of M. africanum West African 2 was sequenced using classical capillary sequencing techniques. Our findings reveal a unique sequence, RD900, that was independently lost during the evolution of two important lineages within the complex: the "modern" M. tuberculosis group and the lineage leading to M. bovis. Closely related to M. bovis and other animal strains within the M. tuberculosis complex, M. africanum West African 2 shares an abundance of pseudogenes with M. bovis but also with M. africanum West African clade 1. Comparison with other strains of the M. tuberculosis complex revealed pseudogenes events in all the known lineages pointing toward ongoing genome erosion likely due to increased genetic drift and relaxed selection linked to serial transmission-bottlenecks and an intracellular lifestyle. CONCLUSIONS/SIGNIFICANCE: The genomic differences identified between M. africanum West African 2 and the other strains of the Mycobacterium tuberculosis complex may explain its attenuated phenotype, and pave the way for targeted experiments to elucidate the phenotypic characteristic of M. africanum. Moreover, availability of the whole genome data allows for verification of conservation of targets used for the next generation of diagnostics and vaccines, in order to ensure similar efficacy in West Africa.
Several genomes of different Mycobacterium tuberculosis isolates have been completely sequenced around the world. The genomic information obtained have shown higher diversity than originally thought and specific adaptations to different human populations. Within this work, we sequenced the genome of one Colombian M. tuberculosis virulent isolate. Genomic comparison against the reference genome of H37Rv and other strains showed multiple deletion and insertions that ranged between a few bases to thousands. Excluding PPE and PG-PGRS genes, 430 proteins present changes in at least 1 amino acid. Also, novel positions of the IS6110 mobile element were identified. This isolate is also characterized by a large genomic deletion of 3.6 kb, leading to the loss and modification of the dosR regulon genes, Rv1996 and Rv1997. To our knowledge, this is the first report of the genome sequence of a Latin American M. tuberculosis clinical isolate.
We report the completely annotated genome sequence of Mycobacterium tuberculosis Erdman (TMC 107; ATCC 35801), which is a well-known laboratory strain of M. tuberculosis.
Mycobacterium bovis bacillus Calmette-Guerin (BCG) is the only vaccine available against tuberculosis, and the strains used worldwide represent a family of daughter strains with distinct genotypic characteristics. Here we report the complete genome sequence of M. bovis BCG Moreau, the strain in continuous use in Brazil for vaccine production since the 1920s.
We report the annotated genome sequence of a clinical isolate, Mycobacterium tuberculosis strain NCGM2209, which belongs to the "Beijing family" and was isolated in Japan.
BACKGROUND: Studies of Mycobacterium bovis BCG strains used in different countries and vaccination programs show clear variations in the genomes and immune protective properties of BCG strains. The aim of this study was to characterise the genomic and immune proteomic profile of the BCG 1931 strain used in Mexico. RESULTS: BCG Mexico 1931 has a circular chromosome of 4,350,386 bp with a G+C content and numbers of genes and pseudogenes similar to those of BCG Tokyo and BCG Pasteur. BCG Mexico 1931 lacks Region of Difference 1 (RD1), RD2 and N-RD18 and one copy of IS6110, indicating that BCG Mexico 1931 belongs to DU2 group IV within the BCG vaccine genealogy. In addition, this strain contains three new RDs, which are 53 (RDMex01), 655 (RDMex02) and 2,847 bp (REDMex03) long, and 55 single-nucleotide polymorphisms representing non-synonymous mutations compared to BCG Pasteur and BCG Tokyo. In a comparative proteomic analysis, the BCG Mexico 1931, Danish, Phipps and Tokyo strains showed 812, 794, 791 and 701 protein spots, respectively. The same analysis showed that BCG Mexico 1931 shares 62% of its protein spots with the BCG Danish strain, 61% with the BCG Phipps strain and only 48% with the BCG Tokyo strain. Thirty-nine reactive spots were detected in BCG Mexico 1931 using sera from subjects with active tuberculosis infections and positive tuberculin skin tests. CONCLUSIONS: BCG Mexico 1931 has a smaller genome than the BCG Pasteur and BCG Tokyo strains. Two specific deletions in BCG Mexico 1931 are described (RDMex02 and RDMex03). The loss of RDMex02 (fadD23) is associated with enhanced macrophage binding and RDMex03 contains genes that may be involved in regulatory pathways. We also describe new antigenic proteins for the first time.
Mycobacterium bovis Bacille Calmette-Guerin (BCG) is the only vaccine available against tuberculosis (TB). A number of BCG strains are in use, and they exhibit biochemical and genetic differences. We report the genome sequences of four BCG strains representing different lineages, which will help to design more effective TB vaccines.
Mycobacterium tuberculosis is one of most prevalent pathogens in the world. Drug-resistant strains of this pathogen caused by the excessive use of antibiotics have long posed serious threats to public health worldwide. A broader picture of drug resistance mechanisms at the genomic level can be obtained only with large-scale comparative genomic methodology. Two closely related Beijing family isolates, one resistant to four first-line drugs (CCDC5180) and one sensitive to them (CCDC5079), were completely sequenced. These sequences will serve as valuable references for further drug resistance site identification studies and could be of great importance for developing drugs targeting these sites.
        
Title: Whole genome sequence analysis of Mycobacterium bovis bacillus Calmette-Guerin (BCG) Tokyo 172: a comparative study of BCG vaccine substrains Seki M, Honda I, Fujita I, Yano I, Yamamoto S, Koyama A Ref: Vaccine, 27:1710, 2009 : PubMed
To investigate the molecular characteristics of bacillus Calmette-Guerin (BCG) vaccines, the complete genomic sequence of Mycobacterium bovis BCG Tokyo 172 was determined, and the results were compared with those for BCG Pasteur and other M. tuberculosis complex. The genome of BCG Tokyo had a length of 4,371,711bp and contained 4033 genes, including 3950 genes coding for proteins (CDS). There were 18 regions of difference (showing differences of more than 20bp), 20 insertion or deletion (ins/del) mutations of less than 20bp, and 68 SNPs between the two BCG substrains. These findings are useful for better understanding of the genetic differences in BCG substrains due to in vitro evolution of BCG.
To understand the evolution, attenuation, and variable protective efficacy of bacillus Calmette-Guerin (BCG) vaccines, Mycobacterium bovis BCG Pasteur 1173P2 has been subjected to comparative genome and transcriptome analysis. The 4,374,522-bp genome contains 3,954 protein-coding genes, 58 of which are present in two copies as a result of two independent tandem duplications, DU1 and DU2. DU1 is restricted to BCG Pasteur, although four forms of DU2 exist; DU2-I is confined to early BCG vaccines, like BCG Japan, whereas DU2-III and DU2-IV occur in the late vaccines. The glycerol-3-phosphate dehydrogenase gene, glpD2, is one of only three genes common to all four DU2 variants, implying that BCG requires higher levels of this enzyme to grow on glycerol. Further amplification of the DU2 region is ongoing, even within vaccine preparations used to immunize humans. An evolutionary scheme for BCG vaccines was established by analyzing DU2 and other markers. Lesions in genes encoding sigma-factors and pleiotropic transcriptional regulators, like PhoR and Crp, were also uncovered in various BCG strains; together with gene amplification, these affect gene expression levels, immunogenicity, and, possibly, protection against tuberculosis. Furthermore, the combined findings suggest that early BCG vaccines may even be superior to the later ones that are more widely used.
Mycobacterium bovis is the causative agent of tuberculosis in a range of animal species and man, with worldwide annual losses to agriculture of $3 billion. The human burden of tuberculosis caused by the bovine tubercle bacillus is still largely unknown. M. bovis was also the progenitor for the M. bovis bacillus Calmette-Guerin vaccine strain, the most widely used human vaccine. Here we describe the 4,345,492-bp genome sequence of M. bovis AF2122/97 and its comparison with the genomes of Mycobacterium tuberculosis and Mycobacterium leprae. Strikingly, the genome sequence of M. bovis is >99.95% identical to that of M. tuberculosis, but deletion of genetic information has led to a reduced genome size. Comparison with M. leprae reveals a number of common gene losses, suggesting the removal of functional redundancy. Cell wall components and secreted proteins show the greatest variation, indicating their potential role in host-bacillus interactions or immune evasion. Furthermore, there are no genes unique to M. bovis, implying that differential gene expression may be the key to the host tropisms of human and bovine bacilli. The genome sequence therefore offers major insight on the evolution, host preference, and pathobiology of M. bovis.
Virulence and immunity are poorly understood in Mycobacterium tuberculosis. We sequenced the complete genome of the M. tuberculosis clinical strain CDC1551 and performed a whole-genome comparison with the laboratory strain H37Rv in order to identify polymorphic sequences with potential relevance to disease pathogenesis, immunity, and evolution. We found large-sequence and single-nucleotide polymorphisms in numerous genes. Polymorphic loci included a phospholipase C, a membrane lipoprotein, members of an adenylate cyclase gene family, and members of the PE/PPE gene family, some of which have been implicated in virulence or the host immune response. Several gene families, including the PE/PPE gene family, also had significantly higher synonymous and nonsynonymous substitution frequencies compared to the genome as a whole. We tested a large sample of M. tuberculosis clinical isolates for a subset of the large-sequence and single-nucleotide polymorphisms and found widespread genetic variability at many of these loci. We performed phylogenetic and epidemiological analysis to investigate the evolutionary relationships among isolates and the origins of specific polymorphic loci. A number of these polymorphisms appear to have occurred multiple times as independent events, suggesting that these changes may be under selective pressure. Together, these results demonstrate that polymorphisms among M. tuberculosis strains are more extensive than initially anticipated, and genetic variation may have an important role in disease pathogenesis and immunity.
Countless millions of people have died from tuberculosis, a chronic infectious disease caused by the tubercle bacillus. The complete genome sequence of the best-characterized strain of Mycobacterium tuberculosis, H37Rv, has been determined and analysed in order to improve our understanding of the biology of this slow-growing pathogen and to help the conception of new prophylactic and therapeutic interventions. The genome comprises 4,411,529 base pairs, contains around 4,000 genes, and has a very high guanine + cytosine content that is reflected in the biased amino-acid content of the proteins. M. tuberculosis differs radically from other bacteria in that a very large portion of its coding capacity is devoted to the production of enzymes involved in lipogenesis and lipolysis, and to two new families of glycine-rich proteins with a repetitive structure that may represent a source of antigenic variation.