(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Bacteria: NE > Terrabacteria group: NE > Firmicutes: NE > Bacilli: NE > Lactobacillales: NE > Streptococcaceae: NE > Streptococcus: NE > Streptococcus pneumoniae: NE
Warning: This entry is a compilation of different species or line or strain with more than 90% amino acide identity. You can retrieve all strain data
(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) Streptococcus pneumoniae R6: N, E.
Streptococcus pneumoniae SP14-BS69: N, E.
Streptococcus pneumoniae SP3-BS71: N, E.
Streptococcus pneumoniae SP19-BS75: N, E.
Streptococcus pneumoniae Taiwan19F-14: N, E.
Streptococcus pneumoniae MLV-016: N, E.
Streptococcus pneumoniae SP195: N, E.
Streptococcus pneumoniae SP11-BS70: N, E.
Streptococcus pneumoniae SP18-BS74: N, E.
Streptococcus pneumoniae D39: N, E.
Streptococcus pneumoniae CDC0288-04: N, E.
Streptococcus pneumoniae SP9-BS68: N, E.
Streptococcus pneumoniae CDC1873-00: N, E.
Streptococcus pneumoniae G54: N, E.
Streptococcus pneumoniae 70585: N, E.
Streptococcus pneumoniae CGSP14: N, E.
Streptococcus pneumoniae CDC3059-06: N, E.
Streptococcus pneumoniae ATCC 700669: N, E.
Streptococcus pneumoniae CDC1087-00: N, E.
Streptococcus pneumoniae P1031: N, E.
Streptococcus pneumoniae JJA: N, E.
Streptococcus pneumoniae SP6-BS73: N, E.
Streptococcus pneumoniae SP23-BS72: N, E.
Streptococcus pneumoniae Hungary19A-6: N, E.
Streptococcus pneumoniae INV104: N, E.
Streptococcus pneumoniae BS397: N, E.
Streptococcus pneumoniae INV200: N, E.
Streptococcus pneumoniae AP200: N, E.
Streptococcus pneumoniae GA04375: N, E.
Streptococcus pneumoniae SP14-BS292: N, E.
Streptococcus pneumoniae BS458: N, E.
Streptococcus pneumoniae BS455: N, E.
Streptococcus pneumoniae OXC141: N, E.
Streptococcus pneumoniae TCH8431/19A: N, E.
Streptococcus pneumoniae BS457: N, E.
Streptococcus pneumoniae SP-BS293: N, E.
Streptococcus pneumoniae TIGR4: N, E.
Streptococcus pneumoniae 670-6B: N, E.
Streptococcus pneumoniae GA47502: N, E.
Streptococcus pneumoniae GA44500: N, E.
Streptococcus pneumoniae 2070335: N, E.
Streptococcus pneumoniae PCS8203: N, E.
Streptococcus pneumoniae PCS8106: N, E.
Streptococcus mitis SK564: N, E.
Streptococcus mitis SK597: N, E.
Streptococcus oralis ATCC 35037: N, E.
Streptococcus oralis ATCC 49296: N, E.
Streptococcus sp. M143: N, E.
Streptococcus sp. C300: N, E.
Streptococcus oralis Uo5: N, E.
Streptococcus mitis SK321: N, E.
Streptococcus mitis B6: N, E.
Streptococcus mitis NCTC 12261: N, E.
Streptococcus mitis ATCC 6249: N, E.
Streptococcus sp. oral taxon 071 str. 73H25AP: N, E.
Streptococcus sp. M334: N, E.
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MKNPALLEEIKTYRGRDEVPEDFDAFWDGEVKNVSTLPSYHLEERDFHIP QVKCYELTFEGSKEGKVYARIVLPKSEEKVPLIFHFHGYMGRGWDWADML GFTVAGYGVVSMDVRGQSGYSQDGLRSPLGNTVKGHIIRGAVEGRDHLFY KDVYLDIYQLVEIVASLSQVDEKRLSSYGASQGGALALVAAALNPRIQKT VAIYPFLSDFRRVIEIGNTSEAYDELFRYFKFYDPFHETEEEIMATLAYI DVKNLAHRIQGEVKMITGLDDDVCYPITQFAIYNRLTCDKTYRIMPEYAH EAMNVFVNDQVYNWLCGSEIPFKYLK
The public availability of numerous microbial genomes is enabling the analysis of bacterial biology in great detail and with an unprecedented, organism-wide and taxon-wide, broad scope. Streptococcus pneumoniae is one of the most important bacterial pathogens throughout the world. We present here sequences and functional annotations for 2.1-Mbp of pneumococcal DNA, covering more than 90% of the total estimated size of the genome. The sequenced strain is a clinical isolate resistant to macrolides and tetracycline. It carries a type 19F capsular locus, but multilocus sequence typing for several conserved genetic loci suggests that the strain sequenced belongs to a pneumococcal lineage that most often expresses a serotype 15 capsular polysaccharide. A total of 2,046 putative open reading frames (ORFs) longer than 100 amino acids were identified (average of 1,009 bp per ORF), including all described two-component systems and aminoacyl tRNA synthetases. Comparisons to other complete, or nearly complete, bacterial genomes were made and are presented in a graphical form for all the predicted proteins.
Streptococcus pneumoniae is among the most significant causes of bacterial disease in humans. Here we report the 2,038,615-bp genomic sequence of the gram-positive bacterium S. pneumoniae R6. Because the R6 strain is avirulent and, more importantly, because it is readily transformed with DNA from homologous species and many heterologous species, it is the principal platform for investigation of the biology of this important pathogen. It is also used as a primary vehicle for genomics-based development of antibiotics for gram-positive bacteria. In our analysis of the genome, we identified a large number of new uncharacterized genes predicted to encode proteins that either reside on the surface of the cell or are secreted. Among those proteins there may be new targets for vaccine and antibiotic development.
The 2,160,837-base pair genome sequence of an isolate of Streptococcus pneumoniae, a Gram-positive pathogen that causes pneumonia, bacteremia, meningitis, and otitis media, contains 2236 predicted coding regions; of these, 1440 (64%) were assigned a biological role. Approximately 5% of the genome is composed of insertion sequences that may contribute to genome rearrangements through uptake of foreign DNA. Extracellular enzyme systems for the metabolism of polysaccharides and hexosamines provide a substantial source of carbon and nitrogen for S. pneumoniae and also damage host tissues and facilitate colonization. A motif identified within the signal peptide of proteins is potentially involved in targeting these proteins to the cell surface of low-guanine/cytosine (GC) Gram-positive species. Several surface-exposed proteins that may serve as potential vaccine candidates were identified. Comparative genome hybridization with DNA arrays revealed strain differences in S. pneumoniae that could contribute to differences in virulence and antigenicity.
Streptococcus oralis, a commensal species of the human oral cavity, belongs to the Mitis group of streptococci, which includes one of the major human pathogens as well, S. pneumoniae. We report here the first complete genome sequence of this species. S. oralis Uo5, a high-level penicillin- and multiple-antibiotic-resistant isolate from Hungary, is competent for genetic transformation under laboratory conditions. Comparative and functional genomics of Uo5 will be important in understanding the evolution of pathogenesis among Mitis streptococci and their potential to engage in interspecies gene transfer.
Streptococcus mitis is the closest relative of the major human pathogen S. pneumoniae. The 2,15 Mb sequence of the Streptococcus mitis B6 chromosome, an unusually high-level beta-lactam resistant and multiple antibiotic resistant strain, has now been determined to encode 2100 genes. The accessory genome is estimated to represent over 40%, including 75 mostly novel transposases and IS, the prophage phiB6 and another seven phage related regions. Tetracycline resistance mediated by Tn5801, and an unusual and large gene cluster containing three aminoglycoside resistance determinants have not been described in other Streptococcus spp. Comparative genomic analyses including hybridization experiments on a S. mitis B6 specific microarray reveal that individual S. mitis strains are almost as distantly related to the B6 strain as S. pneumoniae. Both species share a core of over 900 genes. Most proteins described as pneumococcal virulence factors are present in S. mitis B6, but the three choline binding proteins PcpA, PspA and PspC, and three gene clusters containing the hyaluronidase gene, ply and lytA, and the capsular genes are absent in S. mitis B6 and other S. mitis as well and confirm their importance for the pathogenetic potential of S. pneumoniae. Despite the close relatedness between the two species, the S. mitis B6 genome reveals a striking X-alignment when compared with S. pneumoniae.
The human microbiome refers to the community of microorganisms, including prokaryotes, viruses, and microbial eukaryotes, that populate the human body. The National Institutes of Health launched an initiative that focuses on describing the diversity of microbial species that are associated with health and disease. The first phase of this initiative includes the sequencing of hundreds of microbial reference genomes, coupled to metagenomic sequencing from multiple body sites. Here we present results from an initial reference genome sequencing of 178 microbial genomes. From 547,968 predicted polypeptides that correspond to the gene complement of these strains, previously unidentified ("novel") polypeptides that had both unmasked sequence length greater than 100 amino acids and no BLASTP match to any nonreference entry in the nonredundant subset were defined. This analysis resulted in a set of 30,867 polypeptides, of which 29,987 (approximately 97%) were unique. In addition, this set of microbial genomes allows for approximately 40% of random sequences from the microbiome of the gastrointestinal tract to be associated with organisms based on the match criteria used. Insights into pan-genome analysis suggest that we are still far from saturating microbial species genetic data sets. In addition, the associated metrics and standards used by our group for quality assurance are presented.
Streptococcus pneumoniae is a human commensal and pathogen able to cause a variety of diseases that annually result in over a million deaths worldwide. The S. pneumoniae(Spain23F) sequence type 81 lineage was among the first recognized pandemic clones and was responsible for almost 40% of penicillin-resistant pneumococcal infections in the United States in the late 1990s. Analysis of the chromosome sequence of a representative strain, and comparison with other available genomes, indicates roles for integrative and conjugative elements in the evolution of pneumococci and, more particularly, the emergence of the multidrug-resistant Spain 23F ST81 lineage. A number of recently acquired loci within the chromosome appear to encode proteins involved in the production of, or immunity to, antimicrobial compounds, which may contribute to the proficiency of this strain at nasopharyngeal colonization. However, further sequencing of other pandemic clones will be required to establish whether there are any general attributes shared by these strains that are responsible for their international success.
        
Title: Genome evolution driven by host adaptations results in a more virulent and antimicrobial-resistant Streptococcus pneumoniae serotype 14 Ding F, Tang P, Hsu MH, Cui P, Hu S, Yu J, Chiu CH Ref: BMC Genomics, 10:158, 2009 : PubMed
BACKGROUND: Streptococcus pneumoniae serotype 14 is one of the most common pneumococcal serotypes that cause invasive pneumococcal diseases worldwide. Serotype 14 often expresses resistance to a variety of antimicrobial agents, resulting in difficulties in treatment. To gain insight into the evolution of virulence and antimicrobial resistance traits in S. pneumoniae from the genome level, we sequenced the entire genome of a serotype 14 isolate (CGSP14), and carried out comprehensive comparison with other pneumococcal genomes. Multiple serotype 14 clinical isolates were also genotyped by multilocus sequence typing (MLST). RESULTS: Comparative genomic analysis revealed that the CGSP14 acquired a number of new genes by horizontal gene transfer (HGT), most of which were associated with virulence and antimicrobial resistance and clustered in mobile genetic elements. The most remarkable feature is the acquisition of two conjugative transposons and one resistance island encoding eight resistance genes. Results of MLST suggested that the major driving force for the genome evolution is the environmental drug pressure. CONCLUSION: The genome sequence of S. pneumoniae serotype 14 shows a bacterium with rapid adaptations to its lifecycle in human community. These include a versatile genome content, with a wide range of mobile elements, and chromosomal rearrangement; the latter re-balanced the genome after events of HGT.
The distributed-genome hypothesis (DGH) states that pathogenic bacteria possess a supragenome that is much larger than the genome of any single bacterium and that these pathogens utilize genetic recombination and a large, noncore set of genes as a means of diversity generation. We sequenced the genomes of eight nasopharyngeal strains of Streptococcus pneumoniae isolated from pediatric patients with upper respiratory symptoms and performed quantitative genomic analyses among these and nine publicly available pneumococcal strains. Coding sequences from all strains were grouped into 3,170 orthologous gene clusters, of which 1,454 (46%) were conserved among all 17 strains. The majority of the gene clusters, 1,716 (54%), were not found in all strains. Genic differences per strain pair ranged from 35 to 629 orthologous clusters, with each strain's genome containing between 21 and 32% noncore genes. The distribution of the orthologous clusters per genome for the 17 strains was entered into the finite-supragenome model, which predicted that (i) the S. pneumoniae supragenome contains more than 5,000 orthologous clusters and (ii) 99% of the orthologous clusters ( approximately 3,000) that are represented in the S. pneumoniae population at frequencies of >or=0.1 can be identified if 33 representative genomes are sequenced. These extensive genic diversity data support the DGH and provide a basis for understanding the great differences in clinical phenotype associated with various pneumococcal strains. When these findings are taken together with previous studies that demonstrated the presence of a supragenome for Streptococcus agalactiae and Haemophilus influenzae, it appears that the possession of a distributed genome is a common host interaction strategy.
Streptococcus pneumoniae (pneumococcus) is a leading human respiratory pathogen that causes a variety of serious mucosal and invasive diseases. D39 is an historically important serotype 2 strain that was used in experiments by Avery and coworkers to demonstrate that DNA is the genetic material. Although isolated nearly a century ago, D39 remains extremely virulent in murine infection models and is perhaps the strain used most frequently in current studies of pneumococcal pathogenesis. To date, the complete genome sequences have been reported for only two S. pneumoniae strains: TIGR4, a recent serotype 4 clinical isolate, and laboratory strain R6, an avirulent, unencapsulated derivative of strain D39. We report here the genome sequences and new annotation of two different isolates of strain D39 and the corrected sequence of strain R6. Comparisons of these three related sequences allowed deduction of the likely sequence of the D39 progenitor and mutations that arose in each isolate. Despite its numerous repeated sequences and IS elements, the serotype 2 genome has remained remarkably stable during cultivation, and one of the D39 isolates contains only five relatively minor mutations compared to the deduced D39 progenitor. In contrast, laboratory strain R6 contains 71 single-base-pair changes, six deletions, and four insertions and has lost the cryptic pDP1 plasmid compared to the D39 progenitor strain. Many of these mutations are in or affect the expression of genes that play important roles in regulation, metabolism, and virulence. The nature of the mutations that arose spontaneously in these three strains, the relative global transcription patterns determined by microarray analyses, and the implications of the D39 genome sequences to studies of pneumococcal physiology and pathogenesis are presented and discussed.
The public availability of numerous microbial genomes is enabling the analysis of bacterial biology in great detail and with an unprecedented, organism-wide and taxon-wide, broad scope. Streptococcus pneumoniae is one of the most important bacterial pathogens throughout the world. We present here sequences and functional annotations for 2.1-Mbp of pneumococcal DNA, covering more than 90% of the total estimated size of the genome. The sequenced strain is a clinical isolate resistant to macrolides and tetracycline. It carries a type 19F capsular locus, but multilocus sequence typing for several conserved genetic loci suggests that the strain sequenced belongs to a pneumococcal lineage that most often expresses a serotype 15 capsular polysaccharide. A total of 2,046 putative open reading frames (ORFs) longer than 100 amino acids were identified (average of 1,009 bp per ORF), including all described two-component systems and aminoacyl tRNA synthetases. Comparisons to other complete, or nearly complete, bacterial genomes were made and are presented in a graphical form for all the predicted proteins.
Streptococcus pneumoniae is among the most significant causes of bacterial disease in humans. Here we report the 2,038,615-bp genomic sequence of the gram-positive bacterium S. pneumoniae R6. Because the R6 strain is avirulent and, more importantly, because it is readily transformed with DNA from homologous species and many heterologous species, it is the principal platform for investigation of the biology of this important pathogen. It is also used as a primary vehicle for genomics-based development of antibiotics for gram-positive bacteria. In our analysis of the genome, we identified a large number of new uncharacterized genes predicted to encode proteins that either reside on the surface of the cell or are secreted. Among those proteins there may be new targets for vaccine and antibiotic development.
The 2,160,837-base pair genome sequence of an isolate of Streptococcus pneumoniae, a Gram-positive pathogen that causes pneumonia, bacteremia, meningitis, and otitis media, contains 2236 predicted coding regions; of these, 1440 (64%) were assigned a biological role. Approximately 5% of the genome is composed of insertion sequences that may contribute to genome rearrangements through uptake of foreign DNA. Extracellular enzyme systems for the metabolism of polysaccharides and hexosamines provide a substantial source of carbon and nitrogen for S. pneumoniae and also damage host tissues and facilitate colonization. A motif identified within the signal peptide of proteins is potentially involved in targeting these proteins to the cell surface of low-guanine/cytosine (GC) Gram-positive species. Several surface-exposed proteins that may serve as potential vaccine candidates were identified. Comparative genome hybridization with DNA arrays revealed strain differences in S. pneumoniae that could contribute to differences in virulence and antigenicity.