Deshayes_2007_Genome.Biol_8_R20

Reference

Title : Interrupted coding sequences in Mycobacterium smegmatis: authentic mutations or sequencing errors? - Deshayes_2007_Genome.Biol_8_R20
Author(s) : Deshayes C , Perrodou E , Gallien S , Euphrasie D , Schaeffer C , Van-Dorsselaer A , Poch O , Lecompte O , Reyrat JM
Ref : Genome Biol , 8 :R20 , 2007
Abstract :

BACKGROUND In silico analysis has shown that all bacterial genomes contain a low percentage of ORFs with undetected frameshifts and in-frame stop codons These interrupted coding sequences ICDSs may really be present in the organism or may result from misannotation based on sequencing errors The reality or otherwise of these sequences has major implications for all subsequent functional characterization steps including module prediction comparative genomics and high-throughput proteomic projects RESULTS We show here using Mycobacterium smegmatis as a model species that a significant proportion of these ICDSs result from sequencing errors We used a resequencing procedure and mass spectrometry analysis to determine the nature of a number of ICDSs in this organism We found that 28 of the 73 ICDSs investigated correspond to sequencing errors CONCLUSION The correction of these errors results in modification of the predicted amino acid sequences of the corresponding proteins and changes in annotation We suggest that each bacterial ICDS should be investigated individually to determine its true status and to ensure that the genome sequence is appropriate for comparative genomics analyses.

PubMedSearch : Deshayes_2007_Genome.Biol_8_R20
PubMedID: 17295914
Gene_locus related to this paper: mycs2-a0qpe0 , mycs2-a0qtm5 , mycs2-a0qwn1 , mycs2-a0qx73 , mycs2-a0qyi2 , mycs2-a0qzz2 , mycs2-a0r5t4 , mycs2-a0r606 , mycsm-Q938B4 , mycs2-a0qnx2 , mycs2-a0qsm2 , mycs2-a0qu66 , mycs2-a0r0q0 , mycs2-a0qsm1 , mycs2-a0qqp0 , mycs2-a0qwt7

Related information

Gene_locus mycs2-a0qpe0    mycs2-a0qtm5    mycs2-a0qwn1    mycs2-a0qx73    mycs2-a0qyi2    mycs2-a0qzz2    mycs2-a0r5t4    mycs2-a0r606    mycsm-Q938B4    mycs2-a0qnx2    mycs2-a0qsm2    mycs2-a0qu66    mycs2-a0r0q0    mycs2-a0qsm1    mycs2-a0qqp0    mycs2-a0qwt7

Citations formats

Deshayes C, Perrodou E, Gallien S, Euphrasie D, Schaeffer C, Van-Dorsselaer A, Poch O, Lecompte O, Reyrat JM (2007)
Interrupted coding sequences in Mycobacterium smegmatis: authentic mutations or sequencing errors?
Genome Biol 8 :R20

Deshayes C, Perrodou E, Gallien S, Euphrasie D, Schaeffer C, Van-Dorsselaer A, Poch O, Lecompte O, Reyrat JM (2007)
Genome Biol 8 :R20