(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Bacteria: NE > Proteobacteria: NE > Gammaproteobacteria: NE > Cellvibrionales: NE > Halieaceae: NE > unclassified Halieaceae: NE > marine gamma proteobacterium HTCC2080: NE
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MEFLRTPDDRFENLENFPYEPNYLSVPDGEGGQLRIHYVDTGPRDGRVVL LMHGQPAWCYLYRSMIPLLVAKGFRVVAPDLVGFGRSDKPTERANYTYAN HVAWMSDWLTQLGLEDITVFFQDWGSLIGLRLVTAFSERFNHVVLANGGL PAGIIPEAFAKPLQEAYETLPVPEAHELGDRFRDTTGIPGFLYWRKFAAE NPAIAQPGKSLMNDLMGAGLSQGEVDAYDAPFPDLSYAAGARQFPSLVPL FHDEPEVPENKAAWKLLEQFDKPFLLAFADNDPVTQGGDKAFKERVPGCQ GVEHRTISPAGHFLQQDQPEQCVQAILDVMAHGSS
https://www.researchsquare.com/article/rs-1027271/v1
Next-generation sequencing doubles genomic databases every 2.5 years. The accumulation of sequence data raises the need to speed up functional analysis. Herein, we present a pipeline integrating bioinformatics and microfluidics and its application for high-throughput mining of novel haloalkane dehalogenases. We employed bioinformatics to identify 2,905 putative dehalogenases and selected 45 representative enzymes, of which 24 were produced in soluble form. Droplet-based microfluidics accelerates subsequent experimental testing up to 20,000 reactions per day while achieving 1,000-fold lower protein consumption. This resulted in doubling the dehalogenation 'toolbox' characterized over three decades, yielding biocatalysts surpassing the efficiency of currently available enzymes. Combining microfluidics with modern global data analysis provided precious mechanistic information related to the high catalytic efficiency of new variants. This pipeline applied to other enzyme families can accelerate the identification of biocatalysts for industrial applications as well as the collection of high-quality data for machine learning.
Organisms in the OM60/NOR5 clade of the Gammaproteobacteria are ubiquitous in the world's oceans and can make up as much as 11% of bacterial cells in certain areas. Isolated from coastal Oregon water, Gammaproteobacteria HTCC2148 and HTCC2080 are two members of this important clade. Here we present the genome sequences of the OM60 Gammaproteobacteria HTCC2148 and HTCC2080.