(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > unclassified sequences: NE > metagenomes: NE > ecological metagenomes: NE > mine drainage metagenome: NE
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MGVKPEPVLVVPARGRATASLIFLHGLGADAHDFETLGDALDLKGCRYLF PNAPIRPVTLNGGYPMRAWFDIDSISRAALSEPSGLRNSFATVEWLLHHE TQAGIAMNRILVGGFSQGGAVALAWAGQYRAPLLGIAGLSTFLPVGVEPD AHAHAIFLAHGRNDPVVPLALAEKTRDDLLSAGHQVTWRIYPMQHEVVAD EIQALRAWILDRLAS
Acid mine drainage (AMD) systems are extremely acidic and are metal-rich formations inhabited by relatively low-complexity communities of acidophiles whose enzymes remain mostly uncharacterized. Indeed, enzymes from only a few AMD sites have been studied. The low number of available cultured representatives and genome sequences of acidophiles inhabiting AMDs makes it difficult to assess the potential of these environments for enzyme bioprospecting. In this study, using naive and in silico metagenomic approaches, we retrieved 16 esterases from the alpha/beta-hydrolase fold superfamily with the closest match from uncultured acidophilic Acidobacteria, Actinobacteria (Acidithrix, Acidimicrobium, and Ferrimicrobium), Acidiphilium, and other Proteobacteria inhabiting the Los Rueldos site, which is a unique AMD formation in northwestern Spain with a pH of -2. Within this set, only two polypeptides showed high homology (99.4%), while for the rest, the pairwise identities ranged between 4 and 44.9%, suggesting that the diversity of active polypeptides was dominated not by a particular type of protein or highly similar clusters of proteins, but by diverse non-redundant sequences. The enzymes exhibited amino acid sequence identities ranging from 39 to 99% relative to homologous proteins in public databases, including those from other AMDs, thus indicating the potential novelty of proteins associated with a specialized acidophilic community. Ten of the 16 hydrolases were successfully expressed in Escherichia coli. The pH for optimal activity ranged from 7.0 to 9.0, with the enzymes retaining 33-68% of their activities at pH 5.5, which was consistent with the relative frequencies of acid residues (from 54 to 67%). The enzymes were the most active at 30-65 degreesC, retaining 20-61% of their activity under the thermal conditions characterizing Los Rueldos (13.8 +/- 0.6 degreesC). The analysis of the substrate specificity revealed the capacity of six hydrolases to efficiently degrade (up to 1,652 +/- 75 U/g at pH 8.0 and 30 degreesC) acrylic- and terephthalic-like [including bis(2-hydroxyethyl)-terephthalate, BHET] esters, and these enzymes could potentially be of use for developing plastic degradation strategies yet to be explored. Our assessment uncovers the novelty and potential biotechnological interest of enzymes present in the microbial populations that inhibit the Los Rueldos AMD system.
Esterases receive special attention because their wide distribution in biological systems and environments and their importance for physiology and chemical synthesis. The prediction of esterases substrate promiscuity level from sequence data and the molecular reasons why certain such enzymes are more promiscuous than others, remain to be elucidated. This limits the surveillance of the sequence space for esterases potentially leading to new versatile biocatalysts and new insights into their role in cellular function. Here we performed an extensive analysis of the substrate spectra of 145 phylogenetically and environmentally diverse microbial esterases, when tested with 96 diverse esters. We determined the primary factors shaping their substrate range by analyzing substrate range patterns in combination with structural analysis and protein-ligand simulations. We found a structural parameter that helps ranking (classifying) promiscuity level of esterases from sequence data at 94% accuracy. This parameter, the active site effective volume, exemplifies the topology of the catalytic environment by measuring the active site cavity volume corrected by the relative solvent accessible surface area (SASA) of the catalytic triad. Sequences encoding esterases with active site effective volumes (cavity volume/SASA) above a threshold show greater substrate spectra, which can be further extended in combination with phylogenetic data. This measure provides also a valuable tool for interrogating substrates capable of being converted. This measure, found to be transferred to phosphatases of the haloalkanoic acid dehalogenase superfamily and possibly other enzymatic systems, represents a powerful tool for low-cost bioprospecting for esterases with broad substrate ranges, in large scale sequence datasets.