Esterases receive special attention because their wide distribution in biological systems and environments and their importance for physiology and chemical synthesis. The prediction of esterases substrate promiscuity level from sequence data and the molecular reasons why certain such enzymes are more promiscuous than others, remain to be elucidated. This limits the surveillance of the sequence space for esterases potentially leading to new versatile biocatalysts and new insights into their role in cellular function. Here we performed an extensive analysis of the substrate spectra of 145 phylogenetically and environmentally diverse microbial esterases, when tested with 96 diverse esters. We determined the primary factors shaping their substrate range by analyzing substrate range patterns in combination with structural analysis and protein-ligand simulations. We found a structural parameter that helps ranking (classifying) promiscuity level of esterases from sequence data at 94% accuracy. This parameter, the active site effective volume, exemplifies the topology of the catalytic environment by measuring the active site cavity volume corrected by the relative solvent accessible surface area (SASA) of the catalytic triad. Sequences encoding esterases with active site effective volumes (cavity volume/SASA) above a threshold show greater substrate spectra, which can be further extended in combination with phylogenetic data. This measure provides also a valuable tool for interrogating substrates capable of being converted. This measure, found to be transferred to phosphatases of the haloalkanoic acid dehalogenase superfamily and possibly other enzymatic systems, represents a powerful tool for low-cost bioprospecting for esterases with broad substrate ranges, in large scale sequence datasets.
BACKGROUND: A complete saccharification of plant polymers is the critical step in the efficient production of bio-alcohols. Beta-glucosidases acting in the degradation of intermediate gluco-oligosaccharides produced by cellulases limit the yield of the final product. RESULTS: In the present work, we have identified and then successfully cloned, expressed, purified and characterised 4 highly active beta-glucosidases from fibre-adherent microbial community from the cow rumen. The enzymes were most active at temperatures 45-55 degrees C and pH 4.0-7.0 and exhibited high affinity and activity towards synthetic substrates such as p-nitrophenyl-beta-D-glucopyranoside (pNPbetaG) and pNP-beta-cellobiose, as well as to natural cello-oligosaccharides ranging from cellobiose to cellopentaose. The apparent capability of the most active beta-glucosidase, herein named LAB25g2, was tested for its ability to improve, at low dosage (31.25 units g-1 dry biomass, using pNPbetaG as substrate), the hydrolysis of pre-treated corn stover (dry matter content of 20%; 350 g glucan kg-1 dry biomass) in combination with a beta-glucosidase-deficient commercial Trichoderma reseei cellulase cocktail (5 units g-1 dry biomass in the basis of pNPbetaG). LAB25g2 increased the final hydrolysis yield by a factor of 20% (44.5 +/- 1.7% vs. 34.5 +/- 1.5% in control conditions) after 96-120 h as compared to control reactions in its absence or in the presence of other commercial beta-glucosidase preparations. The high stability (half-life higher than 5 days at 50 degrees C and pH 5.2) and 2-38000 fold higher (as compared with reported beta-glucosidases) activity towards cello-oligosaccharides may account for its performance in supplementation assays. CONCLUSIONS: The results suggest that beta-glucosidases from yet uncultured bacteria from animal digestomes may be of a potential interest for biotechnological processes related to the effective bio-ethanol production in combination with low dosage of commercial cellulases.