Petroleum-based plastics are durable and accumulate in all ecological niches. Knowledge on enzymatic degradation is sparse. Today, less than 50 verified plastics-active enzymes are known. First examples of enzymes acting on the polymers polyethylene terephthalate (PET) and polyurethane (PUR) have been reported together with a detailed biochemical and structural description. Furthermore, very few polyamide (PA) oligomer active enzymes are known. In this article, the current known enzymes acting on the synthetic polymers PET and PUR are briefly summarized, their published activity data were collected and integrated into a comprehensive open access database. The Plastics-Active Enzymes Database (PAZy) represents an inventory of known and experimentally verified enzymes that act on synthetic fossil fuel-based polymers. Almost 3000 homologs of PET-active enzymes were identified by profile hidden Markov models. Over 2000 homologs of PUR-active enzymes were identified by BLAST. Based on multiple sequence alignments, conservation analysis identified the most conserved amino acids, and sequence motifs for PET- and PUR-active enzymes were derived.
Petroleum based plastics are durable and accumulate in all ecological niches. Knowledge on enzymatic degradation is sparse. Today, less than 50 verified plastics-active enzymes are known. First examples of enzymes acting on the polymers polyethylene terephthalate (PET) and polyurethane (PUR) have been reported together with a detailed biochemical and structural description. Further, very few polyamide (PA) oligomer active enzymes are known. In this paper, the current known enzymes acting on the synthetic polymers PET and PUR are briefly summarized, their published activity data were collected and integrated into a comprehensive open access database. The Plastics-Active Enzymes Database (PAZy) represents an inventory of known and experimentally verified plastics-active enzymes. Almost 3000 homologues of PET-active enzymes were identified by profile hidden Markov models. Over 2000 homologues of PUR-active enzymes were identified by BLAST. Based on multiple sequence alignments, conservation analysis identified the most conserved amino acids, and sequence motifs for PET- and PUR-active enzymes were derived.
https://www.researchsquare.com/article/rs-567691/v2
Polyethylene terephthalate (PET) is an important synthetic polymer accumulating in nature 2 and recent studies have identified microorganisms capable of degrading PET. While the majority of 3 known PET hydrolases originate from the Actinobacteria and Proteobacteria, here we describe the 4 first functional PET-active enzymes from the Bacteroidetes phylum. Using a PETase-specific 5 Hidden-Markov-Model (HMM)-based search algorithm we identified two promiscuous and cold6 active esterases derived from Aequorivita sp. (PET27) and Chryseobacterium jeonii (PET30) acting 7 on PET foil and powder. Notably, one of the enzymes (PET30) was able to hydrolyze PET at 8 temperatures between 4 - 30 C with a similar turnover rate compared to the well-known Ideonella 9 sakaiensis enzyme (IsPETase). 10 PET27 and PET30 homologues were detected in metagenomes encompassing a wide range 11 of different global climate zones. Additional transcript abundance mapping of marine samples imply 12 that these promiscuous enzymes and source organisms may play a role in the long-term 13 degradation of microplastic particles and fibers.
The alpha/beta-hydrolase fold family is highly diverse in sequence, structure and biochemical function. To investigate the sequence-structure-function relationships, the Lipase Engineering Database (https://led.biocatnet.de) was updated. Overall, 280 638 protein sequences and 1557 protein structures were analysed. All alpha/beta-hydrolases consist of the catalytically active core domain, but they might also contain additional structural modules, resulting in 12 different architectures: core domain only, additional lids at three different positions, three different caps, additional N- or C-terminal domains and combinations of N- and C-terminal domains with caps and lids respectively. In addition, the alpha/beta-hydrolases were distinguished by their oxyanion hole signature (GX-, GGGX- and Y-types). The N-terminal domains show two different folds, the Rossmann fold or the beta-propeller fold. The C-terminal domains show a beta-sandwich fold. The N-terminal beta-propeller domain and the C-terminal beta-sandwich domain are structurally similar to carbohydrate-binding proteins such as lectins. The classification was applied to the newly discovered polyethylene terephthalate (PET)-degrading PETases and MHETases, which are core domain alpha/beta-hydrolases of the GX- and the GGGX-type respectively. To investigate evolutionary relationships, sequence networks were analysed. The degree distribution followed a power law with a scaling exponent gamma = 1.4, indicating a highly inhomogeneous network which consists of a few hubs and a large number of less connected sequences. The hub sequences have many functional neighbours and therefore are expected to be robust toward possible deleterious effects of mutations. The cluster size distribution followed a power law with an extrapolated scaling exponent tau = 2.6, which strongly supports the connectedness of the sequence space of alpha/beta-hydrolases. DATABASE: Supporting data about domains from other proteins with structural similarity to the N- or C-terminal domains of alpha/beta-hydrolases are available in Data Repository of the University of Stuttgart (DaRUS) under doi: https://doi.org/10.18419/darus-458.
Esterases receive special attention because their wide distribution in biological systems and environments and their importance for physiology and chemical synthesis. The prediction of esterases substrate promiscuity level from sequence data and the molecular reasons why certain such enzymes are more promiscuous than others, remain to be elucidated. This limits the surveillance of the sequence space for esterases potentially leading to new versatile biocatalysts and new insights into their role in cellular function. Here we performed an extensive analysis of the substrate spectra of 145 phylogenetically and environmentally diverse microbial esterases, when tested with 96 diverse esters. We determined the primary factors shaping their substrate range by analyzing substrate range patterns in combination with structural analysis and protein-ligand simulations. We found a structural parameter that helps ranking (classifying) promiscuity level of esterases from sequence data at 94% accuracy. This parameter, the active site effective volume, exemplifies the topology of the catalytic environment by measuring the active site cavity volume corrected by the relative solvent accessible surface area (SASA) of the catalytic triad. Sequences encoding esterases with active site effective volumes (cavity volume/SASA) above a threshold show greater substrate spectra, which can be further extended in combination with phylogenetic data. This measure provides also a valuable tool for interrogating substrates capable of being converted. This measure, found to be transferred to phosphatases of the haloalkanoic acid dehalogenase superfamily and possibly other enzymatic systems, represents a powerful tool for low-cost bioprospecting for esterases with broad substrate ranges, in large scale sequence datasets.
The development of novel enzymes for biocatalytic processes requires knowledge on substrate profile and selectivity; this can be derived from databases and from publications. Often, these sources lack time-course data for the substrate or product, and an unambiguous link between experiment and enzyme sequence. The lack of integrated, original data hampers the comprehensive analysis of enzyme kinetics and the evaluation of sequence-function relationships. In order to accelerate enzyme engineering, BioCatNet integrates protein sequence, protein structure, and experimental data for a given enzyme family. BioCatNet explicitly assigns the enzyme sequence to the experimental data, which consists of information on reaction conditions and time-course data. BioCatNet facilitates the consistent documentation of reaction conditions, the archiving of time-course data, and the efficient exchange of experimental data among collaborators. Data integration is demonstrated for three case studies by using the TEED (Thiamine diphosphate-dependent Enzymes Engineering Database).