Glycidyl ethers and their vicinal diols are important building blocks in the organic synthesis of anti-cancer and anti-obesity drugs. Ylehd, an epoxide hydrolase from tropical marine yeast Yarrowia lipolytica, was explored for its enantioselective properties by kinetic, thermodynamic and in silico studies. Kinetic resolution of racemic phenyl glycidyl ether (PGE) yielded (S)-epoxide while for benzyl glycidyl ether (BGE) (R)-epoxide was obtained, with vicinal diols of the opposite configuration. Amongst the enantiomers of PGE and BGE, the (S)-selective conversion of benzyl glycidyl ether to its corresponding diol, (S)-3-benzyloxy-1,2-propanediol while retaining (R)-BGE was most favourable with 95% ee in 20 min. Enantioselective conversion of specific enantiomer of BGE to its corresponding diols was attributed to the favourable kinetic and thermodynamic parameters as well as to the number and proximity of water molecules near the base H325 in the active site pocket. The easily available and highly active Ylehd could be a potential biocatalyst for large scale preparation of pharmaceutically relevant chiral (R)-benzyl glycidyl ether and (S)-3-benzyloxy-1,2-propanediol.
During 11-12 August 2014, a Protein Bioinformatics and Community Resources Retreat was held at the Wellcome Trust Genome Campus in Hinxton, UK. This meeting brought together the principal investigators of several specialized protein resources (such as CAZy, TCDB and MEROPS) as well as those from protein databases from the large Bioinformatics centres (including UniProt and RefSeq). The retreat was divided into five sessions: (1) key challenges, (2) the databases represented, (3) best practices for maintenance and curation, (4) information flow to and from large data centers and (5) communication and funding. An important outcome of this meeting was the creation of a Specialist Protein Resource Network that we believe will improve coordination of the activities of its member resources. We invite further protein database resources to join the network and continue the dialogue.
As the volume of data relating to proteins increases, researchers rely more and more on the analysis of published data, thus increasing the importance of good access to these data that vary from the supplemental material of individual articles, all the way to major reference databases with professional staff and long-term funding. Specialist protein resources fill an important middle ground, providing interactive web interfaces to their databases for a focused topic or family of proteins, using specialized approaches that are not feasible in the major reference databases. Many are labors of love, run by a single lab with little or no dedicated funding and there are many challenges to building and maintaining them. This perspective arose from a meeting of several specialist protein resources and major reference databases held at the Wellcome Trust Genome Campus (Cambridge, UK) on August 11 and 12, 2014. During this meeting some common key challenges involved in creating and maintaining such resources were discussed, along with various approaches to address them. In laying out these challenges, we aim to inform users about how these issues impact our resources and illustrate ways in which our working together could enhance their accuracy, currency, and overall value. Proteins 2015; 83:1005-1013. (c) 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
        
Title: Distribution, classification, domain architectures and evolution of prolyl oligopeptidases in prokaryotic lineages Kaushik S, Sowdhamini R Ref: BMC Genomics, 15:985, 2014 : PubMed
BACKGROUND: Prolyl oligopeptidases (POPs) are proteolytic enzymes, widely distributed in all the kingdoms of life. Bacterial POPs are pharmaceutically important enzymes, yet their functional and evolutionary details are not fully explored. Therefore, current analysis is aimed at understanding the distribution, domain architecture, probable biological functions and gene family expansion of POPs in bacterial and archaeal lineages. RESULTS: Exhaustive sequence analysis of 1,202 bacterial and 91 archaeal genomes revealed ~3,000 POP homologs, with only 638 annotated POPs. We observed wide distribution of POPs in all the analysed bacterial lineages. Phylogenetic analysis and co-clustering of POPs of different phyla suggested their common functions in all the prokaryotic species. Further, on the basis of unique sequence motifs we could classify bacterial POPs into eight subtypes. Analysis of coexisting domains in POPs highlighted their involvement in protein-protein interactions and cellular signaling. We proposed significant extension of this gene family by characterizing 39 new POPs and 158 new alpha/beta hydrolase members. CONCLUSIONS: Our study reflects diversity and functional importance of POPs in bacterial species. Many genomes with multiple POPs were identified with high sequence variations and different cellular localizations. Such anomalous distribution of POP genes in different bacterial genomes shows differential expansion of POP gene family primarily by multiple horizontal gene transfer events.
        
Title: Decoding the structural events in substrate-gating mechanism of eukaryotic prolyl oligopeptidase using normal mode analysis and molecular dynamics simulations Kaushik S, Etchebest C, Sowdhamini R Ref: Proteins, 82:1428, 2014 : PubMed
Prolyl oligopeptidase (POP) is a serine protease, unique for its ability to cleave various small oligopeptides shorter than 30 amino acids. POP is an important drug target since it is implicated in various neurological disorders. Although there is structural evidence that bacterial POPs undergo huge interdomain movements and acquire an "open" state in the substrate-unbound form, hitherto, no crystal structure is available in the substrate-unbound domain-open form of eukaryotic POPs. Indeed, there is no difference between the substrate-unbound/bound states of eukaryotic POPs. This raises unanswered questions about whether difference in the substrate access pathway exists between bacterial and eukaryotic POPs. Here, we have used normal mode analysis and molecular dynamics to unravel the mechanism of substrate entry in mammalian POPs, which has been debated until now. Motions observed using normal modes of porcine and bacterial POPs were analyzed and compared, augmented by molecular dynamics of these proteins. Identical to bacterial POPs, interdomain opening was found to be the possible pathway for the substrate-gating in mammals as well. On the basis of our analyses and evidences, a mechanistic model of substrate entry in POPs has been proposed. Up-down movement of N-terminal hydrolase domain resulted in twisting motion of two domains, followed by the conformational changes of interdomain loop regions, which facilitate interdomain opening. Similar to bacterial POPs, an open form of porcine POP is also proposed with domain-closing motion. This work has direct implications for the development of novel inhibitors of mammalian POPs to understand the etiology of various neurological diseases.
        
Title: Structural analysis of prolyl oligopeptidases using molecular docking and dynamics: insights into conformational changes and ligand binding Kaushik S, Sowdhamini R Ref: PLoS ONE, 6:e26251, 2011 : PubMed
Prolyl oligopeptidase (POP) is considered as an important pharmaceutical target for the treatment of numerous diseases. Despite enormous studies on various aspects of POPs structure and function still some of the questions are intriguing like conformational dynamics of the protein and interplay between ligand entry/egress. Here, we have used molecular modeling and docking based approaches to unravel questions like differences in ligand binding affinities in three POP species (porcine, human and A. thaliana). Despite high sequence and structural similarity, they possess different affinities for the ligands. Interestingly, human POP was found to be more specific, selective and incapable of binding to a few planar ligands which showed extrapolation of porcine POP in human context is more complicated. Possible routes for substrate entry and product egress were also investigated by detailed analyses of molecular dynamics (MD) simulations for the three proteins. Trajectory analysis of bound and unbound forms of three species showed differences in conformational dynamics, especially variations in beta-propeller pore size, which was found to be hidden by five lysine residues present on blades one and seven. During simulation, beta-propeller pore size was increased by approximately 2 A in porcine ligand-bound form which might act as a passage for smaller product movement as free energy barrier was reduced, while there were no significant changes in human and A. thaliana POPs. We also suggest that these differences in pore size could lead to fundamental differences in mode of product egress among three species. This analysis also showed some functionally important residues which can be used further for in vitro mutagenesis and inhibitor design. This study can help us in better understanding of the etiology of POPs in several neurodegenerative diseases.
BACKGROUND: Disulphide bridges are well known to play key roles in stability, folding and functions of proteins. Introduction or deletion of disulphides by site-directed mutagenesis have produced varying effects on stability and folding depending upon the protein and location of disulphide in the 3-D structure. Given the lack of complete understanding it is worthwhile to learn from an analysis of extent of conservation of disulphides in homologous proteins. We have also addressed the question of what structural interactions replaces a disulphide in a homologue in another homologue. RESULTS: Using a dataset involving 34,752 pairwise comparisons of homologous protein domains corresponding to 300 protein domain families of known 3-D structures, we provide a comprehensive analysis of extent of conservation of disulphide bridges and their structural features. We report that only 54% of all the disulphide bonds compared between the homologous pairs are conserved, even if, a small fraction of the non-conserved disulphides do include cytoplasmic proteins. Also, only about one fourth of the distinct disulphides are conserved in all the members in protein families. We note that while conservation of disulphide is common in many families, disulphide bond mutations are quite prevalent. Interestingly, we note that there is no clear relationship between sequence identity between two homologous proteins and disulphide bond conservation. Our analysis on structural features at the sites where cysteines forming disulphide in one homologue are replaced by non-Cys residues show that the elimination of a disulphide in a homologue need not always result in stabilizing interactions between equivalent residues. CONCLUSION: We observe that in the homologous proteins, disulphide bonds are conserved only to a modest extent. Very interestingly, we note that extent of conservation of disulphide in homologous proteins is unrelated to the overall sequence identity between homologues. The non-conserved disulphides are often associated with variable structural features that were recruited to be associated with differentiation or specialisation of protein function.
The organophosphate degrading (opd) gene cluster of plasmid pPDL2 of Flavobacterium sp. ATCC27551 contains a novel open-reading frame, orf243. This was predicted to encode an alpha/beta hydrolase distantly related to the meta-fission product (MFP) hydrolases such as XylF, PhnD, and CumD. By homology modeling Orf243 has most of the structural features of MFP hydrolases including the characteristic active site catalytic triad. The purified protein (designated MfhA) is a homotetramer and shows similar affinity for 2-hydroxy-6-oxohepta-2,4-dienoate (HOHD), 2-hydroxymuconic semialdehyde (HMSA), and 2-hydroxy-5-methylmuconic semialdehyde (HMMSA), the meta-fission products of 3-methyl catechol, catechol, and 4-methyl catechol. The unique catalytic properties of MfhA and the presence near its structural gene of cis-elements required for transposition suggest that mfhA has evolved towards encoding a common hydrolase that can act on meta-fission products containing either aldehyde or ketone groups.
        
Title: Cross genome comparisons of serine proteases in Arabidopsis and rice Tripathi LP, Sowdhamini R Ref: BMC Genomics, 7:200, 2006 : PubMed
BACKGROUND: Serine proteases are one of the largest groups of proteolytic enzymes found across all kingdoms of life and are associated with several essential physiological pathways. The availability of Arabidopsis thaliana and rice (Oryza sativa) genome sequences has permitted the identification and comparison of the repertoire of serine protease-like proteins in the two plant species. RESULTS: Despite the differences in genome sizes between Arabidopsis and rice, we identified a very similar number of serine protease-like proteins in the two plant species (206 and 222, respectively). Nearly 40% of the above sequences were identified as potential orthologues. Atypical members could be identified in the plant genomes for Deg, Clp, Lon, rhomboid proteases and species-specific members were observed for the highly populated subtilisin and serine carboxypeptidase families suggesting multiple lateral gene transfers. DegP proteases, prolyl oligopeptidases, Clp proteases and rhomboids share a significantly higher percentage orthology between the two genomes indicating substantial evolutionary divergence was set prior to speciation. Single domain architectures and paralogues for several putative subtilisins, serine carboxypeptidases and rhomboids suggest they may have been recruited for additional roles in secondary metabolism with spatial and temporal regulation. The analysis reveals some domain architectures unique to either or both of the plant species and some inactive proteases, like in rhomboids and Clp proteases, which could be involved in chaperone function. CONCLUSION: The systematic analysis of the serine protease-like proteins in the two plant species has provided some insight into the possible functional associations of previously uncharacterised serine protease-like proteins. Further investigation of these aspects may prove beneficial in our understanding of similar processes in commercially significant crop plant species.
Hot water epilepsy (HWE) is a benign and rare form of reflex epilepsy that occurs most commonly in humans. Bdm1 is one of the proteins whose mRNA transcript is overexpressed during HWE in a rat model. We show, by sequence analysis and fold recognition methods, that Bdm1 has strong structural similarities to alpha/beta hydrolases like the thioesterases. A three-dimensional model derived by comparative modeling methods allowed the search for catalytic residues using a flexible functional template characteristic of these enzymes. We predict that Bdm1 might be regulated by homocysteine levels by means of direct participation in degradation pathways.