1. Enzyme Genomics:
    Understanding the function of every protein in the cell is one of the great challenges of the post-genomic era. Global genome sequencing efforts have revealed that in any genome 30-50% of genes encode proteins with unknown function (hypothetical proteins). These proteins need to be experimentally characterized, because their functions can’t be computationally predicted on the basis of sequence homology to known proteins. We are using enzymatic assays to directly test purified unknown proteins for catalytic activity. We have designed a series of general and specific screens to detect hydrolases, oxidoreductases, and transferases. The general screens use either general chromogenic substrates or pools of substrates. The positive hits with the model substrates are then tested with a set of potential natural substrates or the substrate pools are deconvoluted to identify the preferred in vitro substrate. The identification of a biochemical activity of a hypothetical protein helps to determine its cellular role. In a screen of over 2,000 purified proteins, over 300 unknown proteins had clear catalytic activity. Many of them were further characterized using secondary enzymatic screens with specific substrates. Enzymatic screens are also used to quickly characterize large enzyme families and to verify the hypotheses generated by structural studies of proteins with unknown function.


  1. Novel Industrial Enzymes:
    Presently, there is a global drive to promote “white” (or biocatalysis-based) biotechnology as a central element in the transformation of modern industrialized societies to sustainable economies. Despite the obvious benefits of biocatalysis, the major hurdles hindering the application of enzymatic processes are: the high production and enzyme costs, low or moderate yields obtained, the limited repertoire of the available enzymes and their low activity and stability. We are exploring the wealth of sequenced genomes and metagenomes to discover new industrial enzymes for potential applications in bioremediation and bioenergy. Global genome and metagenome sequencing efforts have produced millions of genes encoding unknown proteins. These uncharacterized proteins represent a potentially rich source of novel industrial enzymes. Our approach is based on the high-throughput biochemical and structural characterization of purified predicted hydrolases and oxidoreductases (dehalogenases, cellulases, esterases, lipases, and oxidases). The long-term aim of this project is to discover novel enzymes capable of degrading and transforming various xenobiotics, plant biomass, and oil.


  1. Molecular mechanisms of CRISPR:
    Most prokaryotic genomes contain structures known as Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs). Associated with them are a large group of uncharacterized proteins known as CRISPR Associated (Cas). It has been shown that CRISPRs and Cas proteins represent a novel microbial genome defence system which functions to some extent analogously to the eukaryotic RNA silencing machinery. To date, the CRISPR system comprises several thousand uncharacterized proteins organized into approximately 45 protein families and nine types of cas operons (CRISPR-subtypes). Six Cas families (Cas1 to Cas6) represent the core group of CRISPR-associated proteins, and based on the sequence analysis they are predicted to have nuclease and/or helicase activity. The main goal of our work is to experimentally characterize the biochemical activity and catalytic mechanism of the core Cas proteins.