Methods in molecular biology
-
Phosphorylation is among the most important post-translational modifications of proteins and has numerous regulatory functions across all domains of life. However, phosphorylation is often substoichiometric, requiring selective and sensitive methods to enrich phosphorylated peptides from complex cellular digests. Various methods have been devised for this purpose and we have recently described a Fe-IMAC HPLC column chromatography setup which is capable of comprehensive, reproducible, and selective enrichment of phosphopeptides out of complex peptide mixtures. ⋯ Here, we provide a step-by-step protocol for the entire phosphopeptide enrichment procedure including sample preparation (lysis, digestion, desalting), Fe-IMAC column chromatography (column setup, operation, charging), measurement by LC-MS/MS (nHPLC gradient, MS parameters) and data analysis (MaxQuant). To increase throughput, we have optimized several key steps such as the gradient time of the Fe-IMAC separation (15 min per enrichment), the number of consecutive enrichments possible between two chargings (>20) and the column recharging itself (<1 h). We show that the application of this protocol enables the selective (>90 %) identification of more than 10,000 unique phosphopeptides from 1 mg of HeLa digest within 2 h of measurement time (Q Exactive Plus).
-
High-throughput techniques are indispensable for aiding basic and translational research. Among them, recent advances in proteomics techniques have allowed biomedical researchers to characterize the proteome of multiple organisms. ⋯ This chapter provides an overview of computational strategies, methods, and techniques reported in this book for bioinformatics analysis of protein data. An outline of many bioinformatics tools, databases, and proteomic techniques described in each of the chapters is given here.
-
Many publicly available data repositories and resources have been developed to support protein-related information management, data-driven hypothesis generation, and biological knowledge discovery. To help researchers quickly find the appropriate protein-related informatics resources, we present a comprehensive review (with categorization and description) of major protein bioinformatics databases in this chapter. We also discuss the challenges and opportunities for developing next-generation protein bioinformatics databases and resources to support data integration and data analytics in the Big Data era.
-
Multiplex assays that allow the simultaneous measurement of multiple analytes in small sample quantities have developed into a widely used technology. Their implementation spans across multiple assay systems and can provide readouts of similar quality as the respective single-plex measures, albeit at far higher throughput. Multiplex assay systems are therefore an important element for biomarker discovery and development strategies but analysis of the derived data can face substantial challenges that may limit the possibility of identifying meaningful biological markers. This chapter gives an overview of opportunities and challenges of multiplexed biomarker analysis, in particular from the perspective of machine learning aimed at identification of predictive biological signatures.
-
Advancements in MS-based phospho-proteomics techniques have helped uncover hundred thousands of protein phosphorylation sites in human and various model organisms. The majority of these sites are uncharacterized. ⋯ Analyzing the phosphorylation and sequence conservation of uncharacterized sites across species can help reveal a subset of the functionally important phosphorylation events. Here, we outline the workflow and provide an overview of publicly available computational resources for conservation analysis of novel phosphorylation sites.