Methods in molecular biology
-
Despite recent advances in mass spectrometric sequencing speed and improved sensitivity, the in-depth analysis of proteomes still widely relies on off-line peptide separation and fractionation to deal with the enormous molecular complexity of shotgun digested proteomes. While a multitude of methods has been established for off-line peptide separation using HPLC columns, their use can be limited particularly when sample quantities are scarce. In this protocol, we describe an approach which combines high pH reversed-phase peptide separation into few fractions in StageTip micro-columns. ⋯ Here, we provide a step-by-step protocol for TMT6plex labeling of peptides, the construction of StageTips, sample fractionation and pooling schemes adjusted to different types of analytes, mass spectrometric sample measurement, and downstream data processing using MaxQuant. To illustrate the expected results using this protocol, we provide results from an unlabeled and a TMT6plex labeled phosphopeptide sample leading to the identification of >17,000 phosphopeptides in 8 h (Q Exactive HF) and >23,000 TMT6plex labeled phosphopeptides (Q Exactive Plus) in 12 h of measurement time. Importantly, this protocol is equally applicable to the fractionation of full proteome digests.
-
Post-translational modifications (PTMs) are covalent modifications that proteins might undergo following or sometimes during the process of translation. Together with gene diversity, PTMs contribute to the overall variety of possible protein function for a given organism. Single-nucleotide polymorphisms (SNPs) are the most common form of variations found in the human genome, and have been found to be associated with diseases like Alzheimer's disease (AD) and Parkinson's disease (PD), among many others. ⋯ However, these data are unsystematically distributed across a number of diverse databases. Thus, there is a need for efforts toward data standardization and validation of bioinformatics algorithms that can fully leverage SNP and PTM information for biomedical research. In this book chapter, we will present some of the commonly used databases for both SNVs and PTMs and describe a broad approach that can be applied to many scenarios for studying the impact of nsSNVs on PTM sites of human proteins.
-
Soybean Knowledge Base (SoyKB) is a comprehensive all-inclusive web resource for bridging the gap between soybean translational genomics and molecular breeding. It provides information for six entities including genes/proteins, microRNAs (miRNAs)/small interfering RNAs (sRNA), metabolites, single nucleotide polymorphisms (SNPs), and plant introduction lines and traits. It has a user-friendly web interface publicly available at http://soykb.org , which integrates and presents data in an intuitive manner to the soybean researchers, breeders, and consumers. It incorporates several informatics and analytical tools for integrating and merging various multi-omics datasets.
-
In the past decade, proteomics and mass spectrometry have taken tremendous strides forward, particularly in the life sciences, spurred on by rapid advances in technology resulting in generation and conglomeration of vast amounts of data. Though this has led to tremendous advancements in biology, the interpretation of the data poses serious challenges for many practitioners due to the immense size and complexity of the data. Furthermore, the lack of annotation means that a potential gold mine of relevant biological information may be hiding within this data. ⋯ We then integrate a suite of freely available bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology and biochemical pathways. We also provide an example of the functional annotation of missing proteins in human chromosome 7 data from the NeXtProt database, where no evidence is available at the proteomic, antibody, or structural levels. We give examples of protocols, tools and detailed flowcharts that can be extended or tailored to interpret and annotate the proteome of any novel organism.
-
DNA methylation is a major epigenetic modification that regulates gene expression, genome imprinting, and development and has a role in diseases including cancer. There are various methods for whole-genome methylation profiling that differ in cost and resolution. ⋯ In this chapter, we provide detailed protocols for whole-genome bisulfite sequencing (WGBS), which captures the complete methylome. Using WGBS, we are able to generate a reference DNA methylome for normal or malignant hematopoietic cells.