Methods in molecular biology
-
In the past decade, proteomics and mass spectrometry have taken tremendous strides forward, particularly in the life sciences, spurred on by rapid advances in technology resulting in generation and conglomeration of vast amounts of data. Though this has led to tremendous advancements in biology, the interpretation of the data poses serious challenges for many practitioners due to the immense size and complexity of the data. Furthermore, the lack of annotation means that a potential gold mine of relevant biological information may be hiding within this data. ⋯ We then integrate a suite of freely available bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology and biochemical pathways. We also provide an example of the functional annotation of missing proteins in human chromosome 7 data from the NeXtProt database, where no evidence is available at the proteomic, antibody, or structural levels. We give examples of protocols, tools and detailed flowcharts that can be extended or tailored to interpret and annotate the proteome of any novel organism.
-
DNA methylation is a major epigenetic modification that regulates gene expression, genome imprinting, and development and has a role in diseases including cancer. There are various methods for whole-genome methylation profiling that differ in cost and resolution. ⋯ In this chapter, we provide detailed protocols for whole-genome bisulfite sequencing (WGBS), which captures the complete methylome. Using WGBS, we are able to generate a reference DNA methylome for normal or malignant hematopoietic cells.
-
Multiplex assays that allow the simultaneous measurement of multiple analytes in small sample quantities have developed into a widely used technology. Their implementation spans across multiple assay systems and can provide readouts of similar quality as the respective single-plex measures, albeit at far higher throughput. Multiplex assay systems are therefore an important element for biomarker discovery and development strategies but analysis of the derived data can face substantial challenges that may limit the possibility of identifying meaningful biological markers. This chapter gives an overview of opportunities and challenges of multiplexed biomarker analysis, in particular from the perspective of machine learning aimed at identification of predictive biological signatures.
-
Whole-genome bisulfite sequencing (WGBS) has become a powerful tool to dissect genome-wide methylation profiles at single-base resolution. In this chapter we describe in detail the bioinformatics pipeline used for the analysis of ARGONAUTE-dependent DNA methylation in Arabidopsis thaliana. We provide tools and command lines used for mapping bisulfite sequencing reads, for estimating methylation levels at individual cytosine sites, for identifying differentially methylated regions (DMRs), and for calculating methylation levels of DMRs.
-
microRNAs (miRNAs) are central regulators of gene expression. They are actively studied for their involvement in numerous physiological and pathological conditions but also as diagnostic biomarkers or promising therapeutic targets. The increased complexity of the miRNA interactomes hinders straightforward interpretation of miRNA expression differences between states and conditions. ⋯ The most commonly utilized databases and algorithms include DIANA-microT-CDS, DIANA-TarBase v7.0, DIANA-lncBase v2.0, DIANA-miRGen v3.0, DIANA-miRPath v3.0, and DIANA-mirExTra v2.0. In the presented protocol, we will utilize different online tools in order to explore miRNA functions and to identify probable targets of interest for downstream analyses and wet lab experiments. The combined use of different applications from the DIANA suite can shed light to numerous different aspects of miRNA regulation and regulatory function, without the necessity for extensive bioinformatics expertise or computational infrastructure.