Bioinformatics

The most recent articles from: Bioinformatics

Bioinformatics · Feb 2006

Comparative Study

Improved pairwise alignments of proteins in the Twilight Zone using local structure predictions.

In recent years, advances have been made in the ability of computational methods to discriminate between homologous and non-homologous proteins in the 'twilight zone' of sequence similarity, where the percent sequence identity is a poor indicator of homology. To make these predictions more valuable to the protein modeler, they must be accompanied by accurate alignments. Pairwise sequence alignments are inferences of orthologous relationships between sequence positions. Evolutionary distance is traditionally modeled using global amino acid substitution matrices. But real differences in the likelihood of substitutions may exist for different structural contexts within proteins, since structural context contributes to the selective pressure. ⋯ HMMSUM (HMMSTR-based substitution matrices) is a new model for structural context-based amino acid substitution probabilities consisting of a set of 281 matrices, each for a different sequence-structure context. HMMSUM does not require the structure of the protein to be known. Instead, predictions of local structure are made using HMMSTR, a hidden Markov model for local structure. Alignments using the HMMSUM matrices compare favorably to alignments carried out using the BLOSUM matrices or structure-based substitution matrices SDM and HSDM when validated against remote homolog alignments from BAliBASE. HMMSUM has been implemented using local Dynamic Programming and with the Bayesian Adaptive alignment method.

read more… mark as read…
Bioinformatics · Mar 2005

Comparative Study

A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis.

Cancer diagnosis is one of the most important emerging clinical applications of gene expression microarray technology. We are seeking to develop a computer system for powerful and reliable cancer diagnostic model creation based on microarray data. To keep a realistic perspective on clinical applications we focus on multicategory diagnosis. To equip the system with the optimum combination of classifier, gene selection and cross-validation methods, we performed a systematic and comprehensive evaluation of several major algorithms for multicategory classification, several gene selection methods, multiple ensemble classifier methods and two cross-validation designs using 11 datasets spanning 74 diagnostic categories and 41 cancer types and 12 normal tissue types. ⋯ alexander.statnikov@vanderbilt.edu.

read more… or not…
Bioinformatics · Nov 2004

Letter

Bioinformatics leads charge by publishing more Internet addresses in abstracts than any other journal.

no abstract available

expand abstract… or not…
Bioinformatics · Nov 2004

Comparative Study Clinical Trial Controlled Clinical Trial

A mixture model for estimating the local false discovery rate in DNA microarray analysis.

Statistical methods based on controlling the false discovery rate (FDR) or positive false discovery rate (pFDR) are now well established in identifying differentially expressed genes in DNA microarray. Several authors have recently raised the important issue that FDR or pFDR may give misleading inference when specific genes are of interest because they average the genes under consideration with genes that show stronger evidence for differential expression. The paper proposes a flexible and robust mixture model for estimating the local FDR which quantifies how plausible each specific gene expresses differentially. ⋯ An R function implementing the proposed model is available at http://www.geocities.com/jg_liao/software

read more… or not…
Bioinformatics · Oct 2004

Data mining in bioinformatics using Weka.

The Weka machine learning workbench provides a general-purpose environment for automatic classification, regression, clustering and feature selection-common data mining problems in bioinformatics research. It contains an extensive collection of machine learning algorithms and data pre-processing methods complemented by graphical user interfaces for data exploration and the experimental comparison of different machine learning techniques on the same problem. Weka can process data given in the form of a single relational table. Its main objectives are to (a) assist users in extracting useful information from data and (b) enable them to easily identify a suitable algorithm for generating an accurate predictive model from it. ⋯ http://www.cs.waikato.ac.nz/ml/weka.

read more… mark as read…

Bioinformatics

Improved pairwise alignments of proteins in the Twilight Zone using local structure predictions.

A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis.

Bioinformatics leads charge by publishing more Internet addresses in abstracts than any other journal.

A mixture model for estimating the local false discovery rate in DNA microarray analysis.

Data mining in bioinformatics using Weka.

What will the 'Medical Journal of You' look like?

Start your free 21 day trial now.