Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data

Shafagh Fallah; David Tritchler; Joseph Beyene

doi:10.2202/1544-6115.1261

Startseite Lebenswissenschaften Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data

Artikel

Lizenziert

Nicht lizenziert Erfordert eine Authentifizierung

Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data

Shafagh Fallah , David Tritchler und Joseph Beyene

Veröffentlicht/Copyright: 2. August 2008

Veröffentlicht von

Veröffentlichen auch Sie bei De Gruyter Brill

Manuskript einreichen Informationen für Autor*innen Erkunden Sie dieses Fachgebiet

Aus der Zeitschrift Statistical Applications in Genetics and Molecular Biology Band 7 Heft 1

MLA
APA
Harvard
Chicago
Vancouver

MLA
APA
Harvard
Chicago
Vancouver

Fallah, Shafagh, Tritchler, David and Beyene, Joseph. "Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data" Statistical Applications in Genetics and Molecular Biology, vol. 7, no. 1. https://doi.org/10.2202/1544-6115.1261

Fallah, S., Tritchler, D. & Beyene, J. (). Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data. Statistical Applications in Genetics and Molecular Biology, 7(1). https://doi.org/10.2202/1544-6115.1261

Fallah, S., Tritchler, D. and Beyene, J. () Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data. Statistical Applications in Genetics and Molecular Biology, Vol. 7 (Issue 1). https://doi.org/10.2202/1544-6115.1261

Fallah, Shafagh, Tritchler, David and Beyene, Joseph. "Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data" Statistical Applications in Genetics and Molecular Biology 7, no. 1 (). https://doi.org/10.2202/1544-6115.1261

Fallah S, Tritchler D, Beyene J. Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data. Statistical Applications in Genetics and Molecular Biology. ;7(1). https://doi.org/10.2202/1544-6115.1261

Kopie

In die Zwischenablage kopiert

BibTeX EndNote RIS

Many clustering methods require that the number of clusters believed present in a given data set be specified a priori, and a number of methods for estimating the number of clusters have been developed. However, the selection of the number of clusters is well recognized as a difficult and open problem and there is a need for methods which can shed light on specific aspects of the data. This paper adopts a model for clustering based on a specific structure for a similarity matrix. Publicly available gene expression data sets are analyzed to illustrate the method and the performance of our method is assessed by simulation.

Keywords: cluster analysis; eigenanalysis; microarray; segmented regression; scree plot

Published Online: 2008-8-2

Sie haben derzeit keinen Zugang zu diesem Inhalt.

Artikel in diesem Heft

Article
Self-Organizing Maps with Statistical Phase Synchronization (SOMPS) for Analyzing Cell Cycle-Specific Gene Expression Data
Coalescent Time Distributions in Trees of Arbitrary Size
Quantifying the Association between Gene Expressions and DNA-Markers by Penalized Canonical Correlation Analysis
Nonparametric Functional Mapping of Quantitative Trait Loci Underlying Programmed Cell Death
Accommodating Uncertainty in a Tree Set for Function Estimation
Drifting Markov Models with Polynomial Drift and Applications to DNA Sequences
Comparing the Characteristics of Gene Expression Profiles Derived by Univariate and Multivariate Classification Methods
Calculating Confidence Intervals for Prediction Error in Microarray Classification Using Resampling
Structure Learning in Nested Effects Models
Correcting the Estimated Level of Differential Expression for Gene Selection Bias: Application to a Microarray Study
Adapting Prediction Error Estimates for Biased Complexity Selection in High-Dimensional Bootstrap Samples
Adaptive Choice of the Number of Bootstrap Samples in Large Scale Multiple Testing
Re-Cracking the Nucleosome Positioning Code
Semi-Parametric Differential Expression Analysis via Partial Mixture Estimation
A SNP Streak Model for the Identification of Genetic Regions Identical-by-descent
Detecting Two-Locus Gene-Gene Effects Using Monotonisation of the Penetrance Matrix
Modeling DNA Methylation in a Population of Cancer Cells
Phenotyping Genetic Diseases Using an Extension of µ-Scores for Multivariate Data
The Estimator of the Optimal Measure of Allelic Association: Mean, Variance and Probability Distribution When the Sample Size Tends to Infinity
Predicting Protein Concentrations with ELISA Microarray Assays, Monotonic Splines and Monte Carlo Simulation
A Comparison of Normalization Techniques for MicroRNA Microarray Data
Collapsing SNP Genotypes in Case-Control Genome-Wide Association Studies Increases the Type I Error Rate and Power
Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data
Data Distribution of Short Oligonucleotide Expression Arrays and Its Application to the Construction of a Generalized Intellectual Framework
Approximately Sufficient Statistics and Bayesian Computation
A Composite-Conditional-Likelihood Approach for Gene Mapping Based on Linkage Disequilibrium in Windows of Marker Loci
Statistical Methods in Integrative Analysis for Gene Regulatory Modules
Reducing Spatial Flaws in Oligonucleotide Arrays by Using Neighborhood Information
Pattern Classification of Phylogeny Signals
A Unification of Multivariate Methods for Meta-Analysis of Genetic Association Studies
Importance Sampling for the Infinite Sites Model
Supervised Distance Matrices
Addressing the Shortcomings of Three Recent Bayesian Methods for Detecting Interspecific Recombination in DNA Sequence Alignments
A Sparse PLS for Variable Selection when Integrating Omics Data
Software Communication
TRAB: Testing Whether Mutation Frequencies Are Above an Unknown Background

Zeitschrift durchsuchen In dieser Zeitschrift suchen

https://doi.org/10.2202/1544-6115.1261

Schlagwörter für diesen Artikel

cluster analysis; eigenanalysis; microarray; segmented regression; scree plot

Artikel in diesem Heft

Article
Self-Organizing Maps with Statistical Phase Synchronization (SOMPS) for Analyzing Cell Cycle-Specific Gene Expression Data
Coalescent Time Distributions in Trees of Arbitrary Size
Quantifying the Association between Gene Expressions and DNA-Markers by Penalized Canonical Correlation Analysis
Nonparametric Functional Mapping of Quantitative Trait Loci Underlying Programmed Cell Death
Accommodating Uncertainty in a Tree Set for Function Estimation
Drifting Markov Models with Polynomial Drift and Applications to DNA Sequences
Comparing the Characteristics of Gene Expression Profiles Derived by Univariate and Multivariate Classification Methods
Calculating Confidence Intervals for Prediction Error in Microarray Classification Using Resampling
Structure Learning in Nested Effects Models
Correcting the Estimated Level of Differential Expression for Gene Selection Bias: Application to a Microarray Study
Adapting Prediction Error Estimates for Biased Complexity Selection in High-Dimensional Bootstrap Samples
Adaptive Choice of the Number of Bootstrap Samples in Large Scale Multiple Testing
Re-Cracking the Nucleosome Positioning Code
Semi-Parametric Differential Expression Analysis via Partial Mixture Estimation
A SNP Streak Model for the Identification of Genetic Regions Identical-by-descent
Detecting Two-Locus Gene-Gene Effects Using Monotonisation of the Penetrance Matrix
Modeling DNA Methylation in a Population of Cancer Cells
Phenotyping Genetic Diseases Using an Extension of µ-Scores for Multivariate Data
The Estimator of the Optimal Measure of Allelic Association: Mean, Variance and Probability Distribution When the Sample Size Tends to Infinity
Predicting Protein Concentrations with ELISA Microarray Assays, Monotonic Splines and Monte Carlo Simulation
A Comparison of Normalization Techniques for MicroRNA Microarray Data
Collapsing SNP Genotypes in Case-Control Genome-Wide Association Studies Increases the Type I Error Rate and Power
Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data
Data Distribution of Short Oligonucleotide Expression Arrays and Its Application to the Construction of a Generalized Intellectual Framework
Approximately Sufficient Statistics and Bayesian Computation
A Composite-Conditional-Likelihood Approach for Gene Mapping Based on Linkage Disequilibrium in Windows of Marker Loci
Statistical Methods in Integrative Analysis for Gene Regulatory Modules
Reducing Spatial Flaws in Oligonucleotide Arrays by Using Neighborhood Information
Pattern Classification of Phylogeny Signals
A Unification of Multivariate Methods for Meta-Analysis of Genetic Association Studies
Importance Sampling for the Infinite Sites Model
Supervised Distance Matrices
Addressing the Shortcomings of Three Recent Bayesian Methods for Detecting Interspecific Recombination in DNA Sequence Alignments
A Sparse PLS for Variable Selection when Integrating Omics Data
Software Communication
TRAB: Testing Whether Mutation Frequencies Are Above an Unknown Background