Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?

Lev Klebanov; Andrei Yakovlev

doi:10.2202/1544-6115.1185

Home Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?

Article

Licensed

Unlicensed Requires Authentication

Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?

Lev Klebanov and Andrei Yakovlev

Published/Copyright: March 24, 2006

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal Statistical Applications in Genetics and Molecular Biology Volume 5 Issue 1

MLA
APA
Harvard
Chicago
Vancouver

MLA
APA
Harvard
Chicago
Vancouver

Klebanov, Lev and Yakovlev, Andrei. "Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?" Statistical Applications in Genetics and Molecular Biology, vol. 5, no. 1. https://doi.org/10.2202/1544-6115.1185

Klebanov, L. & Yakovlev, A. (). Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?. Statistical Applications in Genetics and Molecular Biology, 5(1). https://doi.org/10.2202/1544-6115.1185

Klebanov, L. and Yakovlev, A. () Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?. Statistical Applications in Genetics and Molecular Biology, Vol. 5 (Issue 1). https://doi.org/10.2202/1544-6115.1185

Klebanov, Lev and Yakovlev, Andrei. "Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?" Statistical Applications in Genetics and Molecular Biology 5, no. 1 (). https://doi.org/10.2202/1544-6115.1185

Klebanov L, Yakovlev A. Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?. Statistical Applications in Genetics and Molecular Biology. ;5(1). https://doi.org/10.2202/1544-6115.1185

Copy

Copied to clipboard

BibTeX EndNote RIS

One of the prevailing ideas in the literature on microarray data analysis is to pool the expression measures across genes and treat them as a sample drawn from some distribution. Several universal laws were proposed to analytically describe this distribution. This idea raises a number of concerns. The expression levels of genes are not identically distributed random variables so that treating them as a sample amounts to sampling from a mixture of equally weighted distributions, each being associated with a different gene. The expression levels of different genes are heavily dependent random variables so that the law of large numbers and statistical goodness-of-fit tests are normally inapplicable to this kind of data. This dependence represents a very serious pitfall in microarray data analysis.

Keywords: microarray analysis; gene expression; pooled data; stochastic dependence

Published Online: 2006-3-24

You are currently not able to access this content.

Articles in the same Issue

Article
Low-Order Conditional Independence Graphs for Inferring Genetic Networks
A Generalized Clustering Problem, with Application to DNA Microarrays
A Bayes Regression Approach to Array-CGH Data
Statistical Selection of Maintenance Genes for Normalization of Gene Expressions
Predicting the Strongest Domain-Domain Contact in Interacting Protein Pairs
Dimension Reduction for Classification with Gene Expression Microarray Data
A New Type of Stochastic Dependence Revealed in Gene Expression Data
A New Order Estimator for Fixed and Variable Length Markov Models with Applications to DNA Sequence Similarity
Quality Optimised Analysis of General Paired Microarray Experiments
Issues of Processing and Multiple Testing of SELDI-TOF MS Proteomic Data
Cross-Validated Bagged Prediction of Survival
Treatment of Uninformative Families in Mean Allele Sharing Tests for Linkage
Quantile-Function Based Null Distribution in Resampling Based Multiple Testing
Combining Results of Microarray Experiments: A Rank Aggregation Approach
Model Selection for Mixtures of Mutagenetic Trees
Pseudo-likelihood for Non-reversible Nucleotide Substitution Models with Neighbour Dependent Rates
A Method to Increase the Power of Multiple Testing Procedures Through Sample Splitting
Bayesian Hierarchical Model for Correcting Signal Saturation in Microarrays Using Pixel Intensities
Using Complexity for the Estimation of Bayesian Networks
Detecting Local High-Scoring Segments: a First-Stage Approach for Genome-Wide Association Studies
Examining Protein Structure and Similarities by Spectral Analysis Technique
Parameter Estimation for the Exponential-Normal Convolution Model for Background Correction of Affymetrix GeneChip Data
Approximate Sample Size Calculations with Microarray Data: An Illustration
Numerical Solutions for Patterns Statistics on Markov Chains
A Heuristic Bayesian Method for Segmenting DNA Sequence Alignments and Detecting Evidence for Recombination and Gene Conversion
A Two-Step Multiple Comparison Procedure for a Large Number of Tests and Multiple Treatments
Validation in Genomics: CpG Island Methylation Revisited
An Improved Nonparametric Approach for Detecting Differentially Expressed Genes with Replicated Microarray Data
Letter to the Editor
Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?
Reader's Reaction
Reader's Reaction to "Dimension Reduction for Classification with Gene Expression Microarray Data" by Dai et al (2006)

Search journal Search the content of this journal

https://doi.org/10.2202/1544-6115.1185

Keywords for this article

microarray analysis; gene expression; pooled data; stochastic dependence

Articles in the same Issue

Article
Low-Order Conditional Independence Graphs for Inferring Genetic Networks
A Generalized Clustering Problem, with Application to DNA Microarrays
A Bayes Regression Approach to Array-CGH Data
Statistical Selection of Maintenance Genes for Normalization of Gene Expressions
Predicting the Strongest Domain-Domain Contact in Interacting Protein Pairs
Dimension Reduction for Classification with Gene Expression Microarray Data
A New Type of Stochastic Dependence Revealed in Gene Expression Data
A New Order Estimator for Fixed and Variable Length Markov Models with Applications to DNA Sequence Similarity
Quality Optimised Analysis of General Paired Microarray Experiments
Issues of Processing and Multiple Testing of SELDI-TOF MS Proteomic Data
Cross-Validated Bagged Prediction of Survival
Treatment of Uninformative Families in Mean Allele Sharing Tests for Linkage
Quantile-Function Based Null Distribution in Resampling Based Multiple Testing
Combining Results of Microarray Experiments: A Rank Aggregation Approach
Model Selection for Mixtures of Mutagenetic Trees
Pseudo-likelihood for Non-reversible Nucleotide Substitution Models with Neighbour Dependent Rates
A Method to Increase the Power of Multiple Testing Procedures Through Sample Splitting
Bayesian Hierarchical Model for Correcting Signal Saturation in Microarrays Using Pixel Intensities
Using Complexity for the Estimation of Bayesian Networks
Detecting Local High-Scoring Segments: a First-Stage Approach for Genome-Wide Association Studies
Examining Protein Structure and Similarities by Spectral Analysis Technique
Parameter Estimation for the Exponential-Normal Convolution Model for Background Correction of Affymetrix GeneChip Data
Approximate Sample Size Calculations with Microarray Data: An Illustration
Numerical Solutions for Patterns Statistics on Markov Chains
A Heuristic Bayesian Method for Segmenting DNA Sequence Alignments and Detecting Evidence for Recombination and Gene Conversion
A Two-Step Multiple Comparison Procedure for a Large Number of Tests and Multiple Treatments
Validation in Genomics: CpG Island Methylation Revisited
An Improved Nonparametric Approach for Detecting Differentially Expressed Genes with Replicated Microarray Data
Letter to the Editor
Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?
Reader's Reaction
Reader's Reaction to "Dimension Reduction for Classification with Gene Expression Microarray Data" by Dai et al (2006)