Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks

Alex Lewin; Natalia Bochkina; Sylvia Richardson

doi:10.2202/1544-6115.1314

Home Life Sciences Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks

Article

Licensed

Unlicensed Requires Authentication

Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks

Alex Lewin , Natalia Bochkina and Sylvia Richardson

Published/Copyright: December 21, 2007

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Statistical Applications in Genetics and Molecular Biology Volume 6 Issue 1

MLA
APA
Harvard
Chicago
Vancouver

MLA
APA
Harvard
Chicago
Vancouver

Lewin, Alex, Bochkina, Natalia and Richardson, Sylvia. "Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks" Statistical Applications in Genetics and Molecular Biology, vol. 6, no. 1. https://doi.org/10.2202/1544-6115.1314

Lewin, A., Bochkina, N. & Richardson, S. (). Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks. Statistical Applications in Genetics and Molecular Biology, 6(1). https://doi.org/10.2202/1544-6115.1314

Lewin, A., Bochkina, N. and Richardson, S. () Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks. Statistical Applications in Genetics and Molecular Biology, Vol. 6 (Issue 1). https://doi.org/10.2202/1544-6115.1314

Lewin, Alex, Bochkina, Natalia and Richardson, Sylvia. "Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks" Statistical Applications in Genetics and Molecular Biology 6, no. 1 (). https://doi.org/10.2202/1544-6115.1314

Lewin A, Bochkina N, Richardson S. Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks. Statistical Applications in Genetics and Molecular Biology. ;6(1). https://doi.org/10.2202/1544-6115.1314

Copy

Copied to clipboard

BibTeX EndNote RIS

We present a Bayesian hierarchical model for detecting differentially expressed genes using a mixture prior on the parameters representing differential effects. We formulate an easily interpretable 3-component mixture to classify genes as over-expressed, under-expressed and non-differentially expressed, and model gene variances as exchangeable to allow for variability between genes. We show how the proportion of differentially expressed genes, and the mixture parameters, can be estimated in a fully Bayesian way, extending previous approaches where this proportion was fixed and empirically estimated. Good estimates of the false discovery rates are also obtained.Different parametric families for the mixture components can lead to quite different classifications of genes for a given data set. Using Affymetrix data from a knock out and wildtype mice experiment, we show how predictive model checks can be used to guide the choice between possible mixture priors. These checks show that extending the mixture model to allow extra variability around zero instead of the usual point mass null fits the data better.A software package for R is available.

Keywords: microarray; mixture model; predictive checks; Bayesian analysis; MCMC

Published Online: 2007-12-21

You are currently not able to access this content.

Articles in the same Issue

Article
Accounting for Dependence in Similarity Data from DNA Fingerprinting
Normalization of Dye Bias in Microarray Data Using the Mixture of Splines Model
A Generalized Sidak-Holm Procedure and Control of Generalized Error Rates under Independence
Using Duplicate Genotyped Data in Genetic Analyses: Testing Association and Estimating Error Rates
Likelihood-Based Inference for Multi-Color Optical Mapping
Sparse Logistic Regression with Lp Penalty for Biomarker Identification
Super Learning: An Application to the Prediction of HIV-1 Drug Resistance
Supervised Detection of Conserved Motifs in DNA Sequences with Cosmo
Accurate Ranking of Differentially Expressed Genes by a Distribution-Free Shrinkage Approach
Statistical Inference for Quantitative Polymerase Chain Reaction Using a Hidden Markov Model: A Bayesian Approach
A Bayesian Model of AFLP Marker Evolution and Phylogenetic Inference
Sequential Quantitative Trait Locus Mapping in Experimental Crosses
Case-Control Inference of Interaction between Genetic and Nongenetic Risk Factors under Assumptions on Their Distribution
Inference on the Limiting False Discovery Rate and the P-value Threshold Parameter Assuming Weak Dependence between Gene Expression Levels within Subject
Reconstructing Gene Regulatory Networks with Bayesian Networks by Combining Expression Data with Multiple Sources of Prior Knowledge
Cox Survival Analysis of Microarray Gene Expression Data Using Correlation Principal Component Regression
A Method for Meta-Analysis of Case-Control Genetic Association Studies Using Logistic Regression
Approximating the Variance of the Conditional Probability of the State of a Hidden Markov Model
Using Linear Mixed Models for Normalization of cDNA Microarrays
Experimental Design for Two-Color Microarrays Applied in a Pre-Existing Split-Plot Experiment
The Cyclohedron Test for Finding Periodic Genes in Time Course Expression Studies
H-Tuple Approach to Evaluate Statistical Significance of Biological Sequence Comparison with Gaps
Multiple Testing Issues in Discriminating Compound-Related Peaks and Chromatograms from High Frequency Noise, Spikes and Solvent-Based Noise in LC - MS Data Sets
A Bayesian Approach to Estimation and Testing in Time-course Microarray Experiments
Super Learner
Testing for Trends in Dose-Response Microarray Experiments: A Comparison of Several Testing Procedures, Multiplicity and Resampling-Based Inference
On the Operational Characteristics of the Benjamini and Hochberg False Discovery Rate Procedure
A Comparison of Methods to Control Type I Errors in Microarray Studies
Selection of Biologically Relevant Genes with a Wrapper Stochastic Algorithm
T-BAPS: A Bayesian Statistical Tool for Comparison of Microbial Communities Using Terminal-restriction Fragment Length Polymorphism (T-RFLP) Data
Population Structure and Covariate Analysis Based on Pairwise Microsatellite Allele Matching Frequencies
Estimating the Arm-Wise False Discovery Rate in Array Comparative Genomic Hybridization Experiments
An Expectation Maximization Approach to Estimate Malaria Haplotype Frequencies in Multiply Infected Children
Estimation of Expression Levels in Spotted Microarrays with Saturated Pixels
Improving Divergence Time Estimation in Phylogenetics: More Taxa vs. Longer Sequences
Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks
Multiple Testing for SNP-SNP Interactions

Search journal Search the content of this journal

https://doi.org/10.2202/1544-6115.1314

Keywords for this article

microarray; mixture model; predictive checks; Bayesian analysis; MCMC

Articles in the same Issue

Article
Accounting for Dependence in Similarity Data from DNA Fingerprinting
Normalization of Dye Bias in Microarray Data Using the Mixture of Splines Model
A Generalized Sidak-Holm Procedure and Control of Generalized Error Rates under Independence
Using Duplicate Genotyped Data in Genetic Analyses: Testing Association and Estimating Error Rates
Likelihood-Based Inference for Multi-Color Optical Mapping
Sparse Logistic Regression with Lp Penalty for Biomarker Identification
Super Learning: An Application to the Prediction of HIV-1 Drug Resistance
Supervised Detection of Conserved Motifs in DNA Sequences with Cosmo
Accurate Ranking of Differentially Expressed Genes by a Distribution-Free Shrinkage Approach
Statistical Inference for Quantitative Polymerase Chain Reaction Using a Hidden Markov Model: A Bayesian Approach
A Bayesian Model of AFLP Marker Evolution and Phylogenetic Inference
Sequential Quantitative Trait Locus Mapping in Experimental Crosses
Case-Control Inference of Interaction between Genetic and Nongenetic Risk Factors under Assumptions on Their Distribution
Inference on the Limiting False Discovery Rate and the P-value Threshold Parameter Assuming Weak Dependence between Gene Expression Levels within Subject
Reconstructing Gene Regulatory Networks with Bayesian Networks by Combining Expression Data with Multiple Sources of Prior Knowledge
Cox Survival Analysis of Microarray Gene Expression Data Using Correlation Principal Component Regression
A Method for Meta-Analysis of Case-Control Genetic Association Studies Using Logistic Regression
Approximating the Variance of the Conditional Probability of the State of a Hidden Markov Model
Using Linear Mixed Models for Normalization of cDNA Microarrays
Experimental Design for Two-Color Microarrays Applied in a Pre-Existing Split-Plot Experiment
The Cyclohedron Test for Finding Periodic Genes in Time Course Expression Studies
H-Tuple Approach to Evaluate Statistical Significance of Biological Sequence Comparison with Gaps
Multiple Testing Issues in Discriminating Compound-Related Peaks and Chromatograms from High Frequency Noise, Spikes and Solvent-Based Noise in LC - MS Data Sets
A Bayesian Approach to Estimation and Testing in Time-course Microarray Experiments
Super Learner
Testing for Trends in Dose-Response Microarray Experiments: A Comparison of Several Testing Procedures, Multiplicity and Resampling-Based Inference
On the Operational Characteristics of the Benjamini and Hochberg False Discovery Rate Procedure
A Comparison of Methods to Control Type I Errors in Microarray Studies
Selection of Biologically Relevant Genes with a Wrapper Stochastic Algorithm
T-BAPS: A Bayesian Statistical Tool for Comparison of Microbial Communities Using Terminal-restriction Fragment Length Polymorphism (T-RFLP) Data
Population Structure and Covariate Analysis Based on Pairwise Microsatellite Allele Matching Frequencies
Estimating the Arm-Wise False Discovery Rate in Array Comparative Genomic Hybridization Experiments
An Expectation Maximization Approach to Estimate Malaria Haplotype Frequencies in Multiply Infected Children
Estimation of Expression Levels in Spotted Microarrays with Saturated Pixels
Improving Divergence Time Estimation in Phylogenetics: More Taxa vs. Longer Sequences
Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks
Multiple Testing for SNP-SNP Interactions