Comparison of Targeted Maximum Likelihood and Shrinkage Estimators of Parameters in Gene Networks
-
Geert Geeven
Abstract
Gene regulatory networks, in which edges between nodes describe interactions between transcription factors (TFs) and their target genes, model regulatory interactions that determine the cell-type and condition-specific expression of genes. Regression methods can be used to identify TF-target gene interactions from gene expression and DNA sequence data. The response variable, i.e. observed gene expression, is modeled as a function of many predictor variables simultaneously. In practice, it is generally not possible to select a single model that clearly achieves the best fit to the observed experimental data and the selected models typically contain overlapping sets of predictor variables. Moreover, parameters that represent the marginal effect of the individual predictors are not always present. In this paper, we use the statistical framework of estimation of variable importance to define variable importance as a parameter of interest and study two different estimators of this parameter in the context of gene regulatory networks. On yeast data we show that the resulting parameter has a biologically appealing interpretation. We apply the proposed methodology on mammalian gene expression data to gain insight into the temporal activity of TFs that underly gene expression changes in F11 cells in response to Forskolin stimulation.
©2012 Walter de Gruyter GmbH & Co. KG, Berlin/Boston
Articles in the same Issue
- Article
- Large-scale Parentage Inference with SNPs: an Efficient Algorithm for Statistical Confidence of Parent Pair Allocations
- ExactDAS: An Exact Test Procedure for the Detection of Differential Alternative Splicing in Microarray Experiments
- Incorporating Genomic Annotation into a Hidden Markov Model for DNA Methylation Tiling Array Data
- Variational Bayes Procedure for Effective Classification of Tumor Type with Microarray Gene Expression Data
- Detecting Differential Expression in RNA-sequence Data Using Quasi-likelihood with Shrunken Dispersion Estimates
- Empirical Bayesian Selection of Hypothesis Testing Procedures for Analysis of Sequence Count Expression Data
- Analyzing Genetic Association Studies with an Extended Propensity Score Approach
- Genotype Copy Number Variations using Gaussian Mixture Models: Theory and Algorithms
- Estimators of the local false discovery rate designed for small numbers of tests
- A PAUC-based Estimation Technique for Disease Classification and Biomarker Selection
- Comparison of Targeted Maximum Likelihood and Shrinkage Estimators of Parameters in Gene Networks
- DNA Pooling and Statistical Tests for the Detection of Single Nucleotide Polymorphisms
Articles in the same Issue
- Article
- Large-scale Parentage Inference with SNPs: an Efficient Algorithm for Statistical Confidence of Parent Pair Allocations
- ExactDAS: An Exact Test Procedure for the Detection of Differential Alternative Splicing in Microarray Experiments
- Incorporating Genomic Annotation into a Hidden Markov Model for DNA Methylation Tiling Array Data
- Variational Bayes Procedure for Effective Classification of Tumor Type with Microarray Gene Expression Data
- Detecting Differential Expression in RNA-sequence Data Using Quasi-likelihood with Shrunken Dispersion Estimates
- Empirical Bayesian Selection of Hypothesis Testing Procedures for Analysis of Sequence Count Expression Data
- Analyzing Genetic Association Studies with an Extended Propensity Score Approach
- Genotype Copy Number Variations using Gaussian Mixture Models: Theory and Algorithms
- Estimators of the local false discovery rate designed for small numbers of tests
- A PAUC-based Estimation Technique for Disease Classification and Biomarker Selection
- Comparison of Targeted Maximum Likelihood and Shrinkage Estimators of Parameters in Gene Networks
- DNA Pooling and Statistical Tests for the Detection of Single Nucleotide Polymorphisms