Home Quantile-Function Based Null Distribution in Resampling Based Multiple Testing
Article
Licensed
Unlicensed Requires Authentication

Quantile-Function Based Null Distribution in Resampling Based Multiple Testing

  • Mark J. van der Laan and Alan E. Hubbard
Published/Copyright: May 21, 2006

Simultaneously testing a collection of null hypotheses about a data generating distribution based on a sample of independent and identically distributed observations is a fundamental and important statistical problem involving many applications. Methods based on marginal null distributions (i.e., marginal p-values) are attractive since the marginal p-values can be based on a user supplied choice of marginal null distributions and they are computationally trivial, but they, by necessity, are known to either be conservative or to rely on assumptions about the dependence structure between the test-statistics. Re-sampling based multiple testing (Westfall and Young, 1993) involves sampling from a joint null distribution of the test-statistics, and controlling (possibly in a, for example, step-down fashion) the user supplied type-I error rate under this joint null distribution for the test-statistics. A generally asymptotically valid null distribution avoiding the need for the subset pivotality condition for the vector of test-statistics was proposed in Pollard, van der Laan (2003) for null hypotheses about general real valued parameters. This null distribution was generalized in Dudoit, vanderLaan, Pollard (2004) to general null hypotheses and test-statistics. In ongoing recent work van der Laan, Hubbard (2005), we propose a new generally asymptotically valid null distribution for the test-statistics and a corresponding bootstrap estimate, whose marginal distributions are user supplied, and can thus be set equal to the (most powerful) marginal null distributions one would use in univariate testing to obtain a p-value. Previous proposed null distributions either relied on a restrictive subset pivotality condition (Westfall and Young) or did not guarantee this latter property (Dudoit, vanderLaan, Pollard, 2004). It is argued and illustrated that the resulting new re-sampling based multiple testing methods provide more accurate control of the wished Type-I error in finite samples and are more powerful. We establish formal results and investigate the practical performance of this methodology in a simulation and data analysis.

Published Online: 2006-5-21

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston

Articles in the same Issue

  1. Article
  2. Low-Order Conditional Independence Graphs for Inferring Genetic Networks
  3. A Generalized Clustering Problem, with Application to DNA Microarrays
  4. A Bayes Regression Approach to Array-CGH Data
  5. Statistical Selection of Maintenance Genes for Normalization of Gene Expressions
  6. Predicting the Strongest Domain-Domain Contact in Interacting Protein Pairs
  7. Dimension Reduction for Classification with Gene Expression Microarray Data
  8. A New Type of Stochastic Dependence Revealed in Gene Expression Data
  9. A New Order Estimator for Fixed and Variable Length Markov Models with Applications to DNA Sequence Similarity
  10. Quality Optimised Analysis of General Paired Microarray Experiments
  11. Issues of Processing and Multiple Testing of SELDI-TOF MS Proteomic Data
  12. Cross-Validated Bagged Prediction of Survival
  13. Treatment of Uninformative Families in Mean Allele Sharing Tests for Linkage
  14. Quantile-Function Based Null Distribution in Resampling Based Multiple Testing
  15. Combining Results of Microarray Experiments: A Rank Aggregation Approach
  16. Model Selection for Mixtures of Mutagenetic Trees
  17. Pseudo-likelihood for Non-reversible Nucleotide Substitution Models with Neighbour Dependent Rates
  18. A Method to Increase the Power of Multiple Testing Procedures Through Sample Splitting
  19. Bayesian Hierarchical Model for Correcting Signal Saturation in Microarrays Using Pixel Intensities
  20. Using Complexity for the Estimation of Bayesian Networks
  21. Detecting Local High-Scoring Segments: a First-Stage Approach for Genome-Wide Association Studies
  22. Examining Protein Structure and Similarities by Spectral Analysis Technique
  23. Parameter Estimation for the Exponential-Normal Convolution Model for Background Correction of Affymetrix GeneChip Data
  24. Approximate Sample Size Calculations with Microarray Data: An Illustration
  25. Numerical Solutions for Patterns Statistics on Markov Chains
  26. A Heuristic Bayesian Method for Segmenting DNA Sequence Alignments and Detecting Evidence for Recombination and Gene Conversion
  27. A Two-Step Multiple Comparison Procedure for a Large Number of Tests and Multiple Treatments
  28. Validation in Genomics: CpG Island Methylation Revisited
  29. An Improved Nonparametric Approach for Detecting Differentially Expressed Genes with Replicated Microarray Data
  30. Letter to the Editor
  31. Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?
  32. Reader's Reaction
  33. Reader's Reaction to "Dimension Reduction for Classification with Gene Expression Microarray Data" by Dai et al (2006)
Downloaded on 12.10.2025 from https://www.degruyterbrill.com/document/doi/10.2202/1544-6115.1199/html
Scroll to top button