Stopping-Time Resampling and Population Genetic Inference under Coalescent Models
-
Paul A. Jenkins
To extract full information from samples of DNA sequence data, it is necessary to use sophisticated model-based techniques such as importance sampling under the coalescent. However, these are limited in the size of datasets they can handle efficiently. Chen and Liu (2000) introduced the idea of stopping-time resampling and showed that it can dramatically improve the efficiency of importance sampling methods under a finite-alleles coalescent model. In this paper, a new framework is developed for designing stopping-time resampling schemes under more general models. It is implemented on data both from infinite sites and stepwise models of mutation, and extended to incorporate crossover recombination. A simulation study shows that this new framework offers a substantial improvement in the accuracy of likelihood estimation over a range of parameters, while a direct application of the scheme of Chen and Liu (2000) can actually diminish the estimate. The method imposes no additional computational burden and is robust to the choice of parameters.
©2012 Walter de Gruyter GmbH & Co. KG, Berlin/Boston
Artikel in diesem Heft
- Article
- The Inheritance Procedure: Multiple Testing of Tree-structured Hypotheses
- Optimality Criteria for the Design of 2-Color Microarray Studies
- Stopping-Time Resampling and Population Genetic Inference under Coalescent Models
- A Mixture-Model Approach for Parallel Testing for Unequal Variances
- Fast Identification of Biological Pathways Associated with a Quantitative Trait Using Group Lasso with Overlaps
- MicroRNA Transcription Start Site Prediction with Multi-objective Feature Selection
- A Context Dependent Pair Hidden Markov Model for Statistical Alignment
- Fast Wavelet Based Functional Models for Transcriptome Analysis with Tiling Arrays
- Alignment-free Sequence Comparison for Biologically Realistic Sequences of Moderate Length
- Transcriptional Network Inference from Functional Similarity and Expression Data: A Global Supervised Approach
- Improving Hidden Markov Models for Classification of Human Immunodeficiency Virus-1 Subtypes through Linear Classifier Learning
Artikel in diesem Heft
- Article
- The Inheritance Procedure: Multiple Testing of Tree-structured Hypotheses
- Optimality Criteria for the Design of 2-Color Microarray Studies
- Stopping-Time Resampling and Population Genetic Inference under Coalescent Models
- A Mixture-Model Approach for Parallel Testing for Unequal Variances
- Fast Identification of Biological Pathways Associated with a Quantitative Trait Using Group Lasso with Overlaps
- MicroRNA Transcription Start Site Prediction with Multi-objective Feature Selection
- A Context Dependent Pair Hidden Markov Model for Statistical Alignment
- Fast Wavelet Based Functional Models for Transcriptome Analysis with Tiling Arrays
- Alignment-free Sequence Comparison for Biologically Realistic Sequences of Moderate Length
- Transcriptional Network Inference from Functional Similarity and Expression Data: A Global Supervised Approach
- Improving Hidden Markov Models for Classification of Human Immunodeficiency Virus-1 Subtypes through Linear Classifier Learning