The measurement of biallelic pair-wise association called linkage disequilibrium (LD) is an important issue in order to understand the genomic architecture. A plethora of measures of association in two by two tables have been proposed in the literature. Beside the problem of choosing an appropriate measure, the problem of their estimation has been neglected in the literature. It needs to be emphasized that the definition of a measure and the choice of an estimator function for it are conceptually unrelated tasks.In this paper, we compare the performance of various estimators for the three popular LD measures D', r and Y in a simulation study for small to moderate samples sizes (N<=500). The usual frequency-plug-in estimators can lead to unreliable or undefined estimates. Estimators based on the computationally expensive volume measures have been proposed recently as a remedy to this well-known problem. We confirm that volume estimators have better expected mean square error than the naive plug-in estimators. But they are outperformed by estimators plugging-in easy to calculate non-informative Bayesian probability estimates into the theoretical formulae for the measures. Fully Bayesian estimators with non-informative Dirichlet priors have comparable accuracy but are computationally more expensive.We recommend the use of non-informative Bayesian plug-in estimators based on Jeffreys' prior, in particular when dealing with SNP array data where the occurrence of small table entries and table margins is likely.
Contents
- Article
-
Publicly AvailableComparison of Estimators for Measures of Linkage DisequilibriumJanuary 6, 2010
-
Publicly AvailableComparing Mortality in Renal Patients on Hemodialysis versus Peritoneal Dialysis Using a Marginal Structural ModelJanuary 6, 2010
-
Publicly AvailableEstimation of Modified Concordance Ratio in Sib-Pairs: Effect of Consanguinity on the Risk of Congenital Heart DiseasesJanuary 6, 2010
-
Publicly AvailableLack of Fit in Self Modeling Regression: Application to Pulse WaveformsJanuary 14, 2010
-
Publicly AvailableA Comparison of Variable Selection Approaches for Dynamic Treatment RegimesJanuary 26, 2010
-
Publicly AvailableA Pseudo-EM Algorithm for Clustering Incomplete Longitudinal DataMarch 15, 2010
-
March 29, 2010
-
Publicly AvailableA Note on the Effect on Power of Score Tests via Dimension Reduction by Penalized Regression under the NullMarch 29, 2010
-
Publicly AvailableOrdinal Regression Models for Continuous ScalesApril 6, 2010
-
Publicly AvailableEvaluation of Incidence Rates in Pre-Clinical Studies Using a Williams-Type ProcedureApril 22, 2010
-
Publicly AvailableCollaborative Double Robust Targeted Maximum Likelihood EstimationMay 17, 2010
-
Publicly AvailableAn Application of Collaborative Targeted Maximum Likelihood Estimation in Causal Inference and GenomicsMay 17, 2010
-
Publicly AvailableA Unified Approach for Nonparametric Evaluation of Agreement in Method Comparison StudiesJune 8, 2010
-
Publicly AvailableCollaborative Targeted Maximum Likelihood for Time to Event DataJune 22, 2010
-
Publicly AvailableA Stochastic EM Type Algorithm for Parameter Estimation in Models with Continuous Outcomes, under Complex AscertainmentJuly 7, 2010
-
Publicly AvailableSimple Nonparametric Confidence Regions for the Evaluation of Continuous-Scale Diagnostic TestsJuly 16, 2010
-
Publicly AvailableA Targeted Maximum Likelihood Estimator of a Causal Effect on a Bounded Continuous OutcomeAugust 1, 2010
-
Publicly AvailableBivariate Zero-Inflated Regression for Count Data: A Bayesian Approach with Application to Plant CountsAugust 13, 2010
-
Publicly AvailableFast Function-on-Scalar Regression with Penalized Basis ExpansionsAugust 19, 2010
-
Publicly AvailableStatistical Methods for Comparative Phenomics Using High-Throughput Phenotype MicroarraysAugust 24, 2010
-
Publicly AvailablePanel Count Data Regression with Informative Observation TimesSeptember 6, 2010
-
Publicly AvailableA Small Sample Correction for Estimating Attributable Risk in Case-Control StudiesOctober 13, 2010
-
October 25, 2010
-
Publicly AvailableSurvival Models in Health Economic Evaluations: Balancing Fit and Parsimony to Improve PredictionOctober 27, 2010
-
Publicly AvailableInterval Estimation of Some Epidemiological Measures of AssociationOctober 29, 2010
- Reader's Reaction
-
Publicly AvailableComment: Analyzing Propensity Score Matched Count DataJanuary 15, 2010
-
July 6, 2010
-
Publicly AvailableRejoinder to Nancy Cook's Comment on "Measures to Summarize and Compare the Predictive Capacity of Markers"July 29, 2010