Differentiating strains of a pathogen is often central to investigating its epidemiological aspects. The genetic similarity of a group of strains can be assessed by calculating a matrix of dissimilarities from their DNA fingerprinting profiles. The mean dissimilarity for each strain across other strains within the group is then used as an observation in a statistical analysis. These observations are not independent of each other, and so standard analysis techniques such as the t-test are inappropriate, because they underestimate the variance of the group means, and hence overstate the statistical significance of any differences. By examining the correlation between elements of the dissimilarity matrix, it is shown that the variance is underestimated by a factor of between about 2 and 4. Permutation tests are proposed as a way of addressing the problem of dependence, and are applied to a study of fluconazole resistance in Candida albicans.
Contents
- Article
-
Requires Authentication UnlicensedAccounting for Dependence in Similarity Data from DNA FingerprintingLicensedJanuary 15, 2007
-
Requires Authentication UnlicensedNormalization of Dye Bias in Microarray Data Using the Mixture of Splines ModelLicensedJanuary 25, 2007
-
Requires Authentication UnlicensedA Generalized Sidak-Holm Procedure and Control of Generalized Error Rates under IndependenceLicensedJanuary 25, 2007
-
Requires Authentication UnlicensedUsing Duplicate Genotyped Data in Genetic Analyses: Testing Association and Estimating Error RatesLicensedFebruary 5, 2007
-
Requires Authentication UnlicensedLikelihood-Based Inference for Multi-Color Optical MappingLicensedFebruary 10, 2007
-
Requires Authentication UnlicensedSparse Logistic Regression with Lp Penalty for Biomarker IdentificationLicensedFebruary 10, 2007
-
Requires Authentication UnlicensedSuper Learning: An Application to the Prediction of HIV-1 Drug ResistanceLicensedFebruary 23, 2007
-
Requires Authentication UnlicensedSupervised Detection of Conserved Motifs in DNA Sequences with CosmoLicensedFebruary 23, 2007
-
Requires Authentication UnlicensedAccurate Ranking of Differentially Expressed Genes by a Distribution-Free Shrinkage ApproachLicensedFebruary 23, 2007
-
Requires Authentication UnlicensedStatistical Inference for Quantitative Polymerase Chain Reaction Using a Hidden Markov Model: A Bayesian ApproachLicensedMarch 19, 2007
-
Requires Authentication UnlicensedA Bayesian Model of AFLP Marker Evolution and Phylogenetic InferenceLicensedApril 17, 2007
-
Requires Authentication UnlicensedSequential Quantitative Trait Locus Mapping in Experimental CrossesLicensedApril 17, 2007
-
Requires Authentication UnlicensedCase-Control Inference of Interaction between Genetic and Nongenetic Risk Factors under Assumptions on Their DistributionLicensedApril 22, 2007
-
Requires Authentication UnlicensedInference on the Limiting False Discovery Rate and the P-value Threshold Parameter Assuming Weak Dependence between Gene Expression Levels within SubjectLicensedMay 21, 2007
-
Requires Authentication UnlicensedReconstructing Gene Regulatory Networks with Bayesian Networks by Combining Expression Data with Multiple Sources of Prior KnowledgeLicensedMay 29, 2007
-
Requires Authentication UnlicensedCox Survival Analysis of Microarray Gene Expression Data Using Correlation Principal Component RegressionLicensedMay 29, 2007
-
Requires Authentication UnlicensedA Method for Meta-Analysis of Case-Control Genetic Association Studies Using Logistic RegressionLicensedJune 14, 2007
-
Requires Authentication UnlicensedApproximating the Variance of the Conditional Probability of the State of a Hidden Markov ModelLicensedJuly 6, 2007
-
Requires Authentication UnlicensedUsing Linear Mixed Models for Normalization of cDNA MicroarraysLicensedJuly 26, 2007
-
Requires Authentication UnlicensedExperimental Design for Two-Color Microarrays Applied in a Pre-Existing Split-Plot ExperimentLicensedJuly 26, 2007
-
Requires Authentication UnlicensedThe Cyclohedron Test for Finding Periodic Genes in Time Course Expression StudiesLicensedAugust 15, 2007
-
Requires Authentication UnlicensedH-Tuple Approach to Evaluate Statistical Significance of Biological Sequence Comparison with GapsLicensedAugust 25, 2007
-
Requires Authentication UnlicensedMultiple Testing Issues in Discriminating Compound-Related Peaks and Chromatograms from High Frequency Noise, Spikes and Solvent-Based Noise in LC - MS Data SetsLicensedSeptember 8, 2007
-
Requires Authentication UnlicensedA Bayesian Approach to Estimation and Testing in Time-course Microarray ExperimentsLicensedSeptember 16, 2007
-
Requires Authentication UnlicensedSuper LearnerLicensedSeptember 16, 2007
-
Requires Authentication UnlicensedTesting for Trends in Dose-Response Microarray Experiments: A Comparison of Several Testing Procedures, Multiplicity and Resampling-Based InferenceLicensedOctober 11, 2007
-
Requires Authentication UnlicensedOn the Operational Characteristics of the Benjamini and Hochberg False Discovery Rate ProcedureLicensedOctober 11, 2007
-
Requires Authentication UnlicensedA Comparison of Methods to Control Type I Errors in Microarray StudiesLicensedOctober 11, 2007
-
Requires Authentication UnlicensedSelection of Biologically Relevant Genes with a Wrapper Stochastic AlgorithmLicensedNovember 6, 2007
-
Requires Authentication UnlicensedT-BAPS: A Bayesian Statistical Tool for Comparison of Microbial Communities Using Terminal-restriction Fragment Length Polymorphism (T-RFLP) DataLicensedNovember 6, 2007
-
Requires Authentication UnlicensedPopulation Structure and Covariate Analysis Based on Pairwise Microsatellite Allele Matching FrequenciesLicensedNovember 6, 2007
-
Requires Authentication UnlicensedEstimating the Arm-Wise False Discovery Rate in Array Comparative Genomic Hybridization ExperimentsLicensedNovember 19, 2007
-
Requires Authentication UnlicensedAn Expectation Maximization Approach to Estimate Malaria Haplotype Frequencies in Multiply Infected ChildrenLicensedNovember 19, 2007
-
Requires Authentication UnlicensedEstimation of Expression Levels in Spotted Microarrays with Saturated PixelsLicensedDecember 8, 2007
-
Requires Authentication UnlicensedImproving Divergence Time Estimation in Phylogenetics: More Taxa vs. Longer SequencesLicensedDecember 21, 2007
-
Requires Authentication UnlicensedFully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model ChecksLicensedDecember 21, 2007
-
Requires Authentication UnlicensedMultiple Testing for SNP-SNP InteractionsLicensedDecember 26, 2007