Abstract
Adaptive transmission disequilibrium test (aTDT) and MAX3 test are two robust-efficient association tests for case-parent family trio data. Both tests incorporate information of common genetic models including recessive, additive and dominant models and are efficient in power and robust to genetic model specifications. The aTDT uses information of departure from Hardy-Weinberg disequilibrium to identify the potential genetic model underlying the data and then applies the corresponding TDT-type test, and the MAX3 test is defined as the maximum of the absolute value of three TDT-type tests under the three common genetic models. In this article, we propose three robust Bayes procedures, the aTDT based Bayes factor, MAX3 based Bayes factor and Bayes model averaging (BMA), for association analysis with case-parent trio design. The asymptotic distributions of aTDT under the null and alternative hypothesis are derived in order to calculate its Bayes factor. Extensive simulations show that the Bayes factors and the p-values of the corresponding tests are generally consistent and these Bayes factors are robust to genetic model specifications, especially so when the priors on the genetic models are equal. When equal priors are used for the underlying genetic models, the Bayes factor method based on aTDT is more powerful than those based on MAX3 and Bayes model averaging. When the prior placed a small (large) probability on the true model, the Bayes factor based on aTDT (BMA) is more powerful. Analysis of a simulation data about RA from GAW15 is presented to illustrate applications of the proposed methods.
Acknowledgments
Dr. Min Yuan is supported by the National Science Foundation of China (NSFC), Grant No. 11201452, 11271346 and 11401558. Dr. Xiaoqing Pan is supported by the Fundamental Research Funds for the Central Universities (No. WK2040160010) and China Postdoctoral Science Foundation (No. 2014M561823).
Conflict of interest statement: We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work, there is no professional or other personal interest of any nature or kind in any product, service and/or company that could be construed as influencing the position presented in, or the review of, the manuscript entitled “Bayes Factors Based on robust TDT-type Tests for Family Trio Design.”
Appendix A
Using the facts that
the expectation of Zθ can be shown to be
and the variance of Zθ is
Therefore, Zθ is asymptotically distributed as
Appendix B
Let f(x)=n2x+n4x2. Making Taylor expansion of
The variance of
where
Therefore,
Appendix C
Denote
and
The covariance between Zθ and Zhwd can be shown to be
References
Falk, C. T. and P. Rubinstein (1987): “Haplotype relative risk: an easy reliable way to construct a proper control sample for risk calculation,” Ann. Hum. Genet., 51, 227–233.Suche in Google Scholar
Field, L. L., C. Fothergill-Payne, J. Bertrams, and M. P. Baur (1986): “HLA-DR effects in a large German IDDM dataset,” Genet. Epidemiol. Suppl., 1, 323–328.Suche in Google Scholar
Jeffreys, H. (1961): Theory of probability, 3rd edition. Oxford, UK: Oxford University Press.Suche in Google Scholar
Johnson, V. E. (2005): “Bayes factors based on test statistics,” J Roy. Stat. Soc. B, 67, 689–701.Suche in Google Scholar
Johnson, V. E. (2008): “Properties of bayes factors based on test statistics,” Scand. J. Stat., 35, 354–368.Suche in Google Scholar
Kass, R. and A. Raftery (1995): “Bayes factors,” J. Am. Stat. Assoc., 90, 773–795.Suche in Google Scholar
Miller, M. B., G. R. Lind, N. Li and S. Y. Jang (2007): “Genetic Analysis Workshop 15: simulation of a complex genetic model for rheumatoid arthritis in nuclear families including a dense SNP map with linkage disequilibrium between marker loci and trait loci,” BMC Proc., 1, S4.10.1186/1753-6561-1-S1-S4Suche in Google Scholar
Ott, J. (1989): “Statistical properties of the haplotype relative risk,” Genet. Epidemiol., 6, 127–130.Suche in Google Scholar
Rubinstein, P., M. Walker, C. Carpenter, C. Carrier, J. Krassner, C. Falk and F. Ginsberg (1981): “Genetics of HLA disease association: the use haplotype relative risk (HRR) and the ‘haplo-delta’ (Dh) estimates in juvenile diabetes from three racial groups,” Hum. Immunol., 3, 384.Suche in Google Scholar
Sawcer, S. (2010): “Bayes factors in complex genetics,” Eur. J. Hum. Genet., 18, 746–750.Suche in Google Scholar
Schaid, D. J. and S. S. Sommer (1993): “Genotype relative risks: methods for design and analysis of candidate-gene association studies,” Am. J. Hum. Genet., 53, 1114–1126.Suche in Google Scholar
Schaid, D. J. and S. S. Sommer (1994): “Comparison of statistics for candidate-gene association studies using cases and parents,” Am. J. Hum. Genet., 55, 402–409.Suche in Google Scholar
Serfling, R. J. (1980): Approximation theorems of mathematical statistics, New York: Wiley.10.1002/9780470316481Suche in Google Scholar
Spielman, R. S., R. E. McGinnis and W. J. Ewens (1993): “Transmission test for linkage diseqilibrium: the insulin gene region and inculin-dependent diabetes mellitus (IDDM),” Am. J. Hum. Genet., 52, 506–516.Suche in Google Scholar
Stephens, M. and D. J. Balding (2009): “Bayesian statistical methods for genetic association studies,” Nat. Rev. Genet., 10, 681–690.Suche in Google Scholar
Terwilliger, J. D. and J. Ott (1992): “A haplotype-based ‘haplotype relative risk ’ approach to detecting allelic association,” Hum. Hered., 42, 337–346.Suche in Google Scholar
The Wellcome Trust Case-Control Consortium (WTCCC) (2007): “Genome-wide association study of 14,000 cases of seven common diseases and 3000 shared controls,” Nature, 447, 661–683.10.1038/nature05911Suche in Google Scholar PubMed PubMed Central
Thomson, G., W. P. Robinson, M. K. Kuhner and S. Joe (1989): “HLA, insulin gene, and Gm associations with IDDM,” Genet. Epidemiol., 6, 155–160.Suche in Google Scholar
Wakefield, J. (2007): “A Bayesian measure of the probability of false discovery in genetic epidemiology studies,” Am. J. Hum. Genet., 81, 208–227.Suche in Google Scholar
Wakefield, J. (2009): “Bayes factors for genome-wide association studies: comparison with p-values,” Genet. Epidemiol., 33, 79–86.Suche in Google Scholar
Yuan, M., X. Tian, Y. Yang and G. Zheng (2009): “Adaptive transmission disequilibrium test for family trio design,” Stat. Appl. Genet. Mol. Biol., 8(1), 1–20.Suche in Google Scholar
Zheng, G., J. Joo and Y. Yang (2009): “Pearsons test, trend test, and MAX are all trend tests with different types of scores,” Ann. Hum. Genet., 73, 133–140.Suche in Google Scholar
Zheng, G., Q. Li and A. Yuan (2014): “Some statistical properties of efficiency robust tests with applications to genetic association studies,” Scand. J. Stat., 41(3), 762–774.Suche in Google Scholar
©2015 by De Gruyter
Artikel in diesem Heft
- Frontmatter
- Research Articles
- A novel method to prioritize RNAseq data for post-hoc analysis based on absolute changes in transcript abundance
- A mutual information estimator with exponentially decaying bias
- Bayes factors based on robust TDT-type tests for family trio design
- Modeling gene-covariate interactions in sparse regression with group structure for genome-wide association studies
- Weighted Kolmogorov Smirnov testing: an alternative for Gene Set Enrichment Analysis
- Application of the fractional-stable distributions for approximation of the gene expression profiles
- Software and Application Notes
- CSI: a nonparametric Bayesian approach to network inference from multiple perturbed time series gene expression data
- TopKLists: a comprehensive R package for statistical inference, stochastic aggregation, and visualization of multiple omics ranked lists
Artikel in diesem Heft
- Frontmatter
- Research Articles
- A novel method to prioritize RNAseq data for post-hoc analysis based on absolute changes in transcript abundance
- A mutual information estimator with exponentially decaying bias
- Bayes factors based on robust TDT-type tests for family trio design
- Modeling gene-covariate interactions in sparse regression with group structure for genome-wide association studies
- Weighted Kolmogorov Smirnov testing: an alternative for Gene Set Enrichment Analysis
- Application of the fractional-stable distributions for approximation of the gene expression profiles
- Software and Application Notes
- CSI: a nonparametric Bayesian approach to network inference from multiple perturbed time series gene expression data
- TopKLists: a comprehensive R package for statistical inference, stochastic aggregation, and visualization of multiple omics ranked lists