Abstract
Integrative analysis of copy number and gene expression data can help in understanding the cis and trans effect of copy number aberrations on transcription levels of genes involved in a pathway. To analyse how these copy number mediated gene-gene interactions differ between groups of samples we propose a new method, named dNET. Our method uses ridge regression to model the network topology involving one gene’s expression level, its gene dosage and the expression levels of other genes in the network. The interaction parameters are estimated by fitting the model per gene for all samples together. However, instead of testing for differential network topology per gene, dNET tests for an overall difference in estimated parameters between two groups of samples and produces a single p-value. With the help of several simulation studies, we show that dNET can detect differential network nodes with high accuracy and low rate of false positives even in the presence of differential cis effects. We also apply dNET to publicly available TCGA cancer datasets and identify pathways where copy number mediated gene-gene interactions differ between samples with cancer stage lower than stage 3 and samples with cancer stage 3 or above.
Acknowledgement
This work has been supported and funded by Netherlands Bioinformatics Centre.
References
Chaturvedi, N., J. J. Goeman, J. M. Boer, W. N. van Wieringen and R. X. d. Menezes (2014): “A test for comparing two groups of samples when analyzing multiple omics profiles,” BMC Bioinformatics, 15, 1–14.10.1186/1471-2105-15-236Search in Google Scholar PubMed PubMed Central
Chaturvedi, N., R. X. d. Menezes and J. J. Goeman (2017): “A global x global test for testing associations between two large sets of variables,” Biometrical J., 59, 145–158.10.1002/bimj.201500106Search in Google Scholar PubMed
Flutre, T., X. Wen, J. Pritchard and M. Stephens (2013): “A statistical framework for joint eqtl analysis in multiple tissues,” PLoS Genet., 9, e1003486.10.1371/journal.pgen.1003486Search in Google Scholar PubMed PubMed Central
Goeman, J. J., S. van De Geer and H. van Houwelingen (2006): “Testing against a high dimensional alternative,” J. R. Stat. Soc. Series B Stat. Methodol. 68, 477–493.10.1111/j.1467-9868.2006.00551.xSearch in Google Scholar
Hoerl, A. and R. Kennard (1970): “Ridge regression: biased estimation for nonorthogonal problems,” Technometrics, 12, 55–67.10.1080/00401706.1970.10488634Search in Google Scholar
Horlings, H., C. Lai, D. Nuyten, H. Halfwerk, P. Kristel, E. van Beers, S. Joosse, C. Klijn, P. Nederlof, M. Reinders, L. Wessels and M. van de Vijver (2010): “Integration of dna copy number alterations and prognostic gene expression signatures in breast cancer patients,” Clin. Cancer Res., 16, 651–663.10.1158/1078-0432.CCR-09-0709Search in Google Scholar PubMed
Jornsten, R., T. Abenius, T. Kling, L. Schmidt, E. Johansson, T. E. M. Nordling, B. Nordlander, C. Sander, P. Gennemark, K. Funa, B. Nilsson, L. Lindahl and S. Nelander (2011): “Network modeling of the transcriptional effects of copy number aberrations in glioblastoma,” Mol. Syst. Biol., 7, 486.10.1038/msb.2011.17Search in Google Scholar PubMed PubMed Central
Kendziorski, C. M., M. Chen, M. Yuan, H. Lan and A. D. Attie (2006): “Statistical methods for expression quantitative trait loci (eqtl) mapping,” Biometrics, 62, 19–27.10.1111/j.1541-0420.2005.00437.xSearch in Google Scholar PubMed
Kheirelseid, E., N. Miller, K. Chang, M. Nugent and M. J. Kerin (2013): “Clinical applications of gene expression in colorectal cancer,” J. Gastrointest. Oncol., 4, 144–157.Search in Google Scholar PubMed
Listgarten, J., C. Kadie and D. Heckerman (2010): “Correction for hidden confounders in the genetic analysis of gene expression,” Proc. Natl. Acad. Sci. USA, 107, 16465–16470.10.1073/pnas.1002425107Search in Google Scholar PubMed PubMed Central
Menezes, R. d., M. Boetzer, M. Sieswerda, G. van Ommen and J. M. Boer (2009): “Integrated analysis of dna copy number and gene expression microarray data using gene sets,” BMC Bioinformatics, 10, 203.10.1186/1471-2105-10-203Search in Google Scholar PubMed PubMed Central
Phipson, B. and G. K. Smyth (2010): “Permutation p-values should never be zero: calculating exact p-values when permutations are randomly drawn,” Stat. Appl. Genet. Mol. Biol., 9, 1544–6115.10.2202/1544-6115.1585Search in Google Scholar PubMed
van Iterson, M., S. Bervoets, E. de Meijer, H. Buermans, P. ’t Hoen, R. d. Menezes and J. Boer (2013): “Integrated analysis of microrna and mrna expression: adding biological significance to microrna target predictions,” Nucleic Acids Res., 41, 1–10.10.1093/nar/gkt525Search in Google Scholar PubMed PubMed Central
van Wieringen, W., J. Berkhof and M. van de Wiel (2010): “A random coefficients model for regional co-expression associated with DNA copy number,” Stat. Appl. Genet. Mol. Biol., 9, 1–28.10.2202/1544-6115.1531Search in Google Scholar PubMed
van Wieringen, W. N. and M. A. van de Wiel (2014): “Penalized differential pathway analysis of integrative oncogenomics studies,” Stat. Appl. Genet. Mol. Biol., 13, 141–158.10.1515/sagmb-2013-0020Search in Google Scholar PubMed
Yao, F., C. Zhang, W. Du, C. Liu and Y. Xu (2015): “Identification of gene-expression signatures and protein markers for breast cancer grading and staging,” PLoS One, 10, 1–17.10.1371/journal.pone.0138213Search in Google Scholar PubMed PubMed Central
©2018 Walter de Gruyter GmbH, Berlin/Boston
Articles in the same Issue
- Research Articles
- Assessing genome-wide significance for the detection of differentially methylated regions
- A test for detecting differential indirect trans effects between two groups of samples
- A variable selection approach in the multivariate linear model: an application to LC-MS metabolomics data
Articles in the same Issue
- Research Articles
- Assessing genome-wide significance for the detection of differentially methylated regions
- A test for detecting differential indirect trans effects between two groups of samples
- A variable selection approach in the multivariate linear model: an application to LC-MS metabolomics data