Abstract
With the increasing complexity of data, researchers in various fields have become increasingly interested in estimating the causal effect of a matrix exposure, which involves complex multivariate treatments, on an outcome. Balancing covariates for the matrix exposure is essential to achieve this goal. While exact balancing and approximate balancing methods have been proposed for multiple balancing constraints, dealing with a matrix treatment introduces a large number of constraints, making it challenging to achieve exact balance or select suitable threshold parameters for approximate balancing methods. To address this challenge, the weighted Euclidean balancing method is proposed, which offers an approximate balance of covariates from an overall perspective. In this study, both parametric and nonparametric methods for estimating the causal effect of a matrix treatment is proposed, along with providing theoretical properties of the two estimations. To validate the effectiveness of our approach, extensive simulation results demonstrate that the proposed method outperforms alternative approaches across various scenarios. Finally, we apply the method to analyze the causal impact of the omics variables on the drug sensitivity of Vandetanib. The results indicate that EGFR CNV has a significant positive causal effect on Vandetanib efficacy, whereas EGFR methylation exerts a significant negative causal effect.
Funding source: Shanghai “Science and technology Innovation Action Plan” Computational Biology Key Project
Award Identifier / Grant number: 23JS1400500
Award Identifier / Grant number: 23JS1400800
Funding source: Natural Science Foundation of Shanghai Municipality
Award Identifier / Grant number: 24ZR1420400
-
Research ethics: The local Institutional Review Board deemed the study exempt from review.
-
Informed consent: Not applicable.
-
Author contributions: The authors have accepted responsibility for the entire content of this manuscript and approved its submission.
-
Use of Large Language Models, AI and Machine Learning Tools: We used ChatGPT to improvethe language of the manuscript.
-
Conflict of interest: The authors state no conflict of interest.
-
Research funding: The research was supported by Natural Science Foundation of Shanghai Municipality (No. 24ZR1420400) and Shanghai “Science and technology Innovation Action Plan” Computational Biology Key Project (No. 23JS1400500, No. 23JS1400800).
-
Data availability: The data can be obtained upon request from the corresponding author.
References
1. Chan, KCG, Yam, SCP, Zhang, Z. Globally efficient non-parametric inference of average treatment effects by empirical balancing calibration weighting. J Roy Stat Soc B Stat Methodol 2016;78:673. https://doi.org/10.1111/rssb.12129.Suche in Google Scholar PubMed PubMed Central
2. Dong, Y, Lee, Y-Y, Gou, M. Regression discontinuity designs with a continuous treatment. J Am Stat Assoc 2021:1–14. https://doi.org/10.1080/01621459.2021.1923509.Suche in Google Scholar
3. Fong, C, Hazlett, C, Imai, K. Covariate balancing propensity score for a continuous treatment: application to the efficacy of political advertisements. Ann Appl Stat 2018;12:156–77. https://doi.org/10.1214/17-aoas1101.Suche in Google Scholar
4. Hsu, Y-C, Lai, T-C, Lieli, RP. Counterfactual treatment effects: estimation and inference. J Bus Econ Stat 2022;40:240–55. https://doi.org/10.1080/07350015.2020.1800479.Suche in Google Scholar
5. Imai, K, Ratkovic, M. Covariate balancing propensity score. J Roy Stat Soc B 2014;76:243–63. https://doi.org/10.1111/rssb.12027.Suche in Google Scholar
6. Xiong, C, Yu, M, Shao, J. Treatment recommendation and parameter estimation under single-index contrast function. Stat Theor Relat Fields 2017;1:171–81. https://doi.org/10.1080/24754269.2017.1341012.Suche in Google Scholar
7. Yiu, S, Su, L. Covariate association eliminating weights: a unified weighting framework for causal effect estimation. Biometrika 2018;105:709–22. https://doi.org/10.1093/biomet/asy015.Suche in Google Scholar PubMed PubMed Central
8. Zhu, Y, Coffman, DL, Ghosh, D. A boosting algorithm for estimating generalized propensity scores with continuous treatments. J Causal Inference 2015;3:25–40. https://doi.org/10.1515/jci-2014-0022.Suche in Google Scholar PubMed PubMed Central
9. Zubizarreta, JR. Stable weights that balance covariates for estimation with incomplete outcome data. J Am Stat Assoc 2015;110:910–22. https://doi.org/10.1080/01621459.2015.1023805.Suche in Google Scholar
10. Assié, G, Letouzé, E, Fassnacht, M, Jouinot, A, Luscap, W, Barreau, O, et al.. Integrated genomic characterization of adrenocortical carcinoma. Nat Genet 2014;46:607–12. https://doi.org/10.1038/ng.2953.Suche in Google Scholar PubMed
11. Chow, ML, Pramparo, T, Winn, ME, Barnes, CC, Li, H-R, Weiss, L, et al.. Age-dependent brain gene expression and copy number anomalies in autism suggest distinct pathological processes at young versus mature ages. PLoS Genet 2012;8:e1002592. https://doi.org/10.1371/journal.pgen.1002592.Suche in Google Scholar PubMed PubMed Central
12. Seoane, JA, Day, INM, Gaunt, TR, Campbell, C. A pathway-based data integration framework for prediction of disease progression. Bioinformatics 2014;30:838–45. https://doi.org/10.1093/bioinformatics/btt610.Suche in Google Scholar PubMed PubMed Central
13. Imai, K, Van Dyk, DA. Causal inference with general treatment regimes: generalizing the propensity score. J Am Stat Assoc 2004;99:854–66. https://doi.org/10.1198/016214504000001187.Suche in Google Scholar
14. Guido, WI. The role of the propensity score in estimating dose-response functions. Biometrika 2000;87:706–10. https://doi.org/10.1093/biomet/87.3.706.Suche in Google Scholar
15. Paul, RR, Rubin Donald, B. The central role of the propensity score in observational studies for causal effects. Biometrika 1983;70:41–55. https://doi.org/10.1093/biomet/70.1.41.Suche in Google Scholar
16. Lunceford, JK, Davidian, M. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Stat Med 2004;23:2937–60. https://doi.org/10.1002/sim.1903.Suche in Google Scholar PubMed
17. Rubin, DB. Matched sampling for causal effects. Cambridge: Cambridge University Press; 2006.10.1017/CBO9780511810725Suche in Google Scholar
18. Hainmueller, J. Entropy balancing for causal effects: a multivariate reweighting method to produce balanced samples in observational studies. Polit Anal 2012;20:25–46. https://doi.org/10.1093/pan/mpr025.Suche in Google Scholar
19. Vegetabile, BG, Gillen, DL, Stern, HS. Optimally balanced Gaussian process propensity scores for estimating treatment effects. J Roy Stat Soc: Series A 2020;183:355–77. https://doi.org/10.1111/rssa.12502.Suche in Google Scholar PubMed PubMed Central
20. Robbins, MW, Ann Griffin, B, Shih, RA, Slaughter, ME. Robust estimation of the causal effect of time-varying neighborhood factors on health outcomes. Stat Med 2020;39:544–61. https://doi.org/10.1002/sim.8423.Suche in Google Scholar PubMed PubMed Central
21. DeYoreo, M, Kottas, A. Bayesian nonparametric modeling for multivariate ordinal regression. J Comput Graph Stat 2018;27:71–84. https://doi.org/10.1080/10618600.2017.1316280.Suche in Google Scholar
22. Hu, W, Pan, T, Kong, D, Shen, W. Nonparametric matrix response regression with application to brain imaging data analysis. Biometrics 2021;77:1227–40. https://doi.org/10.1111/biom.13362.Suche in Google Scholar PubMed
23. Zhao, Q, Percival, D. Entropy balancing is doubly robust. J Causal Inference 2017;5. https://doi.org/10.1515/jci-2016-0010.Suche in Google Scholar
24. Athey, S, Imbens, GW, Wager, S. Efficient inference of average treatment effects in high dimensions via approximate residual balancing. Technical report 2016.Suche in Google Scholar
25. Wang, Y, Zubizarreta, JR. Minimal dispersion approximately balancing weights: asymptotic properties and practical considerations. Biometrika 2020;107:93–105.10.1093/biomet/asz050Suche in Google Scholar
26. Siegel, RL, Miller, KD, Jemal, A. Ca: a cancer journal for clinicians. Cancer Stat 2019;69:7–34. https://doi.org/10.3322/caac.21551.Suche in Google Scholar PubMed
27. Murtuza, A, Bulbul, A, Shen, JP, Keshavarzian, P, Woodward, BD, Lopez-Diaz, FJ, et al.. Novel third-generation EGFR tyrosine kinase inhibitors and strategies to overcome therapeutic resistance in lung cancer. Cancer Res 2019;79:689–98. https://doi.org/10.1158/0008-5472.can-18-1281.Suche in Google Scholar
28. Rolfo, C, Passiglia, F, Ostrowski, M, Farracho, L, Ondøichova, T, Dolcan, A, et al.. Improvement in lung cancer outcomes with targeted therapies: an update for family physicians. J Am Board Fam Med 2015;28:124–33. https://doi.org/10.3122/jabfm.2015.01.140072.Suche in Google Scholar PubMed
29. Lu, T-P, Chuang, EY, Chen, JJ. Identification of reproducible gene expression signatures in lung adenocarcinoma. BMC Bioinf 2013;14:1–10. https://doi.org/10.1186/1471-2105-14-371.Suche in Google Scholar PubMed PubMed Central
30. Chang, S-M, Yang, M, Lu, W, Huang, Y-J, Huang, Y, Hung, H, et al.. Gene-set integrative analysis of multi-omics data using tensor-based association test. Bioinformatics 2021;37:2259–65. https://doi.org/10.1093/bioinformatics/btab125.Suche in Google Scholar PubMed PubMed Central
31. Barretina, J, Caponigro, G, Stransky, N, Venkatesan, K, Margolin, AA, Kim, S, et al.. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 2012;483:603–7. https://doi.org/10.1038/nature11003.Suche in Google Scholar PubMed PubMed Central
32. Hirano, K, Imbens, GW. The propensity score with continuous treatments. Appl Bayesian Model Causal Inference Incomplete-data Perspect 2004;226164:73–84. https://doi.org/10.1002/0470090456.ch7.Suche in Google Scholar
33. Li, X, Xu, D, Zhou, H, Li, L. Tucker tensor regression and neuroimaging analysis. Stat Biosci 2018;10:520–45. https://doi.org/10.1007/s12561-018-9215-6.Suche in Google Scholar PubMed PubMed Central
34. Zhou, Y, Wong, RKW, He, K. Broadcasted nonparametric tensor regression. arXiv:2008.12927, 2020.Suche in Google Scholar
35. Imbens, G, Johnson, PM, Spady, RH. Information theoretic approaches to inference in moment condition models. NBER Technical Working Papers 0186. Cambridge, MA: National Bureau of Economic Research; 1995.10.3386/t0186Suche in Google Scholar
36. Kitamura, Y, Stutzer, M. An information-theoretic alternative to generalized method of moments estimation. Econometrica: J Econom Soc 1997:861–74. https://doi.org/10.2307/2171942.Suche in Google Scholar
37. Ai, C, Oliver, L, Motegi, K, Zhang, Z. A unified framework for efficient estimation of general treatment models. Quant Econ 2021;12:779–816. https://doi.org/10.3982/qe1494.Suche in Google Scholar
38. Ruppert, D, Wand, MP, Carroll, RJ. Semiparametric regression. Cambridge: Cambridge University Press; 2003, vol XII.10.1017/CBO9780511755453Suche in Google Scholar
39. Newey, WK. Convergence rates and asymptotic normality for series estimators. J Econom 1997;79:147–68. https://doi.org/10.1016/s0304-4076(97)00011-0.Suche in Google Scholar
40. Chen, J, Zhou, Y. Causal effect estimation for multivariate continuous treatments. arXiv:2205.08730 2022.10.1002/bimj.202200122Suche in Google Scholar PubMed
41. Morabito, A, Piccirillo, MC, Falasconi, F, De Feo, G, Del Giudice, A, Bryce, J, et al.. Vandetanib (zd6474), a dual inhibitor of vascular endothelial growth factor receptor (VEGFR) and epidermal growth factor receptor (EGFR) tyrosine kinases: current status and future directions. Oncologist 2009;14:378–90. https://doi.org/10.1634/theoncologist.2008-0261.Suche in Google Scholar PubMed
42. Zhou, Y, Li, N, Ma, G, Su, N, Liu, J, Yue, W, et al.. Mining and analysis of adverse event signals of vandetanib based on the FAERS database. Expet Opin Drug Saf 2024. (just-accepted). https://doi.org/10.1080/14740338.2024.2418326.Suche in Google Scholar PubMed
43. Schütte, M, Risch, T, Abdavi-Azar, N, Boehnke, K, Schumacher, D, Keil, M, et al.. Molecular dissection of colorectal cancer in pre-clinical models identifies biomarkers predicting sensitivity to EGFR inhibitors. Nat Commun 2017;8:14262. https://doi.org/10.1038/ncomms14262.Suche in Google Scholar PubMed PubMed Central
44. Pan, ZY, Jiang, ZS, Ouyang, HQ. Study of the methylation patterns of the EGFR gene promoter in non-small cell lung cancer. Genet Mol Res 2015;14:9813–20. https://doi.org/10.4238/2015.august.19.14.Suche in Google Scholar
45. Kris, MG, Natale, RB, Herbst, RS, Lynch, TJJr, Prager, D, Belani, CP, et al.. Efficacy of gefitinib, an inhibitor of the epidermal growth factor receptor tyrosine kinase, in symptomatic patients with non–small cell lung cancer: a randomized trial. JAMA 2003;290:2149–58. https://doi.org/10.1001/jama.290.16.2149.Suche in Google Scholar PubMed
Supplementary Material
This article contains supplementary material (https://doi.org/10.1515/ijb-2024-0021).
© 2025 Walter de Gruyter GmbH, Berlin/Boston
Artikel in diesem Heft
- Frontmatter
- Research Articles
- Prognostic adjustment with efficient estimators to unbiasedly leverage historical data in randomized trials
- Homogeneity test and sample size of response rates for AC 1 in a stratified evaluation design
- A review of survival stacking: a method to cast survival regression analysis as a classification problem
- DsubCox: a fast subsampling algorithm for Cox model with distributed and massive survival data
- A hybrid hazard-based model using two-piece distributions
- Regression analysis of clustered current status data with informative cluster size under a transformed survival model
- Bayesian covariance regression in functional data analysis with applications to functional brain imaging
- Risk estimation and boundary detection in Bayesian disease mapping
- An improved estimator of the logarithmic odds ratio for small sample sizes using a Bayesian approach
- Short Communication
- A multivariate Bayesian learning approach for improved detection of doping in athletes using urinary steroid profiles
- Research Articles
- Guidance on individualized treatment rule estimation in high dimensions
- Weighted Euclidean balancing for a matrix exposure in estimating causal effect
- Penalized regression splines in Mixture Density Networks
Artikel in diesem Heft
- Frontmatter
- Research Articles
- Prognostic adjustment with efficient estimators to unbiasedly leverage historical data in randomized trials
- Homogeneity test and sample size of response rates for AC 1 in a stratified evaluation design
- A review of survival stacking: a method to cast survival regression analysis as a classification problem
- DsubCox: a fast subsampling algorithm for Cox model with distributed and massive survival data
- A hybrid hazard-based model using two-piece distributions
- Regression analysis of clustered current status data with informative cluster size under a transformed survival model
- Bayesian covariance regression in functional data analysis with applications to functional brain imaging
- Risk estimation and boundary detection in Bayesian disease mapping
- An improved estimator of the logarithmic odds ratio for small sample sizes using a Bayesian approach
- Short Communication
- A multivariate Bayesian learning approach for improved detection of doping in athletes using urinary steroid profiles
- Research Articles
- Guidance on individualized treatment rule estimation in high dimensions
- Weighted Euclidean balancing for a matrix exposure in estimating causal effect
- Penalized regression splines in Mixture Density Networks