Startseite Mathematik Weighted Euclidean balancing for a matrix exposure in estimating causal effect
Artikel
Lizenziert
Nicht lizenziert Erfordert eine Authentifizierung

Weighted Euclidean balancing for a matrix exposure in estimating causal effect

  • Juan Chen und Yingchun Zhou ORCID logo EMAIL logo
Veröffentlicht/Copyright: 23. Mai 2025

Abstract

With the increasing complexity of data, researchers in various fields have become increasingly interested in estimating the causal effect of a matrix exposure, which involves complex multivariate treatments, on an outcome. Balancing covariates for the matrix exposure is essential to achieve this goal. While exact balancing and approximate balancing methods have been proposed for multiple balancing constraints, dealing with a matrix treatment introduces a large number of constraints, making it challenging to achieve exact balance or select suitable threshold parameters for approximate balancing methods. To address this challenge, the weighted Euclidean balancing method is proposed, which offers an approximate balance of covariates from an overall perspective. In this study, both parametric and nonparametric methods for estimating the causal effect of a matrix treatment is proposed, along with providing theoretical properties of the two estimations. To validate the effectiveness of our approach, extensive simulation results demonstrate that the proposed method outperforms alternative approaches across various scenarios. Finally, we apply the method to analyze the causal impact of the omics variables on the drug sensitivity of Vandetanib. The results indicate that EGFR CNV has a significant positive causal effect on Vandetanib efficacy, whereas EGFR methylation exerts a significant negative causal effect.


Corresponding author: Yingchun Zhou, KLATASDS-MOE, School of Statistics, East China Normal University, 3663 North Zhongshan Road, Shanghai, 200062, P.R. China; and Institute of Brain and Education Innovation, East China Normal University, Shanghai, P.R. China, E-mail: 

Funding source: Shanghai “Science and technology Innovation Action Plan” Computational Biology Key Project

Award Identifier / Grant number: 23JS1400500

Award Identifier / Grant number: 23JS1400800

Award Identifier / Grant number: 24ZR1420400

  1. Research ethics: The local Institutional Review Board deemed the study exempt from review.

  2. Informed consent: Not applicable.

  3. Author contributions: The authors have accepted responsibility for the entire content of this manuscript and approved its submission.

  4. Use of Large Language Models, AI and Machine Learning Tools: We used ChatGPT to improvethe language of the manuscript.

  5. Conflict of interest: The authors state no conflict of interest.

  6. Research funding: The research was supported by Natural Science Foundation of Shanghai Municipality (No. 24ZR1420400) and Shanghai “Science and technology Innovation Action Plan” Computational Biology Key Project (No. 23JS1400500, No. 23JS1400800).

  7. Data availability: The data can be obtained upon request from the corresponding author.

References

1. Chan, KCG, Yam, SCP, Zhang, Z. Globally efficient non-parametric inference of average treatment effects by empirical balancing calibration weighting. J Roy Stat Soc B Stat Methodol 2016;78:673. https://doi.org/10.1111/rssb.12129.Suche in Google Scholar PubMed PubMed Central

2. Dong, Y, Lee, Y-Y, Gou, M. Regression discontinuity designs with a continuous treatment. J Am Stat Assoc 2021:1–14. https://doi.org/10.1080/01621459.2021.1923509.Suche in Google Scholar

3. Fong, C, Hazlett, C, Imai, K. Covariate balancing propensity score for a continuous treatment: application to the efficacy of political advertisements. Ann Appl Stat 2018;12:156–77. https://doi.org/10.1214/17-aoas1101.Suche in Google Scholar

4. Hsu, Y-C, Lai, T-C, Lieli, RP. Counterfactual treatment effects: estimation and inference. J Bus Econ Stat 2022;40:240–55. https://doi.org/10.1080/07350015.2020.1800479.Suche in Google Scholar

5. Imai, K, Ratkovic, M. Covariate balancing propensity score. J Roy Stat Soc B 2014;76:243–63. https://doi.org/10.1111/rssb.12027.Suche in Google Scholar

6. Xiong, C, Yu, M, Shao, J. Treatment recommendation and parameter estimation under single-index contrast function. Stat Theor Relat Fields 2017;1:171–81. https://doi.org/10.1080/24754269.2017.1341012.Suche in Google Scholar

7. Yiu, S, Su, L. Covariate association eliminating weights: a unified weighting framework for causal effect estimation. Biometrika 2018;105:709–22. https://doi.org/10.1093/biomet/asy015.Suche in Google Scholar PubMed PubMed Central

8. Zhu, Y, Coffman, DL, Ghosh, D. A boosting algorithm for estimating generalized propensity scores with continuous treatments. J Causal Inference 2015;3:25–40. https://doi.org/10.1515/jci-2014-0022.Suche in Google Scholar PubMed PubMed Central

9. Zubizarreta, JR. Stable weights that balance covariates for estimation with incomplete outcome data. J Am Stat Assoc 2015;110:910–22. https://doi.org/10.1080/01621459.2015.1023805.Suche in Google Scholar

10. Assié, G, Letouzé, E, Fassnacht, M, Jouinot, A, Luscap, W, Barreau, O, et al.. Integrated genomic characterization of adrenocortical carcinoma. Nat Genet 2014;46:607–12. https://doi.org/10.1038/ng.2953.Suche in Google Scholar PubMed

11. Chow, ML, Pramparo, T, Winn, ME, Barnes, CC, Li, H-R, Weiss, L, et al.. Age-dependent brain gene expression and copy number anomalies in autism suggest distinct pathological processes at young versus mature ages. PLoS Genet 2012;8:e1002592. https://doi.org/10.1371/journal.pgen.1002592.Suche in Google Scholar PubMed PubMed Central

12. Seoane, JA, Day, INM, Gaunt, TR, Campbell, C. A pathway-based data integration framework for prediction of disease progression. Bioinformatics 2014;30:838–45. https://doi.org/10.1093/bioinformatics/btt610.Suche in Google Scholar PubMed PubMed Central

13. Imai, K, Van Dyk, DA. Causal inference with general treatment regimes: generalizing the propensity score. J Am Stat Assoc 2004;99:854–66. https://doi.org/10.1198/016214504000001187.Suche in Google Scholar

14. Guido, WI. The role of the propensity score in estimating dose-response functions. Biometrika 2000;87:706–10. https://doi.org/10.1093/biomet/87.3.706.Suche in Google Scholar

15. Paul, RR, Rubin Donald, B. The central role of the propensity score in observational studies for causal effects. Biometrika 1983;70:41–55. https://doi.org/10.1093/biomet/70.1.41.Suche in Google Scholar

16. Lunceford, JK, Davidian, M. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Stat Med 2004;23:2937–60. https://doi.org/10.1002/sim.1903.Suche in Google Scholar PubMed

17. Rubin, DB. Matched sampling for causal effects. Cambridge: Cambridge University Press; 2006.10.1017/CBO9780511810725Suche in Google Scholar

18. Hainmueller, J. Entropy balancing for causal effects: a multivariate reweighting method to produce balanced samples in observational studies. Polit Anal 2012;20:25–46. https://doi.org/10.1093/pan/mpr025.Suche in Google Scholar

19. Vegetabile, BG, Gillen, DL, Stern, HS. Optimally balanced Gaussian process propensity scores for estimating treatment effects. J Roy Stat Soc: Series A 2020;183:355–77. https://doi.org/10.1111/rssa.12502.Suche in Google Scholar PubMed PubMed Central

20. Robbins, MW, Ann Griffin, B, Shih, RA, Slaughter, ME. Robust estimation of the causal effect of time-varying neighborhood factors on health outcomes. Stat Med 2020;39:544–61. https://doi.org/10.1002/sim.8423.Suche in Google Scholar PubMed PubMed Central

21. DeYoreo, M, Kottas, A. Bayesian nonparametric modeling for multivariate ordinal regression. J Comput Graph Stat 2018;27:71–84. https://doi.org/10.1080/10618600.2017.1316280.Suche in Google Scholar

22. Hu, W, Pan, T, Kong, D, Shen, W. Nonparametric matrix response regression with application to brain imaging data analysis. Biometrics 2021;77:1227–40. https://doi.org/10.1111/biom.13362.Suche in Google Scholar PubMed

23. Zhao, Q, Percival, D. Entropy balancing is doubly robust. J Causal Inference 2017;5. https://doi.org/10.1515/jci-2016-0010.Suche in Google Scholar

24. Athey, S, Imbens, GW, Wager, S. Efficient inference of average treatment effects in high dimensions via approximate residual balancing. Technical report 2016.Suche in Google Scholar

25. Wang, Y, Zubizarreta, JR. Minimal dispersion approximately balancing weights: asymptotic properties and practical considerations. Biometrika 2020;107:93–105.10.1093/biomet/asz050Suche in Google Scholar

26. Siegel, RL, Miller, KD, Jemal, A. Ca: a cancer journal for clinicians. Cancer Stat 2019;69:7–34. https://doi.org/10.3322/caac.21551.Suche in Google Scholar PubMed

27. Murtuza, A, Bulbul, A, Shen, JP, Keshavarzian, P, Woodward, BD, Lopez-Diaz, FJ, et al.. Novel third-generation EGFR tyrosine kinase inhibitors and strategies to overcome therapeutic resistance in lung cancer. Cancer Res 2019;79:689–98. https://doi.org/10.1158/0008-5472.can-18-1281.Suche in Google Scholar

28. Rolfo, C, Passiglia, F, Ostrowski, M, Farracho, L, Ondøichova, T, Dolcan, A, et al.. Improvement in lung cancer outcomes with targeted therapies: an update for family physicians. J Am Board Fam Med 2015;28:124–33. https://doi.org/10.3122/jabfm.2015.01.140072.Suche in Google Scholar PubMed

29. Lu, T-P, Chuang, EY, Chen, JJ. Identification of reproducible gene expression signatures in lung adenocarcinoma. BMC Bioinf 2013;14:1–10. https://doi.org/10.1186/1471-2105-14-371.Suche in Google Scholar PubMed PubMed Central

30. Chang, S-M, Yang, M, Lu, W, Huang, Y-J, Huang, Y, Hung, H, et al.. Gene-set integrative analysis of multi-omics data using tensor-based association test. Bioinformatics 2021;37:2259–65. https://doi.org/10.1093/bioinformatics/btab125.Suche in Google Scholar PubMed PubMed Central

31. Barretina, J, Caponigro, G, Stransky, N, Venkatesan, K, Margolin, AA, Kim, S, et al.. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 2012;483:603–7. https://doi.org/10.1038/nature11003.Suche in Google Scholar PubMed PubMed Central

32. Hirano, K, Imbens, GW. The propensity score with continuous treatments. Appl Bayesian Model Causal Inference Incomplete-data Perspect 2004;226164:73–84. https://doi.org/10.1002/0470090456.ch7.Suche in Google Scholar

33. Li, X, Xu, D, Zhou, H, Li, L. Tucker tensor regression and neuroimaging analysis. Stat Biosci 2018;10:520–45. https://doi.org/10.1007/s12561-018-9215-6.Suche in Google Scholar PubMed PubMed Central

34. Zhou, Y, Wong, RKW, He, K. Broadcasted nonparametric tensor regression. arXiv:2008.12927, 2020.Suche in Google Scholar

35. Imbens, G, Johnson, PM, Spady, RH. Information theoretic approaches to inference in moment condition models. NBER Technical Working Papers 0186. Cambridge, MA: National Bureau of Economic Research; 1995.10.3386/t0186Suche in Google Scholar

36. Kitamura, Y, Stutzer, M. An information-theoretic alternative to generalized method of moments estimation. Econometrica: J Econom Soc 1997:861–74. https://doi.org/10.2307/2171942.Suche in Google Scholar

37. Ai, C, Oliver, L, Motegi, K, Zhang, Z. A unified framework for efficient estimation of general treatment models. Quant Econ 2021;12:779–816. https://doi.org/10.3982/qe1494.Suche in Google Scholar

38. Ruppert, D, Wand, MP, Carroll, RJ. Semiparametric regression. Cambridge: Cambridge University Press; 2003, vol XII.10.1017/CBO9780511755453Suche in Google Scholar

39. Newey, WK. Convergence rates and asymptotic normality for series estimators. J Econom 1997;79:147–68. https://doi.org/10.1016/s0304-4076(97)00011-0.Suche in Google Scholar

40. Chen, J, Zhou, Y. Causal effect estimation for multivariate continuous treatments. arXiv:2205.08730 2022.10.1002/bimj.202200122Suche in Google Scholar PubMed

41. Morabito, A, Piccirillo, MC, Falasconi, F, De Feo, G, Del Giudice, A, Bryce, J, et al.. Vandetanib (zd6474), a dual inhibitor of vascular endothelial growth factor receptor (VEGFR) and epidermal growth factor receptor (EGFR) tyrosine kinases: current status and future directions. Oncologist 2009;14:378–90. https://doi.org/10.1634/theoncologist.2008-0261.Suche in Google Scholar PubMed

42. Zhou, Y, Li, N, Ma, G, Su, N, Liu, J, Yue, W, et al.. Mining and analysis of adverse event signals of vandetanib based on the FAERS database. Expet Opin Drug Saf 2024. (just-accepted). https://doi.org/10.1080/14740338.2024.2418326.Suche in Google Scholar PubMed

43. Schütte, M, Risch, T, Abdavi-Azar, N, Boehnke, K, Schumacher, D, Keil, M, et al.. Molecular dissection of colorectal cancer in pre-clinical models identifies biomarkers predicting sensitivity to EGFR inhibitors. Nat Commun 2017;8:14262. https://doi.org/10.1038/ncomms14262.Suche in Google Scholar PubMed PubMed Central

44. Pan, ZY, Jiang, ZS, Ouyang, HQ. Study of the methylation patterns of the EGFR gene promoter in non-small cell lung cancer. Genet Mol Res 2015;14:9813–20. https://doi.org/10.4238/2015.august.19.14.Suche in Google Scholar

45. Kris, MG, Natale, RB, Herbst, RS, Lynch, TJJr, Prager, D, Belani, CP, et al.. Efficacy of gefitinib, an inhibitor of the epidermal growth factor receptor tyrosine kinase, in symptomatic patients with non–small cell lung cancer: a randomized trial. JAMA 2003;290:2149–58. https://doi.org/10.1001/jama.290.16.2149.Suche in Google Scholar PubMed


Supplementary Material

This article contains supplementary material (https://doi.org/10.1515/ijb-2024-0021).


Received: 2024-02-26
Accepted: 2025-04-07
Published Online: 2025-05-23

© 2025 Walter de Gruyter GmbH, Berlin/Boston

Heruntergeladen am 29.12.2025 von https://www.degruyterbrill.com/document/doi/10.1515/ijb-2024-0021/pdf
Button zum nach oben scrollen