Estimating population-averaged hazard ratios in the presence of unmeasured confounding

Pablo Martínez-Camblor; Todd A. MacKenzie; A. James O’Malley

doi:10.1515/ijb-2021-0096

Enjoy 40% off

academic books on De Gruyter Brill *

Article

Estimating population-averaged hazard ratios in the presence of unmeasured confounding

Pablo Martínez-Camblor , Todd A. MacKenzie and A. James O’Malley

Published/Copyright: March 23, 2022

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal The International Journal of Biostatistics Volume 19 Issue 1

Abstract

The Cox regression model and its associated hazard ratio (HR) are frequently used for summarizing the effect of treatments on time to event outcomes. However, the HR’s interpretation strongly depends on the assumed underlying survival model. The challenge of interpreting the HR has been the focus of a number of recent papers. Several alternative measures have been proposed in order to deal with these concerns. The marginal Cox regression models include an identifiable hazard ratio without individual but populational causal interpretation. In this work, we study the properties of one particular marginal Cox regression model and consider its estimation in the presence of omitted confounder from an instrumental variable-based procedure. We prove the large sample consistency of an estimation score which allows non-binary treatments. Our Monte Carlo simulations suggest that finite sample behavior of the procedure is adequate. The studied estimator is more robust than its competitor (Wang et al.) for weak instruments although it is slightly more biased for large effects of the treatment. The practical use of the presented techniques is illustrated through a real practical example using data from the vascular quality initiative registry. The used R code is provided as Supplementary material.

Keywords: causal effect; Cox regression model; instrumental variable; mis-specified models; omitted covariates; population-averaged hazard ratio

Corresponding author: Pablo Martínez-Camblor, Department of Anesthesiology, Dartmouth-Hitchcock Medical Center, 7 Lebanon Street, Suite 309, Hinman Box 7261, Lebanon, NH 03751, USA; and Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, NH, USA, E-mail: Pablo.Martinez-Camblor@hitchcock.org

Funding source: Asturies Government

Award Identifier / Grant number: GRUPIN AYUD/2021/50897

Acknowledgements

The authors are grateful with Prof. Linbo Wang for sharing his code with us and with Dr. Jesse Columbo and Phillip Goodney for providing the data for real-world example.

Author contribution: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.
Research funding: First author is partially supported by the Grant GRUPIN AYUD/2021/50897 from the Asturies Goverment.
Conflict of interest statement: The authors have no conflicts of interest to report.

Appendix A

A.1 Results’ proof

Proof of Theorem 1

Under the stated assumptions, we know (Truthers and Kalbfleisch [36]) that the solution to U n X ( ⋅ ) is a consistent estimator for the solution to

U T X ( β ) = ∫ E X , U x ⋅ λ x ( t , u ) ⋅ S x ( t , u ) ⋅ G x ( t ) ⋅ E X , U e β ⋅ x ⋅ S x ( t , u ) ⋅ G x ( t ) d t − ∫ E X , U λ x ( t , u ) ⋅ S x ( t , u ) ⋅ G x ( t ) ⋅ E X , U x ⋅ e β ⋅ x ⋅ S x ( t , u ) ⋅ G x ( t ) d t ,

where G x ( ⋅ ) = P { C > t | X = x } . From Eq. (4) (and the Fubini’s theorem) we have

∂ ∂ t log E U e − Λ x ( t ; u ) = exp { β X ⋅ x } ⋅ log E U e − Λ 0 ( t ; u ) ,

and therefore

E U { λ x ( t ; u ) ⋅ S x ( t , u ) } = exp { β X ⋅ x } ⋅ E U { λ 0 ( t ; u ) ⋅ S 0 ( t , u ) } E U { S 0 ( t , u ) } ⋅ E U { S x ( t , u ) } = κ U ( t ) ⋅ e β X ⋅ x ⋅ E U { S x ( t , u ) } .

Then the independence between X and U implies

U T X ( β ) = ∫ κ U ( t ) ⋅ E X x ⋅ e β X ⋅ x ⋅ G x ( t ) ⋅ E U { S x ( t , u ) } ⋅ E X e β ⋅ x ⋅ G x ( t ) ⋅ E U { S x ( t , u ) } d t − ∫ κ U ( t ) ⋅ E X e β X ⋅ x ⋅ G x ( t ) ⋅ E U { S x ( t , u ) } ⋅ E X x ⋅ e β ⋅ x ⋅ G x ( t ) ⋅ E U { S x ( t , u ) } d t ,

which has a unique solution at β = β _X.□

Proof of Theorem 2

From Assumption 1 (W ╨ T|X) we have that, for β _W = 0, the true survival function satisfies

(11) S x , w ( t ) = E U P { T > t | X = x , W = w , U = u } = E U P { T > t | X = 0 , W = 0 , U = u } exp { β X ⋅ x + β W ⋅ w } = S 0,0 ( t ) exp { β X ⋅ x + β W ⋅ w } .

The maximum partial-likelihood estimator of the parameter β _X = (β _X, β _W) is based on the maximization of the function,

ℓ ( β ) = ∑ i = 1 n ∫ log { λ x i ( t , u i , w i ; β ) ⋅ Y i ( t ) } d N i ( t ) − ∫ log ∑ i = 1 n E U { λ x i ( t , u , w ; β ) ⋅ Y i ( t ) } d ∑ i = 1 n N i ( t ) ,

where β = (β ₁, β ₂). Then, β _X is a solution to the partial derivative equation of E X , W { ℓ ( β ) } . From Eq. (11) and the Assumption 2 (W ╨ U|X), we have that β _X is a solution for

0 = E X , W ∂ ℓ ( β ) ∂ β 2 = E X , W ∑ i = 1 n ∫ 0 ∞ w i − ∑ i = 1 n w i ⋅ Y i ( s ) ⋅ exp { β 1 ⋅ x i + β 2 ⋅ w i } ∑ i = 1 n Y i ( s ) ⋅ exp { β ⋅ x i + β 2 ⋅ w i } d N i ( s )

Assumption 1 (W ╨ T|X) guarantees that β _W = 0 and therefore E W , X U n W ( β X ) = 0 . In addition, we have that

E X , W ∂ U n W ( β ) ∂ β = E X , W ∫ 0 1 ∑ i = 1 n x i ⋅ Y i ( s ) ⋅ exp { β ⋅ x i } ⋅ ∑ i = 1 n w i ⋅ Y i ( s ) ⋅ exp { β ⋅ x i } ∑ i = 1 n Y i ( s ) ⋅ exp { β ⋅ x i } 2 d ∑ i = 1 n N i ( s ) − E X , W ∫ 0 1 ∑ i = 1 n w i ⋅ x i ⋅ Y i ( s ) ⋅ exp { β ⋅ x i } ⋅ ∑ i = 1 n Y i ( s ) ⋅ exp { β ⋅ x i } ∑ i = 1 n Y i ( s ) ⋅ exp { β ⋅ x i } 2 d ∑ i = 1 n N i ( s ) = E X , W ∫ 0 1 ∑ j = 1 n ∑ i = 1 n ( x i ⋅ w j − x i ⋅ w i ) ⋅ Y i ( s ) Y j ( s ) ⋅ exp { β ⋅ ( x i + x j ) } ∑ i = 1 n Y i ( s ) ⋅ exp { β ⋅ x i } 2 d ∑ i = 1 n N i ( s )

The Cauchy–Schwartz inequality and Assumption 3 W / ╨ X guarantee that this is a non-zero function with constant sign and hence, E W , X U n W ( ⋅ ) has one unique zero reached at β _X.□

Proof of Theorem 3

Asymptotic normality of β _X is directly derived from M-statistics theory (see, for instance, van der Vaart [37]). From Theorem 2 and the Taylor expansion, we have that

n ⋅ β n * − β X = − n ⋅ U n W ( β X ) ∂ U n W ( β X ) ∂ β + 1 2 ∂ 2 U n W ( β ̄ n ) ∂ β 2 β n * − β X ,

where β ̄ n is a point between β _X and β n * . From Theorem 2, the central limit theorem and the Slutsky lemma, we have that n ⋅ U n W ( β X ) is asymptotically normal with mean zero and variance

V n ⋅ U n W ( β X ) = ∑ i = 1 n ∫ 0 ∞ w i − S n ( 1 ) ( W , β X , s ) S n ( 0 ) ( W , β X , s ) 2 d N i ( s ) .

Theorem 2 also implies that β n * − β X = o P ( 1 ) . Therefore, the variance of n ⋅ β n * − β X is

V n ⋅ β n * − β X = ∂ U n W ( β X ) ∂ β − 2 ⋅ V n ⋅ U n W ( β X ) ,

and the proof is concluded.□

A.2 Monte Carlo simulations scenario

Now, we will prove that the scenario considered in the Monte Carlo simulations section satisfies the IIC model. That is, we will prove that it fullflls Eq. (4). We have that, for each s ≥ 0,

S 0 ( s ) = P { − log { 1 − γ 4,1 ( u + t ) } > s } = P 1 − γ 4,1 ( u + t ) ≤ e − s ,

where u and t are independent random variables following an exponential (with mean 1) and a gamma (with parameters 3 and 1) distributions, respectively. That is, u + t follows a gamma distribution with parameters 4 and 1. Therefore ξ = 1 − γ _4,1(u + t) is an uniformly distributed variable in [0, 1] and

S 0 ( s ) = P { ξ ≤ e − s } = e − s .

Besides,

S 1 ( s ) = P { − log { 1 − γ 4,1 ( u + t ) } > s ⋅ H R X } = P ξ ≤ e − s ⋅ H R X = S 0 ( s ) H R X .

□

A.3 Wang et al. estimator

Let { ( x i , z i , δ i , q i , w i ) } i = 1 n be an iid random sample containing the treatment, the observed event time, the observed status (failure versus censoring), the measured covariates and the IV (now assumed to be binary), respectively. Wang et al. [15] propose to estimate β by solving for β the equation

∑ i = 1 n δ i ⋅ ω ̂ ( w i , q i ) ⋅ x i − ∑ j = 1 n x j ⋅ e β ⋅ x j I ( z j ≥ z i ) ⋅ ω ̂ ( w i , q i ) ∑ j = 1 n e β ⋅ x j I ( z j ≥ z i ) ⋅ ω ̂ ( w i , q i ) ,

where ω ̂ ( w i , q i ) = h ( x i ) ⋅ ( 2 w i − 1 ) / f ( w i | q i : η ̂ ) ⋅ δ X ( q i ; γ ̂ ) , with h(⋅) any function of X such that the above equation is well-defined. There are different procedures for the estimation of the parameters of the density, f(W|Q), and the conditional risk difference, δ X ( Q ; γ ̂ ) , functions. We refer to Wang et al. [15] for specific details about the procedure.

References

1. Cox, DR. Regression models and life-tables. J Roy Stat Soc B 1972;34:187–220. https://doi.org/10.1111/j.2517-6161.1972.tb00899.x.Search in Google Scholar

2. Aalen, OO, Cook, RJ, Røysland, K. Does Cox analysis of a randomized survival study yield a causal treatment effect? Lifetime Data Anal 2015;21:579–93. https://doi.org/10.1007/s10985-015-9335-y.Search in Google Scholar PubMed

3. Martinussen, T, Vansteelandt, S. On collapsibility and confounding bias in Cox and Aalen regression models. Lifetime Data Anal 2013;19:279–96. https://doi.org/10.1007/s10985-013-9242-z.Search in Google Scholar PubMed

4. Martinussen, T, Vansteelandt, S, Andersen, PK. Subtleties in the interpretation of hazard contrasts. Lifetime Data Anal 2020;26:833–55. https://doi.org/10.1007/s10985-020-09501-5.Search in Google Scholar PubMed

5. Hernán, MA, Robins, JM. Instruments for causal inference: an epidemioligist’s dream? Epidemiology 2006;17:360–72. https://doi.org/10.1097/01.ede.0000222409.00878.37.Search in Google Scholar PubMed

6. Angrist, JD, Imbens, GW, Rubin, DB. Identification of causal effects using instrumental variables. J Am Stat Assoc 1996;91:444–55. https://doi.org/10.1080/01621459.1996.10476902.Search in Google Scholar

7. Pearl, J. Causality: models, reasoning, and inference. New York, NY: Cambridge University Press; 2000.Search in Google Scholar

8. Tan, Z. Regression and weighting methods for causal inference using instrumental variables. J Am Stat Assoc 2006;101:1607–18. https://doi.org/10.1198/016214505000001366.Search in Google Scholar

9. Wang, L, Tchetgen Tchetgen, E. Bounded, efficient and multiply robust estimation of average treatment effects using instrumental variables. J Roy Stat Soc B 2018;80:531–50. https://doi.org/10.1111/rssb.12262.Search in Google Scholar PubMed PubMed Central

10. Robins, JM, Tsiatis, AA. Correcting for non-compliance in randomized trials using rank preserving structural failure time models. Commun Stat Theor Methods 1991;20:2609–31. https://doi.org/10.1080/03610929108830654.Search in Google Scholar

11. Martínez-Camblor, P, MacKenzie, TA, Staiger, DO, Goodney, P P, O’Malley, AJ. Adjusting for bias introduced by instrumental variable estimation in the Cox proportional hazards model. Biostatistics 2019;20:80–96. https://doi.org/10.1093/biostatistics/kxx062.Search in Google Scholar PubMed

12. Wienke, A. Frailty models in survival analysis. Florida: Chapman & Hall/CRC Biostatistics Series; 2010.10.1201/9781420073911Search in Google Scholar

13. Martínez-Camblor, P, MacKenzie, TA, Staiger, DO, Goodney, PP, O’Malley, AJ. An instrumental variable procedure for estimating Cox models with non-proportional hazards in the presence of unmeasured confounding. J Roy Stat Soc C 2019;68:985–1005. https://doi.org/10.1111/rssc.12341.Search in Google Scholar

14. MacKenzie, TA, Tosteson, TD, Morden, NE, Stukel, TA, O’Malley, AJ. Using instrumental variables to estimate a Cox’s proportional hazards regression subject to additive confounding. Health Serv Outcome Res Methodol 2014;14:54–68. https://doi.org/10.1007/s10742-014-0117-x.Search in Google Scholar PubMed PubMed Central

15. Wang, L, Tchetgen Tchetgen, E, Martinussen, T, Vansteelandt, S. Learning causal hazard ratio with endogeneity. arXiv, (1807.05313), 2018.Search in Google Scholar

16. Andersen, PK, Gill, RD. Cox’s regression model for counting processes: a large sample study. Ann Stat 1982;10:1100–20. https://doi.org/10.1214/aos/1176345976.Search in Google Scholar

17. Cox, DR. Partial likelihood. Biometrika 1975;62:269–76. https://doi.org/10.1093/biomet/62.2.269.Search in Google Scholar

18. Rubin, DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol 1974;65:688. https://doi.org/10.1037/h0037350.Search in Google Scholar

19. Martínez-Camblor, P, MacKenzie, TA, O’Malley, AJ. A robust hazard ratio for general modeling of survival-times. Int J Biostat 2021:20210003. https://doi.org/10.1515/ijb-2021-0003.Search in Google Scholar PubMed

20. Hernán, MA, Brumback, B, Robins, JM. Marginal structural models to estimate the joint causal effect of nonrandomized treatments. J Am Stat Assoc 2001;96:440–8. https://doi.org/10.1198/016214501753168154.Search in Google Scholar

21. MacKenzie, TA, Martínez-Camblor, P, O’Malley, AJ. Time dependent hazard ratio estimation using instrumental variables without conditioning on an omitted covariate. BMC Med Res Methodol 2021;21:1–21. https://doi.org/10.1186/s12874-021-01245-6.Search in Google Scholar PubMed PubMed Central

22. Breslow, NE. Discussion of the paper by D. R. Cox. J Roy Stat Soc B 1972;34:216–7.Search in Google Scholar

23. MacKenzie, TA, Brown, JR, Likosky, DS, Wu, Y, Grunkemeier, GL. Review of case-mix corrected survival curves. Ann Thorac Surg 2012;93:1416–25. https://doi.org/10.1016/j.athoracsur.2011.12.094.Search in Google Scholar PubMed

24. Lin, DY, Ying, Z. Semiparametric analysis of general additive-multiplicative hazard models for counting processes. Ann Stat 1995;23:1712–34. https://doi.org/10.1214/aos/1176324320.Search in Google Scholar

25. Martinussen, T, Scheike, TH. A flexible additive multiplicative hazard model. Biometrika 2002;89:283–98. https://doi.org/10.1093/biomet/89.2.283.Search in Google Scholar

26. Madadizadeh, F, Ghanbarnejad, A, Ghavami, V, Zare Bandamiri, M, Mohammadianpanah, M. Applying additive hazards models for analyzing survival in patients with colorectal cancer in fars province, Southern Iran. Asian Pac J Cancer Prev APJCP 2017;18:1077–83. https://doi.org/10.22034/APJCP.2017.18.4.1077.Search in Google Scholar PubMed PubMed Central

27. Berg, A, Xie, X, Strickler, HD, Xue, X. Additive hazard regression models: an application to the natural history of human Papillomavirus. Comput Math Methods Med 2013;2:1–7. https://doi.org/10.1155/2013/796270.Search in Google Scholar PubMed PubMed Central

28. Abadi, A, Saadat, S, Yavari, P, Bajdik, C, Jalili, P. Comparison of Aalen’s additive and Cox proportional hazards models for breast cancer survival: analysis of population– based data from British Columbia, Canada. Asian Pac J Cancer Prev APJCP 2011;12:3113–6.Search in Google Scholar

29. Thanassoulis, G, O’Donnell, CJ, randomization, M. Nature’s randomized trial in the post-genome era. J Am Med Assoc 2009;301:2386–8. https://doi.org/10.1001/jama.2009.812.Search in Google Scholar PubMed PubMed Central

30. Martínez-Camblor, P, MacKenzie, TA, Staiger, DO, Goodney, PP, O’Malley, AJ. Summarizing causal differences in survival curves in the presence of unmeasured confounding. Int J Biostat 2020;17:223–40. https://doi.org/10.1515/ijb-2019-0146.Search in Google Scholar PubMed

31. Efron, B, Tibshirani, RJ. An Introduction to the Bootstrap Monographs on Statistics and Applied Probability 57. Boca Raton, Florida: Chapman & Hall/CRC; 1993.Search in Google Scholar

32. Schermerhorn, ML, Liang, P, Eldrup-Jorgensen, J, Cronenwett, JL, Nolan, BW, Kashyap, VS, et al.. Association of transcarotid artery revascularization vs transfemoral carotid artery stenting with stroke or death among patients with carotid artery stenosis. J Am Med Assoc 2019;322:2313–22. https://doi.org/10.1001/jama.2019.18441.Search in Google Scholar PubMed PubMed Central

33. Martínez-Camblor, P, Pardo-Fernández, JC. The Youden index in the generalized receiver operating characteristic curve context. Int J Biostat 2019;15:1–28. https://doi.org/10.1515/ijb-2018-0060.Search in Google Scholar PubMed

34. Hernán, MA. The hazards of hazard ratios. Epidemiology 2010;21:13–5. https://doi.org/10.1097/ede.0b013e3181c1ea43.Search in Google Scholar PubMed PubMed Central

35. Tchetgen-Tchetgen, EJ, Walter, S, Vansteelandt, S, Martinussen, T, Glymour, M. Instrumental variable estimation in a survival context. Epidemiology 2015;26:402–10. https://doi.org/10.1097/ede.0000000000000262.Search in Google Scholar

36. Truthers, CA, Kalbfleisch, JD. Misspecified proportional hazard models. Biometrika 1986;73:363–9. https://doi.org/10.1093/biomet/73.2.363.Search in Google Scholar

37. van der Vaart, AW. Asymptotic Statistics. Cambridge: Cambridge University Press; 2000.Search in Google Scholar

Supplementary Material

The online version of this article offers supplementary material (https://doi.org/10.1515/ijb-2021-0096).

Received: 2021-09-06

Revised: 2022-01-24

Accepted: 2022-03-02

Published Online: 2022-03-23

You are currently not able to access this content.

Supplementary Material Details

Articles in the same Issue

https://doi.org/10.1515/ijb-2021-0096

Keywords for this article

causal effect; Cox regression model; instrumental variable; mis-specified models; omitted covariates; population-averaged hazard ratio