A compound representation of the multiple treatment propensity score with applications to marginal structural modeling

David Stein; Lauren D’Arinzo; Fraser Gaspar; Max Oliver; Kristin Fitzgerald; Di Lu; Steven Piantadosi; Alpesh Amin; Brandon Webb

doi:10.1515/em-2023-0005

Article

A compound representation of the multiple treatment propensity score with applications to marginal structural modeling

David Stein , Lauren D’Arinzo , Fraser Gaspar , Max Oliver , Kristin Fitzgerald , Di Lu , Steven Piantadosi , Alpesh Amin and Brandon Webb

Published/Copyright: November 6, 2024

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal Epidemiologic Methods Volume 13 Issue s2

Abstract

Objectives

Methods of causal inference are used to estimate treatment effectiveness for non-randomized study designs. The propensity score (i.e., the probability that a subject receives the study treatment conditioned on a set of variables related to treatment and/or outcome) is often used with matching or sample weighting techniques to, ideally, eliminate bias in the estimates of treatment effect due to treatment decisions. If multiple treatments are available, the propensity score is a function of the adjustment set and the set of possible treatments. This paper develops a compound representation that separates the treatment decision into a binary decision: treat or don't treat, and a potential treatment decision: choose the treatment that would be given if the subject is treated.

Methods

The compound representation was derived from Robin's definition of the propensity score, and a second proof is derived from importance sampling. A simulation study illustrates the use of the method.

Results

Multiple treatment stabilized marginal structural weights were calculated with this approach, and the method was applied to an observational study to evaluate the effectiveness of different neutralizing monoclonal antibodies to treat infection with various severe acute respiratory syndrome coronavirus 2 variants.

Conclusions

The method can greatly simplify the computation of multiple treatment propensity scores and reduce bias in comparison with improperly used logistic regression.

Keywords: multiple treatment propensity score; marginal structural models; COVID-19

Corresponding author: David Stein, Center for Clinical Investigations, Brigham and Women's Hospital, Boston, MA 02115, USA; and Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA, E-mail: dwstein@mgb.org

Funding source: Administration for Strategic Preparedness and Response, Biomedical Advanced Research and Development Authority

Award Identifier / Grant number: Contract Number 75FCMC18D0047, Task Order 75A50121

Acknowledgments

This work was performed by the mAb Real – World Evidence Collaborative. The views expressed are solely those of the authors and do not necessarily represent those of the U.S. Department of Health and Human Services. This study was supported wholly or in part with federal funds from the Administration for Strategic Preparedness and Response, Biomedical Advanced Research and Development Authority, under Contract Number 75FCMC18D0047, Task Order 75A50121F80012, awarded to The MITRE Corporation.

Research ethics: The study underwent IRB review at the MITRE Corporation and participating healthcare centers.
Informed consent: The need for informed consent was waived because this study used deidentified data.
Author contributions: All authors have accepted responsibility for the entire content of this manuscript and approved its submission.
Competing interests: Dr. Piantadosi reports personal fees from MITRE Corp, outside the submitted work. Dr. Webb reports receiving a grant from Regeneron. Dr. Amin reports grants from MITRE during the conduct of the study; grants from PI or Co-PI of clinical trials sponsored by NIH/NAID, NeuroRx Pharma, Pulmotect, Blade Therapeutics, Novartis, Takeda, Humanigen, Eli Lilly, PTC Therapeutics, OctaPharma, Fulcrum Therapeutics, Alexion-not related to this manuscript, personal fees from speaker and/or consultant for BMS, Pfizer, BI, Portola, Sunovion, Mylan, Salix, Alexion, AstraZeneca, Novartis, Nabriva, Partatek, Bayer, Tetraphase, Achogen LaJolla, Ferring, Seres, Spero, Eli Lilly, Millennium, HeartRite, Aseptiscope, Sprightly-not related to this manuscript, and outside the submitted work.
Research funding: This study was supported wholly or in part with federal funds from the Administration for Strategic Preparedness and Response, Biomedical Advanced Research and Development Authority, under Contract Number 75FCMC18D0047, Task Order 75A50121F80012, awarded to The MITRE Corporation.
Data availability: Data is available from the National COVID Cohort Collaborative. Available at https://ncats.nih.gov/n3c.

Demonstration of confounding if p(t|x)=0

The following example shows how estimates of treatment effect for different treatment types can be biased if the positivity constraint is violated. Assume two time periods and two treatments, t ₁ and t ₂. Untreated is denoted by t ₀. Assume that the probability that a subject appears in interval 2 is twice the probability that a subject appears in interval 1. The probabilities of a subject receiving no treatment or the treatments and the probabilities of the adverse outcome, p _j(Y=1), for the two time intervals, j=1, 2, and for the combined interval, j=C, are given in Table 6. Treatment 1 is given in interval 1 but not in interval 2, whereas treatment 2 is given in both intervals. The disease becomes more contagious and more virulent in interval 2 as compared with interval 1. This table also presents the odds ratios of the adverse outcomes, comparing treated vs. non-treated patients, for each of the time periods and for the combined time period. One sees that, in time interval 1, treatments 1 and 2 are of equal effectiveness, with odds ratios of 0.11. The odds ratio for treatment 2 in interval 2 is 0.08 – note that the probability of the adverse outcome increases for both untreated and those given treatment 2 in interval 2 as compared with interval 1. The combined odds ratio is also shown in the table. From the combined odds ratios, treatment 1 is more effective than treatment 2, whereas they are of equal effectiveness in interval 1, and treatment 1 is not used against the more virulent strain in interval 2. The combined odds ratio for treatment 1 is confounded by comparing treatment effectiveness of the treated population in interval 1 with untreated patients from interval 2, a period of time during which treatment 1 was not available.

Table 6:

Illustration of confounding when p(t|x)=0.

Treat	p ₁(t)	p ₂(t)	p ₁(Y=1)	p ₂(Y=1)	p _C(Y=1)	OR ₁	OR ₂	OR _C
t ₀	0.34	0.34	0.5	0.75	0.67
t ₁	0.33	0	0.1	NA	0.1	0.11	NA	0.06
t ₂	0.33	0.66	0.1	0.2	0.17	0.11	0.08	0.1

Table 7:

Log odds, probabilities, and odds ratios of the outcome model conditioned on the input values.

A	X ₁	X ₂	T ₁	T ₂	Log odds	Probability	Odds ratio
0	0	0	0	0	−2.9444	0.05	NA
0	1	0	0	0	−2.1972	0.1	NA
0	0	1	0	0	−1.7346	0.15	NA
0	1	1	0	0	−1.3863	0.2	NA
1	0	0	1	0	−4.5951	0.01	0.1919
1	1	0	1	0	−2.9444	0.05	0.4737
1	0	1	1	0	−3.8918	0.02	0.1156
1	1	1	1	0	−1.5163	0.18	0.8780
1	0	0	0	1	−4.5951	0.01	0.1919
1	1	0	0	1	−3.8918	0.02	0.1837
1	0	1	0	1	−4.1846	0.015	0.0863
1	1	1	0	1	−2.9444	0.05	0.2105

Table 8:

Unweighted population counts of OS data with X ₁=0. The p-value of the χ ² test of independence is less than 2, 2 × 10⁻¹⁶.

	Unweighted counts: A=0	Unweighted counts: T ₁=1	Unweighted counts: T ₂=1
X ₂=0	61,375	10,230	10,160
X ₂=1	5,174	5,173	18,235

Table 9:

Weighted population counts of the observational data with X ₁=0. The p-value of the χ ² test of independence is 0.271.

	Weighted counts: A=0	Weighted counts: T ₁=1	Weighted counts: T ₂=1
X ₂=0	54,440	12,576	14,735
X ₂=1	11,912	2,830	3,316

Table 10:

X ₂ modification of treatment effects estimated using MSM weighted logistic regression and an RCT. p _RCT-MSM is the p-value of the t-test for the equality of the means.

	μ _RCT	μ _MSM	σ _RCT	σ _MSM	p _RCT-MSM	Δσ, %
X1_0_X2_0_T1	−1.653	−1.657	0.068	0.102	0.488	50
X1_0_X2_0_T2	−1.651	−1.657	0.067	0.102	0.481	52.24
X1_1_X2_0_T2	−1.698	−1.694	0.041	0.041	0.475	0
X1_0_X2_1_T1	−2.161	−2.164	0.102	0.11	0.494	7.84
X1_0_X2_1_T2	−2.456	−2.459	0.116	0.103	0.493	−11.21
X1_1_X2_1_T2	−1.559	−1.557	0.043	0.051	0.486	18.6

An alternate derivation of SW ¹

Robins [11] defines marginal structural models (MSM) to predict outcomes conditioned on baseline covariates of interest from observational data applicable to both failure time studies and outcomes measured at the end of follow up. The MSM incorporates time varying treatments subject to conditions [11], Estimators are derived using influence functions, and The models are applicable to of outcomes measured after a pre-set duration or time-to-event observations. time-to-event and non-linear least squares, semiparametric regression, models to predict outcomes from observational data general treatment and response time dependent processes, where the response process includes the outcome process of interest and the process of other recorded variables. Examples include treatment processes such that the treatment is given at multiple discrete time points and outcome process that are measured at a fixed time or are failure time process. He shows, using influence functions, that weighting observation by the inverse of a subject’s probability of having had his observed treatment history allows for the estimation of causal effects from non-randomized observations. A simpler proof using importance sampling that is applicable to the case of one of several possible treatments, including the null treatment, given at a single time point is presented.

Assume an independent set of N samples {(y _j, t _j, x _j)∣1≤j≤N}, where y _j is the outcome, and t j = A ( x j ) ∈ T . The samples can be expressed as {(y _j, a _j, t _j, x _j)∣1≤j≤N} where a _j ∈ {0, 1} and t j ∈ T 1 . If a _j=0, t _j is interpreted as the potential treatment. Assume that there is a discrete set L and a mapping ϕ: X → L such that p(t _j∣x _j)=p(t _j∣ϕ(x _j)) – that is, t j ∼ ψ ϕ ( x j ) . Let Y ⁱ denote the outcome random variable if all patients have exposure A=i, for i=0, 1.

The observations can be used to estimate causal effects if the following assumptions hold:

Y i ∐ A | X , T .
p(A|X)p(T|X)>0.
The outcome of one individual is independent of the treatment assignment of any other.

From assumption A.1, within strata of (X, T) treated and untreated patients are exchangeable [14], and from assumption A.3., outcomes of different patients are independent. Thus, the expected value of the causal variables can be computed from observations, according to

(12) E ( Y 1 | T ) = ∑ x E ( Y | A = 1 , T , x ) f ( x ) ,

and

(13) E ( Y 0 | T ) = E ( Y 0 ) = ∑ x E ( Y | A = 0 , x ) f ( x ) .

Importance sampling [31] states that if {z _j} is a sample drawn from probability density function f, and g is a probability density function such that g(x) = 0 if f(x) = 0, then { g ( z j ) f ( z j ) z j } is a sample drawn from g. This result is used to transform samples drawn from f(a=1, t _j, x) and f(a=0, x) to samples drawn from f(x) so that A.1 holds for the weighted samples.

Let S W j 1 ⊙ ( a j , t j , x j ) denote the sample (a _j, t _j, x _j) counted with multiplicity S W j 1 . Let a _o ∈ {0, 1} and t _o ∈ {t ₁, …, t _M}. If a _o=1, define S _o = {(a _j, t _j, x _j)| a _j=a _o, t _j=t _o}, and if a _o=0, define S _o = {(a _j, t _j, x _j)| a _j=a _o}. Let N _o = |S _o|. Let I _o be the indicator function for S _o defined by I _o(j) = 1 if (a _j, t _j, x _j) ∈ S _o, and I _o(j) = 0, otherwise.

Define

(14) w ⊙ S o = S W j 1 ⊙ ( a j , t j , x j ) | I o ( j ) = 1 .

Lemma 1.

w ⊙ S _o is a sample from p(x).

Proof.

Case 1: a _o=1. {(a _o, t _o, x _j)} is a sample from

p ( x | a o , t o ) = p ( a o , t o , x ) p ( a o , t o ) = p ( a o | t o , x ) p ( t o , x ) p ( a o , t o ) = p ( a o | x ) p ( t o | x ) p ( x ) p ( a o , t o ) .

Importance sampling implies that each sample from the pseudo population, (a _o, t _o, x _j), is a sample from

Case 2: a _o=0. {(a _o, x _j)} is a sample from

p ( x | a o ) = p ( a o , x ) p ( a o ) = p ( a o | x ) p ( x ) p ( a o )

Importance sampling implies that each sample from the pseudo population, (a _o, x _j), is a sample from

w j p ( x | a o ) = p ( a o ) p ( a o | x ) × p ( a o | x ) p ( x ) p ( a o ) = p ( x ) .

Theorem 3.

Under the assumptions A.1–A.3

(15) lim N → ∞ 1 N o ∑ j = 0 N I o ( j ) w j Y j = E Y a o | t o .

Proof.

This follows from Equations (12) and (13) and Lemma 1.

Additional simulation results

Table 7 lists the log odds, probabilities, and odds ratios of Y=1 for all combinations of the input variables. The outcome probabilities were selected and the coefficients were obtained by solving a system of equations.

Equalization of the distribution of the population covariates

Tables 8 and 9 show the counts of the (X ₂, T _j) subgroups for the subpopulation with X ₁=0 before and after weighting, respectively. The χ ² test applied to Table 8 shows a strong dependence (p-value <2.2 × 10⁻¹⁶) between X ₂ and treatment, whereas the χ ² test applied to Table 9 shows that the weighting, as expected, has largely balanced X ₂ across the treatment groups (p-value 0.271).

RCT and MSM estimates of the log odds of the outcome Y=1 are shown in Table 10 which considers X ₂ as a treatment effect modifier. Results are segregated by the value of X ₁, since if X ₁=1, T ₁ is not available. The MSM estimates come from the Sandwich estimator applied to the logistic regression equation.

(16) g ( E ( Y ) ) = α 0 + α 1 X 2 + α 2 A T 1 + α 3 A T 2 + α 4 A X 2 T 1 + α 5 A X 2 T 2 .

Monte Carlo simulation using 1,000 iterations was used to estimate mean values and standard deviations. For each iteration 100,000 samples were drawn for the RCT and a separate 100,000 samples were drawn for the OS. The MSM and RCT estimates are consistent, and the MSM estimator generally has greater variance than RCT estimation.

References

1. Abernethy, A, Adams, L, Barrett, M, Bechtel, C, Brennan, P, Butte, A, et al.. The promise of digital health: then, now, and the future. Washington D. C.: NAM Perspectives; 2022.Search in Google Scholar

2. Concato, J, Corrigan-Curay, J. Real-world evidence — where are we now? N Engl J Med 2022;386:1680–2. https://doi.org/10.1056/nejmp2200089.Search in Google Scholar

3. Austin, PC, Stuart, EA. Moving towards best practice when using inverse probability of treatment weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies. Stat Med 2015;34:3661–79. https://doi.org/10.1002/sim.6607.Search in Google Scholar PubMed PubMed Central

4. Hernán, M, James, R. Causal inference: what if. Boca Raton: Chapman & Hall/CRC Press; 2020.Search in Google Scholar

5. Stuart, EA. Matching methods for causal inference: a review and a look forward. Stat Sci 2010;25:1–21. https://doi.org/10.1214/09-sts313.Search in Google Scholar PubMed PubMed Central

6. Rosenbaum, PR. Design of observational studies. Springer series in statistics. New York: Springer; 2010. OCLC: ocn444428720.10.1007/978-1-4419-1213-8Search in Google Scholar

7. Ranganathan, P, Pramesh, CS, Aggarwai, R. Common pitfalls in statistical analysis: logistic regression. Perspect Clin Res 2017;8:148–51. https://doi.org/10.4103/picr.picr_87_17.Search in Google Scholar

8. Cole, SR, Hernán, MA. Constructing inverse probability weights for marginal structural models. Am J Epidemiol 2008;168:656–64. https://doi.org/10.1093/aje/kwn164.Search in Google Scholar PubMed PubMed Central

9. Lunceford, J, Davidian, M. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Stat Med 2004;23:2937–60. https://doi.org/10.1002/sim.1903.Search in Google Scholar PubMed

10. Ma, X, Wang, J. Robust inference using inverse probability weighting. J Am Stat Assoc 2020;115:1851–60. https://doi.org/10.1080/01621459.2019.1660173.Search in Google Scholar

11. Robins, JM. Marginal structural models. In: 1997 proceedings of the american statistical association, section on Bayesian statistical science; 1998:1–10 pp.Search in Google Scholar

12. Joffe, MM, Ten Have, TR, Feldman, HI, Kimmel, SE. Model selection, confounder control, and marginal structural models: review and new applications. Am Stat 2004;58:272–9. https://doi.org/10.1198/000313004x5824.Search in Google Scholar

13. Robins, JM, Hernán, MA, Brumback, BA. Marginal structural models and causal inference in epidemiology. Epidemiology 2000;11:550–60. https://doi.org/10.1097/00001648-200009000-00011.Search in Google Scholar PubMed

14. VanderWeele, TJ. Confounding and effect modification: distribution and measure. Epidemiol Methods 2012;1:55–82. https://doi.org/10.1515/2161-962x.1004.Search in Google Scholar

15. Hernán, M, Brumback, BA, Robins, JM. Estimating the causal effect of zidovudine on CD4 count with a marginal structural model for repeated measures. Stat Med 2002;21:1689–709. https://doi.org/10.1002/sim.1144.Search in Google Scholar PubMed

16. Banack, HR, Kaufman, JS. Estimating the time-varying joint effects of obesity and smoking on all-cause mortality using marginal models. Am J Epidemiol 2015;183:122–9. https://doi.org/10.1093/aje/kwv168.Search in Google Scholar PubMed

17. Hernán, M, Brumback, BA, Robins, JM. Marginal structural models to estimate the joint causal effect of nonrandomized treatments. J Am Stat Assoc 2001;96:440–8. https://doi.org/10.1198/016214501753168154.Search in Google Scholar

18. Howe, CJ, Cole, SR, Mehta, SH, Kirk, GD. Estimating the effects of multiple time-varying exposures using joint marginal structural models: alcohol consumption, injection drug use, and hiv acquisition. Epidemiology 2012;23:574–82. https://doi.org/10.1097/ede.0b013e31824d1ccb.Search in Google Scholar PubMed PubMed Central

19. Imbens, G. The role of the propensity score in estimating dose-response functions. Biometrika 2000;87:706–10. https://doi.org/10.1093/biomet/87.3.706.Search in Google Scholar

20. Imai, K, van Dyk, DA. Causal inference with general treatment regimes: generalizing the propensity score. J Am Stat Assoc 2004;99:854–66. https://doi.org/10.1198/016214504000001187.Search in Google Scholar

21. Lopez, MJ, Gutman, R. Estimation of causal effects with multiple treatments: a review and new ideas. Stat Sci 2017;32:432–54. https://doi.org/10.1214/17-sts612.Search in Google Scholar

22. Ambrose, N, Amin, A, Anderson, B, Barrera-Oro, J, Bertagnolli, M, Campion, F, et al.. Neutralizing monoclonal antibody use and COVID-19 infection outcomes. JAMA Netw Open 2023;6:e239694–4. https://doi.org/10.1001/jamanetworkopen.2023.9694.Search in Google Scholar PubMed PubMed Central

23. Ambrose, N, Amin, A, Anderson, B, Bertagnolli, M, Campion, F, Chow, D, et al.. The influence of social determinants on receiving outpatient treatment with monoclonal antibodies, disease risk, and effectiveness for covid-19. J Gen Intern Med 2023;38. https://doi.org/10.1007/s11606-023-08324-y.Search in Google Scholar PubMed PubMed Central

24. van Buuren, S, Groothuis-Oudshoorn, K. mice: multivariate imputation by chained equations in r. J Stat Software 2011;45:1–67. https://doi.org/10.18637/jss.v045.i03.Search in Google Scholar

25. Zeileis, A. Object-oriented computation of sandwich estimators. J Stat Software 2006;16:1–16. https://doi.org/10.18637/jss.v016.i09.Search in Google Scholar

26. Zeileis, A, Köll, S, Graham, N. Various versatile variances: an object-oriented implementation of clustered covariances in R. J Stat Software 2020;95:1–36. https://doi.org/10.18637/jss.v095.i01.Search in Google Scholar

27. Leyrat, C, Seaman, SR, White, IR, Douglas, I, Smeeth, L, Kim, J, et al.. Propensity score analysis with partially observed covariates: how should multiple imputation be used? Stat Methods Med Res 2017;28:3–19. https://doi.org/10.1177/0962280217713032.Search in Google Scholar PubMed PubMed Central

28. VanderWeele, TJ, Ding, P. Sensitivity analysis in observational research: introducing the e-value. Ann Intern Med 2017;167:268–74. https://doi.org/10.7326/m16-2607.Search in Google Scholar

29. Mathur, MB, Ding, P, Riddell, CA, VanderWeele, TJ. Web site and r package for computing e-values. Epidemiology 2018;29:e45–7. https://doi.org/10.1097/ede.0000000000000864.Search in Google Scholar

30. MATLAB. Version 9.11.0 (R2021b). Natick, Massachusetts: The MathWorks Inc; 2021.Search in Google Scholar

31. Bishop, CM. Pattern recognition and machine learning. New York: Springer; 2006.Search in Google Scholar

Received: 2023-02-09

Accepted: 2024-08-05

Published Online: 2024-11-06

You are currently not able to access this content.

Articles in the same Issue

https://doi.org/10.1515/em-2023-0005

Keywords for this article

multiple treatment propensity score; marginal structural models; COVID-19

A compound representation of the multiple treatment propensity score with applications to marginal structural modeling

Article

Abstract

Objectives

Methods

Results

Conclusions

Acknowledgments

Demonstration of confounding if p(t|x)=0

An alternate derivation of SW 1

Lemma 1.

Proof.

Theorem 3.

Proof.

Additional simulation results

Equalization of the distribution of the population covariates

References

Articles in the same Issue

Articles in the same Issue

Articles in the same Issue

An alternate derivation of SW ¹