Selection Bias When Using Instrumental Variable Methods to Compare Two Treatments But More Than Two Treatments Are Available

Ashkan Ertefaie; Dylan Small; James Flory; Sean Hennessy

doi:10.1515/ijb-2015-0006

Article Publicly Available

Selection Bias When Using Instrumental Variable Methods to Compare Two Treatments But More Than Two Treatments Are Available

Ashkan Ertefaie , Dylan Small , James Flory and Sean Hennessy

Published/Copyright: May 26, 2016

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal The International Journal of Biostatistics Volume 12 Issue 1

Abstract

Instrumental variable (IV) methods are widely used to adjust for the bias in estimating treatment effects caused by unmeasured confounders in observational studies. It is common that a comparison between two treatments is focused on and that only subjects receiving one of these two treatments are considered in the analysis even though more than two treatments are available. In this paper, we provide empirical and theoretical evidence that the IV methods may result in biased treatment effects if applied on a data set in which subjects are preselected based on their received treatments. We frame this as a selection bias problem and propose a procedure that identifies the treatment effect of interest as a function of a vector of sensitivity parameters. We also list assumptions under which analyzing the preselected data does not lead to a biased treatment effect estimate. The performance of the proposed method is examined using simulation studies. We applied our method on The Health Improvement Network (THIN) database to estimate the comparative effect of metformin and sulfonylureas on weight gain among diabetic patients.

Keywords: selection bias; instrumental variables; sensitivity analysis

1 Introduction

Confounder adjustment is critical in the estimation of treatment effect in observational studies. In practice, however, there is no guarantee that all the confounders (i.e., baseline variables that affect both the treatment and outcome) have been measured. These unmeasured confounders can bias the treatment effect estimation.

Instrumental variable (IV) methods are designed to adjust for the bias caused by unmeasured confounders in estimating a difference measure of effect such as the risk difference. IVs are predictors of treatment allocation that are independent of unmeasured confounders and do not have a direct effect on the outcome (i.e., only affect the outcome through affecting the treatment). Intuitively, the IV method seeks to extract variation in treatment that is free of unmeasured confounders and uses this variation to estimate the treatment effect. For more information on IV methods, see Angrist et al.[1], Newhouse and McClellan [2], Greenland [3], Hernán and Robins [4], Cheng et al. [5], Baiocchi et al.[6] and Imbens [7]. This paper is motivated by provider preference IV (PP IV) which is commonly used as an IV in health studies. PP IV utilizes the natural variation in medical practices to construct an IV. For example, in studying the effect of cyclooxygenase-2 (COX-2) inhibitors vs. nonselective nonsteroidal anti-inflammatory drugs (nonselective NSAIDs) on gastrointestinal complications Brookhart et al. [8] used whether a patient’s physician last prescription for a patient prescribed a nonsteroidal anti-inflammatory drug (NSAID) was for a Cox-2 inhibitor or a nonselective NSAID as an IV for whether the patient was prescribed a Cox-2 inhibitor or a nonselective NSAID. Other examples of PP IV can be found in Brooks et al. [9] and Brookhart and Schneeweiss [10].

For a binary IV and binary treatment, Angrist et al. [1] proposed a nonparametric estimator which identifies the treatment effect among subjects who would take the treatment if encouraged to do so by the IV and not take the treatment if not encouraged to do so by the IV (compliers). This estimator is equivalent to a 2 stage least square (2SLS) estimator where at the first stage the treatment is regressed on IV and at the second stage the outcome is regressed on fitted values of the first stage regression (i.e., predicted value of the treatment variable given the IV). The coefficient of the independent variable of the second stage regression is equivalent to the estimator proposed by Angrist et al. [1]. The 2SLS estimator is also called the Wald estimator after Wald [11]. More efficient versions of the 2SLS estimator has been developed by Imbens and Rubin [12, 13] and Cheng et al. [14].

Although treatment effect estimation using IV approaches for binary treatments is straightforward, it is challenging when there are more than two treatment arms. In fact, Cheng and Small [15] shows that the treatment effects are not point identifiable and proposes bounds on treatment effects using method of moments. Long et al. [16] studies the same problem and proposes a Bayesian approach for finding credibility intervals for the identification region.

Sometimes investigators are interested in comparing the effect of two treatments while the data consist of more than two treatment options for patients. In this situation a common approach is to select patients who are assigned to the treatments of interest and perform the standard IV approach for binary treatment and IV to estimate the treatment effect [17–21].

The objective of this manuscript is to provide empirical and theoretical evidence that, in general, excluding patients based on their assigned treatment can result in selection bias. This problem has also been discussed in an independent work by Swanson et al. [22] and Swanson [23]. In the current manuscript, we formalize the mechanism of this selection bias within a principal stratification framework and provide a procedure that identifies the treatment effect of interest as a function of a vector of sensitivity parameters. Specifically, the sensitivity parameters are used to estimate the probability of being assigned to one of the treatments of interest as a function of the observed outcome and the IV value [24]. We also list assumptions under which selecting patients based on their received treatment does not lead to a biased treatment effect estimate.

The manuscript is organized as follows. Section 2 provides the setup and defines the causal estimand. It also lists the required assumptions for our procedure. In Section 3, we discuss the selection bias issue and present the proposed sensitivity analysis method. We report on the results of a simulation study in Section 4, where we examine the performance of our proposed method. In Section 5, we study the comparative effect of metformin and sulfonylureas on weight gain using The Health Improvement Network (THIN) database.

2 Settings and notations

For simplicity we focus on observational studies with three treatment options A, B, and C. We assume that we are only interested in comparing the effect of treatment A with B. We use small letters to refer to the possible values of the corresponding capital letter random variable. Let Z be a binary IV where Z∈{A,B} such as provider preference. Z=A means that the IV predisposes the patient to receive treatment level A among the options A and B, and Z=B means that the IV predisposes the patient to receive treatment level B among the options A and B; e.g., in Brookhart et al. [8] study, if A denotes Cox-2 inhibitor and B denotes nonselective NSAID, Z=A if the patient’s physician last prescription to a patient prescribed an NSAID was a Cox-2 inhibotor and Z=B if it was a nonselective NSAID. Let D(z) be the potential treatment assigned given Z=z, where D(z)∈{A,B,C}, i.e., D(A) is the treatment that would be received if the IV was A. The potential outcomes are Y(z,d) for Z=z and D=d, i.e., Y(A,B) is the outcome that would be observed if the IV was A and the treatment level was B. The observed treatment and outcome are D=D(Z) and Y=Y(Z,D(Z)), respectively.

Frangakis and Rubin [25] developed a basic principal strata (PS) framework which stratifies subjects based on their potential assigned treatments. In our setting, we can classify subjects into the following 9 principal strata: S1) always B taker, S2) always A taker, S3) always C taker, S4) subjects who would comply with their value of IV, i.e., D(A)=A, D(B)=B, S5) subjects who would take B if Z=B and would take C if Z=A, S6) subjects who would take C if Z=B and would take A if Z=A, S7) subjects who would take A if Z=B and would take B if Z=A, S8) subjects who would take B if Z=A and would take C if Z=B, and S9) subjects who would take A if Z=B and would take C if Z=A. We denote the proportions in the basic principal strata as πS..

We assume that the IV satisfies the following standard assumptions [1]: (1) stable unit treatment value assumption, (2) The IV is independent of unmeasured confounders, potential outcomes and potential treatment received, (3) the IV affects the treatment but has no direct effect on the outcome. This assumption is known to as the exclusion restriction (ER) assumption. Under the exclusion restriction, we let Y(d) be the potential outcome if treatment d is received which equal Y(z=0,d)=Y(z=1,d), and (4) the IV is independent of the pretreatment covariates, potential outcome and potential treatments. Furthermore, we assume two types of monotonicity assumptions: (5) monotonicity I: there is no defiers (i.e., p(D(A)≠A,D(B)≠B)=0) and (6) monotonicity II: there is no one who would take B if Z=A but take C if Z=B and no one who would take C if Z=A but take A if Z=B. Note that monotonicity assumptions 5 & 6 reduce the number of principal strata to 6. The parameter of interest is defined as

θ=E[Y(A)−Y(B)|PS=S4].

3 IV selection bias

We show in this manuscript that IV analysis methods may not result in an unbiased treatment effect if applied on a data set in which units are preselected based on their received treatments. A standard IV analysis of patients on the subset of patients for whom D=A or B is essentially a comparison of the Z=A vs. Z=B patients on this subset. If Z=A has a different effect on whether D=C than Z=B, then there will be selection bias. For example, patients who are in a more severe stages of the disease may be treated with treatment D=C and therefore have less chance to be observed in the sample; if doctors who prefer drug A(Z=A) are more likely to use treatment C with severe patients than doctors who prefer drug B(Z=B), then there will be selection bias. Another scenario where such selection bias might arise is that treatment C is more likely to be offered if some complications are present and doctors who prefer drug A are more likely to treat a patient with complications with drug C than doctors who prefer drug B.

The challenge is that θ is not identifiable using the observed data when there is selection bias. This is due to the fact that the observed outcomes of Y|Z,D is a mixture of potential outcomes from principal strata. Let fA(y) be the density of outcome for a patient who received D=A when assigned to Z=A (i.e., fA(y)=f(y|D=A,Z=A). Then

fA(y)=γAS2fAS2(y)+γAS4fAS4(y)+γAS6fAS6(y),

where γAS.=πS.πS2+πS4+πS6 and fAS.(y) is the density function of the outcome for subjects in principal stratum S. with D=A. The density fAS2(y) and fAS6(y) can be written as

fAS2(y)=f(Y(A)=y|PS=S2)=p(D(B)=A|D(A)=A,Y(A)=y)fA(y)p(D(B)=A|D(A)=A)

fAS6(y)=f(Y(A)=y|PS=S6)=p(D(B)=C|D(A)=A,Y(A)=y)fA(y)p(D(B)=C|D(A)=A)

Thus,

(1)fAS4(y)=f(Y(A)=y|PS=S4)=[1−p(D(B)=C|D(A)=A,Y(A)=y)−p(D(B)=A|D(A)=A,Y(A)=y)]fA(y)γAS4

Similarly, let fB(y) be the density of outcome for a patient who received D=B when assigned to Z=B. Then

fB(y)=γBS1fBS1(y)+γBS4fAS4(y)+γBS5fBS5(y),

where γBS.=πS.πS1+πS4+πS5 and fBS.(y) is the density function of the outcome for subjects in principal stratum S. with D=B. Also,

fBS1(y)=f(Y(B)=y|PS=S1)=p(D(A)=B|D(B)=B,Y(A)=y)fA(y)p(D(A)=B|D(B)=B)

fBS5(y)=f(Y(B)=y|PS=S5)=p(D(A)=C|D(B)=B,Y(B)=y)fB(y)p(D(A)=C|D(B)=B)

Thus,

(2)fBS4(y)=[1−p(D(A)=C|D(B)=B,Y(B)=y)−p(D(A)=B|D(B)=B,Y(B)=y)]fB(y)γBS4.

The mixture proportions γAS. and γBS. are unknown and need to be estimated using the observed data. However, the strata proportions πS. are not identifiable using the proportions in the observable (D,Z) strata, because

p(D=A|Z=A)=πS2+πS4+πS6

p(D=A|Z=B)=πS2

p(D=B|Z=A)=πS1

p(D=B|Z=B)=πS1+πS4+πS5

p(D=C|Z=B)=πS3+πS6

(3)p(D=C|Z=A)=πS3+πS5,

does not have a unique solution. Also, the density functions fAS4(y) and fBS4(y) are not identifiable using the observed data because p(D(B)=A|D(A)=A,Y(A)=y), p(D(B)=C|D(A)=A,Y(A)=y), p(D(A)=B|D(B)=B,Y(B)=y) and p(D(A)=C|D(B)=B,Y(B)=y) are unknown. So we assume logit models for these probabilities and perform the sensitivity analysis based on these models;

(4)p(D(B)=B|D(A)=A,Y(A)=y)=wA(y; α0A,α1A)=eα0A+α1Ay1+eα0A+α1Ay

(5)p(D(A)=A|D(B)=B,Y(B)=y)=wB(y; α0B,α1B)=eα0B+α1By1+eα0B+α1By.

Note that for any fixed (α1A,α1B), (α0A,α0B) can be determined since ∫fAS4(y)dy=1 and ∫fBS4(y)dy=1. Therefore, the parameter of interest θ can be estimated using the four dimensional sensitivity parameter (γAS4,γBS4,α1A,α1B).

Fixing α1A<0 makes the conditional probability monotone decreasing in y, which means the odds of being a complier among patients who have taken D=A if Z=A (e.g., seeing a physician with A preference) is lower for a larger Y(A)=y. Similarly, α1A>0 makes the conditional probability monotone increasing in y, which means the odds of being a complier among patients who have taken D=A if Z=A is higher for a larger Y(A)=y. The parameter α1B has a similar interpretation. The parameter γAS4 is the probability of being a complier (i.e., take D=A if Z=A and D=B if Z=B) for patients who have taken D=A when Z=A (i.e., γAS4=p(D(B)=B|D(A)=A). The parameter γBS4 is defined similarly.

Remark.

If it is plausible to assume that at least one of the principal strata πS3, πS5 or πS6 is empty, we can reduce the dimension of the sensitivity parameters to two by estimating γAS4 and γBS4 using the observed data.

3.1 No selection bias

Let R be a binary variable which is 1 if a patient is assigned to one of the treatments of interest and 0 otherwise. Under either of the following two conditions, the treatment effect is identifiable when we have access to the entire data (i.e., data includes patients who have received treatment C as well as A and B):

p(R=0|Z,U,X)=p(R=0|U,X) where X and U are measured and unmeasured confounders, respectively.
p(R=0|Z,U,X)=p(R=0|Z,X).

Assumption A.1 implies that the IV is unrelated to the patient’s selection. In other words, a patient would have the same chance of being selected for both values of the IV. A.2 means that the selection probability is independent of the unmeasured confounders given the IV and measured confounders. Figure 1 depicts the association between different variables. The left plot presents a scenario in which analyzing the preselected data will cause bias because conditioning on R will open the path Z→R→U→Y which violates one of the IV assumptions. This path is closed under either of assumptions A.1 or A.2 as shown in the center and right plots, respectively.

Figure 1:

Causal diagram. Conditioning on subjects with R=1 leads to a bias treatment effect estimate in the left figure. Assumptions A.1 and A.2 hold for the center and right figure, respectively.

Under either of assumptions A.1 or A.2, the treatment effect can be identified. Specifically, under assumption A.1, the principal strata S5 and S6 does not exist. Thus

E[Y|Z=A,D=A]=γS2E[Y(A)|PS=S2]+γS4E[Y(A)|PS=S4]

E[Y|Z=B,D=B]=γS1E[Y(B)|PS=S1]+γS4E[Y(B)|PS=S4]

E[Y|Z=A,D=B]=γS1E[Y(B)|PS=S1]

E[Y|Z=B,D=A]=γS2E[Y(A)|PS=S2].

These equations can be used to identify θ=E[Y(A)−Y(B)|PS=S4]. Also, under assumption A.2, θ can be estimated using the following three-step procedure:

Using the entire data, estimate ϖ=p(R=1|X,Z) by postulating a logit model on R.
Include patients who have been assigned to either treatments A or B.
Estimate the treatment effect θ using 2SLS IV approach. At each stage fit a weighted least square where weights are ϖˆ.

Fitting weighted least squares accounts for the selection procedure and hence avoids the bias by creating a pseudo-population in which there is no association between (X,Z) and R.

3.2 Inference procedure

Let YA=(YA1,YA2,...,YAnA) and YB=(YB1,YB2,...,YBnB) denote the outcome of patients whose received treatment matches their IV value for Z=A and Z=B, respectively. We assume that samples are independent within and between each IV group and identically distributed within each group.

For any fixed (γAS4,γBS4,α1A,α1B), (αˆ0A,αˆ0B) is defined as a solution to

∫−∞−∞eα0A+α1Ay1+eα0A+α1AydFˆA(y)=γAS4

∫−∞−∞eα0B+α1By1+eα0B+α1BydFˆB(y)=γBS4

where FˆA(y) and FˆB(y) are empirical distributions calculated from the observed samples YA and YB. These equations can be solved using a one dimensional grid search on real line.

We estimate the treatment effect θ by

θˆ=1γAS4nA∑j=1nAyAjwA(yAj; αˆ0A,α1A)−1γBS4nB∑j=1nByBjwB(yBj; αˆ0B,α1B).

where wA(.) and wB(.) are defined in eqs (4) and (5). By bootstrap sampling from YA and YB, we can estimate the standard error of θˆ as the standard error of the estimated treatment effect from each bootstrapped sample.

4 Simulation studies

Through simulations we examine the performance of our proposed sensitivity analysis assuming known and unknown (γAS4,γBS4). We assume two scenarios with the following principal strata proportions: Scenario 1:

Scenario 1: πS1=πS2=πS3=πS5=0.1, πS4=0.3 and πS6=0.3 which gives us (γAS4,γBS4)=(0.30.7,0.30.5).
Scenario 2: πS1=πS2=πS3=0.1, πS4=0.3, πS5=0.4 and πS6=0.0 which gives us (γAS4,γBS4)=(0.20.3,0.20.7).

The true amount of selection bias is determined by the parameters α1A and α1B in eqs (4) and (5), respectively. The outcome of individuals with D=A and Z=A are generated from a normal distribution with mean 2.5 and standard error 2 (i.e., fA(y)=N(2.5,2)). Also, the outcome of individuals with D=B and Z=B are generated from a normal distribution with mean μ and standard error 2 (i.e., fB(y)=N(μ,2)). For each set of (α1A,α1B), the parameter μ is determined such that the true treatment effect among compliers is θ=0.00, 0.50 and 0.80. We generate samples using rejection sampling from densities fBS4(y) and fBS4(y) according to eqs (1) and (2), respectively.

Table 1 presents the statistical power of detecting the true treatment effect θ=0.0, 0.50, and 0.80 based on 500 datasets of sizes n=500 with (α1A,α1B)=(1,2), (1,1) and (0,0). The nominal type-I error rate is 5%. For both scenarios when the true amount of selection bias is presumed, the empirical and nominal sizes are close. In general, the empirical sizes are inflated when the selection bias parameters are not set correctly. For some of the combinations of the true and presumed sensitivity parameter values the power and type-I error rate are much higher than the nominal one which is caused by the overestimation of the treatment effect. Also, in scenario 2, when there is no selection bias (i.e., true (α1A,α1B)=(0,0)) and we presumed (α1A,α1B)=(1,1) the power is very low because of the underestimation of the treatment effect. Table 2 shows the bias (0.80−θˆ), standard error and mean squared error of the treatment effect estimation for different combinations of the true and presumed sensitivity parameters when the true treatment effect θ=0.80. This table highlights the over- and underestimation phenomenon discussed earlier. Specifically, when the true sensitivity parameters are (α1A,α1B)=(1,2), presuming (α1A,α1B)=(0,2) will result in underestimating the counterfactual mean outcome E[Y(A)|PS=S4]. This is because when we set the presumed α1A less than the true value, the larger outcomes are less likely to be a complier among patients with D=Z=A. In other words it will shift the distribution of this compliers to left. Similarly when the presumed α1A more than the true value, E[Y(A)|PS=S4] will be overestimated (See rows 3 and 4 in Table 2).

Table 1:

Simulation study: power analysis of θ=0.00,0.50 and 0.80 when γAS4 and γBS4 are known.

True (α1A,α1B)	Presumed (α1A,α1B)	Scenario 1			Scenario 2
		0.00	0.50	0.80	0.00	0.50	0.80
(1,2)	(1,2)	0.04	0.60	0.91	0.04	0.52	0.86
(1,2)	(1,1)	0.12	0.81	0.98	0.22	0.88	0.97
(1,2)	(0,0)	0.16	0.36	0.78	0.99	1.00	1.00
(1,1)	(1,1)	0.05	0.58	0.91	0.05	0.48	0.85
(1,1)	(2,2)	0.08	0.71	0.95	0.10	0.24	0.61
(1,1)	(0,0)	0.35	0.05	0.40	0.90	1.00	1.00
(0,0)	(0,0)	0.04	0.61	0.92	0.05	0.60	0.94
(0,0)	(1,0)	1.00	1.00	1.00	0.80	1.00	1.00
(0,0)	(1,1)	0.34	0.96	1.00	0.81	0.00	0.03

Table 2:

Simulation study: Sensitivity analysis of θ=0.80 when γAS4 and γBS4 are known. Bias=0.80−θˆ.

True (α1A,α1B)	Presumed (α1A,α1B)	Scenario 1			Scenario 2
		Bias	S.D.	MSE	Bias	S.D.	MSE
(1,2)	(1,2)	–0.01	0.28	0.08	0.01	0.30	0.09
(1,2)	(1,1)	–0.22	0.29	0.14	–0.38	0.32	0.25
(1,2)	(0,2)	1.38	0.25	1.97	0.81	0.29	0.74
(1,2)	(1,0)	–1.18	0.25	1.43	–2.17	0.28	4.82
(1,2)	(0,0)	0.21	0.23	0.10	–1.34	0.25	1.86
(1,1)	(1,1)	0.00	0.28	0.08	–0.01	0.30	0.10
(1,1)	(1,2)	0.19	0.28	0.11	0.39	0.31	0.26
(1,1)	(0,2)	1.58	0.27	2.58	1.17	0.30	1.49
(1,1)	(1,0)	–0.98	0.26	1.02	–1.77	0.28	3.20
(1,1)	(0,0)	0.41	0.23	0.23	–0.95	0.27	0.98

Figure 2 shows how the treatment effect estimate changes by the sensitivity parameters when the true (α1A,α1B)=(1,2) and θ=0.80. The darker area in left panel of Figure 2 indicates the smaller bias and the darker area in the right panel shows the region in which the bias is not significant at 5% (i.e., the confidence interval for (0.80−θˆ) contains zero). See also Table 2.

$Figure 2: Simulation study: sensitivity analysis of θ=0.80$${\theta} = 0.80$$ when γAS4$$\gamma _A^{S4}$$ and γBS4$$\gamma _B^{S4}$$ are known. The true sensitivity parameter values are (α1A,α1B)=(1,2)$$({\alpha _{1A}}, {\alpha _{1B}}) = (1, 2)$$. The darker area in left panel indicates the smaller bias and the darker area in the right panel shows the region in which the bias is not significant at 5%$$5 \% $$. The first and second rows represent Scenarios 1 and 2, respectively.$

Figure 2:

Simulation study: sensitivity analysis of θ=0.80 when γAS4 and γBS4 are known. The true sensitivity parameter values are (α1A,α1B)=(1,2). The darker area in left panel indicates the smaller bias and the darker area in the right panel shows the region in which the bias is not significant at 5%. The first and second rows represent Scenarios 1 and 2, respectively.

4.1 Unknown (γAS4,γBS4)

In this case our sensitivity parameter space is (γAS4,γBS4,α1A,α1B). Figures 3 and 4 present the bias in the treatment effect estimation for different values of the presumed sensitivity parameters based on scenarios 1 and 2, respectively. In these figures the true (γAS4,γBS4,α1A,α1B)=37,35,1,2 and (γAS4,γBS4,α1A,α1B)=23,27,1,2 in scenarios 1 and 2, respectively. Also the true treatment effect is θ=0.80. The darker area in the left panel shows the region in the sensitivity parameter space in which the bias is smaller and right panel shows the area in which the bias is not significant based on 500 samples (i.e., the confidence interval for (0.80−θˆ) contains zero). The shape of the area which leads to an unbiased estimate changes by the value of the presumed (γAS4,γBS4) and, in general, when the presumed values of (γAS4,γBS4) are far from the true values, the unbiased area is small which means it would be less likely to find an unbiased treatment effect estimate using sensitivity analysis (see also Tables 4 and 5 in the Appendix). We have tabulated the power and type-I error rate for different values of the presumed sensitivity parameters in Table 3. This Table summarizes the result for three different values of the true treatment effect θ=0.00,0.50, and 0.80. The nominal type-I error rate is 5%.

$Figure 3: Simulation study (Scenario 1): Sensitivity analysis of θ=0.80$${\bf \it\theta} = 0.80$$. Sensitivity parameters are (γAS4,γBS4,α1A,α1B)$$(\gamma _A^{S4}, \gamma _B^{S4}, {\alpha _{1A}}, {\alpha _{1B}})$$. The darker area in left panel indicates the smaller bias and the darker area in the right panel shows the region in which the bias is not significant at 5%$$5 \% $$. The true vector of sensitivity parameters are (γAS4,γBS4,α1A,α1B)=37,35,1,2$$(\gamma _A^{S4}, \gamma _B^{S4}, {\alpha _{1A}}, {\alpha _{1B}}) = \left({{3 \over 7}, {3 \over 5}, 1, 2} \right)$$.$

Figure 3:

Simulation study (Scenario 1): Sensitivity analysis of θ=0.80. Sensitivity parameters are (γAS4,γBS4,α1A,α1B). The darker area in left panel indicates the smaller bias and the darker area in the right panel shows the region in which the bias is not significant at 5%. The true vector of sensitivity parameters are (γAS4,γBS4,α1A,α1B)=37,35,1,2.

$Figure 4: Simulation study (Scenario 2): Sensitivity analysis of θ=0.80$${\bf \it\theta} = 0.80$$. Sensitivity parameters are (γAS4,γBS4,α1A,α1B)$$(\gamma _A^{S4}, \gamma _B^{S4}, {\alpha _{1A}}, {\alpha _{1B}})$$. The darker area in left panel indicates the smaller bias and the darker area in the right panel shows the region in which the bias is not significant at 5%$$5 \% $$. The true vector of sensitivity parameters are (γAS4,γBS4,α1A,α1B)=23,27,1,2$$(\gamma _A^{S4}, \gamma _B^{S4}, {\alpha _{1A}}, {\alpha _{1B}}) = \left({{2 \over 3}, {2 \over 7}, 1, 2} \right)$$.$

Figure 4:

Simulation study (Scenario 2): Sensitivity analysis of θ=0.80. Sensitivity parameters are (γAS4,γBS4,α1A,α1B). The darker area in left panel indicates the smaller bias and the darker area in the right panel shows the region in which the bias is not significant at 5%. The true vector of sensitivity parameters are (γAS4,γBS4,α1A,α1B)=23,27,1,2.

Table 3:

Simulation study: Power analysis of θ=0.00,0.50 and 0.80. The true vector of sensitivity parameters in scenarios 1 and 2 are (γAS4,γBS4,α1A,α1B)=(37,35,1,2), and (23,27,1,2), respectively.

Presumed	Scenario 1			Scenario 2
(γAS4,γBS4,α1A,α1B)	0.00	0.50	0.80	0.00	0.50	0.80
(0.3,0.6,1,2)	0.38	0.90	0.98	1.00	1.00	1.00
(0.3,0.6,0,2)	1.00	0.00	0.00	0.07	0.73	0.96
(0.3,0.6,1,0)	1.00	1.00	1.00	1.00	1.00	1.00
(0.3,0.6,0,0)	0.12	0.35	0.80	1.00	1.00	1.00
(0.5,0.5,1,2)	0.40	0.08	0.38	0.93	1.00	1.00
(0.5,0.5,0,2)	1.00	0.00	0.00	0.04	0.40	0.75
(0.5,0.5,1,0)	0.96	1.00	1.00	1.00	1.00	1.00
(0.5,0.5,0,0)	0.10	0.29	0.76	1.00	1.00	1.00
(0.2,0.8,1,2)	0.98	1.00	1.00	1.00	1.00	1.00
(0.2,0.8,0,2)	0.94	0.00	0.05	0.61	1.00	1.00
(0.2,0.8,1,0)	1.00	1.00	1.00	1.00	1.00	1.00
(0.2,0.8,0,0)	0.06	0.24	0.65	1.00	1.00	1.00
(0.7,0.2,1,2)	1.00	0.00	0.00	0.30	0.08	0.32
(0.7,0.2,0,2)	1.00	0.00	0.00	0.98	0.00	0.00
(0.7,0.2,1,0)	0.65	0.99	1.00	1.00	1.00	1.00
(0.7,0.2,0,0)	0.12	0.39	0.87	1.00	1.00	1.00

5 Application to real data

The Health Improvement Network (THIN) is a large database, which contains the electronic medical record of more than 11 million patients. The data contains longitudinal measurements of diagnostic and prescription data and baseline characteristics collected from over 500 general practices in the UK. We are interested in assessing the effect of metformin and sulfonylureas on body mass index (BMI, calculated as mass in kilograms divided by the square of height in meters) among diabetic patients. Our focus is on patients who are taking these treatments as an initial treatment and the BMI is measured two years after treatment initiation. In our analysis we coded metformin and sulfonylureas as A and B, respectively.

In the modern era, metformin is universally acknowledged as the appropriate first-line medication for diabetes type 2 except where contraindicated [26]. While this dominance was being established in the late 1990s and early 2000s, sulfonylureas were the competing first line agent. During that time the prevalence of metformin use in clinical practice rose very quickly, at the expense of sulfonylureas.

The dynamic aspect of the preference justifies the use of provider preference as an IV. We define the preference as a time-varying quantity such that for each practice ID, the IV is defined as an average of metformin use during each two year timeframe from 1998 to 2012. We assign Z=A if the average is more than 50% and B otherwise. Thus, a particular practice ID may have Z=A for one period of time and B for the other.

Our data includes patients who have been assigned to medications other than metformin or sulfonylureas as an initial therapy. Figure 5 presents the sensitivity analysis for the estimation of the treatment effect of interest based on a four dimensional sensitivity parameter (γAS4,γBS4,α1A,α1B).

Figure 5:

THIN data: Sensitivity analysis for comparing the effects on BMI of sulfonylureas vs. metformin. Numbers on the plot present treatment effect estimates. A: metformin; B: sulfonylureas.

The parameter α1A is the log odds coefficient of BMI in predicting the probability that a patient would take D=B (sulfonylureas) if if she saw a physician with sulfonylurea preference (Z=B) given that a patient would take D=A if she saw a physician with metformin preference (Z=A). The parameter γAS4 is the probability of being a complier (i.e., always taking the same medication as the physician preference) for a patient who would have taken D=A if she saw a physician with metformin preference (Z=A). The parameters α1B and γBS4 are defined similarly.

Since there is a clinical sense that sulfonylurea is associated with weight gain [27, 28], physicians may be less likely to prescribe sulfonylurea to high weight patients. This would imply that among patients with D(A)=A, the odds of being a complier should decrease with y. Analogous reasoning suggests that among patients with D(B)=B, the odds of being a complier will increase with y. If this reasoning holds, then we can restrict our sensitivity parameter domain to α1A<0 and α1B>0. Also, on average patients who have taken a sulfonylurea if seeing a physician with a sulfonylurea preference are more likely to be compliers compared with those who have taken metformin given seeing a physician with metformin preference. This means that it is very likely that γAS4<γBS4. In the Appendix, we summarize the results based on the extreme values of the sensitivity parameters (Table 6).

The range of the treatment effect estimate varies by different choices of γAS4 and γBS4. Specifically, for γAS4=0.2 and γBS4=0.6, the treatment effect lies in [−0.5,1]; while for γAS4=0.5 and γBS4=0.3, it lies in [−0.7,0.5]. Based on our sensitivity analysis, it is unlikely that metformin be associated with increase in BMI and a large negative effect (i.e., E[Y(A)−Y(B)|PS=S4]) is also unlikely because it requires very small and large values of α1A and α1B, respectively. Hence, if anything, the treatment effect of interest should lie in [−0.6,−0.1].

6 Discussion

This manuscript shows that IV analysis methods may fail to provide an unbiased estimate of treatment effects when analyzing data in which subjects are selected based on their received treatment. Specifically, this selection bias happens if the chance of including patients in the preselected data varies based on their IV value and/or some unmeasured confounders. For example, suppose we are interested in studying the comparative effect of treatment A and B while there is a third choice of treatment C. Suppose patients with more severe stages of the disease are more likely to be treated with C if seeing a doctor with A preference. Then if we analyzed a data that only includes patients who have received treatment A or B, severe cases will be under represented among patients who have seen doctors with A preference. This means that in the preselected data, the IV is associated with an unmeasured confounder which makes the IV invalid and results in a biased treatment effect estimate.

Within the principal strata framework, we develop a sensitivity analysis that can be used to estimate the treatment effect among compliers as a function of a vector of sensitivity parameters. Specifically, the sensitivity parameters are used to identify the probability of being a complier given the IV value, received treatment and the observed outcome. The dimension of the sensitivity parameter can be reduced if it is plausible to assume that there is no always C takers. This is because the proportions in the principal strata are identifiable which means γAS4 and γBS4 can be estimated using the observed data.

Funding statement: Funding: This work was partly supported by the National Science Foundation (grant SES-1260782).

Appendix

Appendix: Supplementary Tables

In this Appendix, we present the supplementary tables.

Table 4 shows the bias (0.80−θˆ) and mean squared error of the treatment effect estimate for different combinations of the presumed sensitivity parameters when the true treatment effect θ=0.80.
Table 5 shows the estimate and the standard error of the treatment effect when the sensitivity parameters are set to their extreme values (the true treatment effect is θ=0.80). Because γAS4 and γBS4 are probabilities of being a complier and we do not expect them to be very small, we set the minimum values of these parameters to be 0.1. This table approximates the analytical bounds and direction of the bias in our simulation scenarios.
Table 6 shows the estimate and the standard error of the treatment effect when the sensitivity parameters are set to their extreme values. Similar to Table 5, we set the minimum values of γAS4 and γBS4 to 0.1. This table shows that the effects on BMI of sulfonylureas vs. metformin varies between [–2.91, 4.27]. However, when we restrict the sensitivity parameters to the plausible range, i.e., γAS4≤γBS4, α1A≤0 and α1B≥0, the effect estimate lies in [–0.90,0.87].

Table 4:

Simulation study: Sensitivity analysis of θ=0.80. The true vector of sensitivity parameters in scenarios 1 and 2 are (γAS4,γBS4,α1A,α1B)=(37,35,1,2), and (23,27,1,2), respectively.

Presumed (γAS4,γBS4,α1A,α1B)	Scenario 1		Scenario 2
	Bias	MSE	Bias	MSE
(0.3,0.6,1,2)	–0.37	0.22	–1.92	3.79
(0.3,0.6,0,2)	1.37	1.95	–0.18	0.12
(0.3,0.6,1,0)	–1.54	2.45	–3.07	9.55
(0.3,0.6,0,0)	0.23	0.11	–1.32	1.83
(0.5,0.5,1,2)	0.46	0.28	–1.09	1.28
(0.5,0.5,0,2)	1.65	2.80	0.11	0.08
(0.5,0.5,1,0)	–0.95	1.04	–2.57	6.72
(0.5,0.5,0,0)	0.20	0.09	–1.35	1.89
(0.2,0.8,1,2)	–1.22	1.59	–2.76	7.80
(0.2,0.8,0,2)	0.87	0.81	–0.69	0.55
(0.2,0.8,1,0)	–1.86	3.54	–3.42	11.88
(0.2,0.8,0,0)	0.23	0.11	–1.32	1.79
(0.7,0.2,1,2)	1.98	4.03	0.44	0.30
(0.7,0.2,0,2)	2.72	7.54	1.20	1.53
(0.7,0.2,1,0)	–0.55	0.36	–2.09	4.45
(0.7,0.2,0,0)	0.18	0.10	–1.34	1.88

Table 5:

Simulation study: Sensitivity analysis of θ=0.80 for the extreme values of the sensitivity parameters. The true vector of sensitivity parameters in scenarios 1 and 2 are (γAS4,γBS4,α1A,α1B)=(37,35,1,2), and 23,27,1,2, respectively.

Presumed (γAS4,γBS4,α1A,α1B)	Scenario 1		Scenario 2
	Estimate	S.D.	Estimate	S.D.
(1,1,+∞,+∞)	0.60	0.23	1.98	0.23
(1,1,+∞,−∞)	1.83	0.20	2.53	0.22
(1,1,−∞,+∞)	–1.19	0.20	0.20	0.17
(1,1,−∞,−∞)	0.07	0.14	0.74	0.16
(0.1,0.1,+∞,+∞)	5.66	2.50	13.93	2.15
(0.1,0.1,+∞,−∞)	19.45	1.72	21.05	2.11
(0.1,0.1,−∞,+∞)	–13.35	1.71	–4.85	0.64
(0.1,0.1,−∞,−∞)	0.61	0.43	2.27	0.45
(0.1,1,+∞,+∞)	15.84	1.88	17.33	2.36
(0.1,1,+∞,−∞)	16.98	1.78	17.88	2.36
(0.1,1,−∞,+∞)	–2.92	0.36	–1.48	0.41
(0.1,1,−∞,−∞)	–1.64	0.32	–0.94	0.40
(1,0.1,+∞,+∞)	–9.82	1.82	–1.44	0.53
(1,0.1,+∞,−∞)	4.08	0.38	5.67	0.35
(1,0.1,−∞,+∞)	–11.74	1.78	–3.23	0.50
(1,0.1,−∞,−∞)	2.31	0.37	3.87	0.31

Table 6:

THIN data: Sensitivity analysis for comparing the effects on BMI of sulfonylureas vs. metformin. The treatment effect is defined as θ=E[Y(A)−Y(B)|PI=S4]. A: metformin; B: sulfonylureas.

Presumed (γAS4,γBS4,α1A,α1B)	Estimate	S.D.
(1,1,+∞,+∞)	0.09	0.03
(1,1,+∞,−∞)	0.52	0.08
(1,1,−∞,+∞)	–0.15	0.02
(1,1,−∞,−∞)	0.28	0.06
(0.1,0.1,+∞,+∞)	1.15	0.30
(0.1,0.1,+∞,−∞)	4.27	0.52
(0.1,0.1,−∞,+∞)	–2.91	0.60
(0.1,0.1,−∞,−∞)	1.52	0.54
(0.1,1,+∞,+∞)	1.45	0.18
(0.1,1,+∞,−∞)	1.88	0.20
(0.1,1,−∞,+∞)	–1.29	0.16
(0.1,1,−∞,−∞)	–0.86	0.19
(1,0.1,+∞,+∞)	–0.21	0.17
(1,0.1,+∞,−∞)	2.90	0.47
(1,0.1,−∞,+∞)	–0.45	0.16
(1,0.1,−∞,−∞)	2.66	0.46
(0.1,1,−∞,+∞)	–0.90	0.14
(0.1,1,−∞,0)	–0.49	0.12
(0.1,1,0,+∞)	0.26	0.07
(0.1,1,0,0)	0.87	0.09
(0.5,0.5,−∞,+∞)	–0.38	0.14
(0.5,0.5,−∞,0)	0.13	0.10
(0.5,0.5,0,+∞)	0.30	0.12
(0.5,0.5,0,0)	0.82	0.08

References

1. Angrist JD, Imbens GW, Rubin DB. Identification of causal effects using instrumental variables. J Am Stat Assoc 1996;91:444–55.10.3386/t0136Search in Google Scholar

2. Newhouse JP, McClellan M. Econometrics in outcomes research: the use of instrumental variables. Ann Rev Public Health 1998;19:17–34.10.1146/annurev.publhealth.19.1.17Search in Google Scholar PubMed

3. Greenland S. An introduction to instrumental variables for epidemiologists. Int J Epidemiol 2000;29:722–9.10.1093/ije/29.4.722Search in Google Scholar PubMed

4. Hernán MA, Robins JM. Instruments for causal inference: an epidemiologist’s dream? Epidemiology 2006;17:360–72.10.1097/01.ede.0000222409.00878.37Search in Google Scholar PubMed

5. Cheng J, Qin J, Zhang B. Semiparametric estimation and inference for distributional and general treatment effects. J R Stat Soc Ser B Stat Method 2009a;71:881–904.10.1111/j.1467-9868.2009.00715.xSearch in Google Scholar

6. Baiocchi M, Cheng J, Small DS. Tutorial in biostatistics: instrumental variable methods for causal inference. 2014.10.1002/sim.6128Search in Google Scholar PubMed PubMed Central

7. Imbens GW. Instrumental variables: an econometrician’s perspective. Technical report, National Bureau of Economic Research, 2014.10.3386/w19983Search in Google Scholar

8. Brookhart MA, Wang P, Solomon DH, Schneeweiss S. Evaluating short-term drug effects using a physician-specific prescribing preference as an instrumental variable. Epidemiology 2006;17:268.10.1097/01.ede.0000193606.58671.c5Search in Google Scholar PubMed PubMed Central

9. Brooks JM, Chrischilles EA, Scott SD, Chen-Hardee SS. Was breast conserving surgery underutilized for early stage breast cancer? Instrumental variables evidence for stage II patients from Iowa. Health Serv Res 2003;38:1385–402.10.1111/j.1475-6773.2003.00184.xSearch in Google Scholar PubMed PubMed Central

10. Brookhart MA, Schneeweiss S. Preference-based instrumental variable methods for the estimation of treatment effects: assessing validity and interpreting results. Int J Biostat 2007;3:1–25.10.2202/1557-4679.1072Search in Google Scholar PubMed PubMed Central

11. Wald A. The fitting of straight lines if both variables are subject to error. Ann Math Stat 1940;11:284–300.10.1214/aoms/1177731868Search in Google Scholar

12. Imbens GW, Rubin DB. Estimating outcome distributions for compliers in instrumental variables models. Rev Econ Stud 1997b;64:555–74.10.2307/2971731Search in Google Scholar

13. Imbens GW, Rubin DB. Bayesian inference for causal effects in randomized experiments with noncompliance. Ann Stat 1997a;25:305–27.10.1214/aos/1034276631Search in Google Scholar

14. Cheng J, Small DS, Tan Z, Ten Have TR. Efficient nonparametric estimation of causal effects in randomized trials with noncompliance. Biometrika 2009b;96:19–36.10.1093/biomet/asn056Search in Google Scholar

15. Cheng J, Small DS. Bounds on causal effects in three-arm trials with non-compliance. J R Stat Soc Ser B Stat Method 2006;68:815–36.10.1111/j.1467-9868.2006.00568.xSearch in Google Scholar

16. Long Q, Little RJ, Lin X. Estimating causal effects in trials involving multitreatment arms subject to non-compliance: a Bayesian framework. J R Stat Soc Ser C Appl Stat 2010;59:513–31.10.1111/j.1467-9876.2009.00709.xSearch in Google Scholar PubMed PubMed Central

17. Basu A, Heckman JJ, Navarro-Lozano S, Urzua S. Use of instrumental variables in the presence of heterogeneity and self-selection: an application to treatments of breast cancer patients. Health Econ 2007;16:1133–57.10.1002/hec.1291Search in Google Scholar PubMed

18. Chen H, Mehta S, Aparasu R, Patel A, Ochoa-Perez M. Comparative effectiveness of monotherapy with mood stabilizers versus second generation (atypical) antipsychotics for the treatment of bipolar disorder in children and adolescents. Pharmacoepidemiol Drug Saf 2014;23:299–308.10.1002/pds.3568Search in Google Scholar PubMed

19. Hadley J, Polsky D, Mandelblatt JS, Mitchell JM, Weeks JC, Wang Q, et al. An exploratory instrumental variable analysis of the outcomes of localized breast cancer treatments in a medicare population. Health Econ 2003;12:171–86.10.1002/hec.710Search in Google Scholar PubMed

20. Hadley J, Yabroff KR, Barrett MJ, Penson DF, Saigal CS, Potosky AL. Comparative effectiveness of prostate cancer treatments: evaluating statistical adjustments for confounding in observational data. J National Cancer Inst 2010;102:1780–93.10.1093/jnci/djq393Search in Google Scholar PubMed PubMed Central

21. Suh HS, Hay JW, Johnson KA, Doctor JN. Comparative effectiveness of statin plus fibrate combination therapy and statin monotherapy in patients with type 2 diabetes: use of propensity-score and instrumental variable methods to adjust for treatment-selection bias. Pharmacoepidemiol Drug Saf 2012;21:470–84.10.1002/pds.3261Search in Google Scholar PubMed

22. Swanson SA, Robins JM, Miller M, Hernán MA. Selecting on treatment: a pervasive form of bias in instrumental variable analyses. Am J Epidemiol 2015;181:191–7.10.1093/aje/kwu284Search in Google Scholar PubMed PubMed Central

23. Swanson S. Contributions to instrumental variable methods in epidemiology. Boston, MA: Ph.D. thesis, Harvard School of Public Health, 2014.Search in Google Scholar

24. Gilbert PB, Bosch RJ, Hudgens MG. Sensitivity analysis for the assessment of causal vaccine effects on viral load in HIV vaccine trials. Biometrics 2003;59:531–41.10.1111/1541-0420.00063Search in Google Scholar PubMed

25. Frangakis CE, Rubin DB. Principal stratification in causal inference. Biometrics 2002;58:21–9.10.1111/j.0006-341X.2002.00021.xSearch in Google Scholar

26. Nathan D, Buse J, Davidson M, Ferrannini E, Holman R, Sherwin R, et al. American diabetes association; European association for study of diabetes. Medical management of hyperglycemia in type 2 diabetes: a consensus algorithm for the initiation and adjustment of therapy: a consensus statement of the American diabetes association and the European association for the study of diabetes. Diabetes Care 2009;32:193–203.10.2337/dc08-9025Search in Google Scholar PubMed PubMed Central

27. Flory JH, Small DS, Cassano PA, Brillon DJ, Mushlin AI, Hennessy S. Comparative effectiveness of oral diabetes drug combinations in reducing glycosylated hemoglobin. J Comp Eff Res 2014;3:29–39.10.2217/cer.13.87Search in Google Scholar PubMed PubMed Central

28. Forst T, Hanefeld M, Jacob S, Moeser G, Schwenk G, Pfützner A, et al. Association of sulphonylurea treatment with all-cause and cardiovascular mortality: a systematic review and meta-analysis of observational studies. Diabetes Vasc Dis Res 2013;10:302–14.10.1177/1479164112465442Search in Google Scholar PubMed

Published Online: 2016-5-26

Published in Print: 2016-5-1

Articles in the same Issue

https://doi.org/10.1515/ijb-2015-0006

Keywords for this article

selection bias; instrumental variables; sensitivity analysis