Role of placebo samples in observational studies

Ting Ye; Qijia He; Shuxiao Chen; Bo Zhang

doi:10.1515/jci-2023-0020

Article Open Access

Role of placebo samples in observational studies

Ting Ye , Qijia He , Shuxiao Chen and Bo Zhang

Published/Copyright: March 5, 2025

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Journal of Causal Inference Volume 13 Issue 1

Abstract

In an observational study, it is common to leverage known null effects to detect bias. One such strategy is to set aside a placebo sample – a subset of data immune from the hypothesized cause-and-effect relationship. Existence of an effect in the placebo sample raises concerns about unmeasured confounding bias while the absence of it helps corroborate the causal conclusion. This article describes a framework for using a placebo sample to detect and remove bias. We state the identification assumptions and develop estimation and inference methods based on outcome regression, inverse probability weighting, and doubly robust approaches. Simulation studies investigate the finite-sample performance of the proposed methods. We illustrate the methods using an empirical study of the effect of the earned income tax credit on infant health.

Keywords: causal inference; placebo test; treatment effect heterogeneity; unmeasured confounding

MSC 2010: 62D20

1 Introduction

A common task of observational studies is to infer a cause-and-effect relationship. Unlike well-controlled randomized experiments where physical randomization ensures the validity of causal conclusions, observational studies suffer from various challenges that could compromise their causal conclusions. One major assumption researchers make is the treatment ignorability assumption [1], also known as the no unmeasured confounders assumption (NUCA) [2], which states that the exposure and control groups are comparable after adjusting for observed pre-exposure covariates. Unfortunately, concerns of bias from confounders that are not measured (e.g., laboratory results in the medical claims data) and cannot be measured (e.g., motivation/personality in social science research) almost always persist.

How to address the unmeasured confounding bias? Broadly speaking, there are three types of strategies. First, one can conduct a sensitivity analysis that relaxes the NUCA and assesses the effect under a posited sensitivity analysis model (see, e.g., [3–5], among many others). Methods in the second category utilize natural experiments, with instrumental variable (IV) methods being a prime example among these approaches [6]. Methods in the third category utilize the auxiliary information about the causal mechanism and the nature of the suspected unmeasured confounder, and these methods aim to detect, quantify, and remove the unmeasured confounding bias. One prominent example is the methodology leveraging negative control outcomes, i.e., outcome variables not affected by the treatment [7–10]. The popular difference-in-differences (DID) estimation approach [11] can also be recast as an NCO method [12]. The utility of negative controls is also recognized outside the causal inference literature. In the context of statistical genomics and computational biology, control genes have been used to effectively differentiate between biological signals and confounding noise in high-throughput data [13–17].

This article studies a method that utilizes the auxiliary information in a different way. The idea is to set aside a subset of data, which we refer to as a placebo sample and is immune from the hypothesized cause-and-effect relationship, analyze the exposure–outcome association in the placebo sample, and integrate this knowledge into estimating the causal effect. Intuitively, the presence of an exposure–outcome association in the placebo sample would raise serious concerns of the NUCA, while a null association helps corroborate the causal conclusion.

A placebo sample has been used as a bias detection device in empirical studies (see, e.g., Hoynes et al. [18]) and mentioned in general discussions [19]. One prominent application scenario is policy evaluation, where individuals not eligible or minimally affected by the policy constitute a placebo sample. In a study of the effect of minimum wages and the earned income tax credit (EITC) on deaths of despair – deaths due to drug overdose, suicide, and alcohol-related causes, Dow et al. [20] used college graduates as the placebo sample because they are unlikely “to be exposed to minimum wage jobs or to be eligible for the EITC.” Placebo sample is a useful bias detection device; however, several limitations are evident. If we fail to detect bias, this cannot be equated with the absence of bias, as it could also be due to a lack of power, especially when the placebo sample size is small. Conversely, finding evidence of bias does not necessarily nullify the causal conclusion. This article builds upon the identification assumptions and estimation methods in the NCO literature and proposes some parallel strategies that incorporate a placebo sample in causal parameter identification, estimation, and statistical inference.

2 Methods

2.1 Nonparametric identification

Consider a binary exposure A ∈ { 0 , 1 } and potential outcomes { Y ( 0 ) , Y ( 1 ) } . Throughout the article, we assume the consistency assumption and stable unit treatment value assumption (SUTVA) so that the observed outcome Y satisfies Y = A Y ( 1 ) + ( 1 − A ) Y ( 0 ) [21]. Suppose that we have a binary baseline covariate S such that S = 0 represents the placebo sample that is unaffected by the exposure. We call subjects with S = 1 the primary sample.

Assumption 1

(Placebo sample) For S = 0 , Y ( 1 ) = Y ( 0 ) = Y almost surely.

Denote the other baseline covariates as X . Suppose that a random sample of n subjects is obtained, which is written as { ( Y i , A i , X i , S i ) : i = 1 , … , n } and assumed to be independent and identically distributed according to the joint law of ( Y ( A ) , A , X , S ) . Our target parameter is the average treatment effect of the treated (ATT) in the primary sample

(1) θ 0 = E { Y ( 1 ) − Y ( 0 ) ∣ S = 1 , A = 1 } .

Assumption 2 is our key identification assumption that links the subjects with S = 0 and S = 1 . Figure 1 illustrates the idea.

Figure 1

Illustration of Assumption 2.

Assumption 2

(Additive equi-confounding) E { Y ( 0 ) ∣ S = 1 , A = 1 , X } − E { Y ( 0 ) ∣ S = 1 , A = 0 , X } = E { Y ( 0 ) ∣ S = 0 , A = 1 , X } − E { Y ( 0 ) ∣ S = 0 , A = 0 , X } almost surely.

The left-hand side of Assumption 2 encodes the level of confounding bias. Because the observed covariates, X may fail to render the exposure and control groups in the S = 1 stratum comparable, E { Y ( 0 ) ∣ S = 1 , A = 1 , X } ≠ E { Y ( 0 ) ∣ S = 1 , A = 0 , X } in general. However, Assumption 2 states that the extent of residual confounding bias in the S = 1 stratum is precisely equal to that in S = 0 , making it possible to debias using the placebo sample.

A similar assumption also appears in the DID and NCO literature. With repeated cross-sectional data, DID assumes a parallel trends assumption, which can be thought of as a special case of Assumption 2 that uses the pre- and post-exposure time indicator as S . As such, DID with repeated cross-sectional data can be recast as a placebo sample approach that uses a sample collected before the onset of the treatment as a placebo sample. For the NCO literature, the additive equi-confouding assumption says that the residual confounding bias for an NCO and the outcome is the same [12], compared to which Assumption 2 may be more reasonable as it is about the same outcome and is invariant to scaling of the outcome variable. Additional discussion of the assumptions can be found in Supplement §S1.2.

Consistent with the literature on DID, our focus is on the ATT, which can be identified without imposing restrictions on treatment effects for the primary sample. The average treatment effect is not identifiable without additional assumptions; therefore, it is not considered in this article.

Finally, we make a standard positivity assumption.

Assumption 3

(Positivity) For some ε > 0 , P ( A = 1 , S = 1 ) > ε , P ( S = 1 ∣ X ) < 1 − ε , ε < P ( A = 1 ∣ S = 0 , X ) < 1 − ε , and P ( A = 1 ∣ S = 1 , X ) < 1 − ε , with probability 1.

We give some intuition before formally stating the identification results. For s = 0 , 1 , a = 0 , 1 , and every x , let μ Y ( s , a , x ) = E { Y ∣ S = s , A = a , X = x } , and Δ s ( x ) = μ Y ( s , 1 , x ) − μ Y ( s , 0 , x ) be observed data functions. The target causal parameter of interest can be decomposed into a contrast term C ( X ) and a bias term B ( X ) as follows:

E { Y ( 1 ) − Y ( 0 ) ∣ S = 1 , A = 1 , X } = E { Y ( 1 ) ∣ S = 1 , A = 1 , X } − E { Y ( 0 ) ∣ S = 1 , A = 0 , X } ︸ contrast C ( X ) − [ E { Y ( 0 ) ∣ S = 1 , A = 1 , X } − E { Y ( 0 ) ∣ S = 1 , A = 0 , X } ] ︸ bias B ( X ) .

By the consistency assumption, the contrast function C ( X ) equals Δ 1 ( X ) and is identified from observed data. By Assumption 2, the bias function B ( X ) equals E { Y ( 0 ) ∣ S = 0 , A = 1 , X } − E { Y ( 0 ) ∣ S = 0 , A = 0 , X } , which then equals Δ 0 ( X ) by Assumption 1. Finally, the target parameter E { Y ( 1 ) − Y ( 0 ) ∣ S = 1 , A = 1 , X } equals Δ 1 ( X ) − Δ 0 ( X ) and is identified from observed data. Alternatively, identification can be obtained from inverse probability weighting (IPW) that avoids modeling the outcome distribution. These two identification results are stated in Proposition 1.

Proposition 1

Under Assumptions 1–2, θ 0 defined in (1) is identified by
θ 0 = E { Δ 1 ( X ) − Δ 0 ( X ) ∣ S = 1 , A = 1 } .
Suppose that Assumption 3 also holds, then
θ 0 = 1 E { S A } E S − π S ( X ) 1 − π S ( X ) π A ( X , 1 ) { A − π A ( X , S ) } π A ( X , S ) { 1 − π A ( X , S ) } Y ,
where π S ( X ) = P ( S = 1 ∣ X ) and π A ( X , S ) = P ( A = 1 ∣ X , S ) .

All proofs in this article can be found in Supplement §S2. There are interesting connections among the DID, NCO, and placebo sample methods. Both NCO and placebo sample methods exploit the known null effect: the former leverages a placebo outcome known not to be affected by the treatment, whereas the latter utilizes a placebo population known not to be affected by the treatment. We view these two methods as complementary devices that can even be used in the same study. For instance, in the study of EITC on deaths of despair, Dow et al. [20] used both devices – a cancer outcome as an NCO and college graduates as a placebo sample. In another example, to study the causal effect of ivacaftor on lung functions in cystic fibrosis patients, Newsome et al. [22] also utilized both devices – outcomes measured in the pre-ivacaftor period as NCOs and patients ineligible for ivacaftor due to their genotype as a placebo sample. In addition, the DID with longitudinal data can be interpreted as an NCO method that uses the pre-exposure outcome as an NCO, and the DID with repeated cross-sectional data can be interpreted as a placebo sample method that uses a sample drawn before the exposure is in place as the placebo sample.

We note that Proposition 1 (b) slightly generalizes the IPW method proposed by Abadie [23], which is applicable only when S is independent of all the other variables.

2.2 Estimation and semiparametric inference

Proposition 1 suggests two estimators. The first one is a regression-based estimator

θ ˆ reg = 1 n 11 ∑ i = 1 n S i A i { Δ 1 ( X i ; β ˆ ) − Δ 0 ( X i ; β ˆ ) } ,

where n 11 = ∑ i = 1 n I ( S i = 1 , A i = 1 ) , Δ s ( x ; β ˆ ) = μ Y ( s , 1 , x ; β ˆ ) − μ Y ( s , 0 , x ; β ˆ ) , μ Y ( s , a , x ; β ) is a model of μ Y ( s , a , x ) parameterized by the finite-dimensional parameter β , and β ˆ is an estimator of β . The regression-based estimator θ ˆ reg is consistent for the target parameter when outcome regression models μ Y ( s , a , x ) are correctly specified by μ Y ( s , a , x ; β ) . In practice, researchers often specify generalized linear regression models for these outcome models.

A second estimator is an IPW estimator

θ ˆ ipw = 1 n 11 ∑ i = 1 n S i − π S ( X i ; ψ ˆ ) 1 − π S ( X i ; ψ ˆ ) π A ( X i , 1 ; α ˆ ) { A i − π A ( X i , S i ; α ˆ ) } π A ( X i , S i ; α ˆ ) { 1 − π A ( X i , S i ; α ˆ ) } Y i ,

where π S ( x ; ψ ) is a model of π S ( x ) parametrized by finite-dimensional parameter ψ , π A ( x , s ; α ) a model of π A ( x ) parametrized by α , and ψ ˆ and α ˆ estimators of ψ and α . In practice, researchers often fit logistic regression models for π S ( x ; ψ ) and π A ( x ; ψ ) . IPW-type estimators could be unstable when the (estimated) π A ( ⋅ ) and 1 − π S ( ⋅ ) are close to zero, and a common way to stabilize the weights is by normalization [24].

The reliability of the regression-based estimator and the IPW estimator depends on correct specification of different parts of the likelihood. When we are uncertain about which models are correctly specified, it is of interest to develop a doubly robust estimator that is guaranteed to be consistent and deliver valid inference about θ 0 , provided that either { μ Y } or { π S , π A } , but not necessarily both, are correctly specified. The next theorem derives the efficient influence function for θ [25,26] in the nonparametric model, where Assumptions 1–3 are assumed for identification, but the model is otherwise unrestricted. Similar to the discussion in Park and Tchetgen Tchetgen [27] regarding DID, our Assumptions 1 and 2 do not have any testable implications on the observed data (except for a falsification condition if the outcome is known to satisfy certain support conditions). Theorem 1 also provides the basis for constructing a doubly robust estimator.

Theorem 1

Under Assumptions 1–3 and the nonparametric model, denote the observed data as O = ( Y , A , S , X ) , the efficient influence function for θ is

EIF ( O ; θ ) = S A E { S A } { Y − μ Y ( 1 , 0 , X ) − μ Y ( 0 , 1 , X ) + μ Y ( 0 , 0 , X ) − θ } − S ( 1 − A ) π A ( X , 1 ) 1 − π A ( X , 1 ) E { S A } { Y − μ Y ( 1 , 0 , X ) } − ( 1 − S ) A π A ( X , 1 ) π A ( X , 0 ) π S ( X ) 1 − π S ( X ) E { S A } { Y − μ Y ( 0 , 1 , X ) } + ( 1 − S ) ( 1 − A ) π A ( X , 1 ) 1 − π A ( X , 0 ) π S ( X ) 1 − π S ( X ) E { S A } { Y − μ Y ( 0 , 0 , X ) } .

The efficient influence function gives an estimator θ ˆ dr defined as the solution to ∑ i = 1 n EIF ( O i ; θ , η ˆ ) = 0 , where η ˆ = ( μ ˆ Y , π ˆ A , π ˆ S , λ ˆ ) denotes the collection of nuisance parameters, and λ ˆ = n 11 ⁄ n estimates E { S A } . We prove in Supplement §S2 that θ ˆ dr is doubly robust.

Next, we derive the asymptotic properties of θ ˆ dr . Let ‖ f ‖ 2 = { ∫ f 2 ( o ) d P ( o ) } 1 ⁄ 2 denote the L 2 ( P ) norm of any real-valued function f , and ‖ f ‖ 2 = ∑ j = 1 ℓ ‖ f j ‖ 2 for any collection of real-valued functions f = ( f 1 , … , f ℓ ) , where P denotes the distribution of O . Moreover, let η 0 = ( μ Y 0 , π A 0 , π S 0 , λ 0 ) denote the true values of the nuisance parameters.

Assumption 4

( θ ˆ dr , η ˆ ) → P ( θ 0 , η ¯ ) , where η ¯ = ( μ ¯ Y , π ¯ A , π ¯ S , λ 0 ) with either μ ¯ Y = μ Y 0 or ( π ¯ A , π ¯ S ) = ( π A 0 , π S 0 ) .
For some ε > 0 , π ˆ S ( X ) < 1 − ε , ε < π ˆ A ( 0 , X ) < 1 − ε , and π ˆ A ( 1 , X ) < 1 − ε with probability 1.
For each θ in an open subset of the real line and each η in a metric space, let EIF ( o ; θ , η ) be a measurable function such that the class of functions { EIF ( o ; θ , η ) : ∣ θ − θ 0 ∣ < ε , ‖ η − η ¯ ‖ 2 < ε } is Donsker for some ε > 0 , and such that E { EIF ( O ; θ , η ) − EIF ( O ; θ 0 , η ¯ ) } 2 → 0 as ( θ , η ) → ( θ 0 , η ¯ ) .

Assumption 4 (a) describes the double robustness of our proposed estimator. Assumption 4 (b) is standard for M-estimators [26, Chapter 5.4].

Theorem 2 summarizes the doubly robust and locally efficient property of θ ˆ dr .

Theorem 2

Under Assumptions 1–4, θ ˆ dr satisfies

θ ˆ dr − θ 0 = O P { n − 1 ⁄ 2 + ( ‖ π ˆ A − π A 0 ‖ 2 + ‖ π ˆ S − π S 0 ‖ 2 ) ‖ μ ˆ Y − μ Y 0 ‖ 2 } .

Suppose further that ( ‖ π ˆ A − π A 0 ‖ 2 + ‖ π ˆ S − π S 0 ‖ 2 ) ‖ μ ˆ Y − μ Y 0 ‖ 2 = o P ( n − 1 ⁄ 2 ) , then

(2) n ( θ ˆ dr − θ 0 ) → d N ( 0 , E { EIF ( O ; θ 0 , η 0 ) 2 } ) ,

and θ ˆ dr achieves the semiparametric efficiency bound.

The first part of Theorem 2 characterizes the convergence rate of θ ˆ dr ; it also implies that θ ˆ dr is doubly robust in the sense that it is consistent when either { μ ˆ Y } or { π ˆ S , π ˆ A } is consistent. The second part of Theorem 2 says that if the nuisance parameters are consistently estimated with fast rate, e.g., if they are estimated using parametric methods, then their variance contributions are negligible, and θ ˆ dr achieves the semiparametric efficiency bound. The results in Theorems 1 and 2 generalize the results in the study of Sant-Anna and Zhao [28] to allow for S being correlated with A and X .

When (2) holds, a plug-in variance estimator for n θ ˆ dr can be easily constructed as n − 1 ∑ i = 1 n EIF ( O i ; θ ˆ dr , η ˆ ) 2 . Even if (2) does not hold, e.g., when only the model for { μ Y } or the models for { π S , π A } are correctly specified, but all the nuisance parameters are finite-dimensional and in the form of M-estimators, n θ ˆ dr is still consistent and asymptotically normal from standard M-estimation theory [29, Chapter 6]. Thus, a consistent variance estimator for n θ ˆ dr can be constructed under the M-estimation framework, allowing for doubly robust inference (see details in Supplement §S3). Alternatively, the nonparametric bootstrap is commonly used in practice. However, when nonparametric, data-adaptive estimators of these nuisance parameters are used, conducting doubly robust inference becomes much more challenging if one of the nuisance parameters is inconsistently estimated. In such cases, θ ˆ dr may become irregular and exhibit a slower convergence rate than the typical root- n . Recent advancements in addressing these challenges can be found in previous works [30,31].

Finally, we note that the Donsker condition in Assumption 4 can be relaxed by using sample splitting [32], which enables estimating the nuisance parameters using complex machine learning methods that do not satisfy the Donsker condition. For example, even without the Donsker condition in Assumption 4, the result in (2) holds when the nuisance parameters are estimated at faster than n − 1 ⁄ 4 -rates.

3 Sensitivity analyses relaxing identification assumptions

3.1 Sensitivity analysis under linear sensitivity models

We propose a sensitivity analysis method of Assumptions 1–2 under linear sensitivity models. Recall in Section 2.1, we decompose the target parameter as θ 0 = E { C ( X ) − B ( X ) ∣ S = 1 , A = 1 } , where C ( X ) = Δ 1 ( X ) is identified from the observed data. The key is then to bound B ( X ) when Assumptions 1–2 are violated. Consider the following sensitivity model.

Sensitivity Model 1.A E { Y ( 1 ) − Y ( 0 ) ∣ S = 0 , A = 1 , X } = λ 0 + λ X T X almost surely.

Sensitivity Model 1.B E { Y ( 0 ) ∣ S = 1 , A = 1 , X } − E { Y ( 0 ) ∣ S = 1 , A = 0 , X } − [ E { Y ( 0 ) ∣ S = 0 , A = 1 , X } − E { Y ( 0 ) ∣ S = 0 , A = 0 , X } ] = δ 0 + δ X T X almost surely.

Here, H ≔ { λ 0 , λ X , δ 0 , δ X } are sensitivity parameters. When all these parameters equal zero, Sensitivity Models 1.A and 1.B degenerate to Assumptions 1–2 in the primary analysis. Under Sensitivity Models 1.A and 1.B, we have that θ 0 = θ 0 ( H ) , where

θ 0 ( H ) = E { Δ 1 ( X ) − Δ 0 ( X ) ∣ S = 1 , A = 1 } − ( δ 0 − λ 0 ) − ( δ X − λ X ) T E { X ∣ S = 1 , A = 1 } .

Fix parameters Γ L , Γ U , Λ L , and Λ U and consider the set of sensitivity parameters

ℋ = { H : δ 0 , δ X 1 , … , δ X p ∈ [ Γ L , Γ U ] and λ 0 , λ X 1 , … , λ X p ∈ [ Λ L , Λ U ] } ,

where p = dim ( X ) . Then, θ 0 ( H ) is in the set

Θ 0 ≔ [ inf H ∈ ℋ θ 0 ( H ) , sup H ∈ ℋ θ 0 ( H ) ] .

The set Θ 0 is often referred to as the partially identified set under Sensitivity Models 1.A and 1.B and parameters Γ L , Γ U , Λ L , and Λ U .

A general method to construct confidence intervals for the set Θ 0 is the union method. Taking an estimator θ ˆ ∈ { θ ˆ reg , θ ˆ ipw , θ ˆ dr } of E { Δ 1 ( X ) − Δ 0 ( X ) ∣ S = 1 , A = 1 } , an estimator of θ 0 ( H ) is

θ ˆ ( H ) = θ ˆ − ( δ 0 − λ 0 ) − ( δ X − λ X ) T 1 n 11 ∑ i : S i = A i = 1 X i = θ ˆ − ( δ 0 − λ 0 ) − ∑ j = 1 p ( δ X j − λ X j ) 1 n 11 ∑ i : S i = A i = 1 X i j .

Suppose that for any fixed H , we construct a confidence interval [ L ( H ) , U ( H ) ] for θ 0 ( H ) , e.g., by nonparametric bootstrap, the interval [ inf H ∈ ℋ L ( H ) , sup H ∈ ℋ U ( H ) ] is an asymptotic confidence interval of θ 0 with at least ( 1 − α ) -coverage under the collection of sensitivity models ℋ . However, as discussed in the study of Zhao et al. [33], L ( H ) and U ( H ) may be complicated functions of H , and optimization over H may be complex. Therefore, we use

(3) [ Q α ⁄ 2 ( inf H ∈ ℋ θ ˆ * ( H ) ) , Q 1 − α ⁄ 2 ( sup H ∈ ℋ θ ˆ * ( H ) ) ]

as our confidence interval, where inf H ∈ ℋ θ ˆ * ( H ) is the analog of inf H ∈ ℋ θ ˆ ( H ) computed using each bootstrap sample, and Q α ⁄ 2 ( inf H ∈ ℋ θ ˆ * ( H ) ) is the α ⁄ 2 -percentile of inf H ∈ ℋ θ ˆ * ( H ) in the bootstrap distribution; the quantities in the upper bound are defined similarly. The interval in (3) is shown by Qingyuan Zhao et al. [33] to cover θ 0 with probability at least 1 − α asymptotically. Note that as θ ˆ * ( H ) is monotone in every element in H , the confidence interval (3) is computationally efficient.

3.2 Sensitivity analysis under marginal sensitivity models

We consider a second sensitivity model. With a slight abuse of notation, we still denote the sensitivity parameters as Λ and Γ , although they have distinct meanings compared to those in Section 3.1:

Sensitivity Model 2.A − Λ ≤ E { Y ( 1 ) − Y ( 0 ) ∣ S = 0 , A = 1 , X } ≤ Λ almost surely.

Sensitivity Model 2.B Let U denote the set of unmeasured confounders such that A ⊥ ( Y ( 0 ) , Y ( 1 ) ) ∣ U , X , S . Suppose that (i)

(4) 1 Γ ≤ OR { π S ( U , A , X ) , π S ( A , X ) } ≤ Γ ,

where π S ( U , A , X ) = P ( S = 1 ∣ U , A , X ) , π S ( A , X ) = P ( S = 1 ∣ A , X ) , and OR ( p 1 , p 2 ) = p 1 ⁄ ( 1 − p 1 ) p 2 ⁄ ( 1 − p 2 ) ; and (ii)

E { Y ( 0 ) ∣ U , X , S = 1 } − E { Y ( 0 ) ∣ U , X , S = 0 }

is a function of X alone.

Sensitivity models 2.A and 2.B allow the treatment to have a small effect on the placebo sample and that the distribution of U being dependent on S after conditioning on A and X . In particular, the model in (4) is termed the marginal sensitivity model in the literature [33]. When Γ = 1 , Sensitivity Model 2.B reduces to the case where U ⊥ S ∣ A , X . Finally, we suppose that E { Y ( 0 ) ∣ S = 0 , A , X } ≥ 0 almost surely, which is not a restriction because we can always add a constant to every Y i to make Y i ≥ 0 without affecting the parameter of interest.

Let r ( A , U , X ) = P ( U ∣ S = 1 , A , X ) ⁄ P ( U ∣ S = 0 , A , X ) , V ( X ) = E { Y ( 0 ) ∣ S = 1 , A = 1 , X } − E { Y ( 0 ) ∣ S = 1 , A = 0 , X } − [ E { Y ( 0 ) ∣ S = 0 , A = 1 , X } − E { Y ( 0 ) ∣ S = 0 , A = 0 , X } ] , and T ( X ) = E { Y ( 1 ) − Y ( 0 ) ∣ S = 0 , A = 1 , X } . Under the Sensitivity Model 2.A, we have T ( X ) ∈ [ − Λ , Λ ] . From the Bayes formula, we have r ( A , U , X ) = OR { π S ( U , A , X ) , π S ( A , X ) } , and thus, Γ − 1 ≤ r ( A , U , X ) ≤ Γ under the Sensitivity Model 2.B. We show in Supplement §2 that

(5) E [ Y ∣ S = 0 , A = 1 , X ] ( Γ − 1 − 1 ) − E [ Y ∣ S = 0 , A = 0 , X ] ( Γ − 1 ) ≤ V ( X ) ≤ E [ Y ∣ S = 0 , A = 1 , X ] ( Γ − 1 ) − E [ Y ∣ S = 0 , A = 0 , X ] ( Γ − 1 − 1 ) .

Finally, with θ 0 written as θ 0 = E [ Δ 1 ( X ) − Δ 0 ( X ) ∣ S = 1 , A = 1 ] + E [ T ( X ) − V ( X ) ∣ S = 1 , A = 1 ] , we can construct the bounds for θ 0 as [ θ L , θ U ] , where θ U = E [ Δ 1 ( X ) − Δ 0 ( X ) ∣ S = 1 , A = 1 ] + Λ − E [ E { Y ∣ S = 0 , A = 1 , X } ∣ S = 1 , A = 1 ] ( Γ − 1 − 1 ) + E [ E { Y ∣ S = 0 , A = 0 , X } ∣ S = 1 , A = 1 ] ( Γ − 1 ) and θ L = E [ Δ 1 ( X ) − Δ 0 ( X ) ∣ S = 1 , A = 1 ] − Λ − E [ E { Y ∣ S = 0 , A = 1 , X } ∣ S = 1 , A = 1 ] ( Γ − 1 ) + E [ E { Y ∣ S = 0 , A = 0 , X } ∣ S = 1 , A = 1 ] ( Γ − 1 − 1 ) . In correspondence to the three estimation approaches, the bounds in the sensitivity analysis can also be estimated in three ways (see Supplement §1 for their expressions).

4 Simulation study

We compare the placebo sample approach to naive methods based on NUCA, and investigate the operating characteristics of various estimators proposed in Section 2.2. We simulate the full data according to the following data-generating process with sample size n = 1,000 , 2,000, and 5,000:

X = ( X 1 , X 2 , X 3 ) , where X j ∼ N ( 0 , 1 ) for j = 1 , 2 , 3 . Each X j is truncated at − 3 and 3.
S is Bernoulli with P ( S = 1 ∣ X ) = expit { − 0.5 X 1 − X 2 + 0.3 X 3 − 0.5 X 2 X 3 + b } with (b1) b = 0.5 ; (b2) b = 0 ; and (b3) b = − 0.5 . The corresponding proportions of the placebo sample are approximately 38, 50, and 62% of the total sample size n , respectively.
A is Bernoulli with P ( A = 1 ∣ X , S ) = expit { − X 1 − X 2 + X 3 + X 2 X 3 + 0.2 S + 0.5 } .
U is Bernoulli with P ( U = 1 ∣ X , S , A ) satisfying (d1) P ( U = 1 ∣ X , S , A ) = 0.6 A + 0.2 and (d2) P ( U = 1 ∣ X , S , A ) = 0.6 A + 0.2 × sign { X 1 + X 2 } + 0.2 .
Y ( 0 ) ∣ X , U , S is Normal with unit variance and mean satisfying E { Y ( 0 ) ∣ X , U , S } = − X 1 − X 2 − ( X 3 + 0.5 X 2 X 3 ) S + 2 U + 2 .
For S = 0 , Y ( 1 ) = Y ( 0 ) . For S = 1 , (f1) Y ( 1 ) − Y ( 0 ) = 1 and (f2) Y ( 1 ) − Y ( 0 ) ∼ N ( 1 , 0.5 ) . The observed outcome Y = A Y ( 1 ) + ( 1 − A ) Y ( 0 ) . The target parameter θ 0 = 1 .

The observed data are { ( X i , S i , A i , Y i ) , i = 1 , … , n } . Four combinations of (d) and (e) all satisfy Assumption 2. Our simulation study can be summarized by the factorial design below:

Factor 1: Data-generating process of S = 1 ∣ X : (b1), (b2), and (b3);

Factor 2: Data-generating process of U ∣ X , S , A : (d1) and (d2);

Factor 3: Data-generating process of Y ( 0 ) ∣ X , U , S : (e);

Factor 4: Treatment effect: (f1) and (f2);

Factor 5: Estimator of θ 0 : (I) a naive regression estimator θ ˆ reg , naive that regresses Y on A and X in the S = 1 stratum, (II) a naive augmented inverse probability weighted estimator θ ˆ dr , naive based on subjects in S = 1 , and three placebo sample estimators proposed in Section 2.2, (III) regression-based estimator θ ˆ reg , (IV) stabilized IPW estimator θ ˆ ipw , and (V) doubly robust estimator θ ˆ dr .

Factors 1–4 specify 3 × 2 × 1 × 2 = 12 scenarios, and Factor 5 specifies estimators of θ 0 . We further consider three placebo sample estimators under different model misspecification. All model misspecification refers to omitting an interaction term involving X 2 X 3 when fitting μ Y , π S , and π A . Table 1 summarizes the simulation results for nine estimators in four scenarios and under different sample sizes. The remaining eight scenarios can be found in Supplement §S4. Estimators are evaluated in terms of their bias, median estimated standard error, and coverage of the 95 % confidence interval based on the nonparametric bootstrap using 2,000 resamples. Both naive estimators ( θ ˆ reg , naive and θ ˆ dr , naive ) are largely biased due to the unmeasured confounding. Among the three proposed estimators that leverage the placebo sample, the regression-based estimator θ ˆ reg has the smallest variance when π Y is correctly specified but becomes biased when π Y is misspecified. When { π S , π A } are correctly specified, the IPW estimator θ ˆ ipw ’s bias shrinks and its coverage appears to approach its nominal level as sample size increases; however, the finite sample performance of the IPW estimator is inferior to that of the regression-based estimator and that of the doubly robust estimator when models are correctly specified. The doubly robust estimator θ ˆ dr is approximately unbiased in all three cases and has smaller variance when all models are correctly specified compared to when only a subset of the models are correctly specified. We recommend using the doubly robust estimator θ ˆ dr based on its robustness property and simulation results. In Supplement §S3, we also discuss how to construct an empirical sandwich variance estimator, and provide R code implementing it.

Table 1

Simulation results for four scenarios

Sample size	Estimator	Model Spec.	Bias	Median	95% CI	Bias	Median	95% CI	Bias	Median	95% CI	Bias	Median	95% CI
				Est. SE	Cov. (%)		Est. SE	Cov. (%)		Est. SE	Cov. (%)		Est. SE	Cov. (%)
			Scenario 1			Scenario 2			Scenario 3			Scenario 4
1,000	θ ˆ reg , naive		1.19	0.16	0.00	1.20	0.17	0.00	1.18	0.17	0.00	1.20	0.17	0.00
2,000	θ ˆ reg , naive		1.19	0.12	0.00	1.19	0.12	0.00	1.19	0.12	0.00	1.18	0.12	0.00
5,000	θ ˆ reg , naive		1.19	0.07	0.00	1.19	0.07	0.00	1.18	0.07	0.00	1.18	0.08	0.00
1,000	θ ˆ dr , naive		1.19	0.17	0.40	1.17	0.17	0.20	1.19	0.16	0.40	1.21	0.16	0.20
2,000	θ ˆ dr , naive		1.19	0.13	0.00	1.19	0.13	0.10	1.20	0.11	0.10	1.19	0.11	0.10
5,000	θ ˆ dr , naive		1.19	0.08	0.00	1.19	0.08	0.00	1.19	0.07	0.21	1.20	0.07	0.11
1,000	θ ˆ reg	μ Y correct	‒ 0.01	0.20	94.46	0.00	0.20	94.76	‒ 0.01	0.19	93.55	0.00	0.19	94.96
2,000	θ ˆ reg	μ Y correct	‒0.01	0.14	94.08	‒ 0.01	0.14	93.68	0.00	0.14	94.78	‒ 0.01	0.14	94.98
5,000	θ ˆ reg	μ Y correct	‒ 0.01	0.09	95.92	‒ 0.01	0.09	94.31	‒ 0.01	0.09	94.95	0.00	0.09	95.06
1,000	θ ˆ reg	μ Y incorrect	‒ 0.74	0.20	2.92	‒ 0.73	0.20	3.12	‒ 0.74	0.19	3.73	‒ 0.74	0.20	3.23
2,000	θ ˆ reg	μ Y incorrect	‒ 0.74	0.14	0.10	‒ 0.74	0.14	0.00	‒ 0.74	0.14	0.10	‒ 0.74	0.14	0.00
5,000	θ ˆ reg	μ Y incorrect	‒ 0.74	0.09	0.00	‒ 0.74	0.09	0.00	‒ 0.74	0.09	0.00	‒ 0.73	0.09	0.00
1,000	θ ˆ ipw	( π S , π A ) correct	‒ 0.11	0.48	90.73	‒ 0.09	0.48	91.33	‒ 0.08	0.43	91.53	‒ 0.07	0.43	91.63
2,000	θ ˆ ipw	( π S , π A ) correct	‒ 0.07	0.37	90.07	‒ 0.07	0.37	90.87	‒ 0.06	0.33	90.47	‒ 0.05	0.33	91.78
5,000	θ ˆ ipw	( π S , π A ) correct	‒ 0.04	0.25	91.94	‒ 0.03	0.26	93.02	‒ 0.03	0.23	91.51	‒ 0.02	0.23	91.94
1,000	θ ˆ ipw	( π S , π A ) incorrect	‒0.18	0.50	89.52	‒ 0.15	0.51	91.83	‒ 0.14	0.45	89.92	‒ 0.12	0.44	90.42
2,000	θ ˆ ipw	( π S , π A ) incorrect	‒0.15	0.39	88.37	‒ 0.13	0.40	90.47	‒ 0.12	0.34	88.87	‒ 0.11	0.34	91.07
5,000	θ ˆ ipw	( π S , π A ) incorrect	‒0.12	0.27	89.26	‒ 0.10	0.27	89.04	‒ 0.09	0.24	88.18	‒ 0.06	0.24	90.98
1,000	θ ˆ dr	All correct	‒0.03	0.33	93.95	‒ 0.01	0.33	93.75	‒ 0.02	0.30	93.04	0.00	0.30	95.06
2,000	θ ˆ dr	All correct	‒ 0.02	0.25	93.78	‒ 0.02	0.24	92.28	‒ 0.01	0.22	94.48	‒ 0.01	0.22	94.18
5,000	θ ˆ dr	All correct	‒ 0.01	0.17	93.98	‒ 0.01	0.17	95.70	‒ 0.01	0.15	94.09	0.00	0.15	94.31
1,000	θ ˆ dr	μ Y correct	‒0.03	0.39	94.15	‒ 0.02	0.38	93.55	‒ 0.03	0.35	92.94	‒ 0.01	0.34	94.46
2,000	θ ˆ dr	μ Y correct	‒0.01	0.30	92.78	0.00	0.30	93.08	‒ 0.02	0.26	94.78	‒ 0.03	0.26	92.88
5,000	θ ˆ dr	μ Y correct	‒ 0.01	0.20	94.09	‒ 0.01	0.20	94.41	‒ 0.02	0.17	93.66	0.00	0.17	93.66
1,000	θ ˆ dr	( π S , π A ) correct	‒ 0.03	0.35	92.54	‒ 0.02	0.36	92.94	‒ 0.03	0.33	92.54	‒ 0.01	0.33	94.35
2,000	θ ˆ dr	( π S , π A ) correct	‒0.02	0.27	91.68	‒ 0.02	0.27	92.78	‒ 0.01	0.24	93.48	0.01	0.25	92.78
5,000	θ ˆ dr	( π S , π A ) correct	‒ 0.01	0.18	92.59	‒ 0.01	0.18	93.98	‒ 0.01	0.17	92.27	‒ 0.01	0.17	93.23

Scenario 1: b = ( b 3 ) , d = ( d 1 ) , e = ( e ) , f = ( f 1 ) . Scenario 2: b = ( b 3 ) , d = ( d 1 ) , e = ( e ) , f = ( f 2 ) . Scenario 3: b = ( b 3 ) , d = ( d 2 ) , e = ( e ) , f = ( f 1 ) . Scenario 4: b = ( b 3 ) , d = ( d 2 ) , e = ( e ) , f = ( f 2 ) . θ 0 = 1 in all four scenarios.

5 Application: The effect of EITC on infant health

Following Hoynes et al. [18], we apply our methods to studying the effect of 1993 EITC reform on low birth weight among a “high-impact sample” consisting of single mothers aged 18 and older with a high school education or less. The treated group is second- and higher-order births and the control group is the first births, because first births were exposed to relatively small EITC credit. Our placebo sample consists of the single mothers who are college graduates.

The accompanying data in the study by Hoynes et al. [18] are collapsed to cells defined by state, year, parity of birth (first, second, third, fourth, or greater birth to a mother), education of mother ( < 12 , 12, 13–15, > 16 ), race of the mother (white, black, other), ethnicity of the mother (Hispanic, non-Hispanic, missing), and age of mother (18–24, 25–35, 35+). Since the low birth weight is a binary outcome, we recovered individual-level data from the given size and fraction of low birth weight of each cell and ended up with 811,424 subjects in the high-impact sample and 53,131 in the placebo sample. As discussed in Section 1, the high education subgroup was unlikely to be eligible for the EITC, thus making it an appealing placebo sample. In this study, there is a concern of unmeasured confounding due to pre-exposure behaviors such as smoking and drinking, which may differ between the exposure and control groups and lead to lower birth weight. A sufficient condition for Assumption 2 to hold in this case study is that (i) within each socioeconomic cell defined above, the smoking and drinking rates are similar for single mothers in the placebo sample and the primary sample; and (ii) the effect of smoking and drinking on birth weight is not modified by the mother’s education. Our primary analysis will operate under this identification assumption although a sensitivity analysis will relax it. Figure S1 in Supplement §S1.1 illustrates the role of placebo sample in the EITC study using directed acyclic graphs (DAGs).

For illustration, we calculate two naive estimators based on the primary sample and effective tax year 1994 (in percentage points): the naive regression estimator θ ˆ reg , naive = 1.95 ( SE = 0.13 ) and naive doubly robust estimator θ ˆ dr , naive = 1.74 ( SE = 0.12 ) . Both estimators adjust for the aforementioned parity and demographic variables, and two-way interactions between the demographic variables. The standard errors are based on the nonparametric bootstrap with 100 resamples. Both naive estimators indicate a harmful effect of EITC on birth weight, which is likely biased due to unmeasured confounding. In comparison, the three proposed placebo-sample estimators are: the regression-based estimator θ ˆ reg = − 0.57 (SE = 0.28), the IPW estimator θ ˆ ipw = − 0.38 (SE = 1.10), and the doubly robust estimator θ ˆ dr = − 0.67 (SE = 0.30). The relative performance of these three estimators is similar to that in simulation. For illustration, we calculate two naive estimators based on the primary sample and effective tax year 1994 (in percentage points): the naive regression estimator θ ˆ reg , naive = 1.95 ( SE = 0.13 ) and naive doubly robust estimator θ ˆ dr , n a i v e = 1.74 ( SE = 0.12 ) . Both estimators adjust for the aforementioned parity and demographic variables, and two-way interactions between the demographic variables. The standard errors are based on the nonparametric bootstrap with 100 resamples. Both naive estimators indicate a harmful effect of EITC on birth weight, which is likely biased due to unmeasured confounding. In comparison, the three proposed placebo-sample estimators are: the regression-based estimator θ ˆ reg = − 0.57 (SE = 0.28), the IPW estimator θ ˆ ipw = − 0.38 (SE = 1.10), and the doubly robust estimator θ ˆ dr = − 0.67 (SE = 0.30). The relative performance of these three estimators is similar to that in simulation. Three placebo-sample estimators indicate that, for second parity or higher births among the high-impact sample, the EITC leads to reduced risk of low birth weight. Based on θ ˆ reg , the risk would be 0.57 (95% CI: 0.03 to 1.10) percentage points lower (relative to the overall mean of 10.03%). A sensitivity analysis based on Sensitivity Model 1 (applicable to binary outcomes when all X ’s are discrete) finds that the confidence interval becomes [ − 1.208 , 0.069] with Λ = 0 and Γ = 1.01 , indicating that the observed treatment effect is no longer significant even when a small violation of Assumption 2 is present. Therefore, the conclusion that EITC reduces the risk of low birth weight is sensitive to potential violations of Assumption 2.

6 Summary

In an observational study, it is common to leverage known null effect to detect bias. One such strategy is to set aside a placebo sample – a subset of data immune from the hypothesized cause-and-effect relationship. Existence of an effect in the placebo sample raises concern of unmeasured confounding bias, while, absence of it corroborates the causal conclusion. One prominent application scenario is policy evaluation, where individuals not eligible or minimally affected by the policy constitute a placebo sample.

In this article, we described framework for using a placebo sample to detect and remove bias. We stated identification assumption, and developed various semiparametric estimation and inference methods and their accompanying sensitivity analysis methods. The proposed methods were evaluated in an extensive simulation study. We applied the proposed methods to study the effect of 1993 EITC reform on infant health where our placebo sample is the single mothers who are college graduates as they are unlikely to be eligible for EITC. Our work serves as a theoretical basis for future research.

Acknowledgements

The authors would like to thank the anonymous referees for their helpful and constructive feedback.

Funding information: Dr. Ye’s research was partially supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award Number R35GM155070.
Author contributions: Ting Ye conceived the project, developed the methods and theory, and drafted the manuscript. Bo Zhang, Shuxiao Chen, Qijia He did simulation studies and real data analysis, and contributed to the writing.
Conflict of interest: The authors state no conflict of interest.

References

[1] Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. 10.1093/biomet/70.1.41Search in Google Scholar

[2] Robins JM. Estimation of the time-dependent accelerated failure time model in the presence of confounding factors. Biometrika. 1992;79:321–34. 10.1093/biomet/79.2.321Search in Google Scholar

[3] Rosenbaum PR. Observational studies. New York: Springer; 2002. 10.1007/978-1-4757-3692-2Search in Google Scholar

[4] VanderWeele TJ, Ding P. Sensitivity analysis in observational research: introducing the E-value. Ann Int Med. 2017;167(4):268–74. 10.7326/M16-2607Search in Google Scholar PubMed

[5] Zhang B, Tchetgen Tchetgen EJ. A semi-parametric approach to model-based sensitivity analysis in observational studies. J R Stat Soc Ser A Stat Soc. 2022;185(Supplement_2):S668–91. 10.1111/rssa.12946Search in Google Scholar PubMed PubMed Central

[6] Angrist JD, Imbens GW, Rubin DB. Identification of causal effects using instrumental variables. J Amer Stat Assoc. 1996;91(434):444–55. 10.1080/01621459.1996.10476902Search in Google Scholar

[7] Rosenbaum PR. Detecting bias with confidence in observational studies. Biometrika. 1992;79(2):367–74. 10.1093/biomet/79.2.367Search in Google Scholar

[8] Lipsitch M, Tchetgen Tchetgen E, Cohen T. Negative controls: a tool for detecting confounding and bias in observational studies. Epidemiology (Cambridge Mass.). 2010;21(3):383. 10.1097/EDE.0b013e3181d61eebSearch in Google Scholar PubMed PubMed Central

[9] Shi X, Miao W, Nelson JC, Tchetgen Tchetgen EJ. Multiply robust causal inference with double-negative control adjustment for categorical unmeasured confounding. J R Stat Soc Ser B (Stat Methodol). 2020;82(2):521–40, 2021/09/09. 10.1111/rssb.12361Search in Google Scholar PubMed PubMed Central

[10] Rosenbaum PR. Sensitivity analyses informed by tests for bias in observational studies. Biometrics. 2023;79(1):475–87. 10.1111/biom.13558Search in Google Scholar PubMed

[11] Card D. The impact of the mariel boatlift on the Miami labor market. ILR Rev. 1990;43(2):245–57. 10.1177/001979399004300205Search in Google Scholar

[12] Sofer T, Richardson DB, Colicino E, Schwartz J, Tchetgen Tchetgen EJ. On negative outcome control of unobserved confounding as a generalization of difference-in-differences. Stat Sci. 2016;31(3):348. 10.1214/16-STS558Search in Google Scholar PubMed PubMed Central

[13] Gagnon-Bartsch JA, Speed TP. Using control genes to correct for unwanted variation in microarray data. Biostatistics. 2012;13(3):539–52. 10.1093/biostatistics/kxr034Search in Google Scholar PubMed PubMed Central

[14] Gagnon-Bartsch JA, Jacob L, Speed TP. Removing unwanted variation from high dimensional data with negative controls. Berkeley: Tech Reports from Dep Stat Univ California. 2013. p. 1–112. Search in Google Scholar

[15] Leek JT, Scharpf RB, Corrada Bravo H, Simcha D, Langmead B, Johnson WE, et al. Tackling the widespread and critical impact of batch effects in high-throughput data. Nat Rev Genetics. 2010;11(10):733–9. 10.1038/nrg2825Search in Google Scholar PubMed PubMed Central

[16] Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28(6):882–3. 10.1093/bioinformatics/bts034Search in Google Scholar PubMed PubMed Central

[17] Abid A, Zhang MJ, Bagaria VK, Zou J. Exploring patterns enriched in a dataset with contrastive principal component analysis. Nat Commun. 2018;9(1):2134. 10.1038/s41467-018-04608-8Search in Google Scholar PubMed PubMed Central

[18] Hoynes H, Miller D, Simon D. Income, the earned income tax credit, and infant health. Amer Econ J Econ Policy. 2015;7(1):172–211. 10.1257/pol.20120179Search in Google Scholar

[19] Eggers AC, Tunón G, Dafoe A. Placebo tests for causal inference. Working Paper. 2021. Search in Google Scholar

[20] Dow WH, Godøy A, Lowenstein C, Reich M. Can labor market policies reduce deaths of despair? J Health Econ. 2020;74:102372. 10.1016/j.jhealeco.2020.102372Search in Google Scholar PubMed PubMed Central

[21] Rubin DB. Randomization analysis of experimental data: the Fisher randomization test comment. J Amer Stat Assoc. 1980;75:591–3. 10.2307/2287653Search in Google Scholar

[22] Newsome SJ, Daniel RM, Carr SB, Bilton D, Keogh RH. Using negative control outcomes and difference-in-differences analysis to estimate treatment effects in an entirely treated cohort: the effect of ivacaftor in cystic fibrosis. Amer J Epidemiol. 2022;191(3):505–15. 10.1093/aje/kwab263Search in Google Scholar PubMed PubMed Central

[23] Abadie A. Semiparametric difference-in-difference estimators. Rev Econ Stud. 2005;75(1):1–19. 10.1111/0034-6527.00321Search in Google Scholar

[24] Robins J, Sued M, Lei-Gomez Q, Rotnitzky A. Comment: Performance of double-robust estimators when inverse probability weights are highly variable. Stat Sci. 2007;22(4):544–59. 10.1214/07-STS227DSearch in Google Scholar

[25] Bickel PJ, Klaassen CAJ, Ritov Y, Wellner JA. Efficient and adaptive estimation for semiparametric models. New York: Springer; 1993. Search in Google Scholar

[26] van der Vaart AW. Asymptotic statistics. Cambridge, UK: Cambridge University Press; 2000. Search in Google Scholar

[27] Park C, Tchetgen Tchetgen E. A universal difference-in-differences approach for causal inference. 2022. arXiv: http://arXiv.org/abs/arXiv:2212.13641. 10.1097/EDE.0000000000001676Search in Google Scholar PubMed PubMed Central

[28] Sant-Anna PHC, Zhao J. Doubly robust difference-in-differences estimators. J Econ. 2020;219(1):101–22. 10.1016/j.jeconom.2020.06.003Search in Google Scholar

[29] Newey WK, McFadden D. Chapter 36 large sample estimation and hypothesis testing. Handbook Econ. 1994;4:2111–245. 10.1016/S1573-4412(05)80005-4Search in Google Scholar

[30] Benkeser D, Carone M, Van Der Laan MJ, Gilbert PB. Doubly robust nonparametric inference on the average treatment effect. Biometrika. 2017;104(4):863–80. 10.1093/biomet/asx053Search in Google Scholar PubMed PubMed Central

[31] Dukes O, Vansteelandt S, Whitney D. On doubly robust inference for double machine learning. 2021. arXiv: http://arXiv.org/abs/arXiv:2107.06124. Search in Google Scholar

[32] Chernozhukov V, Chetverikov D, Demirer M, Duflo E, Hansen C, Newey W. Double/debiased/Neyman machine learning of treatment effects. Amer Econ Rev. 2017;107(5):261–65. 10.1257/aer.p20171038Search in Google Scholar

[33] Zhao Q, Small DS, Bhattacharya BB. Sensitivity analysis for inverse probability weighting estimators via the percentile bootstrap. J R Stat Soc Ser B (Stat Meth). 2019;81(4):735–61. 10.1111/rssb.12327Search in Google Scholar

Received: 2023-04-11

Revised: 2024-10-14

Accepted: 2025-01-24

Published Online: 2025-03-05

This work is licensed under the Creative Commons Attribution 4.0 International License.

Supplementary material

Articles in the same Issue

https://doi.org/10.1515/jci-2023-0020

Keywords for this article

causal inference; placebo test; treatment effect heterogeneity; unmeasured confounding

Creative Commons

BY 4.0

Role of placebo samples in observational studies

Article

Abstract

1 Introduction

2 Methods

2.1 Nonparametric identification

Assumption 1

Assumption 2

Assumption 3

Proposition 1

2.2 Estimation and semiparametric inference

Theorem 1

Assumption 4

Theorem 2

3 Sensitivity analyses relaxing identification assumptions

3.1 Sensitivity analysis under linear sensitivity models

3.2 Sensitivity analysis under marginal sensitivity models

4 Simulation study

5 Application: The effect of EITC on infant health

6 Summary

Acknowledgements

References

Supplementary Material

Articles in the same Issue

Articles in the same Issue

Articles in the same Issue