Abstract
When a binary treatment
1 Introduction
A typical treatment effect analysis [1–6] involves a binary treatment
The instrumental variable estimator (IVE) for
Treatment effect heterogeneity is an important problem that has been addressed by several researchers such as Imai and Ratkovic [13] and Künzel et al. [14]. Athey and Imbens [15] noted the following three problems: (i) estimating heterogeneous effects, (ii) finding an optimal policy allocating subjects to the treatment or control based on
We find the
A potential candidate for the one-dimensional heterogeneity function of
The main motivation to condition on
which is proven later. In other words,
Another attractive property of
To see how informative
For us, this is the primary advantage (along with the weak requisite assumptions) of the CP effect. The disadvantages are that it is specific to each IV, it is not a generally interesting effect, the “monotonicity condition” (
Let
In the aforementioned example, although
To accomplish the goal of this research, we use two nonparametric reduced forms (RFs):
where
We can estimate
The first estimator is based on the single index assumption
Hence, we can identify
In our second estimator, we assume that
The two ratio estimators suffer from the “excessively small denominator” problem: A near-zero denominator can blow up the ratio. To avoid this, our third estimator applies a power approximation to
If desired,
where AT indicates “always takers” with
In the remainder of this article, Sections 2 and 3 explain the two ratio estimators and IVE, respectively; Sections 4 and 5 present simulation and empirical studies, respectively; and Section 6 concludes this article. Most proofs are presented in the appendix. As usual, we assume independent and identically distributed observations
2 Ratio approaches
2.1 Identification
Since (2) motivates the ratio estimators, we first prove (2) in Theorem 1, and (1) is proven in the next section. As preliminaries, we present our assumptions on IV:
The “IV exclusion restriction” is implicit in the notation
The appendix proves the main heterogeneity-dimension reduction idea:
Although (4)(i) is enough for our estimators below, bear in mind that “
The proof for (5) also establishes the “balancing score” property of IS: The distribution of
Theorem 1
Under (4)(i), (4)(ii), and the support-overlap condition
The qualifier “for all
2.2 Ratio estimator under single index assumption
Our single index assumption for dimension reduction with an unknown
To estimate
One approach to overcome the identification problems is to assume a continuous regressor with a non-zero slope, e.g., the last regressor
Since we further assume a strictly increasing
Let
The numerator of
There are six estimators in
The probability limits of
That
Theorem 2
In Theorem 2, (i) is assumed to identify
Although not mentioned in Theorem 2, Assumption 4.2 of Ichimura [23, p. 82] must be introduced if several components of
Theorem 2 and
For our third estimator using (1) under
2.3 Ratio estimator under the probit IS
The requisite conditions for the preceding ratio estimator and its asymptotic variance are not simple. Our second ratio estimator is simpler in terms of the requisite conditions and asymptotic variance, although it requires imposing the probit assumption
Under
Then, with
the notation
Theorem 3
under the following assumptions, where
The aforementioned assumptions are weaker than those in Theorem 2 because single index estimation is not needed. Moreover, (ii) is not essential, because no continuous covariate means that nonparametric estimation is not needed. If
3 Power approximation approach
3.1 Identification
We apply a power approximation to
This yields a moment condition
With
The main identification finding in this section is Theorem 4.
Theorem 4
Under (4)(i) and (ii), (1) holds for any Y as long as
The proof for Theorem 4 in the appendix reveals
Hence, even if
3.2 Power approximation estimator
Before estimation, we introduce a modification to
Take
We use (13) instead of (1) to implement our IVE, where
We present two versions of the power approximation estimator: one conditioned on
First, we condition on
Using the same
Let
Then, we obtain the IVE of
The appendix proves the next two theorems.
Theorem 5
The IVE in (15) is asymptotically normal with variance
and
Now, we condition on
Replace
Obtain the IVE of
Theorem 6
The IVE in (16) is asymptotically normal with variance
4 Simulation study
With
Since
For the first ratio estimator
Let
Overall, six estimators are compared at the three evaluation points.
For each entry in each following table, four numbers appear at a given evaluation point: the (i) absolute bias (∣Bias∣); (ii) SD; (iii) averaged SD (across 10,000 repetitions) based on the asymptotic variance to be compared with (ii); and (iv) proportion of the 95% point-wise confidence intervals (CI) capturing the true value. We do not present the root mean-squared error (RMSE) to save space: in most cases, the absolute bias is much smaller than the SD, and thus, the RMSE is similar to the SD. The entries with the subscript “avg” indicate the simple averages across the three evaluation points, which are used as summary measures.
To make
Table 1 shows the results for continuous
Continuous
| Effect | Linear | Quadratic | Effect | Linear | Quadratic |
|---|---|---|---|---|---|
|
|
0.056 0.698 0.488 0.83 | 0.074 0.588 0.447 0.81 |
|
0.018 0.735 0.730 0.89 | 0.006 0.781 0.765 0.89 |
|
|
0.044 0.364 0.369 0.96 | 0.067 0.370 0.368 0.96 |
|
0.121 0.899 0.966 0.94 | 0.111 0.897 0.940 0.94 |
|
|
0.114 1.14 0.782 0.85 | 0.155 1.11 0.771 0.87 |
|
0.019 1.93 2.88 0.84 | 0.077 1.86 2.76 0.84 |
|
|
0.071 0.73 0.55 0.88 | 0.099 0.69 0.53 0.88 |
|
0.053 1.2 1.5 0.89 | 0.065 1.2 1.5 0.89 |
|
|
0.041 0.486 0.503 0.93 | 0.015 0.486 0.496 0.93 |
|
0.143 1.70 5.69 0.96 | 0.180 1.53 5.37 0.96 |
|
|
0.034 0.345 0.353 0.96 | 0.118 0.353 0.353 0.96 |
|
0.034 1.16 5.50 0.98 | 0.011 1.28 6.60 0.98 |
|
|
0.027 0.670 0.685 0.98 | 0.029 0.660 0.680 0.98 |
|
0.019 3.08 17.0 0.99 | 0.008 3.49 18.6 1.0 |
|
|
0.034 0.50 0.51 0.97 | 0.054 0.50 0.51 0.95 |
|
0.065 2.0 9.4 0.98 | 0.066 2.1 10 0.98 |
|
|
0.053 0.488 0.502 0.95 | 0.028 0.490 0.497 0.93 |
|
0.113 0.694 0.847 0.96 | 0.133 0.755 0.911 0.96 |
|
|
0.032 0.346 0.354 0.96 | 0.116 0.354 0.353 0.96 |
|
0.030 0.537 0.746 0.98 | 0.007 0.545 0.764 0.98 |
|
|
0.010 0.676 0.689 0.98 | 0.045 0.668 0.682 0.97 |
|
0.023 1.45 2.46 1.0 | 0.092 1.49 2.46 1.0 |
|
|
0.032 0.50 0.52 0.97 | 0.063 0.50 0.51 0.95 |
|
0.055 0.90 1.4 0.98 | 0.077 0.93 1.4 0.98 |
Avg.Asy.SD: average across 10,000 reps of the asymptotic SD formula in theorems;
When
Continuous
| Effect | Linear | Quadratic | Effect | Linear | Quadratic |
|---|---|---|---|---|---|
|
|
0.007 0.165 0.152 0.91 | 0.019 0.165 0.154 0.90 |
|
0.031 0.231 0.233 0.94 | 0.033 0.230 0.234 0.92 |
|
|
0.015 0.126 0.129 0.96 | 0.030 0.127 0.130 0.95 |
|
0.019 0.246 0.257 0.95 | 0.011 0.244 0.259 0.95 |
|
|
0.026 0.265 0.244 0.91 | 0.037 0.259 0.240 0.91 |
|
0.027 0.621 0.682 0.91 | 0.054 0.631 0.689 0.92 |
|
|
0.016 0.19 0.18 0.93 | 0.029 0.18 0.18 0.92 |
|
0.026 0.37 0.39 0.93 | 0.032 0.37 0.39 0.93 |
|
|
0.007 0.142 0.142 0.95 | 0.013 0.138 0.139 0.85 |
|
0.011 0.145 0.147 0.95 | 0.010 0.145 0.149 0.95 |
|
|
0.003 0.101 0.101 0.95 | 0.081 0.101 0.102 0.86 |
|
0.005 0.141 0.150 0.95 | 0.003 0.132 0.142 0.95 |
|
|
0.001 0.201 0.199 0.96 | 0.075 0.197 0.198 0.83 |
|
0.018 0.361 0.362 0.97 | 0.001 0.312 0.329 0.98 |
|
|
0.004 0.15 0.15 0.95 | 0.056 0.15 0.15 0.85 |
|
0.011 0.22 0.22 0.96 | 0.005 0.20 0.21 0.96 |
|
|
0.018 0.142 0.142 0.95 | 0.002 0.138 0.140 0.84 |
|
0.022 0.141 0.142 0.96 | 0.033 0.145 0.146 0.95 |
|
|
0.003 0.101 0.101 0.95 | 0.081 0.101 0.102 0.85 |
|
0.001 0.129 0.133 0.95 | 0.005 0.128 0.134 0.96 |
|
|
0.012 0.201 0.199 0.95 | 0.085 0.196 0.197 0.83 |
|
0.012 0.283 0.281 0.97 | 0.009 0.278 0.276 0.98 |
|
|
0.011 0.15 0.15 0.95 | 0.056 0.15 0.15 0.84 |
|
0.012 0.18 0.19 0.96 | 0.015 0.18 0.19 0.96 |
The findings in Table 3 with binary
Binary
| Effect | Linear | Quadratic | Effect | Linear | Quadratic |
|---|---|---|---|---|---|
|
|
0.029 0.373 0.274 0.83 | 0.020 0.359 0.269 0.82 |
|
0.051 0.578 0.627 0.92 | 0.019 0.552 0.602 0.92 |
|
|
0.024 0.149 0.145 0.95 | 0.034 0.149 0.147 0.94 |
|
0.133 0.564 0.626 0.97 | 0.158 0.589 0.649 0.97 |
|
|
0.026 0.214 0.176 0.68 | 0.027 0.215 0.179 0.71 |
|
0.098 0.869 1.37 0.86 | 0.108 0.856 1.40 0.86 |
|
|
0.026 0.25 0.20 0.82 | 0.027 0.24 0.20 0.82 |
|
0.094 0.67 0.87 0.92 | 0.095 0.67 0.89 0.91 |
|
|
0.013 0.225 0.227 0.93 | 0.028 0.226 0.226 0.90 |
|
0.091 1.50 3.96 0.96 | 0.052 0.621 2.54 0.94 |
|
|
0.059 0.121 0.124 0.94 | 0.083 0.123 0.126 0.91 |
|
0.001 0.481 2.63 0.97 | 0.013 0.450 2.98 0.97 |
|
|
0.030 0.184 0.195 0.98 | 0.049 0.182 0.198 0.97 |
|
0.007 1.17 8.37 0.99 | 0.027 1.11 8.90 0.99 |
|
|
0.034 0.18 0.18 0.95 | 0.053 0.18 0.18 0.93 |
|
0.033 1.1 5.0 0.97 | 0.031 0.73 4.8 0.97 |
|
|
0.008 0.228 0.229 0.93 | 0.023 0.229 0.229 0.90 |
|
0.047 0.377 0.382 0.96 | 0.038 0.350 0.369 0.95 |
|
|
0.058 0.121 0.123 0.94 | 0.082 0.123 0.126 0.91 |
|
0.004 0.192 0.250 0.98 | 0.010 0.190 0.242 0.98 |
|
|
0.036 0.185 0.194 0.98 | 0.056 0.182 0.196 0.97 |
|
0.005 0.314 0.542 0.99 | 0.003 0.300 0.526 0.99 |
|
|
0.034 0.18 0.18 0.95 | 0.054 0.18 0.18 0.92 |
|
0.019 0.30 0.39 0.97 | 0.017 0.28 0.38 0.97 |
Binary
| Effect | Linear | Quadratic | Effect | Linear | Quadratic |
|---|---|---|---|---|---|
|
|
0.002 0.096 0.092 0.90 | 0.005 0.099 0.094 0.90 |
|
0.004 0.170 0.190 0.96 | 0.001 0.176 0.193 0.96 |
|
|
0.009 0.053 0.053 0.94 | 0.011 0.053 0.053 0.93 |
|
0.010 0.152 0.168 0.97 | 0.012 0.152 0.169 0.97 |
|
|
0.008 0.061 0.058 0.89 | 0.003 0.062 0.060 0.89 |
|
0.056 0.313 0.351 0.95 | 0.051 0.310 0.351 0.95 |
|
|
0.006 0.070 0.068 0.91 | 0.006 0.072 0.069 0.91 |
|
0.023 0.21 0.24 0.96 | 0.022 0.21 0.24 0.96 |
|
|
0.025 0.066 0.066 0.85 | 0.035 0.066 0.066 0.68 |
|
0.014 0.072 0.072 0.95 | 0.019 0.071 0.073 0.92 |
|
|
0.046 0.036 0.036 0.71 | 0.068 0.037 0.037 0.45 |
|
0.009 0.048 0.049 0.95 | 0.014 0.048 0.048 0.93 |
|
|
0.044 0.058 0.057 0.61 | 0.073 0.059 0.059 0.32 |
|
0.008 0.059 0.061 0.98 | 0.015 0.055 0.057 0.98 |
|
|
0.038 0.054 0.053 0.72 | 0.058 0.054 0.054 0.48 |
|
0.010 0.060 0.061 0.96 | 0.016 0.058 0.060 0.94 |
|
|
0.021 0.067 0.067 0.84 | 0.031 0.066 0.067 0.66 |
|
0.004 0.074 0.073 0.95 | 0.006 0.073 0.074 0.93 |
|
|
0.046 0.036 0.036 0.70 | 0.068 0.037 0.037 0.44 |
|
0.005 0.048 0.049 0.95 | 0.007 0.049 0.050 0.95 |
|
|
0.047 0.057 0.056 0.59 | 0.077 0.057 0.058 0.30 |
|
0.003 0.056 0.057 0.98 | 0.010 0.052 0.058 0.98 |
|
|
0.038 0.053 0.053 0.71 | 0.058 0.053 0.054 0.47 |
|
0.004 0.059 0.060 0.96 | 0.008 0.058 0.061 0.95 |
Thus far, we examined the CI coverage at each evaluation point separately. For joint coverage across

True effect curve and 50 CBs for continuous
Overall, there exist trade-offs among the bias, SD, CI coverage, ease in implementation, and closeness of the asymptotic variance formula to the actual variance. We note that
5 Empirical analysis
Our empirical analysis is for the effects of 401(k) retirement programs
Because the eligibility for the programs is exogenously set,
We use the same data as those used by Poterba et al. [29] and Abadie [10], derived from the Survey of Income and Program Participation for 1991. The observation unit is a household, and the sample is restricted to households with at least one member employed. Table 5 presents the sample mean (SD) of the variables, where
Mean (SD) of variables (
| Pooled |
|
|
Pooled |
|
|
||
|---|---|---|---|---|---|---|---|
|
|
19.1 (64) | 38.5 (79.3) | 11.7 (055.3) | Inc | 39.3 (24.1) | 49.8 (26.8) | 35.2 (21.6) |
| Age | 41.1 (10.3) | 41.5 (9.65) | 41.9 (10.0) | ||||
|
|
0.28 | 1 | 0 | Mar | 0.63 | 0.69 | 0.60 |
|
|
0.39 | 1 | 0.16 | Hsize | 2.89 (1.53) | 2.92 (1.47) | 2.87 (1.55) |
Before we examine the effect estimates in Table 6, we explain the steps implemented for facilitating the comparison of the estimators. Recall that
Complier effects for financial assets in $1,000 (
|
|
0.2 (SE) | 0.3 (SE) | 0.4 (SE) | 0.5 (SE) | 0.6 (SE) |
|---|---|---|---|---|---|
|
|
4.37 (4.16) | 4.56 (4.94) | 9.11 (6.61) |
|
|
|
|
5.77 (4.12) |
|
|
|
|
|
|
6.38 (11.3) |
|
|
|
12.8 (11.6) |
|
|
|
|
|
|
|
|
|
5.76 (8.51) |
|
|
|
13.5 (10.5) |
|
|
|
|
|
|
|
For the estimation, first, both age and age
For
| Inc | Age | Age
|
Mar | Hsize | |
|---|---|---|---|---|---|
|
|
|
|
|
|
−0.034 (0.028) |
|
|
|
|
|
0.019 (0.037) |
|
In the results of
In Table 6, all estimators show increasing effects of
Table 6 shows no significant difference between
Figure 2 shows

It is puzzling why
The benefits of the global nonparametric approach versus the local approach can be further highlighted. The inverted matrix for
To examine how informative

E(
The effect of a covariate can be found using the graph for
Specifically, when the income increases by one unit (
6 Conclusion
For a binary treatment
When
The dimension reduction achieved by PS for exogenous
We proposed three estimators for
The first estimator is a kernel nonparametric estimator based on a single index estimator for
We presented an empirical illustration for the effects of 401(k) retirement programs (
Acknowledgment
The authors are grateful to two anonymous reviewers for their helpful comments.
-
Funding information: Jin-young Choi’s research has been supported by the Hankuk University of Foreign Studies Research Fund of 2023. Myoung-jae Lee’s research has been supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (2020R1A2C1A01007786).
-
Author contributions: All authors have accepted responsibility for the entire content of this manuscript and approved its submission.
-
Conflict of interest: The authors state no conflict of interest.
-
Ethical approval: The conducted research is not related to either human or animals use.
-
Data availability statement: The dataset analyzed during this study is from “Introductory Econometrics: A Modern Approach, 7e” by Jeffrey M. Wooldridge. The dataset named “401ksubs” is available in the textbook e-learning resource repository, https://www.cengage.com/cgi-wadsworth/course_products_wp.pl?fid=M20b&product_isbn_issn=9781111531041.
Appendix A Proofs
A.1 Proof for (5) and balancing score property of IS
Proof
Under
The first and last expressions prove that
IS
take
A.2 Proof for Theorem 1 regarding (2)
Proof
With
The first and last expressions reveal that
Take
Define
A.3 Proof for Theorem 2 (first ratio estimator)
Proof
Under the assumptions in Theorem 2,
(1) Linearization
The linearization of
suppress
where
Substitute the linearizations for
Rewrite the preceding display by collecting terms, and then multiply by
The right side has six terms, which gives six asymptotic variances. Also, the three terms sharing the same subscript 1 are correlated with each other to give three covariances, and the three terms sharing the same subscript 0 also give three covariances. Hence, the asymptotic variance of
(2) Preliminaries with
With
Analogous expressions hold when
Hence, invoking the Lindeberg CLT for triangular arrays,
An analogous result holds with
(3) Variances
Because of
the variance of the first and second terms in (A4) is, respectively,
Because of
the variance of the third and fourth terms in (A4) is, respectively,
Because of
(4) Covariances
For the covariance between the first and third terms, we need the expected value of the product between
For the covariance between the first and fifth terms, we need the expected value of the product between
Analogously, for the covariance between the third and fifth terms, we need the expected value of the product between
As for the three covariance terms involving the three terms with the subscript 0, the terms analogous to
A.4 Proof for (9)
Proof
Take
A.5 Proof for Theorem 3 (second ratio estimator)
Proof
Define
Also, define their estimands:
The asymptotic variance from the two terms on the right side is
A.6 Proof for Theorem 4
Proof
With
Define
Define
Using the first and last expressions gives
A.7 Proof for Theorem 5
Proof
Let
The IV for
where
Solve this for
where
and
As
A.8 Proof for Theorem 6
Proof
The proof for Theorem 6 is almost the same as that for Theorem 5. Define
The IV for
(A9) and (A10) hold with
and
With
References
[1] Rosenbaum PR. Observational studies. 2nd ed. Springer; 2002. https://doi.org/10.1007/978-1-4757-3692-2. Search in Google Scholar
[2] Lee MJ. Micro-econometrics for policy, program, and treatment effects. Oxford University Press; 2005. https://doi.org/10.1093/0199267693.001.0001. Search in Google Scholar
[3] Lee MJ. Matching, regression discontinuity, difference in differences, and beyond. Oxford University Press; 2016. https://doi.org/10.1093/acprof:oso/9780190258733.001.0001. Search in Google Scholar
[4] Pearl J. Causality. 2nd ed. Cambridge University Press; 2009. https://doi.org/10.1017/CBO9780511803161. Search in Google Scholar
[5] Imbens GW, Rubin DB. Causal inference for statistics, social, and biomedical sciences: an introduction. Cambridge University Press; 2015. https://doi.org/10.1017/CBO9781139025751. Search in Google Scholar
[6] Abadie A, Cattaneo MD. Econometric methods for program evaluation. Annu Rev Econ. 2018;10:465–503. https://doi.org/10.1146/annurev-economics-080217-053402. Search in Google Scholar
[7] Imbens GW, Angrist J. Identification and estimation of local average treatment effects. Econometrica. 1994;62(2):467–76. https://doi.org/10.2307/2951620. Search in Google Scholar
[8] Angrist JD, Imbens GW, Rubin DB. Identification of causal effects using instrumental variables. J Amer Stat Assoc. 1996;91(434):444–55. https://doi.org/10.2307/2291629. Search in Google Scholar
[9] Frölich M. Nonparametric IV estimation of local average treatment effects with covariates. J Econom. 2007;139(1):35–75. https://doi.org/10.1016/j.jeconom.2006.06.004. Search in Google Scholar
[10] Abadie A. Semiparametric instrumental variable estimation of treatment response models. J Econom. 2003;113(2):231–63. https://doi.org/10.1016/S0304-4076(02)00201-4. Search in Google Scholar
[11] Tan Z. Regression and weighting methods for causal inference using instrumental variables. J Amer Stat Assoc. 2006;101(476):1607–18. https://doi.org/10.1198/016214505000001366. Search in Google Scholar
[12] Ogburn EL, Rotnitzky A, Robins JM. Doubly robust estimation of the local average treatment effect curve. J R Stat Soc (Ser B). 2015;77(2):373–96. https://doi.org/10.1111/rssb.12078. Search in Google Scholar PubMed PubMed Central
[13] Imai K, Ratkovic M. Estimating treatment effect heterogeneity in randomized program evaluation. Ann Appl Stat. 2013;7(1):443–70. https://doi.org/10.1214/12-AOAS593. Search in Google Scholar
[14] Künzel SR, Sekhon JS, Bickel PJ, Yu B. Metalearners for estimating heterogeneous treatment effects using machine learning. Proc Nat Acad Sci. 2019;116(10):4156–65. https://doi.org/10.1073/pnas.1804597116. Search in Google Scholar PubMed PubMed Central
[15] Athey S, Imbens GW. Machine learning methods that economists should know about. Ann Rev Econom. 2019;11:685–725. https://doi.org/10.1146/annurev-economics-080217-053433. Search in Google Scholar
[16] Hirano K, Porter JR. Asymptotics for statistical treatment rules. Econometrica. 2009;77(5):1683–701. https://doi.org/10.3982/ECTA6630. Search in Google Scholar
[17] Dudik M, Erhan D, Langford J, Li L. Doubly robust policy evaluation and optimization. Stat Sci. 2014;29(4):485–511. https://doi.org/10.1214/14-STS500. Search in Google Scholar
[18] Athey S, Imbens GW. Recursive partitioning for heterogeneous causal effects. Proc Nat Acad Sci. 2016;113(27):7353–60. https://doi.org/10.1073/pnas.1510489113. Search in Google Scholar PubMed PubMed Central
[19] Choi JY, Lee G, Lee MJ. Endogenous treatment effect for any response conditional on control propensity score. Stat Probability Lett. 2023;196:109747. https://doi.org/10.1016/j.spl.2022.109747. Search in Google Scholar
[20] Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. https://doi.org/10.1093/biomet/70.1.41. Search in Google Scholar
[21] Swanson SA, Hernán MA. Think globally, act globally: an epidemiologist’s perspective on instrumental variable estimation. Stat Sci. 2014;29(3):371–4. https://doi.org/10.1214/14-STS491. Search in Google Scholar
[22] Mogstad M, Torgovitsky A. Identification and extrapolation of causal effects with instrumental variables. Ann Rev Econ. 2018;10:577–613. https://doi.org/10.1146/annurev-economics-101617-041813. Search in Google Scholar
[23] Ichimura H. Semiparametric least squares (SLS) and weighted SLS estimation of single index models. J Econom. 1993;58(1–2):71–120. https://doi.org/10.1016/0304-4076(93)90114-K. Search in Google Scholar
[24] Hastie T, Tibshirani R, Friedman J. The elements of statistical learning. 2nd ed. Springer; 2009. https://doi.org/10.1007/978-0-387-84858-7. Search in Google Scholar
[25] Chernozhukov V, Chetverikov D, Demirer M, Duflo E, Hansen C, Newey W, et al. Double/debiased machine learning for treatment and structural parameters. Econom J. 2018;21(1):C1–68. https://doi.org/10.1111/ectj.12097. Search in Google Scholar
[26] Lee MJ. Simple least squares estimator for treatment effects using propensity score residuals. Biometrika. 2018;105(1):149–64. https://doi.org/10.1093/biomet/asx062. Search in Google Scholar
[27] Lee MJ. Instrument residual estimator for any response variable with endogenous binary treatment. J R Stat Soc (Ser B). 2021;83(3):612–35. https://doi.org/10.1111/rssb.12442. Search in Google Scholar
[28] Gale WG, Scholz JK. IRAs and household saving. Amer Econ Rev. 1994;84(5):1233–60. Search in Google Scholar
[29] Poterba JM, Venti SF, Wise DA. Do 401(k) contributions crowd out other personal saving?. J Public Econ. 1995;58(1):1–32. https://doi.org/10.1016/0047-2727(94)01462-W. Search in Google Scholar
[30] Madrian BC, Shea DF. The Power of suggestion: inertia in 401(k) participation and savings behavior. Quarter J Econ. 2001;116(4):1149–87. https://doi.org/10.1162/003355301753265543. Search in Google Scholar
[31] Benjamin DJ. Does 401(k) eligibility increase saving? Evidence from propensity score subclassification. J Public Econ. 2003;87(5–6):1259–90. https://doi.org/10.1016/S0047-2727(01)00167-0. Search in Google Scholar
[32] Chetty R, Friedman JN, Leth-Petersen S, Nielsen TH, Olsen T. Active vs. passive decisions and crowd-out in retirement savings accounts: evidence from Denmark. Quarter J Econ. 2014;129(3):1141–219. https://doi.org/10.1093/qje/qju013. Search in Google Scholar
© 2023 the author(s), published by De Gruyter
This work is licensed under the Creative Commons Attribution 4.0 International License.
Articles in the same Issue
- Research Articles
- Adaptive normalization for IPW estimation
- Matched design for marginal causal effect on restricted mean survival time in observational studies
- Robust inference for matching under rolling enrollment
- Attributable fraction and related measures: Conceptual relations in the counterfactual framework
- Causality and independence in perfectly adapted dynamical systems
- Sensitivity analysis for causal decomposition analysis: Assessing robustness toward omitted variable bias
- Instrumental variable regression via kernel maximum moment loss
- Randomization-based, Bayesian inference of causal effects
- On the pitfalls of Gaussian likelihood scoring for causal discovery
- Double machine learning and automated confounder selection: A cautionary tale
- Randomized graph cluster randomization
- Efficient and flexible mediation analysis with time-varying mediators, treatments, and confounders
- Minimally capturing heterogeneous complier effect of endogenous treatment for any outcome variable
- Quantitative probing: Validating causal models with quantitative domain knowledge
- On the dimensional indeterminacy of one-wave factor analysis under causal effects
- Heterogeneous interventional effects with multiple mediators: Semiparametric and nonparametric approaches
- Exploiting neighborhood interference with low-order interactions under unit randomized design
- Robust variance estimation and inference for causal effect estimation
- Bounding the probabilities of benefit and harm through sensitivity parameters and proxies
- Potential outcome and decision theoretic foundations for statistical causality
- 2D score-based estimation of heterogeneous treatment effects
- Identification of in-sample positivity violations using regression trees: The PoRT algorithm
- Model-based regression adjustment with model-free covariates for network interference
- All models are wrong, but which are useful? Comparing parametric and nonparametric estimation of causal effects in finite samples
- Confidence in causal inference under structure uncertainty in linear causal models with equal variances
- Special Issue on Integration of observational studies with randomized trials - Part II
- Personalized decision making – A conceptual introduction
- Precise unbiased estimation in randomized experiments using auxiliary observational data
- Conditional average treatment effect estimation with marginally constrained models
- Testing for treatment effect twice using internal and external controls in clinical trials
Articles in the same Issue
- Research Articles
- Adaptive normalization for IPW estimation
- Matched design for marginal causal effect on restricted mean survival time in observational studies
- Robust inference for matching under rolling enrollment
- Attributable fraction and related measures: Conceptual relations in the counterfactual framework
- Causality and independence in perfectly adapted dynamical systems
- Sensitivity analysis for causal decomposition analysis: Assessing robustness toward omitted variable bias
- Instrumental variable regression via kernel maximum moment loss
- Randomization-based, Bayesian inference of causal effects
- On the pitfalls of Gaussian likelihood scoring for causal discovery
- Double machine learning and automated confounder selection: A cautionary tale
- Randomized graph cluster randomization
- Efficient and flexible mediation analysis with time-varying mediators, treatments, and confounders
- Minimally capturing heterogeneous complier effect of endogenous treatment for any outcome variable
- Quantitative probing: Validating causal models with quantitative domain knowledge
- On the dimensional indeterminacy of one-wave factor analysis under causal effects
- Heterogeneous interventional effects with multiple mediators: Semiparametric and nonparametric approaches
- Exploiting neighborhood interference with low-order interactions under unit randomized design
- Robust variance estimation and inference for causal effect estimation
- Bounding the probabilities of benefit and harm through sensitivity parameters and proxies
- Potential outcome and decision theoretic foundations for statistical causality
- 2D score-based estimation of heterogeneous treatment effects
- Identification of in-sample positivity violations using regression trees: The PoRT algorithm
- Model-based regression adjustment with model-free covariates for network interference
- All models are wrong, but which are useful? Comparing parametric and nonparametric estimation of causal effects in finite samples
- Confidence in causal inference under structure uncertainty in linear causal models with equal variances
- Special Issue on Integration of observational studies with randomized trials - Part II
- Personalized decision making – A conceptual introduction
- Precise unbiased estimation in randomized experiments using auxiliary observational data
- Conditional average treatment effect estimation with marginally constrained models
- Testing for treatment effect twice using internal and external controls in clinical trials