Abstract
In a recent article, VanderWeele and Vansteelandt (American Journal of Epidemiology, 2010, 172:1339–1348) (hereafter VWV) build on results due to Judea Pearl on causal mediation analysis and derive simple closed-form expressions for so-called natural direct and indirect effects in an odds ratio context for a binary outcome and a continuous mediator. The expressions obtained by VWV make two key simplifying assumptions:
The mediator is normally distributed with constant variance.
The binary outcome is rare.
Recent advances in causal inference have provided a mathematical formalization of mediation analysis (Robins and Greenland 1992; Pearl 2001, 2011). Specifically, the counterfactual language of causal inference has allowed for new definitions of causal effects in the mediation context, accompanied by formal identification conditions, and corresponding nonparametric formulae for computing these new types of causal effects (Robins and Greenland 1992; Pearl 2001, 2011; van der Laan and Petersen 2005; Imai et al. 2010; VanderWeele and Vansteelandt 2010; Tchetgen Tchetgen and Shiptser 2012; Tchetgen Tchetgen 2011). In a recent manuscript, VanderWeele and Vansteelandt (2010) (hereafter VWV) build on results due to Judea Pearl (2001, 2011) on causal mediation analysis and derive simple closed-form expressions for so-called natural direct and indirect effects in an odds ratio context for a binary outcome and a continuous mediator.
As in VWV, we let A denote an exposure of interest, Y a dichotomous outcome, and M a potential mediator. We let C denote a set of baseline covariates not affected by the exposure. The relations among these variables are depicted in Figure 1.

Example of mediation with exposure A, mediator M, outcome Y, and confounder C
We assume that for each level ,
there exist a potential outcome
corresponding to the outcome had possibly contrary to fact the exposure and mediator variables taken the value
and for
, there exist a counterfactual variable
corresponding to the mediator variable had possibly contrary to fact the exposure variable taken the value
VWV then define the odds ratio natural direct and indirect


As described by VWV, the conditional natural direct effect can be interpreted as comparing the odds, conditional on
, of the outcome Y if exposure had been a, but if the mediator had been fixed to
, to the odds, conditional on
, of the outcome Y if exposure had been
but if the mediator had been fixed at the same level
. The conditional natural indirect effect
can be interpreted as comparing the odds, conditional on
, of the outcome Y if exposure had been a but if the mediator had been fixed to
to the odds, conditional on
, of the outcome Y if exposure had been a but if the mediator had been fixed to
. Recall that the odds ratio total effect of A on Y conditional on
is givenby

Then, it is straightforward to verify the odds ratio total effect decomposition

Identification of natural direct and indirect effects requires additional assumptions. Throughout, we follow VWV and assume:
Consistency


In addition, we assume:
Ignorability

The above ignorability assumption would generally hold under Pearl’s nonparametric structural equations model (Pearl 2011), in which case, the assumption states that there is no unmeasured common cause of A and of A and
and of M and
In order to obtain closed-form expressions for natural direct and indirect effects, VWV also make two simplifying assumptions which are reproduced as follows:
A. The mediator is normally distributed with constant variance
B. The binary outcome is rare.
Assumption A may not always be appropriate in epidemiologic applications, for instance, if the distribution of the mediator is highly skew. In this note, the author shows that under an assumption of “no mediator–exposure odds-ratio interaction”, the simple formulae of VWV continue to hold even when the normality assumption of the mediator is dropped. The author further shows that when the “no interaction” assumption is relaxed, the formula of VWV for the natural indirect effect is still correct if assumption A is also dropped. However, an alternative formula to that of VWV for the natural direct effect is needed for this setting. When the outcome is not rare, the author presents some simple expressions for the odds ratio natural direct and indirect effects under an alternative assumption C to A and B, that the mediator follows a so-called bridge distribution (Wang and Louis 2003). Although the bridge distribution leads to simple expressions of direct and indirect effects even when the outcome is not rare, similar to the normality assumption A, the bridge distribution may not always be appropriate in epidemiologic applications. Thus, the author presents a more viable alternative, which entails using a recently proposed technique for direct risk ratio estimation for a binary outcome that may not be rare, for risk ratio mediation analysis free of assumptions A and B.
1 Relaxing the normality assumption
To proceed, consider the statistical models studied by VWV:
![[1]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq8.png)
and
![[2]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq9.png)
where, under eq. [2] the residual error for the linear regression of M on
is normally distributed with constant variance. VWV show that under assumptions A and B, and models [1] and [2], the odds ratio natural direct and indirect effects are approximately:
![[3]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq10.png)
![[4]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq11.png)
where the approximation holds to the extent the rare outcome assumption is valid. For a fixed value , the total causal effect of A on Y within levels of
comparing the odds of Y when
versus when

can be decomposed on the odds ratio scale into natural direct and indirect causal effects according to:
![[5]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq13.png)
In the appendix, we show that formulae [3] and [4] and therefore, formula [5] continues to hold even if the normality assumption is replaced by the weaker assumption:
A’. The residual error is independent of
but its distribution is otherwise unrestricted.
Thus, by eliminating the requirement that the mediator is normally distributed, the result considerably broadens the range of settings where the formulae of VWV apply. In fact, the result states that eqs [3] and [4] continue to hold even when the mediator is not normally distributed, provided that the regression model [2] completely captures the association between exposure and confounders, and the mediator, i.e.
does not further depend on
The above result relies on the absence of an exposure–mediator interaction in the logistic regression [1]. VWV also considered mediation analyses under a slightly more general regression:
![[6]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq14.png)
where now encodes a possible interaction between the exposure and the mediator. Under assumptions A and B, and models [2] and [6], VWV show that

In the appendix, we show that the formula in the above display continues to hold if assumption A is replaced by the weaker assumption A’. However, the formula for given in VWV under model [6] no longer applies if assumption A does not hold, even if assumption A’ holds. The correct expression for
is given under Assumption A’ in the appendix. For inference, the standard error of estimators of
and
under the various modeling assumptions considered above can be obtained as in VWV by a straightforward application of the delta method, details are relegated to the appendix.
2 Relaxing the rare disease assumption
2.1 Odds ratio mediation analysis
In this section, simple expressions are obtained for the natural direct and indirect odds ratios and
without assumption B of a rare outcome. The formulae are obtained upon replacing both assumptions A (or equivalently assumptions A’) and B with a single alternative distributional assumption for the mediator density:
C. The conditional density of follows a bridge distribution (more specifically the bridge distribution for the logit link) introduced by Wang and Louis (2003). The bridge distribution has the following density function:


The bridge density is denoted with the first argument indicating that it has mean zero,
is a scaling parameter and the subscript l stands for logistic. The variance of
can be expressed in terms of
:

so that the variance of approaches zero as
approaches one.
is symmetric and unimodal similar to the Gaussian density (Wang and Louis 2003). However, when standardized to have unit variance, the bridge density can be shown to have slightly heavier tails than the standard normal and lighter tails than the standard logistic. Wang and Louis (2003) provide a detailed study of
and we refer the reader to their manuscript for additional information about this density. For the purposes of this note, the bridge distribution
is mainly of interest, because it produces simple formulae of natural direct and indirect effects on the odds ratios scale even if the outcome is not rare. As shown in the appendix, this follows from the bridge distribution being the unique covariate distribution for which logistic regression is collapsible. Specifically, marginalization of logistic regression with respect to a single covariate with a bridge distribution is again a logistic regression. For instance, consider the standard logistic regression model [1] of Y given
then under model [2] paired with assumption C, marginalizing over M gives a regression model of Y given
which is again a standard logistic regression:

where

and

Similar expressions relating and
to
, and
are provided in the appendix. A more general formulation of the above result is used in the appendix to establish that under models [1] and [2], and assumption C:
![[7]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq22.png)
![[8]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq23.png)
Note the similarity between formulae [3] and [4], and formulae [7] and [8] where the factor k in the latter two expressions accounts for the outcome not being rare. Note however that, whereas eqs [3] and [4] are only approximate, formulae [7] and [8] are exact. Analogous expressions are derived in the appendix for model [6] which allows for a possible interaction between the mediator and the exposure, and details for obtaining inference are also provided in the appendix.
2.2 Risk ratio mediation analysis
Although assumption C of a bridge distribution for the mediator leads to the simple expressions of direct and indirect effects of the previous section, similar to the normality assumption, the bridge distribution may not always be appropriate in epidemiologic applications. As an alternative, one may wish to conduct mediation analyses on a risk ratio scale even when the outcome is not rare. It is straightforward to establish that the formulae previously obtained under assumption B that the disease is rare, essentially continue to hold when the disease is not rare, upon replacing the logit link function of eqs [1] and [6], with the log-link function. For estimation, a variety of methods exist to compute the risk ratio parameters of the log-linear regression of Y on (Breslow 1974; Wacholder 1986; Lee 1994; Skov et al. 1998; Greenland 2004; Zou 2004; Spiegelman and Hertzmark 2005; Chu and Cole 2010; Tchetgen Tchetgen 2012). The recent two-stage estimator of risk ratio regression of Tchetgen Tchetgen (2012) is of particular interest because of its computational stability, ease of implementation, and asymptotic efficiency properties. For instance, the estimator of Tchetgen Tchetgen (2012) delivers asymptotically efficient estimates of the regression coefficients
in eq. [6] with the logit link replaced by the log link, without requiring an estimate of the intercept
and therefore, it avoids the convergence issues of some other methods such as the log-binomial approach of Wacholder (1986). Estimation details may be found in Tchetgen Tchetgen (2012), and after obtaining these risk ratio estimates, one can subsequently compute the risk ratio natural direct and indirect effects using the formulae previously derived for
and
, respectively, for the rare outcome case. However, we note that the expressions derived for the direct and indirect effects under a rare outcome approximation are now exact, irrespective of the prevalence of the outcome. An important advantage of conducting mediation analyses on the risk ratio scale as we have described is that when model mis-specification is absent, the resulting inferences are generally valid provided assumption A’ holds, even if assumptions A–C do not hold.
3 Concluding remarks
In this note, the author has extended the results of VWV in a number of interesting directions, by providing weaker conditions under which their proposed estimators of natural direct and indirect effects remain valid, and by providing alternative distributional assumptions under which the assumption of a rare outcome can be dropped and yet simple formulae are still available for easy use in epidemiologic practice. However, it is important to note that as in VWV, the methods described herein rely on fairly strong modeling assumptions and can deliver possibly biased inferences in the presence of modeling error of either model [2] or model [6]. As a possible remedy, alternative so-called multiply robust estimators have recently been proposed, that deliver valid inferences about natural direct and indirect effects even when, a model for the joint conditional density of given C is partially mis-specified (Tchetgen Tchetgen and Shpitser 2011, 2012; Zheng and van der Laan 2012).
Appendix
Closed-form expressions for
and 
Under the identifying consistency and ignorability assumptions, assumptions A’ and B given in this article, and the parametric modeling assumptions [2] and [7], we have that
![[9]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq24.png)
where is the moment generating function of
evaluated at
Note that under our assumptions,
![[10]](/document/doi/10.1515/em-2012-0005/asset/graphic/em-2012-0005_eq25.png)
where is the moment generating function of
evaluated at
We conclude that by a result due to Pearl (2001, 2011) (also see VWV)

and

which reduces to the formulae provided in the text for the special case where For inference when
estimation of
requires an estimator of
and
To motivate a simple estimator of the latter quantity, note that under model [2] and assumption C:

since the numerator is equal to

and thus, we similarly have that

which gives

We conclude that and
and therefore,
is consistently estimated upon substituting empirical averages for unknown marginal expectations and consistentestimates for unknown parameters in the equation in the above display. Note that consistentestimation of
and
are readily obtained by standard logistic regression maximum likelihood which gives
and ordinary least-squares which gives
respectively.
The variance–covariance matrix of the resulting estimator of
is obtained using a straightforward application of the delta method and details can be found in VWV. The variance–covariance matrix of
is similarly obtained under the “no interaction” assumption. However, more generally when
requires derivations not included in VWV. To proceed, let
denote the influence function of
Let

Then one can show that the influence function of is givenby

where:

with

with

and

with

and

and

with

Thus, the large sample variance of is approximately given by

A consistent estimator of the above quantity is obtained by substituting empirical expectations for all unknown expectations and consistent estimators of unknown parameters. The above construction requires the influence function for standard logistic regression and ordinary least squares estimation, which is

with , and
Closed-form expressions for
and
under a Bridge distribution
Consider the logistic regression model

where

and

Note that

where is a bridge density with rescaling parameter

Then, by Wang and Louis (2003):

is a standard logistic regression, and therefore,



under “no interaction”, i.e. , we have

A consistent estimator of
is obtained by the method of moment upon noting that
solves the population equation:

where It can then be shown that the influence function of
is given by

Let and
denote the estimators of
and
, respectively, obtained upon substituting
for
The large sample variances of
and
are then obtained by a straightforward application of the delta method, mainly:

where

and

where

References
Breslow, N. E. (1974). Covariance analysis of censored survival data. Biometrics, 30:89–99.Search in Google Scholar
Chu, H., and Cole,S. (2010). Estimation of risk ratios in cohort studies with common outcomes: a Bayesian approach. Epidemiology, 21(6):855–62.Search in Google Scholar
Greenland, S. (2004). Model-based estimation of relative risks and other epidemiologic measures in studies of common outcomes and in case-control studies. American Journal of Epidemiology, 160:301–305.Search in Google Scholar
Imai, K., Keele,L., and Yamamoto,T. (2010). Identification, inference and sensitivity analysis for causal mediation effects. Statistical Science, 25:51–71.Search in Google Scholar
van der Laan, M., and Petersen,M. (2005). Direct effect models. U.C. Berkeley Division of Biostatistics Working Paper Series. Paper 187. Available at http://www.bepress.com/ucbbiostat/paper187.Search in Google Scholar
Lee, J. (1994). Odds ratio or relative risk for cross-sectional data?International Journal of Epidemiology, 23(1):201–203. doi:10.1093/ije/23.1.20.Search in Google Scholar
Pearl, J. (2001). Direct and indirect effects. In: Proceedings of the seventeenth conference on uncertainty and artificial intelligence. San Francisco, CA: Morgan Kaufmann, 411–420.Search in Google Scholar
Pearl, J. (2011). The mediation formula: a guide to the assessment of causal pathways in nonlinear models. To appear in Causality: Statistical Perspectives and Applications, C.Berzuini, P.Dawid, and L.Bernardinelli (Eds.), 151–179. Chichester, UK: John Wiley and Sons, Ltd. Forthcoming, 2011.Search in Google Scholar
Robins, J. M., and Greenland,S. (1992). Identifiability and exchangeability for direct and indirect effects. Epidemiology, 3:143–155.Search in Google Scholar
Skov, T., Deddens,J., Petersen,M. R., and Endahl,L. (1998). Prevalence proportion ratios: estimation and hypothesis testing. International Journal of Epidemiology, 27:91–95.Search in Google Scholar
Spiegelman, D., and Hertzmark,E. (2005). Easy SAS calculations for risk or prevalence ratios and differences. American Journal of Epidemiology, 162:199–200.Search in Google Scholar
Tchetgen Tchetgen, E. J. (2011). On causal mediation analysis with a survival outcome. The International Journal of Biostatistics, 7(1): Article 33, 1–38.10.2202/1557-4679.1351Search in Google Scholar PubMed PubMed Central
Tchetgen Tchetgen, E. J. (2012). Estimation of risk ratios in cohort studies with a common outcome: a simple and efficient two-stage approach. The International Journal of Biostatistics, 1–13.Search in Google Scholar
Tchetgen Tchetgen, E. J., and Shpitser,I. (2011). Semiparametric estimation of models for natural direct and indirect effects. Submitted. Available at http://www.bepress.com/harvardbiostat/paper129.Search in Google Scholar
Tchetgen Tchetgen, E. J., and Shpitser,I. (2012). Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness and sensitivity analysis. Annals of Statistics, 40(3):1816–1845.Search in Google Scholar
VanderWeele, T. J., and Vansteelandt,S. (2010).Odds ratios for mediation analysis with a dichotomous outcome. American Journal of Epidemiology, 172:1339–1348.Search in Google Scholar
Wacholder, S. (1986).Binomial regression in GLIM: estimating risk ratios and risk differences. American Journal of Epidemiology, 123:174–184.Search in Google Scholar
Wang, Z., and Louis,T. (2003). Matching conditional and marginal shapes in binary random intercept models using a bridge distribution function. Biometrika, 90(4):765–775.Search in Google Scholar
Zeng, W., and van der Laan,M. (2012). Targeted maximum likelihood estimation of natural direct effects. The International Journal of Biostatistics, 8(1):1–40.Search in Google Scholar
Zou, G. Y. (2004). A modified poisson regression approach to prospective studies with binary data. American Journal of Epidemiology, 159:702–706.Search in Google Scholar
©2014 by Walter de Gruyter Berlin / Boston
Articles in the same Issue
- Masthead
- Masthead
- Comparison of Approaches to Weight Truncation for Marginal Structural Cox Models
- A Note on Formulae for Causal Mediation Analysis in an Odds Ratio Context
- Reducing Mean Squared Error in the Analysis of Binary Paired Data
- Extended Matrix and Inverse Matrix Methods Utilizing Internal Validation Data When Both Disease and Exposure Status Are Misclassified
- Scaling Oversmoothing Factors for Kernel Estimation of Spatial Relative Risk
- A Simulation Study of Relative Efficiency and Bias in the Nested Case–Control Study Design
- Mediation Analysis with Multiple Mediators
Articles in the same Issue
- Masthead
- Masthead
- Comparison of Approaches to Weight Truncation for Marginal Structural Cox Models
- A Note on Formulae for Causal Mediation Analysis in an Odds Ratio Context
- Reducing Mean Squared Error in the Analysis of Binary Paired Data
- Extended Matrix and Inverse Matrix Methods Utilizing Internal Validation Data When Both Disease and Exposure Status Are Misclassified
- Scaling Oversmoothing Factors for Kernel Estimation of Spatial Relative Risk
- A Simulation Study of Relative Efficiency and Bias in the Nested Case–Control Study Design
- Mediation Analysis with Multiple Mediators