Abstract
Adjusting for covariates is a well-established method to estimate the total causal effect of an exposure variable on an outcome of interest. Depending on the causal structure of the mechanism under study, there may be different adjustment sets, equally valid from a theoretical perspective, leading to identical causal effects. However, in practice, with finite data, estimators built on different sets may display different precisions. To investigate the extent of this variability, we consider the simplest non-trivial non-linear model of a v-structure on three nodes for binary data. We explicitly compute and compare the variance of the two possible different causal estimators. Further, by going beyond leading-order asymptotics, we show that there are parameter regimes where the set with the asymptotically optimal variance does depend on the edge coefficients, a result that is not captured by the recent leading-order developments for general causal models. As a practical consequence, the adjustment set selection needs to account for the relative magnitude of the relationships between variables with respect to the sample size and cannot rely on purely graphical criteria.
1 Introduction
As graphical representations of multivariate probability distributions, Bayesian networks are widely used statistical models with an underlying directed acyclic graph (DAG) structure. When taking DAGs to represent causal diagrams [1,2, 3,4], we may use a machinery based on the “do” calculus [5] to estimate potential intervention effects of any variable on any other. Different graphical criteria exist to identify valid adjustment sets, among which the back-door criterion [6] is probably the most well known, and with more generalised strategies developed more recently [7,8].
A valid adjustment set
For linear Gaussian models, the marginalisation can be simply estimated by regressing
The set of parents of
In evaluating the variance of different estimators, the remarkable result that the asymptotically optimal adjustment set can be determined solely based on graphical criteria regardless of the edge coefficients has recently been obtained [10]. Even more recently, this has been extended to non-parametric estimators [11], and the asymptotically optimal set has been further characterised [12].
To explore the precision of causal estimators for non-linear models, we consider the simplest such case: a DAG with three nodes of binary variables organised in a v-structure with the outcome
and estimate causal effects from them.

A v-structure on three nodes.
In the simple case of a v-structure (as in Figure 1), there is no confounding of the effect of
A valid expression for computing the total causal effect of
where we used the subscript
However, by definition, the conditional distribution of
Therefore, another valid expression for the total causal effect of
where we used the subscript
A more general way of understanding equation (6) is by observing that in the case of the v-structure
with the latter equality justified by structural and invariance properties and also in agreement with the standardisation formula in equation (1).
Since we see that adjustment by
It is instructive to also consider the DAG with the edge from
The questions of whether adjusting for baseline covariates is justified or even desirable is indeed a recurrent one with a long history in the context of randomised controlled trials [15,16, 17,18,19, 20,21]. An extensive body of literature exists with several proposals to exploit covariate adjustment in order to build more efficient estimators [22,23,24, 25,26,27]. Health authority guidelines also typically recommend adjustment for the sake of improving precision [28,29,30]; however, in the case of non-linear models, special care must be taken to account for the potential non-collapsibility of effect measures [31,32,33] and that the estimand may change depending on the method used for adjustment [34]. Recent work [26] examines the performance of covariate-adjusted estimators in clinical trials with binary, ordinal, or time-to-event analysis. For binary outcomes, they present simulation results where the baseline covariate represents categories of age and the estimand is the risk difference, for a magnitude of the prognostic value of the covariate mimicking the association found in observational data. Our study considers the question from a theoretical and causal diagram perspective, in a slightly simplified scenario where the treatment, outcome, and prognostic factors are all binary variables, but where we evaluate how things change with the strength of the prognostic value of the covariate.
Therefore, we consider the v-structure since it provides the simplest example of a causal diagram where there is a choice between different adjustment sets. If we add an edge in the graph of Figure 1 connecting
2 Causal estimates for a binary v-structure
For both causal estimators, we will use the maximum likelihood estimates of probabilities from the observed data. We consider the DAG in Figure 1 with the following probability tables:
When we generate data, as a collection of
(9) If we represent with
By using the marginalisation, we would have the following estimator:
with the terms separated for later ease
These estimators, as they rely on observed data frequencies, are non-parametric and fit in the recent general framework for arbitrary graphs [11]. The key advance of our derivation with respect to their result is that we consider terms beyond the leading-order asymptotics and compute the variance of the estimators for arbitrary sample sizes, which further enables us to perform more detailed asymptotic analyses.
2.1 Raw conditionals
To compute
for which we use that fact that
with the introduction of four auxiliary generating variables
Setting all the generating variables to 1 removes them from consideration and the generating function simplifies to the value 1:
The advantage of using generating functions [35] is that we can express expectations in terms of differential and integral operators. For example, the operator
Removing the generating variables after applying the operator leads to
which is an expectation over the multinomial probability distribution of a binary v-structure. To actually perform the differentiation, we employ the compact form of
The integral operator
which follows from applying the operators to equation (14). When we substitute for the generating variables (which sets
2.1.1 The variance
To compute the variance
we first show that the covariance is 0
The last equality follows by comparing the values of
The more tricky terms are as follows:
The remaining integral can be expressed in terms of hypergeometric functions:
where the first step follows from performing a binomial expansion and integrating term by term, while the second step follows from the definition of the hypergeometric function where we use the notation
We discuss bounds on this variance in Appendix A.
2.2 Marginalisation
To compute the expected value
where we include extra generating variables for all terms in our estimators for which we need the ten generating variables
and similarly for the other terms, leading to
To compute the variance, we reapply the operators of equation (27), as detailed in Appendix B.
2.3 Numerical checks
Code to evaluate the variance of the two estimators through simulation, as well as to evaluate the analytical results, is hosted at https://github.com/jackkuipers/Vcausal. As an example, for
By having the exact analytical result for any parameter combinations, we can avoid expensive Monte Carlo simulations. This allows us to plot the variances over ranges of parameter values and examine their asymptotic behaviour.
2.4 Relative difference in variances
To explore the difference in variances, we focus mainly on the case where the effect of
where
We plot the relative difference in variances of the two estimators,

The relative difference in variance of the two estimators for two sample sizes. (a)
With our general result, we can further allow for an interaction between
so that the causal effect of

The relative difference in variance of the two estimators for two interaction strengths. (a)
3 Asymptotic behaviour
To examine the asymptotic behaviour of the causal effect estimators in more detail, we return here to the setting of equation (29) with the same effect of
and for the variance of
To extract the asymptotic behaviour of the difference in variances of the two estimators, we consider
with root
so that
Note that although we used the scaling
The result in equation (35) therefore shows that the optimal adjustment set, for the effect of
Since for many practical purposes the sample size may be determined by other considerations and we can never take the limit
4 Implications for causal discovery
With a larger sample size, we may be able to detect and quantify smaller causal effects. Therefore, we wish to get a feeling for the strength of the edge
where the
The change is AIC is then
There is therefore an asymptotic regime where the edge is strong enough to detect on average using the AIC, but the estimator from raw conditionals that does not use the edge has lower variance:
which follows from the Cauchy–Schwarz inequality. The regime only vanishes when
5 Discussion
To evaluate the precision of different estimators targeting the same causal effect in causal diagrams, we considered the simple case of a v-structure for binary data and explicitly computed the variance of the two different estimators for the effect of
The results involve combinations of hypergeometric functions, suggesting that exact results for larger DAGs may be rather complex. Which estimator has the lower variance depends, among other parameters, on the relative strength of the edge from
In light of our results, it is instructive to go back to the parallel with clinical trials. Even if the simple case of three nodes with only binary variables does not cover the more general and realistic scenarios encountered in randomised clinical trials, the example is still enlightening to show that taking a gain in precision for granted under any circumstances may not always be justified. Asymptotic results suggest that the optimal adjustment set in the sense of efficiency should include the prognostic factor
By examining the asymptotic regime of large sample sizes, we could confirm the intuition that for edge strengths statistically detectable by the AIC, accounting for the edge in the estimation should generally lead to lower variance. Conversely, that the presence of statistically non-detectable edges should be ignored to achieve a lower variance.
Most importantly, we could also discover an asymptotic regime where raw conditional estimates, ignoring the edge, were more precise in the presence of statistically detectable edges. One way to appreciate the practical relevance of these findings is by observing that we can expect ranges of causal strengths, which become statistically detectable from data before we can gain precision by accounting for them in the estimation. Our detailed asymptotic analysis for the v-structure goes beyond the leading-order asymptotic result where the optimal estimator does not depend on the edge coefficients [10,11].
Outside the asymptotic regime, for finite sample sizes, the gain in precision when using marginalisation and, thus explicitly accounting for the edge presence, appears to be linked to its strength. Although the example considered here is the simplest non-trivial DAG, this finding further supports the idea that learning the full structure of the graph, beyond simply identifying a valid adjustment set, may benefit the precision of causal inference. The practical limitation with observational data is that we can only learn structures up to an equivalence class, so that we need to consider the possible range of causal effects across the whole class [37] or implement Bayesian model averaging across DAGs [13].
If we use a more stringent criterion to decide about the presence of edges, such as the BIC, for example, which implements a stronger penalisation with respect to the AIC, we may end up missing edges too weak to detect on average, but whose presence would improve the precision of the causal estimation through marginalisation. In other words, for moderately weak direct effects, the selection of suitable adjustment sets may be relatively sensitive to the choice of the score. Analogously, we may expect that optimal causal estimation may also be sensitive to the choice of learning algorithm, whether constraint-based search [38,39], score-based search [40] or Bayesian sampling [41,42,43]. Quantifying the extent by which the structure learning affects causal estimation constitutes an interesting line of further investigation.
Finally, a limitation of the current study is its relatively reduced scope, treating a very simple situation with only three binary variables. It would be interesting to see if and how results generalise in the presence of multiple and possibly non-binary covariates as well as to non-binary outcomes. Unfortunately, extensions of our approach to higher dimensions are technically challenging, especially because the number of parameters grows exponentially for larger graphs with more edges. Similarly, extending to non-binary variables requires re-framing the problem in a relatively different setting. Nevertheless, we feel that even this simple example is of practical value to highlight that paying attention to the relative strength of the prognostic value with respect to the sample size is critical to determining the extent of gain in precision. This is echoed in simulations [26], with the discussion of ref. [44] further elaborating on the relative importance of the strength of the covariate associations with the outcomes and the sample size. The conclusions we draw from our exact calculations for a non-parametric estimator of the risk difference appear to match finite sample effects in simulation analyses [44] very closely. Furthermore, it seems natural to expect similar considerations applying to different estimands, such as the risk ratio and the odds ratio, though technically more challenging to treat. Notably, simulation studies provide for a valuable alternative to gain insights about more realistic scenarios, which are technically intractable, as nicely illustrated in ref. [26].
-
Funding information: None declared.
-
Author contributions: All authors have accepted responsibility for the entire content of this manuscript and approved its submission.
-
Conflict of interest: The authors state no conflict of interest.
-
Data availability statement: Data sharing is not applicable to this article as no datasets were generated or analysed during the current study.
Appendix A Bounds on the variance of the
R
estimator
This hypergeometric function in equation (24) has a maximum value at around
so that by considering the early terms in the sum, we can bound
which we can loosen to
To obtain a simple upper bound, we can compute
so that the variance vanishes in the large
B The variance of the
M
estimator
For computing the variance of
B.1 A variance
For example:
For the linear term in
while for the rest of
For the part with the factor of
on the rest we apply the operator for
The linear terms in
while the integrals lead to
Combining all the terms, subtracting the mean part squared and simplifying slightly, we obtain
B.2 The covariances
For the covariances where separate generating variables are used
it is easy to see that the operators act on
The more complicated cases are where the generating variables reoccur
For the term linear in
and
B.3 The variance
Since the terms from the covariances simplify, the complete variance is
We note that the hypergeometric functions can be written solely in terms of
C Asymptotics of the hypergeometric functions
We utilise the following asymptotic expansions of our hypergeometric functions:
and
D Asymptotic behaviour with interactions
When there is an interaction term and
to replace equation (32). In the scaling limit
where the only difference to equation (33) for
References
[1] Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiology. 1999;10:37–48. 10.1097/00001648-199901000-00008Search in Google Scholar
[2] Pearl J. Causality: models, reasoning and inference. Cambridge, UK: Cambridge University Press; 2000. Search in Google Scholar
[3] Hernán MA, Robins JM. Instruments for causal inference: An epidemiologistas dream? Epidemiology. 2006;17:360–72. 10.1097/01.ede.0000222409.00878.37Search in Google Scholar PubMed
[4] VanderWeele TJ, Robins JM. Four types of effect modification: A classification based on directed acyclic graphs. Epidemiology. 2007;18:561–8. 10.1097/EDE.0b013e318127181bSearch in Google Scholar PubMed
[5] Pearl J. Causal diagrams for empirical research. Biometrika. 1995;82:669–88. 10.1093/biomet/82.4.669Search in Google Scholar
[6] Pearl J. [Bayesian analysis in expert systems]: Comment: graphical models, causality and intervention. Statist Sci. 1993;8:266–9. 10.1214/ss/1177010894Search in Google Scholar
[7] Shpitser I, VanderWeele T, Robins JM. On the validity of covariate adjustment for estimating causal effects. In: Twenty-Sixth Conference on Uncertainty in Artificial Intelligence; 2010. p. 527–36. Search in Google Scholar
[8] Perković E, Textor J, Kalisch M, Maathuis MH. Complete graphical characterization and construction of adjustment sets in Markov equivalence classes of ancestral graphs. J Machine Learn Res. 2017;18:8132–93. Search in Google Scholar
[9] Maathuis MH, Colombo D. A generalized back-door criterion. Ann Statist. 2015;43:1060–88. 10.1214/14-AOS1295Search in Google Scholar
[10] Henckel L, Perković E, Maathuis MH. Graphical criteria for efficient total effect estimation via adjustment in causal linear models; J R Stat Soc B. 2022;84:579–99.10.1111/rssb.12451Search in Google Scholar
[11] Rotnitzky A, Smucler E. Efficient adjustment sets for population average causal treatment effect estimation in graphical models. J Machine Learn Res. 2020;21:1–86. Search in Google Scholar
[12] Witte J, Henckel L, Maathuis MH, Didelez V. On efficient adjustment in causal graphs. Machine Learn Res. 2020;21:1–45. Search in Google Scholar
[13] Moffa G, Catone G, Kuipers J, Kuipers E, Freeman D, Marwaha S, et al. Using directed acyclic graphs in epidemiological research in psychosis: An analysis of the role of bullying in psychosis. Schizophrenia Bulletin. 2017;43:1273–9. 10.1093/schbul/sbx013Search in Google Scholar PubMed PubMed Central
[14] Senn S. Modelling in drug development. In Christie M, Cliffe A, Dawid P, Senn S., editors. Simplicity, complexity and modelling. Hoboken, New Jersey, U.S: Wiley; 2011. 10.1002/9781119951445.ch3.Search in Google Scholar
[15] Altman DG. Comparability of randomised groups. Statistician. 1985;34:125–36. 10.2307/2987510Search in Google Scholar
[16] Senn SJ. Covariate imbalance and random allocation in clinical trials. Statist Med. 1989;8:467–75. 10.1002/sim.4780080410Search in Google Scholar PubMed
[17] Pocock SJ, Assmann SE, Enos LE, Kasten LE. Subgroup analysis, covariate adjustment and baseline comparisons in clinical trial reporting: Current practice and problems. Statist Med. 2002;21:2917–30. 10.1002/sim.1296Search in Google Scholar PubMed
[18] Rosenberger WF, Sverdlov O. Handling covariates in the design of clinical trials. Statist Sci. 2008;23:404–19. 10.1214/08-STS269Search in Google Scholar
[19] Austin PC, Manca A, Zwarenstein M, Juurlink DN, Stanbrook MB. A substantial and confusing variation exists in handling of baseline covariates in randomized controlled trials: A review of trials published in leading medical journals. J Clin Epidemiol. 2010;63:142–53. 10.1016/j.jclinepi.2009.06.002Search in Google Scholar PubMed
[20] Senn S. Seven myths of randomisation in clinical trials. Statist Med. 2013;32:1439–50. 10.1002/sim.5713Search in Google Scholar PubMed
[21] Wang J. Covariate adjustment for randomized controlled trials revisited. Pharmaceut Statist. 2020;19:255–61. 10.1002/pst.1988Search in Google Scholar PubMed
[22] Tsiatis AA, Davidian M, Zhang M, Lu X. Covariate adjustment for two-sample treatment comparisons in randomized clinical trials: A principled yet flexible approach. Statist Med. 2008;27:4658–77. 10.1002/sim.3113Search in Google Scholar PubMed PubMed Central
[23] Zhang M, Tsiatis AA, Davidian M. Improving efficiency of inferences in randomized clinical trials using auxiliary covariates. Biometrics. 2008;64:707–15. 10.1111/j.1541-0420.2007.00976.xSearch in Google Scholar PubMed PubMed Central
[24] Rosenblum M, van der Laan MJ. Using regression models to analyze randomized trials: Asymptotically valid hypothesis tests despite incorrectly specified models. Biometrics. 2009;65:937–45. 10.1111/j.1541-0420.2008.01177.xSearch in Google Scholar PubMed PubMed Central
[25] Ge M, Durham LK, Meyer RD, Xie W, Thomas N. Covariate-adjusted difference in proportions from clinical trials using logistic regression and weighted risk differences. Therapeutic Innovat Regulat Sci. 2011;45:481–93. 10.1177/009286151104500409Search in Google Scholar
[26] Benkeser D, Díaz I, Luedtke A, Segal J, Scharfstein D, Rosenblum M. Improving precision and power in randomized trials for COVID-19 treatments using covariate adjustment, for binary, ordinal, and time-to-event outcomes. Biometrics. 2021;77:1467–81. 10.1111/biom.13377Search in Google Scholar PubMed PubMed Central
[27] Morris TP, Walker AS, Williamson EJ, White IR. Planning a method for covariate adjustment in individually - randomised trials: A practical guide; Trials. 2022;23:328. 10.1186/s13063-022-06097-zSearch in Google Scholar PubMed PubMed Central
[28] EMA. Guideline on adjustment for baseline covariates in clinical trials; 2015. Search in Google Scholar
[29] FDA, Draft Guidance. Adjusting for covariates in randomized clinical trials for drugs and biological products; 2021. Search in Google Scholar
[30] FDA, Guidance for Industry. COVID-19: Developing drugs and biological products for treatment or prevention; 2021. Search in Google Scholar
[31] Freedman DA. Randomization does not justify logistic regression. Statist Sci. 2008;23:237–49. 10.1017/CBO9780511815874.015Search in Google Scholar
[32] Moore KL, van der Laan MJ. Covariate adjustment in randomized trials with binary outcomes: Targeted maximum likelihood estimation. Statistic Med. 2009;28:39–64. 10.1002/sim.3445Search in Google Scholar PubMed PubMed Central
[33] Daniel R, Zhang J, Farewell D. Making apples from oranges: Comparing noncollapsible effect estimators and their standard errors after adjustment for different covariate sets. Biometric J. 2021;63:528–57. 10.1002/bimj.201900297Search in Google Scholar PubMed PubMed Central
[34] Permutt T. Do covariates change the estimand? Statistic Biopharmaceutic Res. 2020;12:45–53. 10.1080/19466315.2019.1647874Search in Google Scholar
[35] Wilf HS. Generating functionology. Boca Raton, Florida, USA: CRC Press; 2005. 10.1201/b10576Search in Google Scholar
[36] Wilks SS. The large-sample distribution of the likelihood ratio for testing composite hypotheses. Ann Math Statistic. 1938;9:60–2. 10.1214/aoms/1177732360Search in Google Scholar
[37] Maathuis MH, Kalisch M, Bühlmann P. Estimating high-dimensional intervention effects from observational data. Ann Statistic. 2009;37:3133–64. 10.1214/09-AOS685Search in Google Scholar
[38] Spirtes P, Glymour CN, Scheines R. Causation, prediction, and search. Cambridge, Massachusetts, USA: MIT Press; 2000. 10.7551/mitpress/1754.001.0001Search in Google Scholar
[39] Kalisch M, Bühlmann P. Estimating high-dimensional directed acyclic graphs with the PC-Algorithm. J Machine Learn Res. 2007;8:613–36. Search in Google Scholar
[40] Chickering DM. Optimal structure identification with greedy search. J Machine Learn Res. 2002;3:507–54. Search in Google Scholar
[41] Friedman N, Koller D. Being Bayesian about network structure. A Bayesian approach to structure discovery in Bayesian networks. Machine Learn. 2003;50:95–125. 10.1023/A:1020249912095Search in Google Scholar
[42] Kuipers J, Moffa G. Partition MCMC for inference on acyclic digraphs. J Am Statistic Assoc. 2017;112:282–99. 10.1080/01621459.2015.1133426Search in Google Scholar
[43] Kuipers J, Suter P, Moffa G. Efficient sampling and structure learning of Bayesian networks. J Comput Graphic Statist. 2022. 10.1080/10618600.2021.2020127. Search in Google Scholar
[44] Zhang M, Zhang B. Discussion of “Improving precision and power in randomized trials for COVID-19 treatments using covariate adjustment, for binary, ordinal, and time-to-event outcomes”. Biometrics. 2021;77:1485–8. 10.1111/biom.13492Search in Google Scholar PubMed PubMed Central
© 2022 Jack Kuipers and Giusi Moffa, published by De Gruyter
This work is licensed under the Creative Commons Attribution 4.0 International License.
Articles in the same Issue
- Editorial
- Causation and decision: On Dawid’s “Decision theoretic foundation of statistical causality”
- Research Articles
- Simple yet sharp sensitivity analysis for unmeasured confounding
- Decomposition of the total effect for two mediators: A natural mediated interaction effect framework
- Causal inference with imperfect instrumental variables
- A unifying causal framework for analyzing dataset shift-stable learning algorithms
- The variance of causal effect estimators for binary v-structures
- Treatment effect optimisation in dynamic environments
- Optimal weighting for estimating generalized average treatment effects
- A note on efficient minimum cost adjustment sets in causal graphical models
- Estimating marginal treatment effects under unobserved group heterogeneity
- Properties of restricted randomization with implications for experimental design
- Clarifying causal mediation analysis: Effect identification via three assumptions and five potential outcomes
- A generalized double robust Bayesian model averaging approach to causal effect estimation with application to the study of osteoporotic fractures
- Sensitivity analysis for causal effects with generalized linear models
- Individualized treatment rules under stochastic treatment cost constraints
- A Lasso approach to covariate selection and average treatment effect estimation for clustered RCTs using design-based methods
- Bias attenuation results for dichotomization of a continuous confounder
- Review Article
- Causal inference in AI education: A primer
- Commentary
- Comment on: “Decision-theoretic foundations for statistical causality”
- Decision-theoretic foundations for statistical causality: Response to Shpitser
- Decision-theoretic foundations for statistical causality: Response to Pearl
- Special Issue on Integration of observational studies with randomized trials
- Identifying HIV sequences that escape antibody neutralization using random forests and collaborative targeted learning
- Estimating complier average causal effects for clustered RCTs when the treatment affects the service population
- Causal effect on a target population: A sensitivity analysis to handle missing covariates
- Doubly robust estimators for generalizing treatment effects on survival outcomes from randomized controlled trials to a target population
Articles in the same Issue
- Editorial
- Causation and decision: On Dawid’s “Decision theoretic foundation of statistical causality”
- Research Articles
- Simple yet sharp sensitivity analysis for unmeasured confounding
- Decomposition of the total effect for two mediators: A natural mediated interaction effect framework
- Causal inference with imperfect instrumental variables
- A unifying causal framework for analyzing dataset shift-stable learning algorithms
- The variance of causal effect estimators for binary v-structures
- Treatment effect optimisation in dynamic environments
- Optimal weighting for estimating generalized average treatment effects
- A note on efficient minimum cost adjustment sets in causal graphical models
- Estimating marginal treatment effects under unobserved group heterogeneity
- Properties of restricted randomization with implications for experimental design
- Clarifying causal mediation analysis: Effect identification via three assumptions and five potential outcomes
- A generalized double robust Bayesian model averaging approach to causal effect estimation with application to the study of osteoporotic fractures
- Sensitivity analysis for causal effects with generalized linear models
- Individualized treatment rules under stochastic treatment cost constraints
- A Lasso approach to covariate selection and average treatment effect estimation for clustered RCTs using design-based methods
- Bias attenuation results for dichotomization of a continuous confounder
- Review Article
- Causal inference in AI education: A primer
- Commentary
- Comment on: “Decision-theoretic foundations for statistical causality”
- Decision-theoretic foundations for statistical causality: Response to Shpitser
- Decision-theoretic foundations for statistical causality: Response to Pearl
- Special Issue on Integration of observational studies with randomized trials
- Identifying HIV sequences that escape antibody neutralization using random forests and collaborative targeted learning
- Estimating complier average causal effects for clustered RCTs when the treatment affects the service population
- Causal effect on a target population: A sensitivity analysis to handle missing covariates
- Doubly robust estimators for generalizing treatment effects on survival outcomes from randomized controlled trials to a target population