Estimating Case-Fatality Reduction from Randomized Screening Trials

Sudipta Saha; Zhihui (Amy) Liu; Olli Saarela

doi:10.1515/em-2018-0007

Article Publicly Available

Estimating Case-Fatality Reduction from Randomized Screening Trials

Sudipta Saha , Zhihui (Amy) Liu and Olli Saarela

Published/Copyright: November 7, 2018

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Epidemiologic Methods Volume 7 Issue 1

Abstract

In randomized cancer screening trials where asymptomatic individuals are assigned to undergo a regimen of screening examinations or standard care, the primary objective typically is to estimate the effect of screening assignment on cancer-specific mortality by carrying out an ’intention-to-screen’ analysis. However, most of the participants in the trial will be cancer-free; only those developing a genuine cancer that is screening-detectable can potentially benefit from screening induced early treatments. Here we consider measuring the effect of early treatments in this partially latent subpopulation in terms of reduction in case fatality. To formalize the estimands and identifying assumptions in a causal modeling framework, we first define two measures, namely proportional and absolute case-fatality reduction, using potential outcomes notation. We re-derive an earlier proposed estimator for the former, and propose a new estimator for the latter motivated by the instrumental variable approach. The methods are illustrated using data from the US National Lung Screening Trial, with specific attention to estimation in the presence of censoring and competing risks.

Keywords: randomized screening trials; case-fatality reduction; causal inference; instrumental variable estimator

1 Introduction

The benefits of cancer screening are ideally studied through randomized screening trials, where asymptomatic individuals are randomly assigned to either the study arm or control. The study arm receives a regimen of periodic screening examinations, and those diagnosed with cancer through screening will (usually) receive immediate treatment, while the control arm receives standard care, that is, treatment only when clinically diagnosed. The primary objective of such a trial is to determine whether screening (through subsequent early treatments) reduces cancer-specific mortality by carrying out an ‘intention-to-treat’ or ’intention-to-screen’ (ITT) analysis. However, only those developing a genuine (not overdiagnosed) screening-detectable cancer would be able to benefit from the screening and early treatments. Here we are interested in estimating the causal effect of early treatments in the subpopulation of individuals who can be early diagnosed through screening (from here on, ‘screening-detectable’, i.e. those who would receive a positive diagnosis preceded by a positive screening test if offered screening).

The motivation to study this quantity is the fact that the ITT effect in screening trials is a combination of two things; (I) the probability of early diagnosis in the screening-eligible population, which depends on a variety of factors such as the cancer incidence, screening technology and screening regimen employed in the trial, and (II) the early treatment effect among the screening-detectable subpopulation. The latter addresses a more focused clinical question about the effectiveness of the early treatment in reducing cancer mortality. In this paper we decompose the ITT effect into two components that can be estimated separately, namely the probability of getting screening-detected in the presence of screening and the early treatment effect.

The estimation of the early treatment effect is complicated by the fact that the subpopulation is partially latent; it is unobservable at baseline, and it cannot be identified from the control arm, since screening is assigned only to the study arm, but is observed in the screening arm over the course of the repeated screening examinations. Estimation of treatment effect in a latent subpopulation has been considered by Altstein and Li (2013) and Altstein, Li, and Elashoff (2011) in the context of a cancer trial where the study arm receives biopsy for nodal metastases and those with a positive biopsy receive an immediate lymph node surgery while patients with a negative biopsy receive clinical observation, which is the control regimen. They presented a framework to estimate a causal effect of immediate surgery on the latent subgroup of patients with a positive biopsy. In the context of cancer screening, the subpopulation of screening-detectable individuals accumulates over time due to multiple screening examinations, and thus it is not straightforward to define the intervention of interest.

Miettinen (2015) has argued that a relevant measure of benefits of screening at individual (as opposed to population) level is “reduction in the cancer’s case-fatality rate resulting from its early clinical care replacing the late counterpart of this”. To formulate the corresponding intervention of interest, as a thought experiment he proposed a hypothetical trial where individuals diagnosed with the cancer following a positive screen are randomly assigned to either immediate treatment or delayed treatment; both arms are followed long enough to determine whether the cancer is cured. As such a trial is not feasible in practice, he also proposed an estimator for the same quantity based on data from a conventional randomized screening trial (Miettinen 2014b, p. 157).

The present problem is related to the instrumental variable (IV) approach, which has been usually applied in therapeutic trials to adjust for non-adherence and treatment contamination (Angrist, Imbens, and Rubin 1996; Imbens and Rubin 1997). The latent subpopulation of interest in this context is the subpopulation of compliers, i.e. participants who always adhere to the assigned treatment. In the context of non-adherence to screening assignment, the IV approach has been considered for example by Baker (1998) and Roemeling et al. (2007). In the present setting, we are not interested in the adherence to screening assignment, but rather aim to adapt the IV approach to estimate the causal effect of early treatments in the screening-detectable subpopulation.

Our aim is to formalize this effect in a causal modeling framework and consider its estimation based on conventional randomized screening trials where asymptomatic individuals are assigned to undergo a regimen screening examinations or control. Specifically, our objectives are to (a) define proportional and absolute case-fatality reduction using potential outcomes notation/Rubin’s causal model (Holland 1986; Rubin 1978), (b) re-derive Miettinen’s estimator for proportional case-fatality reduction making explicit the assumptions involved in this, (c) propose a new estimator for absolute reduction in case fatality based on the IV approach, and (d) illustrate the estimators using data from the National Lung Screening Trial (NLST) (NLST Research Team 2011) with particular attention to estimating the quantities in the presence of censoring and competing risks.

The remainder of the paper is outlined as follows: in Section 2 we present the necessary notation to define the causal estimands of interest, and assumptions to estimate these. Using this framework, in Section 3 we derive estimators for proportional and absolute reduction in case fatality. In Section 4, we illustrate the estimators in a reanalysis of the NLST data followed by a brief discussion in Section 5.

2 Causal contrasts in screening trials – theoretical framework

2.1 Notation and assumptions

We develop our notation in the context of NLST, where eligible individuals were 55–74 year old heavy smokers (current or having quit within the last 15 years, with 30+ pack years of smoking history). The participants were assigned at random to undergo either three annual low-dose helical CT (screening arm) scans or standard chest X-rays (control arm), and followed up for 7 years (NLST Research Team 2011). A simplified representation (ignoring time, corresponding to one baseline screening examination) of the causal relationships in a randomized screening trial is presented in the directed acyclic graph of Figure 1. This resembles the setting considered by Altstein, Li, and Elashoff (2011), where the intermediate variable was the result of a one-time diagnostic procedure. However, in the screening context, our notation has to take into account that both the mortality outcome and the early diagnosis are time-dependent.

Figure 1:

A simplified schematic of the causal relationships in a conventional randomized cancer screening trial. Here Z is an indicator of assignment to (one or more rounds) of screening or control, S is an indicator of receiving screening, P is an indicator of a positive screening result, D is an indicator of screening-induced early diagnosis (e.g. positive screening test followed by a positive cytology or biopsy), R an indicator of subsequent referral to early treatment, and Y an indicator for cancer-specific mortality outcome. Due to the randomization, the relationship between Z and Y can be estimated without confounding bias, but the relationship between R and Y is confounded by the observed covariates X and the unobserved factors U. Even if we ignore the intermediate variables S and P, the screening assignment can still influence the outcome only through early diagnosis, which in turn prompts early treatments, which motivates the use of Z as an instrumental variable. The ellipses and rectangles represent unobserved and observed variables, respectively. Unshaded dashed rectangles represent observed variables that have not been used directly in the analysis, whereas shaded rectangles represent variables used in the estimation.

Following the convention in cancer screening trials (NLST Research Team 2011; Steele and Brewster 2011) and because our focus is on case-fatality, we consider cancer-specific mortality outcomes. However, because several authors have recommended using all-cause mortality as a measure of benefits (Black, Haggstrom, and Gilbert Welch 2002; Penston 2011; Welch and Black 1997), we note that the methods that follow could be easily adapted to all-cause mortality outcomes. The results could also be presented for other-cause mortality if this is of interest. To represent the outcome, let Y1i(t) and Y0i(t)∈{0,1} be the potential (underlying, in the absence of censoring) counting processes for cancer-specific death by time t for individual i under assignment to screening and control respectively. For simplicity, where possible, we suppress the subscript i from the subsequent notation. The process notation is used to shorten the notation in the formulas that follow; the connection to potential event times and types is Yz(t)≡1{Tz≤t,Ez=1}, z ∈{0,1}, where Tz is the potential time of death, and Ez cause of death indicator, with Ez=1 indicating cancer and Ez=2 other cause of death. We suppress the corresponding process notation for other-cause mortality, but note that Y1(t) and Y0(t) remain at zero if other cause death has taken place before cancer-specific death. The observed cancer-specific mortality outcome is given by Y(t)=ZY1(t)+(1−Z)Y0(t) (counterfactual consistency). The intention-to-screen estimand in the conventional screening trial can be defined as the proportional reduction in cancer mortality (preventive etiologic proportion) 1−E[Y1(t)]/E[Y0(t)], or absolute reduction/risk difference E[Y0(t)]−E[Y1(t)], usually estimated at the end of the follow-up period of the trial at t = τ (in NLST the main analysis was carried out after 6 years). In the presence of censoring, which in the context of NLST is mostly administrative due to late entry into the study, the quantities E[Yz(t)], z ∈{0,1} can be estimated through non-parametric estimators for cancer-specific cumulative incidence, taking into account the competing causes of death. In the presence of covariate-dependent censoring, estimates for covariate conditional cumulative incidences E[Yz(t)∣X], z ∈{0,1} can be obtained for example by fitting subdistribution hazard models (Fine and Gray 1999) separately in the screening and control arms. The flow of a conventional screening trial is illustrated in panel (a) of Figure 2.

Instead of the causal effect of screening assignment among the asymptomatic population eligible for screening, here we are interested in the effectiveness of early (following screening induced diagnosis) versus delayed (following symptom induced diagnosis) treatment, in the screening-detectable subpopulation. Such a latent subpopulation cannot be identified at the baseline, but rather, accumulates over time through the repeated screening examinations. To reflect this, we introduce (underlying, in the absence of censoring) counting processes D1(t) and D0(t)∈{0,1} for cancer diagnosis following a positive screen (using the technology in the screening arm of the trial, e.g. CT in NLST), related to the observed process by D(t)=ZD1(t)+(1−Z)D0(t). These processes remain at zero if a death due to any cause, or a non-screening based diagnosis, takes place first. Based on the screening assignment and diagnosis status at time t, we can identify four distinct latent subpopulations (Table 1). The screening assignment can benefit individuals for whom D1(t)=1 and D0(t)=0, but this subpopulation is unobservable. However, if the specific screening technology used in the trial is not readily available outside the screening arm, this will be approximately equivalent to the subpopulation for whom D1(t)=1. For the purposes of estimation, we assume (i) that P(D0(t)=1)≈0.

Table 1:

Latent subpopulations in a screening trial.

	D0(t)=0	D0(t)=1
D1(t)=0	Never screening-detected	Can be harmed by screening assignment
D1(t)=1	Can benefit from screening assignment	Always screening-detected

Now, we can consider the causal effect of early versus delayed treatments in the subpopulation defined by D1(t)=1 (screening-detectable case of cancer). For this purpose, since the diagnosis in itself (like screening) is not an intervention, we still need a well-defined intervention. This is provided by the hypothetical intervention screening trial outlined by Miettinen (2011, p. 117–118) and Miettinen (2015), illustrated in panel (b) of Figure 2. Here screening-detected individuals, accumulated through a predefined screening regimen, are randomized into immediate referral to treatment following the diagnosis (R = 1), or sent home without being informed about the result of the screening or subsequent diagnostic tests (R = 0). This hypothetical randomization of the action following a possible diagnosis can already take place at the onset of the trial, and thus the intervention is not time-dependent, even though the diagnosis status is. Thus, although obviously unethical in practice, this intervention and any potential outcomes indexed with respect to it are well-defined. Further, for the subpopulation for whom D1(t)=1, we can now introduce the potential outcomes

Y11(t)≡Y1∗(t)and Y10(t)≡Y0∗(t),

where the first subscript corresponds to the screening assignment and the second to the intervention.

Figure 2:

Illustration of conventional screening trial and Miettinen’s hypothetical intervention trial in terms of potential outcome variables.

For estimation purposes, we assume further (ii) that Y0∗(t)=Y0(t), that is, undergoing screening and diagnostic procedures and being withheld the result of these does not in itself influence the outcome, for example through looking for screening outside the assigned regimen (beyond what the individual would do in the control arm). In a conventional screening trial, R is not randomized, but for lung cancer it makes sense to assume (iii) that early diagnosis is always followed by immediate referral to treatment, so that D1(t)=1⇒R=1. In addition, we assume (iv) monotonicity Y1(t)≤Y0(t) and Y1∗(t)≤Y0∗(t), meaning that neither the screening examinations themselves, or the early treatments can harm the individual. Finally, we assume (v) that D1(t)=0⇒Y1(t)=Y0(t), meaning that screening can only influence the outcome through early diagnosis, the absence of which implying that the two potential outcomes are the same.

The usual causal assumptions, namely strong ignorability (vi)

(Y1(t),Y0(t),D1(t),D0(t),X)⊥⊥Zand0<P(Z=1)<1,

and its covariate conditional version

(Y1(t),Y0(t),D1(t),D0(t))⊥⊥Z∣Xand0<P(Z=1∣X)<1

are satisfied through the randomized screening assignment. The assumptions are summarized in Table 2.

Table 2:

A summary of the causal assumptions.

	Assumption	Meaning
(i)	P(D0(t)=1)≈0	No or limited access to the tested screening technology in the control arm.
(ii)	Y0∗(t)=Y0(t)	Potential outcome under not receiving early treatment in the presence of screening is equal to the potential outcome in the absence of screening.
(iii)	D1(t)=1⇒R=1	Early diagnosis always prompts early treatment.
(iv)	Y1(t)≤Y0(t) & Y1∗(t)≤Y0∗(t)	Monotonicity: Neither the screening itself or early treatment can harm the individual.
(v)	D1(t)=0⇒Y1(t)=Y0(t)	Screening can have effect on the outcome only through early diagnosis.
(vi)	(Y1(t),Y0(t),D1(t),D0(t),X)⊥Z&0<P(Z=1)<1	Strong ignorability: Screening assignment is randomized.
	⇒(Y1(t),Y0(t),D1(t),D0(t))⊥Z∣X&0<P(Z=1∣X)<1

3 Estimating case-fatality reduction based on conventional screening trials

3.1 Proportional case-fatality reduction

To motivate case-fatality reduction as the quantity of interest in screening trials, Miettinen (2014a) decomposed the joint probability of benefiting from screening into several conditional probabilities. To connect the notation introduced in Section 2.1 to this, we introduce a simplified version ignoring the overdiagnosis component (Figure 3). If N individuals are assigned to screening, the expected number of these benefiting from screening by time t is given by

P(Y1∗(t)=0∣Y0∗(t)=1,D1(t)=1)P(Y0∗(t)=1∣D1(t)=1)P(D1(t)=1)N,

where the three probabilities from right to left refer to the probability of early diagnosis due to screening (in any of the screening examinations in the interval [0,t]), the probability that the cancer would be fatal in the absence of early treatments, and finally, the probability that early treatments would avert cancer specific death in the interval [0,t]. Below we will show that the leftmost probability is the proportional reduction in case fatality (by time t).

We note that it may seem that conditioning D1(t)=1 will introduce lead time and immortal time into the causal contrast. While it is true that the individual was immortal until the diagnosis, immortality until the same time is present also under control, because, as discussed before, screening can avert death only due to earlier diagnosis (assumption (v)). In other words, before early diagnosis under screening, the two potential outcomes are the same, and immortal time does not imply immortal time bias. As for lead time, lead time bias is avoided because the time origin in the comparison is the same, the time of randomization.

Figure 3:

A probability tree illustrating decomposition of the joint probability of benefiting from screening into conditional probabilities.

As we want to allow for possible covariate-dependent censoring, we consider a covariate conditional version of the leftmost conditional probability. Under the monotonicity assumption (Section 2.1), this can be written as

(1)P(Y1∗(t)=0∣Y0∗(t)=1,D1(t)=1,X)=1−P(Y1∗(t)=1∣Y0∗(t)=1,D1(t)=1,X)=1−P(Y0∗(t)=1∣Y1∗(t)=1,D1(t)=1,X)P(Y1∗(t)=1∣D1(t)=1,X)P(Y0∗(t)=1∣D1(t)=1,X)=(iv)1−E[Y1∗(t)=1∣D1(t)=1,X]E[Y0∗(t)=1∣D1(t)=1,X],

which is equivalent to the proportional reduction in case fatality for screening-detectable cases of cancer. It is apparent from the last form that this quantity could be estimated directly from data produced by the hypothetical intervention trial depicted in Figure 2 (b), by plugging in estimators for cancer-specific cumulative incidence separately in the intervention (referral to early treatment) and control arms. However, direct estimation based on conventional screening trial data is not possible, since the potential outcome Y0∗(t) is unobserved in the screening arm. However, Miettinen (2014b, p. 157) derived an estimator in terms of quantities estimable from a conventional screening trial. We re-derive this estimator in the present notational framework, to make explicit the assumptions involved, and consider estimation in the presence of censoring and competing risks.

As a side note, while the intervention trial of Figure 2 (b) could be based on a single baseline screening examination, the quantity in eq. (1) in a conventional screening trial of Figure 2 (a) will depend on the screening regimen employed in the trial, due to the repeated screening examinations resulting in a different screening-detectable subpopulation. However, arguably the quantity in eq. (1) is less dependent on the screening regimen than the intention-to-screen estimand because, as we will show in the next section, the latter will directly depend on the probability of being diagnosed through screening.

To express the quantity in eq. (1) in terms of estimable quantities, we can write the t-year cumulative incidences in the screening and control arms as

P(Y1(t)=1∣X)=P(Y1(t)=1∣D1(t)=1,X)P(D1(t)=1∣X)+P(Y1(t)=1∣D1(t)=0,X)P(D1(t)=0∣X)=(iii)P(Y11(t)=1∣D1(t)=1,X)P(D1(t)=1∣X)+P(Y1(t)=1∣D1(t)=0,X)P(D1(t)=0∣X)=P(Y1∗(t)=1∣D1(t)=1,X)P(D1(t)=1∣X)+P(Y1(t)=1∣D1(t)=0,X)P(D1(t)=0∣X)

and

(2)P(Y0(t)=1∣X)=P(Y0(t)=1∣D1(t)=1,X)P(D1(t)=1∣X)+P(Y0(t)=1∣D1(t)=0,X)P(D1(t)=0∣X)=P(Y0(t)=1∣D0(t)=0,D1(t)=1,X)P(D0(t)=0∣D1(t)=1,X)+P(Y0(t)=1∣D0(t)=1,D1(t)=1,X)P(D0(t)=1∣D1(t)=1,X)+P(Y0(t)=1∣D1(t)=0,X)P(D1(t)=0∣X)≈(i),(ii)P(Y0∗(t)=1∣D1(t)=1,X)P(D1(t)=1∣X)+P(Y0(t)=1∣D1(t)=0,X)P(D1(t)=0∣X)=(v)P(Y0∗(t)=1∣D1(t)=1,X)P(D1(t)=1∣X)+P(Y1(t)=1∣D1(t)=0,X)P(D1(t)=0∣X),

to get

(3)P(Y0(t)=1X)−P(Y1(t)=1X)=P(Y0∗(t)=1D1(t)=1,X)P(D1(t)=1X)−P(Y1∗(t)=1D1(t)=1,X)P(D1(t)=1X).

We can now express the quantity in eq. (1) as

(4)1−E[Y1∗(t)=1∣D1(t)=1,X]E[Y0∗(t)=1∣D1(t)=1,X]=P(Y0∗(t)=1∣D1(t)=1,X)−P(Y1∗(t)=1∣D1(t)=1,X)P(Y0∗(t)=1∣D1(t)=1,X)=P(Y0∗(t)=1∣D1(t)=1,X)P(D1(t)=1∣X)−P(Y1∗(t)=1∣D1(t)=1,X)P(D1(t)=1∣X)P(Y0∗(t)=1∣D1(t)=1,X)P(D1(t)=1∣X)=(2),(3)P(Y0(t)=1∣X)−P(Y1(t)=1∣X)P(Y0(t)=1∣X)−P(Y1(t)=1∣D1(t)=0,X)P(D1(t)=0∣X)=P(Y0(t)=1∣X)−P(Y1(t)=1∣X)P(Y0(t)=1∣X)−P(Y1(t)=1,D1(t)=0∣X)=(vi)P(Y(t)=1∣Z=0,X)−P(Y(t)=1∣Z=1,X)P(Y(t)=1∣Z=0,X)−P(Y(t)=1,D(t)=0∣Z=1,X).

Here the terms P(Y(t)=1∣Z=z,X), z ∈{0,1} can be estimated using non-parametric estimators for cumulative incidence in screening and control arms separately (unconditional on the covariates), or Fine & Gray models, fitted separately in the screening and control arms (conditional on the covariates). The latter can accommodate censoring that is dependent on the observed covariates. The term P(Y(t)=1,D(t)=0∣Z=1,X) in the denominator can be estimated by cumulative incidence of cancer-specific death in the screening arm, considering both screening-based cancer diagnosis and other cause deaths as competing risks.

3.2 Absolute case-fatality reduction

For estimating the absolute reduction in case fatality in the latent subpopulation, we adopt the instrumental variable approach. Generally, IV is a variable that is associated with the exposure of interest, but not directly associated with the outcome (Sussman and Hayward 2010). Thus, as discussed before, random assignment in the randomized screening trial is a natural instrumental variable, since it can only influence the outcome through early diagnosis following a positive screen and the subsequent early treatment. As our estimand, we consider the quantity

(5)E[Y0∗(t)−Y1∗(t)∣D1(t)−D0(t)=1,X].

To express the quantity in eq. (5) in terms of estimable quantities, we can expand the covariate conditional intention-to-screen effect as

E[Y0(t)−Y1(t)∣X]=ED1(t),D0(t)∣X{E[Y0(t)−Y1(t)∣D1(t),D0(t),X]}=E[Y0(t)−Y1(t)∣D1(t)=0,D0(t)=0,X]P(D1(t)=0,D0(t)=0∣X)+E[Y0(t)−Y1(t)∣D1(t)=1,D0(t)=0,X]P(D1(t)=1,D0(t)=0∣X)+E[Y0(t)−Y1(t)∣D1(t)=0,D0(t)=1,X]P(D1(t)=0,D0(t)=1∣X)+E[Y0(t)−Y1(t)∣D1(t)=1,D0(t)=1,X]P(D1(t)=1,D0(t)=1∣X)≈(i)E[Y0(t)−Y1(t)∣D1(t)=0,D0(t)=0,X]P(D1(t)=0∣X)+E[Y0(t)−Y1(t)∣D1(t)=1,D0(t)=0,X]P(D1(t)=1∣X).

By applying the other assumptions discussed in Section 2.1, we further get

E[Y0(t)−Y1(t)∣X]=(v)E[Y0(t)−Y1(t)∣D1(t)=1,D0(t)=0,X]P(D1(t)=1∣X)=(iii)E[Y0(t)−Y11(t)∣D1(t)=1,D0(t)=0,X]P(D1(t)=1∣X)=(ii)E[Y0∗(t)−Y1∗(t)∣D1(t)−D0(t)=1,X]E[D1(t)∣X],

giving an expression for the quantity in eq. (5) as

(6)E[Y0∗(t)−Y1∗(t)∣D1(t)−D0(t)=1,X]=E[Y0(t)∣X]−E[Y1(t)∣X]E[D1(t)∣X]=(vi)E[Y(t)∣Z=0,X]−E[Y(t)∣Z=1,X]E[D(t)∣Z=1,X].

As in Section 3.1, the expectations E[Y(t)∣Z=z,X], z ∈{0,1} in eq. (6) can be estimated as cumulative incidences of cancer-specific mortality in the screening and control arm, considering other cause mortality as competing risks. The expectation E[D(t)∣Z=1,X] can be estimated as the cumulative incidence of screening-induced diagnosis of cancer in the screening arm, considering any-cause mortality and non-screening based cancer diagnoses as competing risks. This cumulative incidence can also be used to estimate the prevalence of being early diagnosed through screening at any given time during the follow-up.

3.3 Alternative assumptions

We note that deriving expression (6) required the exact same assumptions as the expression for proportional case-fatality reduction in Section 3.1, with the exception of the monotonicity assumption, which was needed in Section 3.1 to connect the one minus risk ratio quantity to the probability of benefiting from screening/preventive etiologic proportion. The risk ratio itself, as well as the risk difference herein, can be estimated without the monotonicity assumption.

To make the connection between eq. (6) and the IV estimator more obvious, we note that among those with D0(t)=1 we could introduce the hypothetical intervention and the corresponding potential outcomes Y01(t) and Y00(t), and replace assumption (i) with assumptions P(D1(t)=0,D0(t)=1∣X)=0 and Y01(t)=Y11(t), the former ruling out individuals who would only get screening-detected when not assigned to screening and the latter meaning that the effect of the intervention is the same irrespective of the screening assignment. The latter assumption would together with (ii) correspond to the usual ‘exclusion restriction’ assumption in instrumental variable estimation, and the former to the ‘absence of defiers’ (assumptions 3 and 5 of Angrist, Imbens, and Rubin 1996). Under the alternative assumptions we could express the quantity in eq. (5) as

E[Y0∗(t)−Y1∗(t)∣D1(t)−D0(t)=1,X]=E[Y0(t)∣X]−E[Y1(t)∣X]E[D1(t)∣X]−E[D0(t)∣X]=(vi)E[Y(t)∣Z=0,X]−E[Y(t)∣Z=1,X]E[D(t)∣Z=1,X]−E[D(t)∣Z=0,X].

which has the same form as the IV estimator Angrist, Imbens, and Rubin (1996) proposed for estimation under non-adherence in randomized trials. However, the assumption Y01(t)=Y11(t) would be difficult to justify here as the screening assignment will influence the timing of the early diagnosis and thus the timing of early treatment. Thus, we stick with the assumptions leading to eq. (6), but demonstrate in Section 4 how the alternative estimator above could be used for a sensitivity analysis concerning assumption (i).

We also note assumption (ii) was introduced in our context for the purpose of connecting the causal estimand to the hypothetical intervention in Miettinen’s intervention trial. Since we cannot actually obtain data from such a trial, this assumption mainly requires that we can hypothesize such an intervention (where screening itself and being withheld the result does not change the screenee’s behaviour). Had we instead specified our causal quantity of interest as E[Y0(t)−Y11(t)∣D1(t)−D0(t)=0] (meaning that the reference level of exposure is the control arm exposure), assumption (ii) would not be needed.

3.4 Formulas for marginal effects

We presented the covariate-conditional versions of the formulas to accommodate possible covariate-dependent censoring. However, we might still be interested in the marginal effects in the latent screening-detectable subpopulation. Estimators for these can be obtained through averaging over the covariate distribution of the screening-detected subpopulation in the screening arm, because

f(X∣D1(t)=1)=P(D1(t)=1∣X)f(X)P(D1(t)=1)=(vi)P(D(t)=1∣Z=1,X)f(X∣Z=1)P(D(t)=1∣Z=1)=f(X∣Z=1,D(t)=1).

For example, to get an estimate for the marginal probability of benefiting from screening in the latent subpopulation, we can write

(a)P(Y1∗(t)=0∣Y0∗(t)=1,D1(t)=1)=E[Y0∗(t)=1∣D1(t)=1]−E[Y1∗(t)=1∣D1(t)=1]E[Y0∗(t)=1∣D1(t)=1]=∫x{E[Y0∗(t)=1∣D1(t)=1,x]−E[Y1∗(t)=1∣D1(t)=1,x]}f(x∣D1(t)=1)dx∫xE[Y0∗(t)=1∣D1(t)=1,x]f(x∣D1(t)=1)dx≈∑{i:Zi=1,Di(t)=1}E[Y0i∗(t)=1∣D1i(t)=1,Xi]−E[Y1i∗(t)=1∣D1i(t)=1,Xi]∑{i:Zi=1,Di(t)=1}E[Y0i∗(t)=1∣D1i(t)=1,Xi],

where the sum is over the empirical covariate distribution of the screening-detected subpopulation in the screening arm, and the numerator and denominator are estimated through

E[Y0∗(t)∣D1(t)=1,X]−E[Y1∗(t)∣D1(t)=1,X]=P(Y0(t)=1∣X)−P(Y1(t)=1∣X)P(D1(t)=1∣X)=(vi)P(Y(t)=1∣Z=0,X)−P(Y(t)=1∣Z=1,X)P(D(t)=1∣Z=1,X)

and

E[Y0∗(t)=1∣D1(t)=1,X]=P(Y0(t)=1∣X)−P(Y1(t)=1,D1(t)=0∣X)P(D1(t)=1∣X)=(vi)P(Y(t)=1∣Z=0,X)−P(Y(t)=1,D(t)=0∣Z=1,X)P(D(t)=1∣Z=1,X).

We note that former is equivalent to eq. (6). This is because due to assumption (i), P(D1(t)=1,D0(t)=0∣X)≈P(D1(t)=1∣X), and the marginal causal contrast can again be estimated through averaging over the covariate distribution of the screening-detected population in the screening arm as

E[Y0∗(t)−Y1∗(t)∣D1(t)−D0(t)=1]≈∫xE[Y0∗(t)−Y1∗(t)∣D1(t)−D0(t)=1,X]f(x∣D1(t)=1)≈1∑i=1n1{Zi=1,Di(t)=1}∑{i:Zi=1,Di(t)=1}E[Y0i(t)∣Xi]−E[Y1i(t)∣Xi]E[D1i(t)∣Xi]=(vi)1∑i=1n1{Zi=1,Di(t)=1}∑{i:Zi=1,Di(t)=1}E[Yi(t)∣Zi=0,Xi]−E[Yi(t)∣Zi=1,Xi]E[Di(t)∣Zi=1,Xi].

4 Illustration in NLST data

The US National Lung Screening Trial was launched in 2002 to evaluate the effect of screening for lung cancer among 53,453 heavy smokers who were assigned to either three annual rounds of low-dose helical CT examinations (the screening arm) or three annual standard chest X-rays (the control arm) at random (NLST Research Team 2011). The effect was measured in terms of proportional reduction in lung cancer specific mortality. After randomizing smokers into two different arms, an initial screening examination was performed followed by two annual screenings, with a follow-up for cause-specific mortality for a total of 7 years. We carried out a secondary analysis of these data to demonstrate estimating case-fatality reduction from a conventional randomized screening trial. Even though the control arm also received screening in NLST, the effect of X-ray screening on lung cancer mortality has been demonstrated to be near null, and thus serves as control for our purposes (Oken et al. 2011). The NLST found a significant intention-to-screen effect, corresponding to a 20% proportional reduction in lung cancer mortality, and thus we are interested whether we can also detect a significant effect of early versus delayed treatments among the screening-detectable cases, given the additional uncertainty in estimating the denominator terms in the expressions (4) and (6).

We calculated the estimators as functions of time starting from the time of randomization. All the component probabilities were estimated through non-parametric cumulative incidence estimators, using the cuminc function of cmprsk package (Gray 2014) of R statistical environment (R Core Team 2017). We did not consider incorporating covariates, as censoring in NLST is mainly administrative, due to late entry into the study during the recruitment period. As descriptive statistics, we present the cumulative incidences of cancer mortality in the two arms, as well as the cumulative incidences of early diagnosis and cancer death before early diagnosis in the CT arm in Figure 4. The figure demonstrates how the early diagnoses follow the three annual rounds of screening. The cumulative incidence of lung cancer death before early diagnosis begins to increase in the later years of follow-up when CT screening is no longer offered.

Figure 4:

Cumulative incidences of lung cancer death in control and CT arms, as well as detection of lung cancer through screening, and lung cancer death before early diagnosis in the CT arm.

The time-specific estimates for proportional case-fatality reduction in eq. (4) among screening-detectable are presented in Figure 5. For comparison, we also present the estimates for the proportional lung cancer mortality reduction 1−E[Y1(t)]/E[Y0(t)] in the entire eligible population. The results for the latter replicate the original NLST results; a statistically significant maximum reduction of 20% is achieved at 6 years, after which the effect starts to dilute due to screening no longer being offered. In contrast, the case-fatality reduction estimate appears to stabilize at slightly below 30% at 7 years, meaning that early treatments prevented almost a third of the deaths among the screening-detectable in this time interval.

Figure 5:

Estimated proportional reduction in lung cancer mortality among screening-detectable individuals (case-fatality reduction) and among the eligible population as function of time, together with 95% bootstrap confidence bands. The estimates for the first year are unstable and are not shown.

The time-specific estimates for the lung cancer mortality risk difference among the screening-detectable are presented in Figure 6. For completeness, we also present estimates for the risk difference in the entire eligible population (E[Y1(t)]−E[Y0(t)]). The latter reaches a maximum of around 0.003, and the former a maximum of around 0.14 at 7 years, both being significantly different from zero. The inverses of the risk differences can be interpreted in the usual way as numbers needed to screen/treat, so that 1/0.003≈333 eligible individuals needed to be screened to prevent one lung cancer death, and 1/0.14≈7 screening-detectable individuals needed to be treated early to prevent one lung cancer death in this time interval. We note that while this subpopulation is latent, under the causal assumptions its size can be estimated from the screening arm, leading to decomposition of the intention-to-screen obtained by multiplying both sides of eq. (6) by the probability of early detection in the screening arm.

Figure 6:

Estimated absolute reduction in lung cancer mortality (risk difference) among screening-detectable individuals (case-fatality reduction) and among the eligible population as function of time, together with 95% bootstrap confidence bands.

As for the plausibility of the different causal assumptions in the NLST context, the conceptual discussion on assumption (ii) in Section 3.3 applies also here. Assumption (iii) is plausible as we are talking about lung cancer (treatment is likely initiated following the detection without active surveillance). The monotonicity assumption (iv) is not necessary for interpreting the proportional and absolute mortality reductions, so possible harms of the treatment are reflected in the mortality estimates. Violation of assumption (v) would be possible if undergoing screening and receiving a negative result would prompt the screenee to change behaviour, for instance through change in smoking habits (e.g. being reassured by the negative screen and smoking more as a result). Assumption (vi) is satisfied through design as NLST was a randomized screening trial.

Assumption (i) states that CT screening was rare in the control arm. We can evaluate the sensitivity of the results to this assumption using the alternative estimator discussed in Section 3.3, as long as we can estimate the probability of early detection in the control arm. While we do not have individual data on the uptake of CT screening in the control arm in NLST, we can perform a rough sensitivity analysis using the reported CT arm adherence of 95% and annual rate of CT screening of 4.3% outside the trial (NLST Research Team 2011). While those who seek CT screening outside the trial may in reality be either higher or lower risk than the CT arm, we can obtain a benchmark by assuming that a random 4.3% of the control arm participants underwent a similar screening regimen as the CT arm. With the same early detection probability, we would expect 0.1% of the control arm participants to be early detected after 7 years of follow up:

0.024×N0.95×N=p×N0.043×N⇔p=0.024×0.0430.95≈0.1%.

Subtracting this from the denominator of the IV-type estimator would change the risk difference estimate to 0.146 instead of 0.14, so the impact of the adjustment is small. If the CT screened control arm participants were twice as likely to be early detected (keeping in mind that we are still talking about asymptomatic individuals), the risk difference would estimate to 0.152. If the CT screened individuals in the control arm are lower risk than the CT arm, the difference would be smaller.

5 Discussion

Miettinen (2011, p. 5–6) defines case-fatality rate as

Case-fatality rate (synonyms: fatality rate, death rate) – Concerning cases of an illness in general, or recognized cases of it (ones with rule-in diagnosis about the illness), the proportion in which the illness is fatal; that is, such that the outcome of the course of the illness is fatality from it. (Cf. ‘Survival rate.’)
Note: For the concept to be truly meaningful, it commonly is to be specific to particulars of the case (broadly at least) and to the choice of treatment; and it also is to be conditional on absence of intercurrent death from some other, ‘competing’ cause.

We note that what constitutes a case is a matter of definition; in the present context we are specifically interested in case fatality among screening-detectable cases of lung cancer. This definition includes possibly overdiagnosable cases; the proportional measure is not affected by overdiagnosis since it additionally conditions on the cancer being fatal in the absence of screening. The absolute measure is necessarily affected by overdiagnosis, since it informs how many screening-detectable individuals would have to be treated to avert one cancer death.

Although the causal contrasts of interest here were formulated in a hypothetical subpopulation, our IV-type estimators are formulated based on the entire population, rather than a restricted subset based on observed data, with causal assumptions used to link the causal contrast to estimable quantities. Thus, our methodology does not introduce selection bias which could arise from restricting the analysis to a subset in IV estimation (Swanson et al. 2015).

In this paper, because our aim was to measure how many lung cancer deaths would actually be averted in a given time window, we have used the term case fatality in a way that deviates from the above definition in two respects; first, we defined our estimand as a function of time, and second, we considered it as a cumulative incidence function in the presence (rather than in the absence) of competing risks. Under censored data, alternatively to the competing risks model, estimation of case fatality could be considered in the cure fraction modeling framework.

Here we introduced the covariate-conditional versions of the estimators to allow for baseline covariate-dependent censoring. However, the estimators outlined would also allow for studying effect modification due to baseline covariates, for instance to identify predictors of the early treatment effect, or identifying subpopulations that most benefit from the early treatment. This is a topic for further work, as is extending the estimators to allow the censoring to depend on latent characteristics, similarly to Frangakis and Rubin (1999), or on time-dependent covariates.

We considered measuring the effect of early treatments solely in terms of mortality reduction in a given time interval. However, it is possible that in addition to curability, some of the effects of the early treatments manifest through gained lifetime which would be only partially reflected as mortality reduction. Application of accelerated failure time models, similar to Altstein and Li (2013) and Altstein, Li, and Elashoff (2011), in the present context is another topic for further research.

Funding statement: The corresponding author’s work was supported by a Discovery Grant from the Natural Sciences and Engineering Research Council of Canada (NSERC) (Funder Id: 10.13039/501100000038, Grant Number: 2014-04245 RGPIN). The authors thank the National Cancer Institute (NCI) for access to NCI’s data collected by the National Lung Screening Trial. The statements contained herein are solely those of the authors and do not represent or imply concurrence or endorsement by the NCI.

References

Altstein, L., and Li, G. (2013). Latent subgroup analysis of a randomized clinical trial through a semiparametric accelerated failure time mixture model. Biometrics, 69:52–61.10.1111/j.1541-0420.2012.01818.xSearch in Google Scholar PubMed PubMed Central

Altstein, L. L., Li, G., and Elashoff, R. M. (2011). A method to estimate treatment efficacy among latent subgroups of a randomized clinical trial. Statistics in Medicine, 30:709–717.10.1002/sim.4131Search in Google Scholar PubMed PubMed Central

Angrist, J. D., Imbens, G. W., and Rubin, D. B. (1996). Identification of causal effects using instrumental variables. Journal of the American statistical Association, 91:444–455.10.3386/t0136Search in Google Scholar

Baker, S. G. (1998). Analysis of survival data from a randomized trial with all-or-none compliance: estimating the cost-effectiveness of a cancer screening program. Journal of the American Statistical Association, 93:929–934.10.1080/01621459.1998.10473749Search in Google Scholar

Black, W. C., Haggstrom, D. A., and Gilbert Welch, H. (2002). All-cause mortality in randomized trials of cancer screening. Journal of the National Cancer Institute, 94:167–173.10.1093/jnci/94.3.167Search in Google Scholar PubMed

Fine, J. P., and Gray, R. J. (1999). A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association, 94:496–509.10.1080/01621459.1999.10474144Search in Google Scholar

Frangakis, C. E., and Rubin, D. B. (1999). Addressing complications of intention-to-treat analysis in the combined presence of all-or-none treatment-noncompliance and subsequent missing outcomes. Biometrika, 86:365–379.10.1093/biomet/86.2.365Search in Google Scholar

Gray, B. (2014). cmprsk: Subdistribution Analysis of Competing Risks. R package version 2.2-7.Search in Google Scholar

Holland, P. W. (1986). Statistics and causal inference. Journal of the American statistical Association, 81:945–960.10.1080/01621459.1986.10478354Search in Google Scholar

Imbens, G. W., and D. B. Rubin. 1997. “Bayesian inference for causal effects in randomized experiments with noncompliance.” The Annals of Statistics 25: 305–327.10.1214/aos/1034276631Search in Google Scholar

Miettinen, O. S. (2011). Epidemiological research: terms and concepts. Springer, Dordrecht.10.1007/978-94-007-1171-6Search in Google Scholar

Miettinen, O. S. (2014a). Screening for breast cancer: what truly is the benefit? Can J Public Health, 104:435–436.10.17269/cjph.104.4444Search in Google Scholar

Miettinen, O. S. (2014b). Toward Scientific Medicine. Springer, Dordrecht.10.1007/978-3-319-01671-9Search in Google Scholar

Miettinen, O. S. (2015). Breast cancer: Misguided research misinforming public policies. Epidemiologic Methods, 4:3–10.10.1515/em-2015-0020Search in Google Scholar

NLST Research Team. 2011. “Reduced lung-cancer mortality with low-dose computed tomographic screening.” N Engl J Med 365: 395–409.10.1056/NEJMoa1102873Search in Google Scholar PubMed PubMed Central

Oken, M. M., Hocking, W. G., Kvale, P. A., Andriole, G. L., Buys, S. S., Church, T. R., Crawford, E. D., Fouad, M. N., Isaacs, C., Reding, D. J., Weissfeld, J. L., Yokochi, L. A., O’Brien, B., Ragard, L. R., Rathmell, J. M., Riley, T. L., Wright, P., Caparaso, N., Hu, P., Izmirlian, G., Pinsky, P. F., Prorok, P. C., Kramer, B. S., Miller, A. B., Gohagan, J. K., and Berg, C. D. (2011). Screening by chest radiograph and lung cancer mortality: the Prostate, Lung, Colorectal, and Ovarian (PLCO) randomized trial. JAMA, 306:1865–1873.10.1001/jama.2011.1591Search in Google Scholar PubMed

Penston, J. (2011). Should we use total mortality rather than cancer specific mortality to judge cancer screening programmes? Yes. BMJ, 343:d6395.10.1136/bmj.d6395Search in Google Scholar PubMed

R Core Team. R: A Language and Environment for Statistical Computing R Foundation for Statistical Computing, 2017 Vienna, Austria.Search in Google Scholar

Roemeling, S., Roobol, M. J., Otto, S. J., Habbema, D. F., Gosselaar, C., Lous, J. J., Cuzick, J., and Schröder, F. H. (2007). Feasibility study of adjustment for contamination and non-compliance in a prostate cancer screening trial. The Prostate, 67:1053–1060.10.1002/pros.20606Search in Google Scholar PubMed

Rubin, D. B. (1978). Bayesian inference for causal effects: The role of randomization. The Annals of statistics, 6:34–58.10.1214/aos/1176344064Search in Google Scholar

Steele, R. J., and Brewster, D. H. (2011). Should we use total mortality rather than cancer specific mortality to judge cancer screening programmes? No. BMJ, 343:d6397.10.1136/bmj.d6397Search in Google Scholar PubMed

Sussman, J. B., and Hayward, R. A. (2010). An IV for the RCT: using instrumental variables to adjust for treatment contamination in randomised controlled trials. BMJ, 340:c2073–c2073.10.1136/bmj.c2073Search in Google Scholar PubMed PubMed Central

Swanson, S. A., Robins, J. M., Miller, M., and Hernán, M. A. (2015). Selecting on treatment: a pervasive form of bias in instrumental variable analyses. American journal of epidemiology, 181:191–197.10.1093/aje/kwu284Search in Google Scholar PubMed PubMed Central

Welch, H. G., and Black, W. C. (1997). Evaluating randomized trials of screening. Journal of general internal medicine, 12:118–124.10.1007/s11606-006-5007-7Search in Google Scholar

Received: 2018-04-19

Revised: 2018-09-17

Accepted: 2018-10-04

Published Online: 2018-11-07

Articles in the same Issue

https://doi.org/10.1515/em-2018-0007

Keywords for this article

randomized screening trials; case-fatality reduction; causal inference; instrumental variable estimator