A Quantitative Concordance Measure for Comparing and Combining Treatment Selection Markers

Zhiwei Zhang; Shujie Ma; Lei Nie; Guoxing Soon

doi:10.1515/ijb-2016-0064

Artikel Öffentlich zugänglich

A Quantitative Concordance Measure for Comparing and Combining Treatment Selection Markers

Zhiwei Zhang , Shujie Ma , Lei Nie und Guoxing Soon

Veröffentlicht/Copyright: 25. März 2017

Veröffentlicht von

Veröffentlichen auch Sie bei De Gruyter Brill

Manuskript einreichen Informationen für Autor*innen

Aus der Zeitschrift The International Journal of Biostatistics Band 13 Heft 1

Abstract

Motivated by an HIV example, we consider how to compare and combine treatment selection markers, which are essential to the notion of precision medicine. The current literature on precision medicine is focused on evaluating and optimizing treatment regimes, which can be obtained by dichotomizing treatment selection markers. In practice, treatment decisions are based not only on efficacy but also on safety, cost and individual preference, making it difficult to choose a single cutoff value for all patients in all settings. It is therefore desirable to have a statistical framework for comparing and combining treatment selection markers without dichotomization. We provide such a framework based on a quantitative concordance measure, which quantifies the extent to which higher marker values are predictive of larger treatment effects. For a given marker, the proposed concordance measure can be estimated from clinical trial data using a U-statistic, which can incorporate auxiliary covariate information through an augmentation term. For combining multiple markers, we propose to maximize the estimated concordance measure among a specified family of combination markers. A cross-validation procedure can be used to remove any re-substitution bias in assessing the quality of an optimized combination marker. The proposed methodology is applied to the HIV example and evaluated in simulation studies.

Keywords: cross-validation; personalized medicine; predictive biomarker; precision medicine; treatment effect heterogeneity; U-statistic

1 Introduction

It is well recognized that treatment effects can be heterogeneous; that is, the same treatment can have different effects on different patients. The increasing awareness of treatment effect heterogeneity has motivated the development of predictive biomarkers for treatment selection [1, 2]. When two or more markers are available for the same treatment selection problem, methods are needed for comparing different markers and for combining multiple markers into a single marker. Our interest in these problems is motivated by two randomized clinical trials known as ECHO [3] and THRIVE [4], which compared the antiretroviral drugs rilpivirine and efavirenz for treatment-naive adult patients infected with human immunodeficiency virus (HIV). Both drugs are non-nucleoside reverse transcriptase inhibitors to be used in combination with other antiretroviral agents. The ECHO and THRIVE trials differed mainly in the use of background nucleoside/nucleotide reverse transcriptase inhibitors (N[t]RTIs). Except for that difference, the two trials followed nearly identical designs, produced very similar results [5], and are therefore combined in our analysis. The combined trial data show that rilpivirine and efavirenz are of comparable efficacy with respect to virologic response (viral load <50 copies/ml at week 48 of treatment). Exploratory analyses suggest that the relative efficacy of rilpivirine and efavirenz may depend on baseline viral load (BVL) and baseline CD4 cell count (BCD4) [6, 7]. Figure 1 shows nonparametric estimates of treatment-specific virologic response rates as functions of marker value, separately for each marker. A logistic regression analysis with treatment and log-transformed marker values as covariates shows significant interactions between treatment and the two markers (likelihood ratio test p=0.0018). These analyses suggest that BVL and BCD4 may be useful for treatment selection, and in this article we consider how to compare the two markers and how to combine them into a single marker.

Figure 1

Nonparametric regression analysis of the combined HIV trial data: smoothed estimates of treatment-specific virologic response rates as functions of BVL (left) and BCD4 (right).

The problems we consider are related to but different from evaluating and optimizing treatment regimes. A continuous or ordinal marker for treatment selection can be dichotomized into a treatment regime, which has a direct impact on the target population. The selection impact curve of Song and Pepe [8] can be used to visualize the impact of the resulting regime as a function of the cutoff value. The existing literature on precision medicine includes a variety of methods for estimating or approximating optimal treatment regimes that maximize a summary measure of efficacy e.g., [9, 10, 11, 12]. In practice, treatment decisions are based not only on efficacy but also on safety, cost and individual preference, which makes it difficult to apply a single cutoff value to all patients in all settings [13]. In our application, for example, rilpivirine is thought to be safer than efavirenz, and different patients may assign different weights to efficacy, safety and cost. Without a strong consensus on how to choose a cutoff, it is desirable to have a statistical framework for comparing and combining treatment selection markers without dichotomization.

There is an interesting analogy here between treatment selection and diagnostic medicine. A continuous or ordinal diagnostic marker can, and eventually will, be dichotomized into a binary diagnostic test. The choice of cutoff value involves balancing the two types of misclassifications, whose relative importance may depend on individual preference and circumstance. Without assuming a specific cutoff value, a continuous or ordinal diagnostic marker can be evaluated using the receiver operating characteristic (ROC) curve [14, 15]. Two diagnostic markers on different scales can be compared with respect to the area under the ROC curve (AUC), and multiple diagnostic markers can be combined by maximizing the AUC for the combination marker [16, 17, 18].

In this article, we propose a quantitative concordance measure for comparing and combining multiple treatment selection markers. The proposed concordance measure quantifies the extent to which higher marker values are predictive of larger treatment effects, and is closely related to the area under the selection impact curve defined by Song and Pepe [8]. For a given treatment selection marker, the proposed concordance measure can be estimated from clinical trial data using a U-statistic, which can incorporate auxiliary covariate information through an augmentation term e.g., [7, 19]. For combining multiple markers, we propose to maximize the estimated concordance measure among a specified family of combination markers. This is similar in spirit to the approach of Zhang et al. [11] for estimating optimal treatment regimes. When there are more than three markers to combine, it may be impractical to find the optimal combination through a grid search, but the objective function can be smoothed as in Ma and Huang [18] so that a gradient descent algorithm can be applied. In assessing the quality of the optimized combination marker, there can be a re-substitution bias if the same data are used to estimate and evaluate the optimal combination [1, 20, 21], and we propose to remove the re-substitution bias using a cross-validation procedure.

2 Methodology

2.1 A quantitative concordance measure

Consider a clinical trial in which each patient receives a randomized treatment T (1 experimental; −1 control). We assume for simplicity that P(T=1)=P(T=−1)=1/2, although the main ideas extend easily to different allocation probabilities. Let Y(t) denote the potential outcome that will realize if a patient receives treatment t∈{1,−1} [22]. Assuming consistency or stable unit treatment value, the actual outcome is just Y=Y(T). Without loss of generality, we assume that higher values of Y are desirable. Let X be a vector of baseline covariates including one or more markers that may be considered in choosing between the two treatments.

We start by considering how to assess the utility of one marker, say V, for treatment selection. To this end, it is important to consider the conditional effect of t=1 versus t=−1 given V, defined as

(2.1)δ(V)=E{Y(1)−Y(−1)|V}.

For a useless marker, the conditional effect function is constant: δ(V)≡E{Y(1)−Y(−1)}=:Δ. A useful marker should have a large amount of variation in the function δ. We assume throughout that higher marker values are intended to predict larger treatment effects, which implies that only “positive" variation in δ is desirable. This motivates our definition of γ=E(G12) with

G12=sgn(V1−V2){δ(V1)−δ(V2)},

where V1 and V2 are the marker values of two independent patients, and sgn is the sign function: sgn(u)=1,0,−1 according as u>,=,<0.

We call γ a quantitative concordance measure because G12 involves the actual difference δ(V1)−δ(V2) and not just the sign of the difference. An analogous qualitative concordance measure might be defined as γ∗=E(G12∗) with

G12∗=sgn(V1−V2)sgn{δ(V1)−δ(V2)}.

It is easy to see that γ∗=1 if V is continuous and δ is strictly increasing. Among such markers, it is certainly desirable to identify those with large variation in δ while considering the distribution of V. That extra information is captured by the quantitative concordance measure γ, which is therefore the focus of this article.

To understand what γ means, suppose the two patients are to receive treatments 1 and −1. If V1\gtV2, then

G12=δ(V1)−δ(V2)=E{Y1(1)−Y1(−1)|V1}−E{Y2(1)−Y2(−1)|V2}=E{Y1(1)+Y2(−1)|V1,V2}−E{Y1(−1)+Y2(1)|V1,V2},

where the last step makes use of the independence of the two patients. If V1\ltV2, the same argument shows that

G12=E{Y1(−1)+Y2(1)|V1,V2}−E{Y1(1)+Y2(−1)|V1,V2}.

If V1=V2, then G12=0. Thus, G12 measures the impact of giving treatment 1 to the patient with a larger V-value (if there is one) versus the opposite treatment allocation, and γ represents the population average impact of treatment allocation based on V versus −V. A large value of γ is clearly desirable. Note also that γ is invariant under a strictly increasing transformation of the marker.

Further insights into γ can be gained by considering the selection impact curve of Song and Pepe [8]. Assume for the moment that V is continuously distributed on an interval. Without loss of generality, we further assume that V is uniformly distributed on (0,1). The selection impact curve is the function

θ(v)=vE{Y(−1)|V\ltv}+(1−v)E{Y(1)|V\gtv},

which gives the mean outcome for a treatment regime that assigns treatment 1 to patients with V\gtv and treatment −1 to everyone else. Without specifying a cutoff v, a natural summary of the selection impact curve is its integral over the unit interval, which is analogous to the AUC for an ROC curve. In Appendix A, we show that

(1)γ=4cov{V,δ(V)}=4∫01θ(v)\/dv−2E{Y(−1)}−2E{Y(1)}.

Thus, for a given pair of treatments, γ is directly related to the area under the selection impact curve.

2.2 Estimating γ for a given marker

We now consider estimation of γ for a given marker using a random sample of (X,T,Y) (recall that X includes V as a component), with individual subjects denoted by subscripts (e.g., i=1,…,n). Estimation of γ is complicated by the fact that the conditional effect function δ is unknown. However, we note that

E(2TY|V)=2P(T=1|V)E{Y(1)|V}−2P(T=−1|V)E{Y(−1)|V}=δ(V),

which suggests that δ(V) may be estimated by 2TY [7, 19]. In fact, δ(V) could be estimated by 2T(Y−A) for any A=a(X), because E(TA|V)=E{E(TA|X)|V}=0 due to randomization. The inclusion of A is motivated by a potential gain in efficiency, as will become clear later.

The foregoing discussion, together with the definition of γ, suggests the following estimator:

(2)γˆ(a)=2n(n−1)∑i\ltjGˆij(a),

where

(3)Gˆij(a)=2sgn(Vi−Vj){Ti(Yi−Ai)−Tj(Yj−Aj)}.

Noting that γˆ(a) is a one-sample U-statistic of order 2, it follows from standard asymptotic theory e.g., van der Vaart [23] Chapter 12 that n{γˆ(a)−γ} converges to a normal distribution with mean zero and variance

σ2(a)=4cov{Gˆ12(a),Gˆ13(a)}.

A consistent estimator of σ2(a) is given by

σˆ2(a)=8n(n−1)(n−2)∑j/=i/=k,j\ltkGˆij(a)Gˆik(a)−4γˆ2(a).

In Appendix B, we show that the asymptotic variance σ2(a) is minimized by taking a(X) to be

(4)aopt(X)=E(Y|X).

To take advantage of this result, we can specify and estimate a working model E(Y|X)=m(X;α), where m is a known function and α is an unknown finite-dimensional parameter. For example, m(X;α) may be specified as a generalized linear model, and α may be estimated by (iteratively reweighted) least squares. Let αˆ denote the resulting estimate of α; then we can estimate aopt(X) by aˆopt(X)=m(X;αˆ). We show in Appendix B that, whether the model m(X;α) is correct or not, γˆ(aˆopt) is consistent for γ and asymptotically normal, and its asymptotic variance is given by σ2(m(⋅;α∞)), where α∞ is the probability limit of αˆ. If m(X;α) is correctly specified, then α∞ is the true value of α and γˆ(aˆopt) is asymptotically equivalent to γˆ(aopt). Regardless of the (in)correctness of m(X;α), the asymptotic variance σ2(m(⋅;α∞)) is consistently estimated by σˆ2(aˆopt).

2.3 Comparing two markers

We now consider the problem of comparing two markers, say Vb and Vc, both of which are included in X. Write γb=E(Gb12) with

Gb12=sgn(Vb1−Vb2){δb(Vb1)−δb(Vb2)},

where δb(v)=E{Y(1)−Y(−1)|Vb=v} and Vb1 and Vb2 are the values of Vb for two independent patients. For a given augmentation A=a(X), γb can be estimated by

γˆb(a)=2n(n−1)∑i\ltjGˆbij(a),

where

Gˆbij(a)=2sgn(Vbi−Vbj){Ti(Yi−Ai)−Tj(Yj−Aj)}.

Let γc and γˆc(a) be defined analogously for Vc. Our comparison of the two markers is based on the difference γb−γc, which is consistently estimated by γˆb(a)−γˆc(a). For any fixed a, it follows from standard asymptotic theory that n{γˆb(a)−γˆc(a)−(γb−γc)} converges to a normal distribution with mean zero and variance

σbc2(a)=4cov{Gˆb12(a),Gˆb13(a)}+cov{Gˆc12(a),Gˆc13(a)}−2cov{Gˆb12(a),Gˆc13(a)}.

A consistent estimator of σbc2(a) is given by

σˆbc2(a)=8n(n−1)(n−2)∑j/=i/=k,j\ltk{Gˆbij(a)Gˆbik(a)+Gˆcij(a)Gˆcik(a)−2Gˆbij(a)Gˆcik(a)}−4{γˆb(a)−γˆc(a)}2.

In Appendix C, we show that σbc2(a) is minimized by the same aopt defined by eq. (4). As in Section 2.2, we can estimate aopt(X) by aˆopt(X)=m(X;αˆ), where m(X;α) is a working model for E(Y|X) and αˆ is an estimate of α. The asymptotic behavior of γˆb(aˆopt)−γˆc(aˆopt) is similar to that of γˆ(aˆopt) and can be described by the last paragraph of Section 2.2 with some obvious modifications.

2.4 Combining multiple markers

Suppose V is a sub-vector of X consisting of two or more treatment selection markers. A natural question is how to combine the markers in V into a single marker S=s(V). We propose to combine V by maximizing γs, the γ-value of S. To minimize new notation, we will continue to use the notation in Section 2.1 with V replaced by V. For instance, δ(V) is the conditional effect of t=1 versus t=−1 given V. It is easy to see that γs is maximized by any s that preserves the ordering by δ:

s(V1)\gts(V2)wheneverδ(V1)>δ(V2).

Thus, δ(V) itself would be an ideal choice of S, as would any strictly increasing function of δ(V).

The foregoing discussion suggests that we could set S equal to an estimate of δ(V), which may be obtained under a model for E(Y|T,V) [24]. Noting that

(2.4)E(Y|T,V)=μ(V)+Tδ(V)/2,

where μ(V)=E{Y(1)+Y(−1)|V}/2, we might parameterize μ(V) and δ(V) separately and estimate the model parameters by (iteratively reweighted) least squares. The optimality of this approach generally depends on the correctness of modeling assumptions, although there may be some robustness with respect to μ(V) [7, 19]. If δ(V) is misspecified, this approach may lead to a sub-optimal choice of S whose interpretation is unclear.

In the rest of this subsection, we consider an alternative approach based on direct maximization of γs, similar in spirit to the approach of Zhang et al. [11] for estimating optimal treatment regimes. For practicality, we restrict attention to a parametric family of functions, s(V;β), indexed by a finite-dimensional parameter β. The specification of s(V;β) may be informed by a parametric model for δ(V), and may also incorporate practical considerations such as simplicity and interpretability. The parameter space for β may need to be restricted so as to ensure that s(V;β1) and s(V;β2) are not strictly increasing functions of each other if β1/=β2. For example, if s(V;β)=β′V, we might restrict β to the unit sphere or, alternatively, fix the first element of β at 1. Once such a parametric family is defined, the problem is then to estimate βopt=argmaxβγ(β), where we write γ(β)=γs(⋅;β) for convenience. The estimand βopt can be interpreted as the true value of β in the following single index model:

(2.4)δ(V)=ξ(s(V;β)),

where ξ is an unknown, strictly increasing function. Because ξ is unspecified, this approach is more flexible than the one based on a parametric model for δ(V). Even if the single index model is misspecfied, s(⋅;βopt) remains interpretable as a local maximizer of γs (within the specified parametric family).

We propose to estimate βopt by maximizing a consistent estimate of γ(β), which may be obtained as in Section 2.2. Let γˆ(β,a) be defined by eqs (2) and (3) with V replaced by s(V;β). The objective function β↦γˆ(β,a) can be maximized using a grid search algorithm when the effective dimension of β is one or two. For a higher-dimensional β, a grid search may be impractical, and the discreteness of the sign function in eq. (3) may create difficulties in applying a more efficient algorithm. To deal with this problem, we follow the approach of Ma and Huang [18] and apply a gradient descent algorithm to a smoothed version of γˆ(β,a). Specifically, we replace the sign function with λn=2K(⋅/ϵn)−1, where K is a smooth, symmetric distribution function and (ϵn) is a sequence of positive numbers decreasing to 0. This leads to the smoothed estimator

γ˜(β,a)=2n(n−1)∑i\ltjG˜ij(β,a),

where

G˜ij(β,a)=2λn(s(Vi;β)−s(Vj;β)){Ti(Yi−Ai)−Tj(Yj−Aj)}.

It can be shown as in Ma and Huang [18] online supplement that, under regularity conditions, γ˜(β,a) is asymptotically equivalent to γˆ(β,a); see Appendix D for details. Thus, the optimal choice of a and the related discussion in Section 2.2 remain applicable to γ˜(β,a). Empirical rules for choosing ϵn are given in Ma and Huang [18] Section 3.5.

Because of the asymptotic equivalence of γˆ(β,a) and γ˜(β,a), we will for simplicity focus on the unsmoothed version in the rest of this subsection. Let βˆopt be a maximizer of γˆ(⋅,aˆ), where aˆ may be fixed or estimated. The arguments of Han [25] and Sherman [26] can be adapted to show that, under regularity conditions, βˆopt is consistent for βopt and (for a suitable parameterization) asymptotically normal. Of particular interest to us is the quantity γ(βˆopt), which measures the performance of the estimated optimal combination. Asymptotic inference on this quantity can be based on the fact that

(5)n{γˆ(βˆopt,aˆ)−γ(βˆopt)}=n{γˆ(βopt,a∞)−γ(βopt)}+op(1),

where a∞ is the probability limit of aˆ (a sketch proof is given in Appendix D).

In finite samples, a re-substitution bias can arise from the fact that βˆopt is chosen to maximize γˆ(⋅,aˆ) [1]. In general, the maximum of an estimated objective function is an over-estimator of the true objective function evaluated at the maximizer [20, 21]. The re-substitution bias can be removed using the following cross-validation procedure. We propose to partition the study cohort randomly into a specified number, say K, of subsamples that are roughly equal in size. For each k∈{1,…,K}, we use the kth subsample as a validation sample and combine the other subsamples into a training sample. From the training sample we obtain

βˆopt(−k)=argmaxβγˆ(−k)(β,aˆ(−k))

using the exact same method for obtaining βˆopt. The superscript (−k) in the preceding display emphasizes that each estimate is based on the training sample alone. Next, we evaluate the estimated combination s(⋅;βˆopt(−k)) in the validation sample and obtain γˆ(k)(βˆopt(−k),aˆ(k)), where the superscript (k) indicates that the corresponding estimate is based on the validation sample alone. Because βˆopt(−k) and γˆ(k)(⋅,aˆ(k)) are based on independent data, γˆ(k)(βˆopt(−k),aˆ(k)) is free of re-substitution bias. The final cross-validated estimator of γ(βˆopt) is given by

γˆcv(βˆopt)=1K∑k=1Kγˆ(k)(βˆopt(−k),aˆ(k)).

Note that γˆ(k)(βˆopt(−k),aˆ(k)) is really an estimator of γ(βˆopt(−k)), which measures the utility of s(⋅;βˆopt(−k)) based on a training sample. Because s(⋅;βˆopt) is based on a larger sample and is expected to perform better, γˆ(k)(βˆopt(−k),aˆ(k)) and hence γˆcv(βˆopt) may be biased downward for estimating γ(βˆopt), especially for small K. The choice of K thus represents a trade-off between bias and computational demand – a larger K decreases the downward bias in γˆ(k)(βˆopt(−k),aˆ(k)) at the expense of more computation. Once K is chosen, the size of a subsample is determined as roughly n/K. If this number is large enough for reliable estimation of aopt, we can set aˆ(k) to be an estimate of aopt based on the kth subsample; otherwise we can substitute a fixed a for aˆ(k).

Because the terms γˆ(k)(βˆopt(−k),aˆ(k)), k=1,…,K, are not independent of each other, it is not straightforward to derive the variance of the cross-validated estimator γˆcv(βˆopt). Nonetheless, inference on γ(βˆopt) can be based on nonparametric bootstrap standard errors and confidence intervals. A nonparametric bootstrap sample of the same size as the original sample can be created by sampling with replacement. A cross-validated estimate of γ(βˆopt) can be obtained from each bootstrap sample, repeating all steps in optimization and cross-validation. This procedure can be repeated many times to produce a collection of bootstrap estimates, from which a bootstrap standard error can be obtained as a sample standard deviation and confidence limits as empirical quantiles.

3 Numerical results

3.1 Analysis of HIV trial data

The ECHO and THRIVE studies enrolled a combined total of 1,374 treatment-naive HIV-infected adults (694 ECHO; 680 THRIVE) in multiple countries. Main inclusion criteria included a screening viral load of at least 5,000 copies/ml and confirmed viral sensitivity to background N[t]RTIs. The study subjects were randomized 1:1 to rilpivirine 25 mg once daily or efavirenz 600 mg once daily, both in conjunction with a background N[t]RTI regimen which consisted of tenofovir disoproxil fumarate (TDF) and emtricitabine (FTC) in ECHO and was based on the investigator’s choice of TDF/FTC, zidovudine/lamivudine (3TC), or abacavir/3TC in THRIVE. Randomization was stratified by screening viral load (≤105, 105–5×105, >5×105 copies/ml) and background N[t]RTI regimen (in THRIVE). A double-dummy design was used to keep patients masked from the assigned treatment. Patients were followed for up to 96 weeks, and the primary endpoint was virologic response at week 48. The observed response rates were 82.2% for rilpivirine and 79.8% for efavirenz.

Here we use the proposed methodology to evaluate, compare and combine the two markers (BVL and BCD4) for choosing between rilpivirine (T=1) and efavirenz (T=−1). For ease of handling, we work with standardized versions of −log(BVL) (minus because higher values of BVL indicate smaller benefits of rilpivirine versus efavirenz) and log(BCD4+1) (where one is added to handle zero counts). The standardization consists of subtracting by the mean and dividing by the standard deviation. We note that γ is invariant under a strictly increasing transformation. The γ-values of the two markers are estimated with A=aˆ(X) equal to 0 (i.e., no augmentation) or an estimate of E(Y), E(Y|V) or E(Y|X), where V consists of the two transformed markers and X consists of age, gender, race, body mass index, and the two markers. We estimate E(Y) with the sample mean of Y, and estimate E(Y|V) and E(Y|X) with logistic regression models that include the specified covariates as linear terms (in addition to an intercept). For brevity, we denote these augmentation terms by Eˆ(Y), Eˆ(Y|V) and Eˆ(Y|X), respectively. Our analysis is restricted to a subset of 1,314 subjects with complete covariate and outcome data.

Table 1

Marker comparison based on HIV trial data: point estimates and analytical standard errors for the γ-values of BVL and BCD4 as well as their difference (BCD4−BVL), obtained with different choices of aˆ.

{aˆ(X)}	Point estimate			Standard error
	BVL	BCD4	Diff.	BVL	BCD4	Diff.
0	0.143	0.021	–0.122	0.056	0.057	0.070
Eˆ(Y)	0.045	0.084	0.039	0.026	0.026	0.028
Eˆ(Y\|V)	0.046	0.083	0.037	0.025	0.025	0.028
Eˆ(Y\|X)	0.043	0.082	0.039	0.025	0.025	0.028

Table 1 shows the point estimates and standard errors for the γ-values of the two markers as well as their difference. As one might expect, the standard errors do not increase when new information is added to the augmentation term. The reductions are quite large when A changes from 0 to Eˆ(Y), and negligible when A changes from Eˆ(Y|V) to Eˆ(Y|X), suggesting that the additional covariates in X (relative to V) are of limited prognostic value. The point estimates based on A=0 are visibly different from those based on non-zero augmentation terms, possibly due to the large variability for A=0. The non-zero augmentation terms lead to very similar results, which together indicate that BCD4 has a higher γ-value than BVL, although the difference is not statistically significant.

Next, we consider how to combine the two markers linearly, so s(V;β)=β′V with ∥β∥=1. Linear combinations are easy to use in practice, even though they may not attain the global optimum. Our approach is to directly maximize an estimate of γ, which may be obtained using one of the four methods described earlier (with different augmentation terms). The maximization is done using a grid search algorithm based on the representation β=(cosϕ,sinϕ)′, ϕ∈[0,2π). As a comparator, we also consider an alternative approach based on the logistic regression model

logitP(Y=1|T,X;η)=η1+ηX′X+T(ηT+ηTV′V),

where η=(η1,ηX′,ηT,ηTV′)′ is a vector of unknown regression parameters. Fitting this logistic regression model produces a maximum likelihood estimate of ηTV, which is then normalized to provide a candidate combination of the two markers. The γ-value of the candidate combination based on logistic regression is estimated with A=Eˆ(Y|X). A 5-fold cross-validation procedure is also used to estimate the γ-value for each candidate combination (from either approach), with A=Eˆ(Y|X) in the validation step. Bootstrap standard errors are obtained from 1,000 bootstrap samples.

Table 2

Marker combination based on HIV trial data: point estimates and bootstrap standard errors for βˆopt, γˆ(βˆopt,aˆ) and γˆcv(βˆopt), obtained with different choices of aˆ in the proposed approach.

{@{}lrrrrrrrr} {aˆ(X)}	Point estimate				Standard error
\\[-2mm]	βˆopt		r		βˆopt		r
	BVL	BCD4	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)	BVL	BCD4	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)
Logistic regression
Eˆ(Y\|X)	0.445	0.895	0.078	0.079	0.343	0.209	0.025	0.028
Proposed approach
0	1.000	0.016	0.144	0.049	0.206	0.453	0.053	0.038
Eˆ(Y)	0.124	0.992	0.085	0.080	0.284	0.194	0.025	0.030
Eˆ(Y\|V)	0.186	0.983	0.085	0.083	0.280	0.196	0.024	0.029
Eˆ(Y\|X)	0.124	0.992	0.084	0.086	0.260	0.192	0.024	0.029

Table 2 shows the results for marker combination. For the proposed approach, the cross-validated γ-estimate tends to increase when new information is added to the augmentation term. Except in the no-augmentation case, the proposed approach produces higher cross-validated γ-estimates than the logistic regression approach, and both approaches suggest that BCD4 should play a much larger role than BVL in the linear combination. In the no-augmentation case, cross-validation reduces the estimate of γ dramatically for the proposed approach.

Example-based simulation

We now evaluate the performance of the proposed methodology in a simulation study mimicking the ECHO and THRIVE trials. We work with the same subset of 1,314 subjects analyzed in Section 3.1 with their original marker values, and generate T and Y randomly according to the observed pattern in the data. Specifically, we let P(T=1|V)=P(T=−1|V)=1/2 and, for each t∈{1,−1}, set P(Y=1|T=t,V)=P(Y(t)=1|V) equal to a nonparametric estimate based on tensor product splines. Because of the curse of dimensionality, this data generation mechanism does not involve the covariates in X that are not in V, which therefore will not be included in the analysis. One thousand datasets are simulated independently and analyzed as in Section 3.1 (except that X=V in this simulation study).

Table 3

Marker comparison in example-based simulation: empirical bias, standard deviation (SD), mean standard error (SE) and empirical coverage probability (CP) (of 95% Wald confidence intervals) for estimating the γ-values of BVL and BCD4 as well as their difference (BCD4−BVL) with different choices of aˆ. The true values are 0.046 for BVL, 0.081 for BCD4, and 0.035 for the difference.

{@{}lrrrrrrrrrrrr} {aˆ(X)}	Bias			SD			SE			CP
	BVL	BCD4	Diff.	BVL	BCD4	Diff.	BVL	BCD4	Diff.	BVL	BCD4	Diff.
0	–0.002	–0.001	0.001	0.059	0.057	0.070	0.057	0.057	0.069	0.937	0.948	0.945
Eˆ(Y)	–0.001	0.000	0.001	0.026	0.025	0.028	0.026	0.026	0.029	0.954	0.953	0.951
Eˆ(Y\|V)	–0.001	–0.001	0.000	0.024	0.024	0.028	0.025	0.025	0.029	0.953	0.958	0.952

Table 3 shows the results for marker comparison: empirical bias, standard deviation, mean standard error and empirical coverage probability (of 95% Wald confidence intervals) for estimating the γ-values of the two markers as well as their difference with different augmentation terms (0, Eˆ(Y) and Eˆ(Y|V) but not Eˆ(Y|X)). It is clear in Table 3 that the estimators are virtually unbiased and generally become more efficient with increasing information in the augmentation, although the improvement from Eˆ(Y) to Eˆ(Y|V) is quite modest (presumably due to low prognostic value of V). In all cases, the standard errors approximate the empirical standard deviations well on average, and the resulting confidence intervals have close-to-nominal coverage.

Table 4

Marker combination in example-based simulation: empirical means and standard deviations of βˆopt, γ(βˆopt), γˆ(βˆopt,aˆ) and γˆcv(βˆopt), obtained with different choices of aˆ in the proposed approach. The true optimum is given by βopt=(0.062,0.998)′ with γ(βopt)=0.081.

{@{}lrrrrrrrrrr} aˆ(V)	Mean					Standard deviation
	βˆopt		c			βˆopt		c
	BVL	BCD4	γ(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)	BVL	BCD4	γ(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)
Logistic regression
Eˆ(Y\|V)	0.443	0.827	0.070	0.079	0.069	0.228	0.334	0.008	0.024	0.028
Proposed approach
0	0.258	0.702	0.063	0.112	0.056	0.537	0.519	0.033	0.046	0.042
Eˆ(Y)	0.186	0.921	0.077	0.087	0.073	0.250	0.327	0.011	0.023	0.029
Eˆ(Y\|V)	0.166	0.931	0.077	0.087	0.072	0.266	0.332	0.011	0.023	0.029

The results for marker combination are presented in Table 4, where the proposed approach with the aforementioned augmentation terms is compared with a logistic regression approach based on the following model:

logitP(Y=1|T,V;η)=η1+ηV′V+T(ηT+ηTV′V),

where η=(η1,ηV′,ηT,ηTV′)′. The proposed approach tends to perform better with increasing information in the augmentation, producing candidate combinations that are closer on average to the optimal combination with a higher mean of γ(βˆopt). By the same measures, logistic regression outperforms the proposed approach with no augmentation but not with non-zero augmentations. For the proposed approach, γˆ(βˆopt,aˆ) (without cross-validation) tends to over-estimate γ(βˆopt), especially in the no-augmentation case, and the re-substitution bias is effectively corrected by cross-validation. Although γˆcv(βˆopt) tends to under-estimate γ(βˆopt), it is on average closer to γ(βˆopt) than γˆ(βˆopt,aˆ) is. It is also worth noting that, for the proposed approach, the variability in βˆopt and γ(βˆopt) tends to decrease with increasing information in the augmentation.

3.3 Additional simulations

Here we report additional simulation experiments with a continuous outcome Y, two markers in V, and a separate baseline variable W such that X=(V′,W)′. The three components of X are independently distributed as standard normal. As before, treatment is randomized 1:1, so that P(T=1|X)=P(T=−1|X)=1/2. Given (X,T), we generate Y from the following single index model:

Y=α1+αX′X+Tξ(βV′V)+ϵ,

where α1=0, αX=(0.5,−0.5,1)′, βV=(1,1)′ or (1,2)′, ξ may be identity, expit (i.e., ξ(u)=1/{1+exp(−u)}) or sgn, and ϵ∼N(0,1) independently of (X,T). Thus, all three components of X are prognostic, and both components of V are effect-modifying. The value of βV determines the relative strength of the two markers, and the choice of ξ has a direct impact on δ(V), which describes the heterogeneity of the treatment effect. One thousand samples of size n=500 are generated independently and analyzed for marker comparison and combination.

Table 5

Marker comparison in additional simulations: empirical bias, standard deviation (SD), mean standard error (SE) and empirical coverage probability (CP) (of 95% Wald confidence intervals) for estimating γ1, γ2 and γ2−γ1 with different choices of aˆ.

{@{}lrrrrrrrrrrrr} {aˆ(X)}	Bias			SD			SE			CP
	γ1	γ2	γ2−γ1	γ1	γ2	γ2−γ1	γ1	γ2	γ2−γ1	γ1	γ2	γ2−γ1
ξ=identity, βV=(1,1)′, γ1=γ2≈2.247
0	0.020	0.015	–0.005	0.207	0.217	0.285	0.212	0.212	0.282	0.950	0.942	0.946
Eˆ(Y)	0.015	0.011	–0.005	0.206	0.217	0.285	0.212	0.212	0.282	0.957	0.945	0.944
Eˆ(Y\|V)	0.003	–0.006	–0.008	0.188	0.197	0.238	0.191	0.191	0.230	0.955	0.945	0.941
Eˆ(Y\|X)	–0.008	–0.014	–0.006	0.160	0.165	0.182	0.161	0.161	0.179	0.954	0.947	0.948
ξ=identity, βV=(1,2)′, γ1≈2.247, γ2≈4.494
0	0.019	0.026	0.007	0.275	0.244	0.330	0.278	0.246	0.327	0.950	0.946	0.942
Eˆ(Y)	0.015	0.016	0.002	0.275	0.244	0.331	0.277	0.246	0.327	0.949	0.947	0.941
Eˆ(Y\|V)	0.000	–0.021	–0.021	0.258	0.228	0.283	0.261	0.228	0.283	0.953	0.948	0.941
Eˆ(Y\|X)	–0.004	–0.027	–0.023	0.236	0.209	0.248	0.239	0.204	0.244	0.955	0.950	0.939
ξ=expit, βV=(1,1)′, γ1=γ2≈0.417
0	–0.005	–0.002	0.003	0.172	0.171	0.258	0.172	0.172	0.263	0.957	0.946	0.953
Eˆ(Y)	–0.006	–0.003	0.003	0.172	0.171	0.258	0.172	0.172	0.263	0.956	0.947	0.954
Eˆ(Y\|V)	–0.006	–0.009	–0.003	0.145	0.148	0.202	0.146	0.146	0.206	0.956	0.941	0.949
Eˆ(Y\|X)	–0.004	–0.007	–0.003	0.103	0.106	0.144	0.104	0.104	0.146	0.945	0.940	0.940
ξ=expit, βV=(1,2)′, γ1≈0.322, γ2≈0.669
0	–0.006	–0.003	0.004	0.171	0.176	0.266	0.174	0.172	0.264	0.950	0.950	0.953
Eˆ(Y)	–0.007	–0.004	0.003	0.171	0.177	0.265	0.174	0.172	0.264	0.949	0.947	0.953
Eˆ(Y\|V)	–0.007	–0.012	–0.005	0.142	0.153	0.206	0.148	0.146	0.207	0.960	0.939	0.944
Eˆ(Y\|X)	–0.003	–0.008	–0.005	0.107	0.103	0.143	0.107	0.104	0.148	0.946	0.954	0.953
ξ=sgn, βV=(1,1)′, γ1=γ2≈1.333
0	–0.010	0.008	0.018	0.183	0.189	0.292	0.185	0.185	0.288	0.958	0.938	0.947
Eˆ(Y)	–0.012	0.006	0.018	0.183	0.189	0.292	0.185	0.185	0.288	0.960	0.941	0.946
Eˆ(Y\|V)	–0.018	–0.006	0.012	0.160	0.160	0.234	0.161	0.161	0.236	0.949	0.941	0.949
Eˆ(Y\|X)	–0.016	–0.009	0.008	0.121	0.126	0.185	0.124	0.124	0.187	0.955	0.951	0.947
ξ=sgn, βV=(1,2)′, γ1≈0.818, γ2≈1.745
0	0.002	0.000	–0.003	0.196	0.184	0.295	0.194	0.176	0.285	0.948	0.932	0.940
Eˆ(Y)	0.000	–0.004	–0.005	0.196	0.183	0.295	0.194	0.176	0.285	0.946	0.936	0.941
Eˆ(Y\|V)	–0.001	–0.015	–0.014	0.171	0.152	0.238	0.171	0.150	0.234	0.950	0.944	0.946
Eˆ(Y\|X)	–0.003	–0.016	–0.013	0.139	0.110	0.184	0.137	0.110	0.183	0.950	0.946	0.936

The results for marker comparison are presented in Table 5, where γj (j=1,2) denotes the γ-value for the jth marker in V. Here, βV=(1,1)′ represents the null case (γ1=γ2) and βV=(1,2)′ represents the alternative case (γ1<γ2). The γ’s are estimated with the same choices of A=aˆ(X) as before (0, Eˆ(Y), Eˆ(Y|V) and Eˆ(Y|X)), expect that E(Y|V) and E(Y|X) are now estimated under linear regression models with V and X, respectively, as linear terms (in addition to an intercept). The results in Table 5 are consistent with those in Table 3 in the sense of small bias in point estimation and variance estimation as well as good coverage of confidence intervals. In addition, Table 5 demonstrates that the prognostic ability of X translates into improved precision when A changes from Eˆ(Y) to Eˆ(Y|V) and further to Eˆ(Y|X). Note that there is no improvement from A=0 to A=Eˆ(Y) because E(Y)=0 in this simulation study.

Table 6

Marker combination in additional simulations: empirical means and standard deviations of βˆopt, γ(βˆopt), γˆ(βˆopt,aˆ) and γˆcv(βˆopt), obtained with different choices of aˆ in the proposed approach.

{@{}lrrrrrrrrrr} {aˆ(X)}	Mean					Standard deviation
	βˆopt		γ(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)	βˆopt		γ(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)
ξ=identity, βV=(1,1)′, βopt=(0.707,0.707)′, γ(βopt)=3.178
Linear regression
Eˆ(Y\|X)	0.706	0.708	3.176	3.160	3.018	0.022	0.022	0.002	0.146	0.151
Proposed approach
0	0.704	0.707	3.170	3.199	3.008	0.049	0.049	0.011	0.184	0.151
Eˆ(Y)	0.704	0.707	3.170	3.193	3.009	0.049	0.049	0.011	0.183	0.152
Eˆ(Y\|V)	0.705	0.707	3.173	3.169	3.013	0.039	0.039	0.007	0.175	0.154
Eˆ(Y\|X)	0.705	0.708	3.175	3.160	3.014	0.029	0.029	0.004	0.146	0.155
ξ=identity, βV=(1,2)′, βopt=(0.447,0.894)′, γ(βopt)=5.024
Linear regression
Eˆ(Y\|X)	0.448	0.894	5.023	4.988	4.763	0.018	0.009	0.001	0.193	0.201
Proposed approach
0	0.447	0.893	5.019	5.049	4.758	0.041	0.020	0.007	0.234	0.206
Eˆ(Y)	0.448	0.893	5.019	5.039	4.754	0.040	0.020	0.007	0.235	0.206
Eˆ(Y\|V)	0.448	0.893	5.021	4.998	4.759	0.033	0.017	0.005	0.220	0.207
Eˆ(Y\|X)	0.448	0.893	5.022	4.988	4.765	0.026	0.014	0.003	0.193	0.205
ξ=expit, βV=(1,1)′, βopt=(0.707,0.707)′, γ(βopt)=0.603
Linear regression
Eˆ(Y\|X)	0.689	0.703	0.593	0.599	0.554	0.125	0.121	0.014	0.105	0.110
Proposed approach
0	0.667	0.676	0.571	0.628	0.529	0.223	0.222	0.049	0.160	0.123
Eˆ(Y)	0.667	0.675	0.570	0.627	0.526	0.223	0.223	0.049	0.160	0.121
Eˆ(Y\|V)	0.674	0.693	0.581	0.609	0.540	0.184	0.181	0.034	0.142	0.116
Eˆ(Y\|X)	0.692	0.700	0.593	0.601	0.554	0.127	0.124	0.014	0.105	0.110
ξ=expit, βV=(1,2)′, βopt=(0.447,0.894)′, γ(βopt)=0.758
Linear regression
Eˆ(Y\|X)	0.457	0.878	0.749	0.753	0.708	0.126	0.068	0.014	0.104	0.108
Proposed approach
0	0.452	0.858	0.732	0.781	0.686	0.211	0.124	0.040	0.159	0.116
Eˆ(Y)	0.451	0.859	0.733	0.779	0.687	0.210	0.121	0.038	0.159	0.116
Eˆ(Y\|V)	0.445	0.875	0.742	0.766	0.697	0.168	0.090	0.023	0.142	0.111
Eˆ(Y\|X)	0.455	0.881	0.750	0.755	0.708	0.118	0.062	0.012	0.104	0.107
ξ=sgn, βV=(1,1)′, βopt=(0.707,0.707)′, γ(βopt)=2.000
Linear regression
Eˆ(Y\|X)	0.706	0.705	1.995	1.983	1.902	0.047	0.047	0.007	0.107	0.110
Proposed approach
0	0.700	0.709	1.990	2.011	1.895	0.062	0.062	0.014	0.162	0.111
Eˆ(Y)	0.700	0.709	1.990	2.007	1.895	0.063	0.062	0.014	0.162	0.111
Eˆ(Y\|V)	0.703	0.707	1.994	1.988	1.899	0.050	0.050	0.009	0.145	0.112
Eˆ(Y\|X)	0.705	0.707	1.996	1.985	1.903	0.038	0.038	0.005	0.106	0.110
ξ=sgn, βV=(1,2)′, βopt=(0.447,0.894)′, γ(βopt)=2.000
Linear regression
Eˆ(Y\|X)	0.445	0.893	1.995	1.979	1.899	0.059	0.029	0.008	0.103	0.106
Proposed approach
0	0.447	0.890	1.989	2.007	1.893	0.082	0.041	0.017	0.164	0.106
Eˆ(Y)	0.447	0.890	1.989	2.003	1.891	0.083	0.041	0.017	0.164	0.108
Eˆ(Y\|V)	0.445	0.893	1.993	1.989	1.896	0.066	0.033	0.011	0.147	0.107
Eˆ(Y\|X)	0.446	0.893	1.996	1.981	1.900	0.049	0.024	0.006	0.102	0.106

The results for marker combination are shown in Table 6, where the proposed approach is compared with a linear regression approach based on the following working model:

E(Y|T,X;η)=η1+ηX′X+T(ηT+ηTV′V),

where η=(η1,ηX′,ηT,ηTV′)′ is a vector of unknown regression parameters. This model is correct if ξ is identity and incorrect if ξ is expit or sgn. The working model is fitted by ordinary least squares and the resulting estimate of ηTV is normalized to provide a candidate combination of the two markers. The γ-value of the candidate combination based on linear regression is estimated with A=Eˆ(Y|X). A 5-fold cross-validation procedure is also used to estimate the γ-value for each candidate combination (from either approach), with A=Eˆ(Y|X) in the validation step. In Table 6, the proposed approach generally performs better in estimating βopt, in the sense of decreasing variability and (for ξ=expit) decreasing bias, when new information is added to the augmentation term. These trends correspond to decreasing variability and (for ξ=expit) increasing mean of γ(βˆopt) (calculated through numerical integration). In all cases considered here, the linear regression approach performs similarly to the proposed approach with A=Eˆ(Y|X). This phenomenon is reminiscent of the observation that logistic regression and direct maximization of AUC often perform similarly for combining multiple diagnostic markers [17, 27]. Table 6 also illustrates that the re-substitution bias may or may not be a serious issue and that cross-validation could introduce a substantial downward bias in estimating γ(βˆopt).

In Appendix E, we examine the performance of the proposed methods at a smaller sample size (n=100). The results for marker comparison indicate that estimation of the optimal augmentation can result in a small-sample bias in γˆ(aˆopt) and the associated standard error, and that in some cases the proposed Wald confidence intervals can have sub-nominal coverage probabilities. For marker combination, the smaller sample size provides an opportunity to evaluate the bootstrap standard errors and the associated confidence intervals for γ(βˆopt). The results show that the bootstrap standard errors (based on 200 bootstrap samples) perform well in most cases, although serious undercoverage could result from a substantial bias in the point estimate of γ(βˆopt). These findings suggest that extra caution should be taken when applying the methods to small samples and when making inference about γ(βˆopt).

4 Discussion

We have proposed a quantitative concordance measure, γ, for comparing and combining treatment selection markers without specifying a cutoff value. Not requiring a cutoff value is a desirable feature because in practice the choice of cutoff may depend on safety, cost and individual preference. In that sense, γ is similar to the AUC for evaluating and comparing diagnostic markers, and we have shown that γ is indeed related to the area under the selection impact curve. Like the AUC, γ is invariant to monotone transformations of marker values. Unlike the AUC, γ does not have a universal scale, which makes it difficult to make comparisons across different treatment selection problems. However, for a given patient population and a given pair of treatments, γ does have a fixed scale and can be used to compare different treatment selection markers.

For combining multiple treatment selection markers, our simulation results show that the proposed approach often performs similarly to a regression approach based on an appropriate model for E(Y|T,X), which is again analogous to the literature on diagnostic markers. The proposed approach does seem advantageous in the HIV trials that motivated this research. In other applications, a simulation study could be conducted to compare the two approaches. If, for a given application, neither approach clearly outperforms the other, both approaches could be included in a sensitivity analysis. Our simulation results also suggest that the re-substitution bias may or may not be a serious problem and that the cross-validated γ-estimate for a candidate combination could be less or more biased (in the opposite direction) than the un-cross-validated estimate. These issues need to be investigated further, perhaps by means of simulation for a given application. Another area of future research is how to incorporate variable selection techniques such as the lasso into the proposed marker combination procedure when the dimension of V is very high (possibly greater than n).

Acknowledgements

We thank two anonymous reviewers for constructive comments that have improved the manuscript greatly. The views expressed in this article do not represent the official position of the U.S. Food and Drug Administration.

1. Simon R. Development and validation of biomarker classifiers for treatment selection. J Stat Plan Inference 2008;138:308–20.10.1016/j.jspi.2007.06.010Suche in Google Scholar

2. Simon R. Clinical trials for predictive medicine: new challenges and paradigms. Clin Trials 2010;7:516–24.10.1177/1740774510366454Suche in Google Scholar

3. Molina JM, Cahn P, Grinsztejn B, Lazzarin A, Mills A, Saag M. Rilpivirine versus efavirenz with tenofovir and emtricitabine in treatment-naive adults infected with HIV-1 (ECHO): a phase 3 randomised double-blind active-controlled trial. Lancet 2011;378:238–46.10.1016/S0140-6736(11)60936-7Suche in Google Scholar

4. Cohen CJ, Andrade-Villanueva J, Clotet B, Fourie J, Johnson MA, Ruxrungtham K. Rilpivirine versus efavirenz with two background nucleoside or nucleotide reverse transcriptase inhibitors in treatment-naive adults infected with HIV-1 (THRIVE): a phase 3, randomised, non-inferiority trial. Lancet 2011;378:229–37.10.1016/S0140-6736(11)60983-5Suche in Google Scholar

5. Cohen CJ, Molina JM, Cahn P, Clotet B, Fourie J, Grinsztejn B. Efficacy and safety of rilpivirine (TMC278) versus efavirenz at 48 weeks in treatment-naive HIV-1-infected patients: pooled results from the phase 3 double-blind randomized ECHO and THRIVE trials. J AIDS 2012;60:33–42.10.1097/QAI.0b013e31824d006eSuche in Google Scholar PubMed

6. Zhang Z, Nie L, Soon G, Liu A. The use of covariates and random effects in evaluating predictive biomarkers under a potential outcome framework. Ann Appl Stat 2014;8:2336-55.10.1214/14-AOAS773Suche in Google Scholar PubMed PubMed Central

7. Zhang Z, Qu Y, Zhang B, Nie L, Soon G. Use of auxiliary covariates in estimating a biomarker-adjusted treatment effect model with clinical trial data. Stat Methods Med Res 2016;25:2103–19.10.1177/0962280213515572Suche in Google Scholar PubMed

8. Song X, Pepe MS. Evaluating markers for selecting a patients treatment. Biometrics 2004;60:874–83.10.1111/j.0006-341X.2004.00242.xSuche in Google Scholar PubMed

9. Qian M, Murphy SA. Performance guarantees for individualized treatment rules. Ann Stat 2011;39:1180–210.10.1214/10-AOS864Suche in Google Scholar PubMed PubMed Central

10. Zhao Y, Zeng D, Rush J, Kosorok MR. Estimating individualized treatment rules using outcome weighted learning. J Am Stat Assoc 2012;107:1106–18.10.1080/01621459.2012.695674Suche in Google Scholar PubMed PubMed Central

11. Zhang B, Tsiatis A, Laber E, Davidian M. A robust method for estimating optimal treatment regimes. Biometrics 2012;68:1010-8.10.1111/j.1541-0420.2012.01763.xSuche in Google Scholar PubMed PubMed Central

12. Kang C, Janes H, Huang Y. Combining biomarkers to optimize patient treatment recommendations. Biometrics 2014;70:695–707.10.1111/biom.12191Suche in Google Scholar PubMed PubMed Central

13. Janes H, Pepe MS, Bossuyt PM, Barlow WE. Measuring the performance of markers for guiding treatment decisions. Ann Intern Med 2011;154:253–9.10.7326/0003-4819-154-4-201102150-00006Suche in Google Scholar PubMed PubMed Central

14. Zhou XH, Obuchowski NA, McClish DK. Statistical methods in diagnostic medicine. New York: Wiley, 2002.10.1002/9780470317082Suche in Google Scholar

15. Pepe MS. The statistical evaluation of medical tests for classification and prediction. New York: Oxford University Press, 2003.Suche in Google Scholar

16. McIntosh MS, Pepe MS. Combining several screening tests: optimality of the risk score. Biometrics 2002;58:657–64.10.1111/j.0006-341X.2002.00657.xSuche in Google Scholar

17. Pepe MS, Cai T, Longton G. Combining predictors for classification using the area under the receiver operating characteristic curve. Biometrics 2006;62;221–9.10.1111/j.1541-0420.2005.00420.xSuche in Google Scholar PubMed

18. Ma S, Huang J. Combining multiple markers for classification using ROC. Biometrics 2007;63:751–7.10.1111/j.1541-0420.2006.00731.xSuche in Google Scholar PubMed

19. Tian L, Alizadeh AA, Gentles AJ, Tibshirani R. A simple method for estimating interactions between a treatment and a large number of covariates. J Am Stat Assoc 2014;109:1517–32.10.1080/01621459.2014.951443Suche in Google Scholar PubMed PubMed Central

20. Dawid AP. Selection paradoxes of Bayesian inference. Multivariate Anal Appl 1994;24:211–20.10.1214/lnms/1215463797Suche in Google Scholar

21. Senn S. A note concerning a selection “paradox” of Dawid’s. Am Stat 2008;62:206–10.10.1198/000313008X331530Suche in Google Scholar

22. Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Edu Psychol 1974;66:688–701.10.1037/h0037350Suche in Google Scholar

23. van der Vaart AW. Asymptotic statistics. Cambridge: Cambridge University Press, 1998.10.1017/CBO9780511802256Suche in Google Scholar

24. Cai T, Tian L, Wong PH, Wei LJ. Analysis of randomized comparative clinical trial data for personalized treatment selections. Biostatistics 2011;12:270–82.10.1093/biostatistics/kxq060Suche in Google Scholar

25. Han AK. Non-parametric analysis of a generalized regression model. J Econ 1987;35:303–16.10.1016/0304-4076(87)90030-3Suche in Google Scholar

26. Sherman RP. The limiting distribution of the maximum rank correlation estimator. Econometrica 1993;61:123–37.10.2307/2951780Suche in Google Scholar

27. Li KC, Duan N. Regression analysis under link violation. Ann Stat 1989;17:1009–52.10.1214/aos/1176347254Suche in Google Scholar

28. Nolan D, Pollard D. Functional limit theorems for $$U$$-processes. Ann Probab 1988;16:1291–8.10.1214/aop/1176991691Suche in Google Scholar

29. van der Vaart AW, Wellner JA. Weak convergence and empirical processes with applications to statistics. New York: Springer-Verlag, 1996.10.1007/978-1-4757-2545-2Suche in Google Scholar

Appendix A: Proof of (1)

We start by noting that

(6)γ=E{sgn(V1−V2)δ(V1)}+E{sgn(V2−V1)δ(V2)}=2E{sgn(V1−V2)δ(V1)}=2E[E{sgn(V1−V2)δ(V1)|V1}]=2E{(2V1−1)δ(V1)}=4E{Vδ(V)}−2E{δ(V)}=4E{Vδ(V)}−4E(V)E{δ(V)}=4cov{V,δ(V)},

which follows from the symmetry of (V1,V2) and the assumed uniform distribution of V. Next, we note that

θ(u)=∫0uE{Y(−1)|V=v}\/dv+∫u1E{Y(1)|V=v}\/dv=∫01E{Y(−1)|V=v}\/dv−∫u1E{Y(−1)|V=v}\/dv+∫u1E{Y(1)|V=v}\/dv=E{Y(−1)}+∫u1δ(v)\/dv,

which implies that

(7)∫01θ(u)\/du=E{Y(−1)}+∫01∫u1δ(v)\/dv\/du=E{Y(−1)}+∫01∫01I(v\gtu)δ(v)\/dv\/du=E{Y(−1)}+∫01∫01I(v\gtu)δ(v)\/du\/dv=E{Y(−1)}+∫01vδ(v)\/dv=E{Y(−1)}+E{Vδ(V)},

where I(⋅) is the indicator function. Combining eqs ({6}) with (7), we have

γ=4E{Vδ(V)}−4E(V)E{δ(V)}=4∫01θ(u)\/du−E{Y(−1)}−2Δ=4∫01θ(u)\/du−4E{Y(−1)}−2[E{Y(1)}−E{Y(−1)}]=4∫01θ(u)\/du−2E{Y(−1)}−2E{Y(1)}.

Appendix B: Theoretical Support for Section 2.2

First, we note that, for two independent subjects i/=j,

E{Gˆij(a)}=E[2sgn(Vi−Vj){Ti(Yi−Ai)−Tj(Yj−Aj)}]=E(E[2sgn(Vi−Vj){Ti(Yi−Ai)−Tj(Yj−Aj)}|Vi,Vj])=E(sgn(Vi−Vj)E[2{Ti(Yi−Ai)−Tj(Yj−Aj)}|Vi,Vj])=E(sgn(Vi−Vj)[E{2Ti(Yi−Ai)|Vi}−E{2Tj(Yj−Aj)|Vj}])=E[sgn(Vi−Vj){δ(Vi)−δ(Vj)}]=E{Gij}=γ,

where we use the independence between subjects and the fact that E{2T(Y−A)|V}=δ(V) (noted in Section 2.2). Therefore, γˆ(a) is unbiased for γ for any fixed a.

For a fixed a, it follows from Theorem 12.3 of van der Vaart [23] that n{γˆ(a)−γ} converges to a normal distribution with mean zero and variance

σ2(a)=4cov{Gˆ12(a),Gˆ13(a)}=4E{Gˆ12(a)Gˆ13(a)}−4E{Gˆ12(a)}E{Gˆ13(a)}=4E{Gˆ12(a)Gˆ13(a)}−4γ2.

A conditioning argument shows that

E{Gˆ12(a)Gˆ13(a)}=E[4sgn((V1−V2)(V1−V3)){T1(Y1−A1)−T2(Y2−A2)}{T1(Y1−A1)−T3(Y3−A3)}]=E(E[4sgn((V1−V2)(V1−V3))×{T1(Y1−A1)−T2(Y2−A2)}{T1(Y1−A1)−T3(Y3−A3)}|V1,V2,V3])=E(sgn((V1−V2)(V1−V3))×E[4{T1(Y1−A1)−T2(Y2−A2)}{T1(Y1−A1)−T3(Y3−A3)}|V1,V2,V3])=E(sgn((V1−V2)(V1−V3))×[E{4T12(Y1−A1)2|V1}−E{4T1(Y1−A1)T2(Y2−A2))|V1,V2}−E{4T1(Y1−A1)T3(Y3−A3))|V1,V3}+E{4T2(Y2−A2)T3(Y3−A3))|V2,V3}])=E(sgn((V1−V2)(V1−V3))×[var{2T1(Y1−A1)|V1}+δ2(V1)−δ(V1)δ(V2)−δ(V1)δ(V3)+δ(V2)δ(V3)])=E(sgn((V1−V2)(V1−V3))[{δ(V1)−δ(V2)}{δ(V1)−δ(V3)}+var{2T1(Y1−A1)|V1}])=E(G12G13)+E[sgn((V1−V2)(V1−V3))var{2T1(Y1−A1)|V1}].

We now consider minimizing σ2(a) with respect to a. Because E{G12(s)G13(s)} does not depend on a, we can focus on

E[sgn((V1−V2)(V1−V3))var{2T1(Y1−A1)|V1}]=E(E[sgn((V1−V2)(V1−V3))var{2T1(Y1−A1)|V1}|V1])=E{κ(V1)τ2(V1)},

where

κ(V1)=E{sgn((V1−V2)(V1−V3))|V1},τ2(V1)=var{2T1(Y1−A1)|V1}.

Note that κ(V1) does not depend on a. Moreover, for any v,

κ(v)={P(V\gtv)−P(V\ltv)}2≥0.

Thus, a choice of a that minimizes τ2(v) for every v would minimize E{κ(V1)τ2(V1)} and hence σ2(a). Such a choice exists and is given by aopt(X)=E(Y|X). To see this, we write

τ2(v)=var{2T(Y−A)|V=v}=E[var{2T(Y−A)|X}|V=v]+var[E{2T(Y−A)|X}|V=v],

where

E{2T(Y−A)|X}=E{Y(1)−Y(−1)|X}=:d(X)

does not depend on A, and

var{2T(Y−A)|X}=E{4T2(Y−A)2|X}−E{2T(Y−A)|X}2=4E{(Y−A)2|X}−d2(X)

is minimized by A=E(Y|X).

Finally, let us consider γˆ(aˆopt)=γˆ(m(⋅;αˆ)), where αˆ converges in probability to α∞. For any fixed α, it follows from the theory of U-statistics that

n{γˆ(m(⋅;α))−γ}=1n∑i=1nh(Xi,Ti,Yi;α)+op(1),

where

h(x,t,y;α)=4E(sgn(V−v)[T{Y−m(X;α)}−t{y−m(x;α)}])−2γ.

Under regularity conditions, we have

n{γˆ(m(⋅;αˆ))−γ}=1n∑i=1nh(Xi,Ti,Yi;αˆ)+op(1)=1n∑i=1nh(Xi,Ti,Yi;α∞)+E{h˙(X,T,Y;α∞)}n(αˆ−α∞)+op(1),

where

h˙(x,t,y;α)=∂h(x,t,y;α)/∂α=−4E[sgn(V−v){Tm˙(X;α)−tm˙(x;α)}],

with m˙(x;α)=∂m(x;α)/∂α. It is easy to see that

E{h˙(X,T,Y;α∞)}=−4E[sgn(V1−V2){T1m˙(X1;α∞)−T2m˙(X2;α∞)}]=0,

because of the independence between randomized treatment and baseline covariates. Therefore, γˆ(aˆopt) is asymptotically equivalent to γˆ(m(⋅;α∞)). If the model m(X;α) is correct, then α∞ is the true value of α and γˆ(aˆopt) is asymptotically equivalent to γˆ(aopt).

Appendix C: Theoretical Support for Section 2.3

For any fixed a, the asymptotic variance of γˆb(a)−γˆc(a) is given by

σbc2(a)=4[cov{Gˆb12(a),Gˆb13(a)}+cov{Gˆc12(a),Gˆc13(a)}−2cov{Gˆb12(a),Gˆc13(a)}]=4[E{Gˆb12(a)Gˆb13(a)}+E{Gˆc12(a)Gˆc13(a)}−2E{Gˆb12(a)Gˆc13(a)}]−4(γb−γc)2.

Write V=(Vb,Vc). It can be shown as in Appendix B that

({8})E{Gˆb12(a)Gˆb13(a)}=E(Gb12Gb13)+E{κb(V1)τ2(V1)},

where

κb(V1)=E{sgn((Vb1−Vb2)(Vb1−Vb3))|V1},τ2(V1)=var{2T1(Y1−A1)|V1}.

Note that we are now conditioning on V1=(Vb1,Vc1) instead of Vb1. Equation ({8}) remains true with the subscript b replaced by c and with κc defined analogously. A similar argument shows that

E{Gˆb12(a)Gˆc13(a)}=E(Gb12Gc13)+E{κbc(V1)τ2(V1)},

where

κbc(V1)=E{sgn((Vb1−Vb2)(Vc1−Vc3))|V1}.

Note that E(Gb12Gb13), E(Gc12Gc13) and E(Gb12Gc13) do not depend on a. Therefore, for the purpose of minimizing σbc2(a) with respect to a, we can focus on minimizing

Eκb(V)+κc(V)−2κbc(V)τ2(V).

Here, the κ’s do not depend on a. Moreover, for any v=(vb,vc),

κb(v)+κc(v)−2κbc(v)={P(Vb\gtvb)−P(Vb\ltvb)+P(Vc\gtvc)−P(Vc\ltvc)}2≥0.

Therefore, a choice of a that minimizes τ2(v) for every v would minimize σbc2(a). Such a choice exists and is given by aopt(X)=E(Y|X), as we have shown in Appendix B.

Let α∞ denote the probability limit of αˆ. By applying the last paragraph of Appendix B to Vb and Vc separately, we obtain

n{γˆb(aˆopt)−γb}=n{γˆb(m(⋅;α∞))−γb}+op(1),n{γˆc(aˆopt)−γc}=n{γˆc(m(⋅;α∞))−γc}+op(1),

and hence

n{γˆb(aˆopt)−γˆc(aˆopt)−(γb−γc)}=n{γˆb(m(⋅;α∞))−γˆc(m(⋅;α∞))−(γb−γc)}+op(1).

This shows that γˆb(aˆopt)−γˆc(aˆopt) is asymptotically equivalent to γˆb(m(⋅;α∞))−γˆc(m(⋅;α∞)).

Appendix D: Theoretical Support for Section 2.4

First, we demonstrate the asymptotical equivalence of γ˜(β,a) to γˆ(β,a) by showing that

(9)n{γ˜(β,a)−γˆ(β,a)}=op(1).

To fix ideas, we work with a specific kernel function, K(u)=1/{1+exp(−u)}, although the argument extends easily to other kernel functions. We also assume for simplicity that |Y−A|≤L<∞ with probability 1, and write S=s(V;β). For any δn>0, we have

({10})|γ˜(β,a)−γˆ(β,a)|≤|1n(n−1)∑i/=jI(|Si−Sj|≥δn){λn(Si−Sj)−sgn(Si−Sj)}{Ti(Yi−Ai)−Tj(Yj−Aj)}|+|1n(n−1)∑i/=jI(|Si−Sj|<δn){λn(Si−Sj)−sgn(Si−Sj)}{Ti(Yi−Ai)−Tj(Yj−Aj)}|≤2Ln(n−1)∑i/=jI(|Si−Sj|≥δn)|λn(Si−Sj)−sgn(Si−Sj)|+4Ln(n−1)∑i/=jI(|Si−Sj|<δn)=:Dn1+Dn2.

For |u|≥δn, we have |λn(u)−sgn(u)|<2exp(−|u|/ϵn)<2exp(−δn/ϵn), so that

({11})nDn1≤4Lexp(2−1logn−δn/ϵn).

It follows from Nolan and Pollard [28] proof of Theorem 5 that

supδn|∑i/=j{I(|Si−Sj|<δ)−P(|S1−S2|<δ)}n(n−1)−2∑i=1n{F−(Si+δ)−F(Si−δ)−P(|S1−S2|<δ)}n|=op(1),

where F is the distribution function of S and F− is the left-hand limit of F. This implies that

(12)n{Dn2−4LP(|S1−S2|<δn)}=8Ln∑i=1n{F−(Si+δn)−F(Si−δn)−P(|S1−S2|<δn)}+op(1)=op(1),

where the second step follows from van der Vaart [23] Theorem 19.24. If S1−S2 is continuously distributed in a neighborhood of 0 with continuous density, then nP(|S1−S2|<δn)→0 if we set δn=nd2 for some d2<−1/2. For this choice of δn, eq. ({12}) implies that nDn2=op(1). With ϵn=nd1 for some d1\ltd2, eq. (11) implies that nDn1=op(1). Substituting these results into eq. (10) completes the proof of eq. ({9}).

Next, we give a sketch proof of eq. (5). It follows from Appendix B that, for any fixed β,

n{γˆ(β,aˆ)−γ(β)}=1n∑i=1nℏ(Xi,Ti,Yi;β,a∞)+op(1),

where

ℏ(x,t,y;β,a)=4E(sgn(s(V;β)−s(v;β))[T{Y−a(X)}−t{y−a(x)}])−2γ(β).

Under regularity conditions, we have

n{γˆ(βˆopt,aˆ)−γ(βˆopt)}=1n∑i=1nℏ(Xi,Ti,Yi;βˆopt,a∞)+op(1)=1n∑i=1nℏ(Xi,Ti,Yi;βopt,a∞)+op(1)=n{γˆ(βopt,aˆ)−γ(βopt)}+op(1),

where the second step can be established using Theorem 19.24 of van der Vaart [23], assuming that

{ℏ(X,T,Y;β,a∞):∥β−βopt∥<ϵ}

is a Donsker class [29] for some ϵ>0 and that

∥ℏ(X,T,Y;βˆopt,a∞)−ℏ(X,T,Y;βopt,a∞)∥2,P=op(1).

Incidentally, eq. (5) remains valid with γ(βˆopt) replaced by γ(βopt), because

n{γˆ(βˆopt,aˆ)−γ(βopt)}=n{γˆ(βˆopt,aˆ)−γ(βˆopt)}+n{γ(βˆopt)−γ(βopt)}=1n∑i=1nℏ(Xi,Ti,Yi;βopt,a∞)+n(βˆopt−βopt)′∂γ(βopt)/∂β+op(1)=1n∑i=1nℏ(Xi,Ti,Yi;βopt,a∞)+op(1),

where the last step follows from the fact that βopt maximizes γ(β), which implies that ∂γ(βopt)/∂β=0. A similar argument has been made by Zhang et al. [11] in the context of estimating optimal treatment regimes. This result implies that an asymptotic confidence interval for γ(βˆopt) based on eq. (5) is also an asymptotic confidence interval for γ(βopt).

Appendix E: Simulation Results for n=100

Here we report further simulation results obtained in the situation of Section 3.3 with n=100. Table 7 presents the results for marker combination in the same format as Table 5. As expected, the methods generally have inferior performance at n=100 as compared to n=500, especially when a regression model is fitted to estimate E(Y|V) or E(Y|X). Specifically, the point estimates are visibly biased, the standard errors tend to under-estimate the true standard deviations, and the confidence intervals have sub-nominal coverage probabilities in some cases. Table 8 shows the results for marker combination in a similar format to Table 6. In terms of γ(βˆopt), the proposed approach generally performs better with increasing information in A, and the linear regression approach performs similarly to the proposed approach with A=Eˆ(Y|X). As in Table 6, the re-substitution bias may or may not be a serious issue, and cross-validation could introduce a substantial downward bias in estimating γ(βˆopt). The smaller sample size provides an opportunity to evaluate the bootstrap standard errors and the associated confidence intervals for γ(βˆopt). The corresponding results in Table 8 show that the bootstrap standard errors (based on 200 bootstrap samples) perform well in most cases, although serious undercoverage (say, less than 92%) could result from a substantial bias in the point estimate of γ(βˆopt).

Table

Further simulation results for marker comparison at n=100: empirical bias, standard deviation (SD), mean standard error (SE) and empirical coverage probability (CP) (of 95% Wald confidence intervals) for estimating γ1, γ2 and γ2−γ1 with different choices of aˆ.

aˆ(X)}	Bias			SD			SE			CP
	γ1	γ2	γ2−γ1	γ1	γ2	γ2−γ1	γ1	γ2	γ2−γ1	γ1	γ2	γ2−γ1
ξ=identity, βV=(1,1)′, γ1=γ2≈2.247
0	0.002	–0.017	–0.018	0.483	0.472	0.639	0.463	0.466	0.620	0.935	0.946	0.941
Eˆ(Y)	–0.022	–0.039	–0.017	0.481	0.471	0.637	0.462	0.464	0.619	0.933	0.943	0.943
Eˆ(Y\|V)	–0.113	–0.103	0.010	0.439	0.437	0.533	0.409	0.410	0.501	0.911	0.922	0.931
Eˆ(Y\|X)	–0.133	–0.118	0.014	0.368	0.373	0.415	0.344	0.344	0.391	0.906	0.909	0.935
ξ=identity, βV=(1,2)′, γ1≈2.247, γ2≈4.494
0	–0.013	0.002	0.015	0.638	0.563	0.768	0.604	0.538	0.712	0.933	0.929	0.928
Eˆ(Y)	–0.038	–0.047	–0.009	0.635	0.562	0.767	0.601	0.535	0.712	0.935	0.920	0.926
Eˆ(Y\|V)	–0.121	–0.191	–0.069	0.603	0.540	0.670	0.556	0.490	0.615	0.915	0.883	0.914
Eˆ(Y\|X)	–0.134	–0.247	–0.113	0.544	0.493	0.575	0.509	0.436	0.530	0.914	0.844	0.907
ξ=expit, βV=(1,1)′, γ1=γ2≈0.417
0	0.004	0.008	0.004	0.395	0.383	0.611	0.377	0.378	0.573	0.946	0.952	0.931
Eˆ(Y)	0.000	0.005	0.005	0.394	0.382	0.610	0.376	0.376	0.571	0.945	0.955	0.931
Eˆ(Y\|V)	–0.006	–0.009	–0.003	0.329	0.324	0.472	0.316	0.317	0.444	0.931	0.955	0.942
Eˆ(Y\|X)	–0.021	–0.010	0.011	0.233	0.235	0.327	0.225	0.225	0.316	0.941	0.942	0.940
ξ=expit, βV=(1,2)′, γ1≈0.322, γ2≈0.669
0	–0.015	0.032	0.048	0.386	0.396	0.597	0.382	0.378	0.577	0.948	0.932	0.949
Eˆ(Y)	–0.021	0.024	0.045	0.385	0.394	0.595	0.380	0.376	0.575	0.945	0.934	0.951
Eˆ(Y\|V)	–0.025	–0.007	0.018	0.325	0.321	0.456	0.320	0.316	0.448	0.942	0.947	0.937
Eˆ(Y\|X)	–0.015	–0.016	–0.001	0.238	0.237	0.332	0.232	0.225	0.321	0.939	0.942	0.936
ξ=sgn, βV=(1,1)′, γ1=γ2≈1.333
0	0.000	0.021	0.021	0.419	0.410	0.641	0.406	0.405	0.628	0.937	0.946	0.949
Eˆ(Y)	–0.013	0.005	0.018	0.421	0.411	0.641	0.405	0.404	0.626	0.931	0.946	0.951
Eˆ(Y\|V)	–0.044	–0.035	0.010	0.359	0.360	0.525	0.349	0.347	0.508	0.938	0.933	0.939
Eˆ(Y\|X)	–0.054	–0.058	–0.004	0.280	0.290	0.415	0.268	0.267	0.400	0.929	0.926	0.934
ξ=sgn, βV=(1,2)′, γ1≈0.818, γ2≈1.745
0	0.015	–0.032	–0.046	0.434	0.389	0.622	0.424	0.386	0.622	0.940	0.941	0.950
Eˆ(Y)	0.008	–0.048	–0.056	0.431	0.388	0.618	0.422	0.385	0.620	0.944	0.939	0.949
Eˆ(Y\|V)	–0.029	–0.076	–0.047	0.383	0.332	0.503	0.368	0.326	0.502	0.934	0.936	0.944
Eˆ(Y\|X)	–0.044	–0.076	–0.033	0.303	0.248	0.394	0.293	0.239	0.393	0.939	0.922	0.942

Table

Further simulation results for marker combination at n=100: empirical means and standard deviations (SD)of βˆopt, γ(βˆopt), γˆ(βˆopt,aˆ) and γˆcv(βˆopt), obtained with different choices of aˆ in the proposed approach, as well as mean standard errors (SE; based on 200 bootstrap samples) for γˆ(βˆopt,aˆ) and γˆcv(βˆopt) and empirical coverage probabilities (CP) of 95% Wald confidence intervals for γ(βˆopt).

aˆ(X)	Mean					SD					SE		CP
	βˆopt		γ(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)	βˆopt		γ(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)
ξ=identity, βV=(1,1)′, βopt=(0.707,0.707)′, γ(βopt)=3.178
linear regression
Eˆ(Y\|X)	0.708	0.703	3.169	3.046	3.010	0.053	0.053	0.013	0.344	0.451	0.342	0.448	0.919	0.912
proposed approach
0	0.704	0.692	3.138	3.225	2.980	0.111	0.112	0.053	0.423	0.461	0.415	0.462	0.948	0.940
Eˆ(Y)	0.701	0.695	3.138	3.189	2.980	0.111	0.112	0.052	0.422	0.464	0.415	0.463	0.948	0.943
Eˆ(Y\|V)	0.701	0.701	3.150	3.080	2.989	0.094	0.094	0.041	0.422	0.445	0.409	0.457	0.924	0.919
Eˆ(Y\|X)	0.705	0.702	3.162	3.052	2.996	0.071	0.071	0.023	0.343	0.453	0.338	0.446	0.919	0.917
ξ=identity, βV=(1,2)′, βopt=(0.447,0.894)′, γ(βopt)=5.024
linear regression
Eˆ(Y\|X)	0.448	0.893	5.019	4.785	4.773	0.039	0.020	0.007	0.460	0.551	0.476	0.551	0.922	0.908
proposed approach
0	0.449	0.888	5.001	5.082	4.755	0.085	0.045	0.033	0.512	0.552	0.511	0.558	0.957	0.922
Eˆ(Y)	0.450	0.888	5.000	5.028	4.740	0.087	0.046	0.034	0.513	0.552	0.510	0.555	0.953	0.918
Eˆ(Y\|V)	0.446	0.891	5.005	4.856	4.764	0.078	0.039	0.028	0.503	0.539	0.519	0.553	0.948	0.922
Eˆ(Y\|X)	0.447	0.892	5.012	4.790	4.771	0.063	0.032	0.018	0.459	0.557	0.474	0.551	0.932	0.907
ξ=expit, βV=(1,1)′, βopt=(0.707,0.707)′, γ(βopt)=0.603
linear regression
Eˆ(Y\|X)	0.636	0.659	0.550	0.633	0.513	0.290	0.277	0.089	0.209	0.429	0.216	0.444	0.930	0.953
proposed approach
0	0.527	0.537	0.450	0.797	0.425	0.479	0.453	0.221	0.314	0.524	0.323	0.512	0.840	0.947
Eˆ(Y)	0.538	0.533	0.453	0.795	0.427	0.474	0.450	0.218	0.310	0.536	0.319	0.514	0.838	0.947
Eˆ(Y\|V)	0.554	0.586	0.483	0.723	0.439	0.443	0.393	0.188	0.282	0.524	0.273	0.510	0.870	0.947
Eˆ(Y\|X)	0.625	0.661	0.546	0.651	0.503	0.302	0.286	0.096	0.203	0.434	0.205	0.447	0.917	0.951

Table

(Continued): Further simulation results for marker combination at n=100: empirical means and standard deviations (SD)of βˆopt, γ(βˆopt), γˆ(βˆopt,aˆ) and γˆcv(βˆopt), obtained with different choices of aˆ in the proposed approach, as well as mean standard errors (SE; based on 200 bootstrap samples) for γˆ(βˆopt,aˆ) and γˆcv(βˆopt) and empirical coverage probabilities (CP) of 95% Wald confidence intervals for γ(βˆopt).

aˆ(X)	Mean						SD					r	SE		r	CP
c	βˆopt		γ(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)	c	βˆopt		γ(βˆopt)	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)	c	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)	c	γˆ(βˆopt,aˆ)	γˆcv(βˆopt)
ξ=expit, βV=(1,2)′, βopt=(0.447,0.894)′, γ(βopt)=0.758
linear regression
Eˆ(Y\|X)	0.433	0.836	0.709	0.750	0.648		0.290	0.169	0.068	0.233	0.421		0.220	0.428		0.932	0.944
proposed approach
0	0.402	0.739	0.630	0.892	0.553		0.424	0.337	0.193	0.337	0.533		0.328	0.499		0.870	0.942
Eˆ(Y)	0.400	0.735	0.627	0.886	0.565		0.430	0.339	0.195	0.335	0.527		0.323	0.500		0.870	0.948
Eˆ(Y\|V)	0.398	0.774	0.653	0.824	0.575		0.395	0.295	0.170	0.312	0.498		0.290	0.478		0.885	0.944
Eˆ(Y\|X)	0.439	0.829	0.707	0.767	0.653		0.294	0.182	0.079	0.229	0.418		0.211	0.432		0.923	0.939
ξ=sgn, βV=(1,1)′, βopt=(0.707,0.707)′, γ(βopt)=2.000
linear regression
Eˆ(Y\|X)	0.699	0.700	1.972	1.908	1.856		0.106	0.106	0.040	0.235	0.401		0.245	0.404		0.953	0.937
proposed approach
0	0.690	0.688	1.937	2.054	1.820		0.157	0.161	0.094	0.359	0.413		0.363	0.429		0.941	0.948
Eˆ(Y)	0.691	0.687	1.937	2.035	1.823		0.158	0.162	0.095	0.360	0.415		0.361	0.431		0.945	0.941
Eˆ(Y\|V)	0.696	0.694	1.958	1.960	1.847		0.128	0.133	0.065	0.327	0.408		0.323	0.419		0.939	0.942
Eˆ(Y\|X)	0.697	0.703	1.976	1.921	1.869		0.099	0.098	0.037	0.234	0.395		0.241	0.403		0.950	0.933
ξ=sgn, βV=(1,2)′, βopt=(0.447,0.894)′, γ(βopt)=2.000
linear regression
Eˆ(Y\|X)	0.441	0.885	1.973	1.921	1.880		0.131	0.066	0.036	0.227	0.393		0.246	0.407		0.955	0.938
proposed approach
0	0.461	0.864	1.948	2.073	1.841		0.178	0.098	0.076	0.356	0.414		0.366	0.432		0.943	0.948
Eˆ(Y)	0.462	0.863	1.947	2.051	1.831		0.179	0.099	0.079	0.356	0.416		0.365	0.435		0.950	0.945
Eˆ(Y\|V)	0.453	0.874	1.961	1.981	1.862		0.155	0.083	0.064	0.307	0.410		0.324	0.420		0.965	0.945
Eˆ(Y\|X)	0.449	0.884	1.979	1.934	1.879		0.116	0.059	0.033	0.225	0.394		0.244	0.407		0.945	0.950

Published Online: 2017-3-25

Artikel in diesem Heft

https://doi.org/10.1515/ijb-2016-0064

Schlagwörter für diesen Artikel

cross-validation; personalized medicine; predictive biomarker; precision medicine; treatment effect heterogeneity; U-statistic