Abstract
Motivated by an HIV example, we consider how to compare and combine treatment selection markers, which are essential to the notion of precision medicine. The current literature on precision medicine is focused on evaluating and optimizing treatment regimes, which can be obtained by dichotomizing treatment selection markers. In practice, treatment decisions are based not only on efficacy but also on safety, cost and individual preference, making it difficult to choose a single cutoff value for all patients in all settings. It is therefore desirable to have a statistical framework for comparing and combining treatment selection markers without dichotomization. We provide such a framework based on a quantitative concordance measure, which quantifies the extent to which higher marker values are predictive of larger treatment effects. For a given marker, the proposed concordance measure can be estimated from clinical trial data using a U-statistic, which can incorporate auxiliary covariate information through an augmentation term. For combining multiple markers, we propose to maximize the estimated concordance measure among a specified family of combination markers. A cross-validation procedure can be used to remove any re-substitution bias in assessing the quality of an optimized combination marker. The proposed methodology is applied to the HIV example and evaluated in simulation studies.
1 Introduction
It is well recognized that treatment effects can be heterogeneous; that is, the same treatment can have different effects on different patients. The increasing awareness of treatment effect heterogeneity has motivated the development of predictive biomarkers for treatment selection [1, 2]. When two or more markers are available for the same treatment selection problem, methods are needed for comparing different markers and for combining multiple markers into a single marker. Our interest in these problems is motivated by two randomized clinical trials known as ECHO [3] and THRIVE [4], which compared the antiretroviral drugs rilpivirine and efavirenz for treatment-naive adult patients infected with human immunodeficiency virus (HIV). Both drugs are non-nucleoside reverse transcriptase inhibitors to be used in combination with other antiretroviral agents. The ECHO and THRIVE trials differed mainly in the use of background nucleoside/nucleotide reverse transcriptase inhibitors (N[t]RTIs). Except for that difference, the two trials followed nearly identical designs, produced very similar results [5], and are therefore combined in our analysis. The combined trial data show that rilpivirine and efavirenz are of comparable efficacy with respect to virologic response (viral load

Nonparametric regression analysis of the combined HIV trial data: smoothed estimates of treatment-specific virologic response rates as functions of BVL (left) and BCD4 (right).
The problems we consider are related to but different from evaluating and optimizing treatment regimes. A continuous or ordinal marker for treatment selection can be dichotomized into a treatment regime, which has a direct impact on the target population. The selection impact curve of Song and Pepe [8] can be used to visualize the impact of the resulting regime as a function of the cutoff value. The existing literature on precision medicine includes a variety of methods for estimating or approximating optimal treatment regimes that maximize a summary measure of efficacy e.g., [9, 10, 11, 12]. In practice, treatment decisions are based not only on efficacy but also on safety, cost and individual preference, which makes it difficult to apply a single cutoff value to all patients in all settings [13]. In our application, for example, rilpivirine is thought to be safer than efavirenz, and different patients may assign different weights to efficacy, safety and cost. Without a strong consensus on how to choose a cutoff, it is desirable to have a statistical framework for comparing and combining treatment selection markers without dichotomization.
There is an interesting analogy here between treatment selection and diagnostic medicine. A continuous or ordinal diagnostic marker can, and eventually will, be dichotomized into a binary diagnostic test. The choice of cutoff value involves balancing the two types of misclassifications, whose relative importance may depend on individual preference and circumstance. Without assuming a specific cutoff value, a continuous or ordinal diagnostic marker can be evaluated using the receiver operating characteristic (ROC) curve [14, 15]. Two diagnostic markers on different scales can be compared with respect to the area under the ROC curve (AUC), and multiple diagnostic markers can be combined by maximizing the AUC for the combination marker [16, 17, 18].
In this article, we propose a quantitative concordance measure for comparing and combining multiple treatment selection markers. The proposed concordance measure quantifies the extent to which higher marker values are predictive of larger treatment effects, and is closely related to the area under the selection impact curve defined by Song and Pepe [8]. For a given treatment selection marker, the proposed concordance measure can be estimated from clinical trial data using a U-statistic, which can incorporate auxiliary covariate information through an augmentation term e.g., [7, 19]. For combining multiple markers, we propose to maximize the estimated concordance measure among a specified family of combination markers. This is similar in spirit to the approach of Zhang et al. [11] for estimating optimal treatment regimes. When there are more than three markers to combine, it may be impractical to find the optimal combination through a grid search, but the objective function can be smoothed as in Ma and Huang [18] so that a gradient descent algorithm can be applied. In assessing the quality of the optimized combination marker, there can be a re-substitution bias if the same data are used to estimate and evaluate the optimal combination [1, 20, 21], and we propose to remove the re-substitution bias using a cross-validation procedure.
2 Methodology
2.1 A quantitative concordance measure
Consider a clinical trial in which each patient receives a randomized treatment
We start by considering how to assess the utility of one marker, say
For a useless marker, the conditional effect function is constant:
where
We call
It is easy to see that
To understand what
where the last step makes use of the independence of the two patients. If
If
Further insights into
which gives the mean outcome for a treatment regime that assigns treatment 1 to patients with
Thus, for a given pair of treatments,
2.2 Estimating γ for a given marker
We now consider estimation of
which suggests that
The foregoing discussion, together with the definition of
where
Noting that
A consistent estimator of
In Appendix B, we show that the asymptotic variance
To take advantage of this result, we can specify and estimate a working model
2.3 Comparing two markers
We now consider the problem of comparing two markers, say
where
where
Let
A consistent estimator of
In Appendix C, we show that
2.4 Combining multiple markers
Suppose
Thus,
The foregoing discussion suggests that we could set
where
In the rest of this subsection, we consider an alternative approach based on direct maximization of
where
We propose to estimate
where
It can be shown as in Ma and Huang [18] online supplement that, under regularity conditions,
Because of the asymptotic equivalence of
where
In finite samples, a re-substitution bias can arise from the fact that
using the exact same method for obtaining
Note that
Because the terms
3 Numerical results
3.1 Analysis of HIV trial data
The ECHO and THRIVE studies enrolled a combined total of 1,374 treatment-naive HIV-infected adults (694 ECHO; 680 THRIVE) in multiple countries. Main inclusion criteria included a screening viral load of at least 5,000 copies/ml and confirmed viral sensitivity to background N[t]RTIs. The study subjects were randomized
Here we use the proposed methodology to evaluate, compare and combine the two markers (BVL and BCD4) for choosing between rilpivirine (
Marker comparison based on HIV trial data: point estimates and analytical standard errors for the
{ | Point estimate | Standard error | ||||
BVL | BCD4 | Diff. | BVL | BCD4 | Diff. | |
0 | 0.143 | 0.021 | –0.122 | 0.056 | 0.057 | 0.070 |
0.045 | 0.084 | 0.039 | 0.026 | 0.026 | 0.028 | |
0.046 | 0.083 | 0.037 | 0.025 | 0.025 | 0.028 | |
0.043 | 0.082 | 0.039 | 0.025 | 0.025 | 0.028 |
Table 1 shows the point estimates and standard errors for the
Next, we consider how to combine the two markers linearly, so
where
Marker combination based on HIV trial data: point estimates and bootstrap standard errors for
{@{}lrrrrrrrr}
{ | Point estimate | Standard error | ||||||
\\[-2mm] | r | r | ||||||
BVL | BCD4 | BVL | BCD4 | |||||
Logistic regression | ||||||||
0.445 | 0.895 | 0.078 | 0.079 | 0.343 | 0.209 | 0.025 | 0.028 | |
Proposed approach | ||||||||
0 | 1.000 | 0.016 | 0.144 | 0.049 | 0.206 | 0.453 | 0.053 | 0.038 |
0.124 | 0.992 | 0.085 | 0.080 | 0.284 | 0.194 | 0.025 | 0.030 | |
0.186 | 0.983 | 0.085 | 0.083 | 0.280 | 0.196 | 0.024 | 0.029 | |
0.124 | 0.992 | 0.084 | 0.086 | 0.260 | 0.192 | 0.024 | 0.029 |
Table 2 shows the results for marker combination. For the proposed approach, the cross-validated
Example-based simulation
We now evaluate the performance of the proposed methodology in a simulation study mimicking the ECHO and THRIVE trials. We work with the same subset of 1,314 subjects analyzed in Section 3.1 with their original marker values, and generate
Marker comparison in example-based simulation: empirical bias, standard deviation (SD), mean standard error (SE) and empirical coverage probability (CP) (of 95% Wald confidence intervals) for estimating the
{@{}lrrrrrrrrrrrr}
{ | Bias | SD | SE | CP | ||||||||
BVL | BCD4 | Diff. | BVL | BCD4 | Diff. | BVL | BCD4 | Diff. | BVL | BCD4 | Diff. | |
0 | –0.002 | –0.001 | 0.001 | 0.059 | 0.057 | 0.070 | 0.057 | 0.057 | 0.069 | 0.937 | 0.948 | 0.945 |
–0.001 | 0.000 | 0.001 | 0.026 | 0.025 | 0.028 | 0.026 | 0.026 | 0.029 | 0.954 | 0.953 | 0.951 | |
–0.001 | –0.001 | 0.000 | 0.024 | 0.024 | 0.028 | 0.025 | 0.025 | 0.029 | 0.953 | 0.958 | 0.952 |
Table 3 shows the results for marker comparison: empirical bias, standard deviation, mean standard error and empirical coverage probability (of 95% Wald confidence intervals) for estimating the
Marker combination in example-based simulation: empirical means and standard deviations of
{@{}lrrrrrrrrrr}
| Mean | Standard deviation | ||||||||
c | c | |||||||||
BVL | BCD4 | BVL | BCD4 | |||||||
Logistic regression | ||||||||||
0.443 | 0.827 | 0.070 | 0.079 | 0.069 | 0.228 | 0.334 | 0.008 | 0.024 | 0.028 | |
Proposed approach | ||||||||||
0 | 0.258 | 0.702 | 0.063 | 0.112 | 0.056 | 0.537 | 0.519 | 0.033 | 0.046 | 0.042 |
0.186 | 0.921 | 0.077 | 0.087 | 0.073 | 0.250 | 0.327 | 0.011 | 0.023 | 0.029 | |
0.166 | 0.931 | 0.077 | 0.087 | 0.072 | 0.266 | 0.332 | 0.011 | 0.023 | 0.029 |
The results for marker combination are presented in Table 4, where the proposed approach with the aforementioned augmentation terms is compared with a logistic regression approach based on the following model:
where
3.3 Additional simulations
Here we report additional simulation experiments with a continuous outcome
where
Marker comparison in additional simulations: empirical bias, standard deviation (SD), mean standard error (SE) and empirical coverage probability (CP) (of 95% Wald confidence intervals) for estimating
{@{}lrrrrrrrrrrrr}
{ | Bias | SD | SE | CP | ||||||||
0 | 0.020 | 0.015 | –0.005 | 0.207 | 0.217 | 0.285 | 0.212 | 0.212 | 0.282 | 0.950 | 0.942 | 0.946 |
0.015 | 0.011 | –0.005 | 0.206 | 0.217 | 0.285 | 0.212 | 0.212 | 0.282 | 0.957 | 0.945 | 0.944 | |
0.003 | –0.006 | –0.008 | 0.188 | 0.197 | 0.238 | 0.191 | 0.191 | 0.230 | 0.955 | 0.945 | 0.941 | |
–0.008 | –0.014 | –0.006 | 0.160 | 0.165 | 0.182 | 0.161 | 0.161 | 0.179 | 0.954 | 0.947 | 0.948 | |
0 | 0.019 | 0.026 | 0.007 | 0.275 | 0.244 | 0.330 | 0.278 | 0.246 | 0.327 | 0.950 | 0.946 | 0.942 |
0.015 | 0.016 | 0.002 | 0.275 | 0.244 | 0.331 | 0.277 | 0.246 | 0.327 | 0.949 | 0.947 | 0.941 | |
0.000 | –0.021 | –0.021 | 0.258 | 0.228 | 0.283 | 0.261 | 0.228 | 0.283 | 0.953 | 0.948 | 0.941 | |
–0.004 | –0.027 | –0.023 | 0.236 | 0.209 | 0.248 | 0.239 | 0.204 | 0.244 | 0.955 | 0.950 | 0.939 | |
0 | –0.005 | –0.002 | 0.003 | 0.172 | 0.171 | 0.258 | 0.172 | 0.172 | 0.263 | 0.957 | 0.946 | 0.953 |
–0.006 | –0.003 | 0.003 | 0.172 | 0.171 | 0.258 | 0.172 | 0.172 | 0.263 | 0.956 | 0.947 | 0.954 | |
–0.006 | –0.009 | –0.003 | 0.145 | 0.148 | 0.202 | 0.146 | 0.146 | 0.206 | 0.956 | 0.941 | 0.949 | |
–0.004 | –0.007 | –0.003 | 0.103 | 0.106 | 0.144 | 0.104 | 0.104 | 0.146 | 0.945 | 0.940 | 0.940 | |
0 | –0.006 | –0.003 | 0.004 | 0.171 | 0.176 | 0.266 | 0.174 | 0.172 | 0.264 | 0.950 | 0.950 | 0.953 |
–0.007 | –0.004 | 0.003 | 0.171 | 0.177 | 0.265 | 0.174 | 0.172 | 0.264 | 0.949 | 0.947 | 0.953 | |
–0.007 | –0.012 | –0.005 | 0.142 | 0.153 | 0.206 | 0.148 | 0.146 | 0.207 | 0.960 | 0.939 | 0.944 | |
–0.003 | –0.008 | –0.005 | 0.107 | 0.103 | 0.143 | 0.107 | 0.104 | 0.148 | 0.946 | 0.954 | 0.953 | |
0 | –0.010 | 0.008 | 0.018 | 0.183 | 0.189 | 0.292 | 0.185 | 0.185 | 0.288 | 0.958 | 0.938 | 0.947 |
–0.012 | 0.006 | 0.018 | 0.183 | 0.189 | 0.292 | 0.185 | 0.185 | 0.288 | 0.960 | 0.941 | 0.946 | |
–0.018 | –0.006 | 0.012 | 0.160 | 0.160 | 0.234 | 0.161 | 0.161 | 0.236 | 0.949 | 0.941 | 0.949 | |
–0.016 | –0.009 | 0.008 | 0.121 | 0.126 | 0.185 | 0.124 | 0.124 | 0.187 | 0.955 | 0.951 | 0.947 | |
0 | 0.002 | 0.000 | –0.003 | 0.196 | 0.184 | 0.295 | 0.194 | 0.176 | 0.285 | 0.948 | 0.932 | 0.940 |
0.000 | –0.004 | –0.005 | 0.196 | 0.183 | 0.295 | 0.194 | 0.176 | 0.285 | 0.946 | 0.936 | 0.941 | |
–0.001 | –0.015 | –0.014 | 0.171 | 0.152 | 0.238 | 0.171 | 0.150 | 0.234 | 0.950 | 0.944 | 0.946 | |
–0.003 | –0.016 | –0.013 | 0.139 | 0.110 | 0.184 | 0.137 | 0.110 | 0.183 | 0.950 | 0.946 | 0.936 |
The results for marker comparison are presented in Table 5, where
Marker combination in additional simulations: empirical means and standard deviations of
{@{}lrrrrrrrrrr}
{ | Mean | Standard deviation | ||||||||
Linear regression | ||||||||||
0.706 | 0.708 | 3.176 | 3.160 | 3.018 | 0.022 | 0.022 | 0.002 | 0.146 | 0.151 | |
Proposed approach | ||||||||||
0 | 0.704 | 0.707 | 3.170 | 3.199 | 3.008 | 0.049 | 0.049 | 0.011 | 0.184 | 0.151 |
0.704 | 0.707 | 3.170 | 3.193 | 3.009 | 0.049 | 0.049 | 0.011 | 0.183 | 0.152 | |
0.705 | 0.707 | 3.173 | 3.169 | 3.013 | 0.039 | 0.039 | 0.007 | 0.175 | 0.154 | |
0.705 | 0.708 | 3.175 | 3.160 | 3.014 | 0.029 | 0.029 | 0.004 | 0.146 | 0.155 | |
Linear regression | ||||||||||
0.448 | 0.894 | 5.023 | 4.988 | 4.763 | 0.018 | 0.009 | 0.001 | 0.193 | 0.201 | |
Proposed approach | ||||||||||
0 | 0.447 | 0.893 | 5.019 | 5.049 | 4.758 | 0.041 | 0.020 | 0.007 | 0.234 | 0.206 |
0.448 | 0.893 | 5.019 | 5.039 | 4.754 | 0.040 | 0.020 | 0.007 | 0.235 | 0.206 | |
0.448 | 0.893 | 5.021 | 4.998 | 4.759 | 0.033 | 0.017 | 0.005 | 0.220 | 0.207 | |
0.448 | 0.893 | 5.022 | 4.988 | 4.765 | 0.026 | 0.014 | 0.003 | 0.193 | 0.205 | |
Linear regression | ||||||||||
0.689 | 0.703 | 0.593 | 0.599 | 0.554 | 0.125 | 0.121 | 0.014 | 0.105 | 0.110 | |
Proposed approach | ||||||||||
0 | 0.667 | 0.676 | 0.571 | 0.628 | 0.529 | 0.223 | 0.222 | 0.049 | 0.160 | 0.123 |
0.667 | 0.675 | 0.570 | 0.627 | 0.526 | 0.223 | 0.223 | 0.049 | 0.160 | 0.121 | |
0.674 | 0.693 | 0.581 | 0.609 | 0.540 | 0.184 | 0.181 | 0.034 | 0.142 | 0.116 | |
0.692 | 0.700 | 0.593 | 0.601 | 0.554 | 0.127 | 0.124 | 0.014 | 0.105 | 0.110 | |
Linear regression | ||||||||||
0.457 | 0.878 | 0.749 | 0.753 | 0.708 | 0.126 | 0.068 | 0.014 | 0.104 | 0.108 | |
Proposed approach | ||||||||||
0 | 0.452 | 0.858 | 0.732 | 0.781 | 0.686 | 0.211 | 0.124 | 0.040 | 0.159 | 0.116 |
0.451 | 0.859 | 0.733 | 0.779 | 0.687 | 0.210 | 0.121 | 0.038 | 0.159 | 0.116 | |
0.445 | 0.875 | 0.742 | 0.766 | 0.697 | 0.168 | 0.090 | 0.023 | 0.142 | 0.111 | |
0.455 | 0.881 | 0.750 | 0.755 | 0.708 | 0.118 | 0.062 | 0.012 | 0.104 | 0.107 | |
Linear regression | ||||||||||
0.706 | 0.705 | 1.995 | 1.983 | 1.902 | 0.047 | 0.047 | 0.007 | 0.107 | 0.110 | |
Proposed approach | ||||||||||
0 | 0.700 | 0.709 | 1.990 | 2.011 | 1.895 | 0.062 | 0.062 | 0.014 | 0.162 | 0.111 |
0.700 | 0.709 | 1.990 | 2.007 | 1.895 | 0.063 | 0.062 | 0.014 | 0.162 | 0.111 | |
0.703 | 0.707 | 1.994 | 1.988 | 1.899 | 0.050 | 0.050 | 0.009 | 0.145 | 0.112 | |
0.705 | 0.707 | 1.996 | 1.985 | 1.903 | 0.038 | 0.038 | 0.005 | 0.106 | 0.110 | |
Linear regression | ||||||||||
0.445 | 0.893 | 1.995 | 1.979 | 1.899 | 0.059 | 0.029 | 0.008 | 0.103 | 0.106 | |
Proposed approach | ||||||||||
0 | 0.447 | 0.890 | 1.989 | 2.007 | 1.893 | 0.082 | 0.041 | 0.017 | 0.164 | 0.106 |
0.447 | 0.890 | 1.989 | 2.003 | 1.891 | 0.083 | 0.041 | 0.017 | 0.164 | 0.108 | |
0.445 | 0.893 | 1.993 | 1.989 | 1.896 | 0.066 | 0.033 | 0.011 | 0.147 | 0.107 | |
0.446 | 0.893 | 1.996 | 1.981 | 1.900 | 0.049 | 0.024 | 0.006 | 0.102 | 0.106 |
The results for marker combination are shown in Table 6, where the proposed approach is compared with a linear regression approach based on the following working model:
where
In Appendix E, we examine the performance of the proposed methods at a smaller sample size (
4 Discussion
We have proposed a quantitative concordance measure,
For combining multiple treatment selection markers, our simulation results show that the proposed approach often performs similarly to a regression approach based on an appropriate model for
Acknowledgements
We thank two anonymous reviewers for constructive comments that have improved the manuscript greatly. The views expressed in this article do not represent the official position of the U.S. Food and Drug Administration.
1. Simon R. Development and validation of biomarker classifiers for treatment selection. J Stat Plan Inference 2008;138:308–20.10.1016/j.jspi.2007.06.010Suche in Google Scholar
2. Simon R. Clinical trials for predictive medicine: new challenges and paradigms. Clin Trials 2010;7:516–24.10.1177/1740774510366454Suche in Google Scholar
3. Molina JM, Cahn P, Grinsztejn B, Lazzarin A, Mills A, Saag M. Rilpivirine versus efavirenz with tenofovir and emtricitabine in treatment-naive adults infected with HIV-1 (ECHO): a phase 3 randomised double-blind active-controlled trial. Lancet 2011;378:238–46.10.1016/S0140-6736(11)60936-7Suche in Google Scholar
4. Cohen CJ, Andrade-Villanueva J, Clotet B, Fourie J, Johnson MA, Ruxrungtham K. Rilpivirine versus efavirenz with two background nucleoside or nucleotide reverse transcriptase inhibitors in treatment-naive adults infected with HIV-1 (THRIVE): a phase 3, randomised, non-inferiority trial. Lancet 2011;378:229–37.10.1016/S0140-6736(11)60983-5Suche in Google Scholar
5. Cohen CJ, Molina JM, Cahn P, Clotet B, Fourie J, Grinsztejn B. Efficacy and safety of rilpivirine (TMC278) versus efavirenz at 48 weeks in treatment-naive HIV-1-infected patients: pooled results from the phase 3 double-blind randomized ECHO and THRIVE trials. J AIDS 2012;60:33–42.10.1097/QAI.0b013e31824d006eSuche in Google Scholar PubMed
6. Zhang Z, Nie L, Soon G, Liu A. The use of covariates and random effects in evaluating predictive biomarkers under a potential outcome framework. Ann Appl Stat 2014;8:2336-55.10.1214/14-AOAS773Suche in Google Scholar PubMed PubMed Central
7. Zhang Z, Qu Y, Zhang B, Nie L, Soon G. Use of auxiliary covariates in estimating a biomarker-adjusted treatment effect model with clinical trial data. Stat Methods Med Res 2016;25:2103–19.10.1177/0962280213515572Suche in Google Scholar PubMed
8. Song X, Pepe MS. Evaluating markers for selecting a patients treatment. Biometrics 2004;60:874–83.10.1111/j.0006-341X.2004.00242.xSuche in Google Scholar PubMed
9. Qian M, Murphy SA. Performance guarantees for individualized treatment rules. Ann Stat 2011;39:1180–210.10.1214/10-AOS864Suche in Google Scholar PubMed PubMed Central
10. Zhao Y, Zeng D, Rush J, Kosorok MR. Estimating individualized treatment rules using outcome weighted learning. J Am Stat Assoc 2012;107:1106–18.10.1080/01621459.2012.695674Suche in Google Scholar PubMed PubMed Central
11. Zhang B, Tsiatis A, Laber E, Davidian M. A robust method for estimating optimal treatment regimes. Biometrics 2012;68:1010-8.10.1111/j.1541-0420.2012.01763.xSuche in Google Scholar PubMed PubMed Central
12. Kang C, Janes H, Huang Y. Combining biomarkers to optimize patient treatment recommendations. Biometrics 2014;70:695–707.10.1111/biom.12191Suche in Google Scholar PubMed PubMed Central
13. Janes H, Pepe MS, Bossuyt PM, Barlow WE. Measuring the performance of markers for guiding treatment decisions. Ann Intern Med 2011;154:253–9.10.7326/0003-4819-154-4-201102150-00006Suche in Google Scholar PubMed PubMed Central
14. Zhou XH, Obuchowski NA, McClish DK. Statistical methods in diagnostic medicine. New York: Wiley, 2002.10.1002/9780470317082Suche in Google Scholar
15. Pepe MS. The statistical evaluation of medical tests for classification and prediction. New York: Oxford University Press, 2003.Suche in Google Scholar
16. McIntosh MS, Pepe MS. Combining several screening tests: optimality of the risk score. Biometrics 2002;58:657–64.10.1111/j.0006-341X.2002.00657.xSuche in Google Scholar
17. Pepe MS, Cai T, Longton G. Combining predictors for classification using the area under the receiver operating characteristic curve. Biometrics 2006;62;221–9.10.1111/j.1541-0420.2005.00420.xSuche in Google Scholar PubMed
18. Ma S, Huang J. Combining multiple markers for classification using ROC. Biometrics 2007;63:751–7.10.1111/j.1541-0420.2006.00731.xSuche in Google Scholar PubMed
19. Tian L, Alizadeh AA, Gentles AJ, Tibshirani R. A simple method for estimating interactions between a treatment and a large number of covariates. J Am Stat Assoc 2014;109:1517–32.10.1080/01621459.2014.951443Suche in Google Scholar PubMed PubMed Central
20. Dawid AP. Selection paradoxes of Bayesian inference. Multivariate Anal Appl 1994;24:211–20.10.1214/lnms/1215463797Suche in Google Scholar
21. Senn S. A note concerning a selection “paradox” of Dawid’s. Am Stat 2008;62:206–10.10.1198/000313008X331530Suche in Google Scholar
22. Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Edu Psychol 1974;66:688–701.10.1037/h0037350Suche in Google Scholar
23. van der Vaart AW. Asymptotic statistics. Cambridge: Cambridge University Press, 1998.10.1017/CBO9780511802256Suche in Google Scholar
24. Cai T, Tian L, Wong PH, Wei LJ. Analysis of randomized comparative clinical trial data for personalized treatment selections. Biostatistics 2011;12:270–82.10.1093/biostatistics/kxq060Suche in Google Scholar
25. Han AK. Non-parametric analysis of a generalized regression model. J Econ 1987;35:303–16.10.1016/0304-4076(87)90030-3Suche in Google Scholar
26. Sherman RP. The limiting distribution of the maximum rank correlation estimator. Econometrica 1993;61:123–37.10.2307/2951780Suche in Google Scholar
27. Li KC, Duan N. Regression analysis under link violation. Ann Stat 1989;17:1009–52.10.1214/aos/1176347254Suche in Google Scholar
28. Nolan D, Pollard D. Functional limit theorems for $$U$$-processes. Ann Probab 1988;16:1291–8.10.1214/aop/1176991691Suche in Google Scholar
29. van der Vaart AW, Wellner JA. Weak convergence and empirical processes with applications to statistics. New York: Springer-Verlag, 1996.10.1007/978-1-4757-2545-2Suche in Google Scholar
Appendix A: Proof of (1)
We start by noting that
which follows from the symmetry of
which implies that
where
Appendix B: Theoretical Support for Section 2.2
First, we note that, for two independent subjects
where we use the independence between subjects and the fact that
For a fixed
A conditioning argument shows that
We now consider minimizing
where
Note that
Thus, a choice of
where
does not depend on
is minimized by
Finally, let us consider
where
Under regularity conditions, we have
where
with
because of the independence between randomized treatment and baseline covariates. Therefore,
Appendix C: Theoretical Support for Section 2.3
For any fixed
Write
where
Note that we are now conditioning on
where
Note that
Here, the
Therefore, a choice of
Let
and hence
This shows that
Appendix D: Theoretical Support for Section 2.4
First, we demonstrate the asymptotical equivalence of
To fix ideas, we work with a specific kernel function,
For
It follows from Nolan and Pollard [28] proof of Theorem 5 that
where
where the second step follows from van der Vaart [23] Theorem 19.24. If
Next, we give a sketch proof of eq. (5). It follows from Appendix B that, for any fixed
where
Under regularity conditions, we have
where the second step can be established using Theorem 19.24 of van der Vaart [23], assuming that
is a Donsker class [29] for some
Incidentally, eq. (5) remains valid with
where the last step follows from the fact that
Appendix E: Simulation Results for n = 100
Here we report further simulation results obtained in the situation of Section 3.3 with
Further simulation results for marker comparison at
Bias | SD | SE | CP | |||||||||
0 | 0.002 | –0.017 | –0.018 | 0.483 | 0.472 | 0.639 | 0.463 | 0.466 | 0.620 | 0.935 | 0.946 | 0.941 |
–0.022 | –0.039 | –0.017 | 0.481 | 0.471 | 0.637 | 0.462 | 0.464 | 0.619 | 0.933 | 0.943 | 0.943 | |
–0.113 | –0.103 | 0.010 | 0.439 | 0.437 | 0.533 | 0.409 | 0.410 | 0.501 | 0.911 | 0.922 | 0.931 | |
–0.133 | –0.118 | 0.014 | 0.368 | 0.373 | 0.415 | 0.344 | 0.344 | 0.391 | 0.906 | 0.909 | 0.935 | |
0 | –0.013 | 0.002 | 0.015 | 0.638 | 0.563 | 0.768 | 0.604 | 0.538 | 0.712 | 0.933 | 0.929 | 0.928 |
–0.038 | –0.047 | –0.009 | 0.635 | 0.562 | 0.767 | 0.601 | 0.535 | 0.712 | 0.935 | 0.920 | 0.926 | |
–0.121 | –0.191 | –0.069 | 0.603 | 0.540 | 0.670 | 0.556 | 0.490 | 0.615 | 0.915 | 0.883 | 0.914 | |
–0.134 | –0.247 | –0.113 | 0.544 | 0.493 | 0.575 | 0.509 | 0.436 | 0.530 | 0.914 | 0.844 | 0.907 | |
0 | 0.004 | 0.008 | 0.004 | 0.395 | 0.383 | 0.611 | 0.377 | 0.378 | 0.573 | 0.946 | 0.952 | 0.931 |
0.000 | 0.005 | 0.005 | 0.394 | 0.382 | 0.610 | 0.376 | 0.376 | 0.571 | 0.945 | 0.955 | 0.931 | |
–0.006 | –0.009 | –0.003 | 0.329 | 0.324 | 0.472 | 0.316 | 0.317 | 0.444 | 0.931 | 0.955 | 0.942 | |
–0.021 | –0.010 | 0.011 | 0.233 | 0.235 | 0.327 | 0.225 | 0.225 | 0.316 | 0.941 | 0.942 | 0.940 | |
0 | –0.015 | 0.032 | 0.048 | 0.386 | 0.396 | 0.597 | 0.382 | 0.378 | 0.577 | 0.948 | 0.932 | 0.949 |
–0.021 | 0.024 | 0.045 | 0.385 | 0.394 | 0.595 | 0.380 | 0.376 | 0.575 | 0.945 | 0.934 | 0.951 | |
–0.025 | –0.007 | 0.018 | 0.325 | 0.321 | 0.456 | 0.320 | 0.316 | 0.448 | 0.942 | 0.947 | 0.937 | |
–0.015 | –0.016 | –0.001 | 0.238 | 0.237 | 0.332 | 0.232 | 0.225 | 0.321 | 0.939 | 0.942 | 0.936 | |
0 | 0.000 | 0.021 | 0.021 | 0.419 | 0.410 | 0.641 | 0.406 | 0.405 | 0.628 | 0.937 | 0.946 | 0.949 |
–0.013 | 0.005 | 0.018 | 0.421 | 0.411 | 0.641 | 0.405 | 0.404 | 0.626 | 0.931 | 0.946 | 0.951 | |
–0.044 | –0.035 | 0.010 | 0.359 | 0.360 | 0.525 | 0.349 | 0.347 | 0.508 | 0.938 | 0.933 | 0.939 | |
–0.054 | –0.058 | –0.004 | 0.280 | 0.290 | 0.415 | 0.268 | 0.267 | 0.400 | 0.929 | 0.926 | 0.934 | |
0 | 0.015 | –0.032 | –0.046 | 0.434 | 0.389 | 0.622 | 0.424 | 0.386 | 0.622 | 0.940 | 0.941 | 0.950 |
0.008 | –0.048 | –0.056 | 0.431 | 0.388 | 0.618 | 0.422 | 0.385 | 0.620 | 0.944 | 0.939 | 0.949 | |
–0.029 | –0.076 | –0.047 | 0.383 | 0.332 | 0.503 | 0.368 | 0.326 | 0.502 | 0.934 | 0.936 | 0.944 | |
–0.044 | –0.076 | –0.033 | 0.303 | 0.248 | 0.394 | 0.293 | 0.239 | 0.393 | 0.939 | 0.922 | 0.942 |
Further simulation results for marker combination at
Mean | SD | SE | CP | |||||||||||
linear regression | ||||||||||||||
0.708 | 0.703 | 3.169 | 3.046 | 3.010 | 0.053 | 0.053 | 0.013 | 0.344 | 0.451 | 0.342 | 0.448 | 0.919 | 0.912 | |
proposed approach | ||||||||||||||
0 | 0.704 | 0.692 | 3.138 | 3.225 | 2.980 | 0.111 | 0.112 | 0.053 | 0.423 | 0.461 | 0.415 | 0.462 | 0.948 | 0.940 |
0.701 | 0.695 | 3.138 | 3.189 | 2.980 | 0.111 | 0.112 | 0.052 | 0.422 | 0.464 | 0.415 | 0.463 | 0.948 | 0.943 | |
0.701 | 0.701 | 3.150 | 3.080 | 2.989 | 0.094 | 0.094 | 0.041 | 0.422 | 0.445 | 0.409 | 0.457 | 0.924 | 0.919 | |
0.705 | 0.702 | 3.162 | 3.052 | 2.996 | 0.071 | 0.071 | 0.023 | 0.343 | 0.453 | 0.338 | 0.446 | 0.919 | 0.917 | |
linear regression | ||||||||||||||
0.448 | 0.893 | 5.019 | 4.785 | 4.773 | 0.039 | 0.020 | 0.007 | 0.460 | 0.551 | 0.476 | 0.551 | 0.922 | 0.908 | |
proposed approach | ||||||||||||||
0 | 0.449 | 0.888 | 5.001 | 5.082 | 4.755 | 0.085 | 0.045 | 0.033 | 0.512 | 0.552 | 0.511 | 0.558 | 0.957 | 0.922 |
0.450 | 0.888 | 5.000 | 5.028 | 4.740 | 0.087 | 0.046 | 0.034 | 0.513 | 0.552 | 0.510 | 0.555 | 0.953 | 0.918 | |
0.446 | 0.891 | 5.005 | 4.856 | 4.764 | 0.078 | 0.039 | 0.028 | 0.503 | 0.539 | 0.519 | 0.553 | 0.948 | 0.922 | |
0.447 | 0.892 | 5.012 | 4.790 | 4.771 | 0.063 | 0.032 | 0.018 | 0.459 | 0.557 | 0.474 | 0.551 | 0.932 | 0.907 | |
linear regression | ||||||||||||||
0.636 | 0.659 | 0.550 | 0.633 | 0.513 | 0.290 | 0.277 | 0.089 | 0.209 | 0.429 | 0.216 | 0.444 | 0.930 | 0.953 | |
proposed approach | ||||||||||||||
0 | 0.527 | 0.537 | 0.450 | 0.797 | 0.425 | 0.479 | 0.453 | 0.221 | 0.314 | 0.524 | 0.323 | 0.512 | 0.840 | 0.947 |
0.538 | 0.533 | 0.453 | 0.795 | 0.427 | 0.474 | 0.450 | 0.218 | 0.310 | 0.536 | 0.319 | 0.514 | 0.838 | 0.947 | |
0.554 | 0.586 | 0.483 | 0.723 | 0.439 | 0.443 | 0.393 | 0.188 | 0.282 | 0.524 | 0.273 | 0.510 | 0.870 | 0.947 | |
0.625 | 0.661 | 0.546 | 0.651 | 0.503 | 0.302 | 0.286 | 0.096 | 0.203 | 0.434 | 0.205 | 0.447 | 0.917 | 0.951 |
(Continued): Further simulation results for marker combination at
Mean | SD | r | SE | r | CP | ||||||||||||
c | c | c | c | ||||||||||||||
linear regression | |||||||||||||||||
0.433 | 0.836 | 0.709 | 0.750 | 0.648 | 0.290 | 0.169 | 0.068 | 0.233 | 0.421 | 0.220 | 0.428 | 0.932 | 0.944 | ||||
proposed approach | |||||||||||||||||
0 | 0.402 | 0.739 | 0.630 | 0.892 | 0.553 | 0.424 | 0.337 | 0.193 | 0.337 | 0.533 | 0.328 | 0.499 | 0.870 | 0.942 | |||
0.400 | 0.735 | 0.627 | 0.886 | 0.565 | 0.430 | 0.339 | 0.195 | 0.335 | 0.527 | 0.323 | 0.500 | 0.870 | 0.948 | ||||
0.398 | 0.774 | 0.653 | 0.824 | 0.575 | 0.395 | 0.295 | 0.170 | 0.312 | 0.498 | 0.290 | 0.478 | 0.885 | 0.944 | ||||
0.439 | 0.829 | 0.707 | 0.767 | 0.653 | 0.294 | 0.182 | 0.079 | 0.229 | 0.418 | 0.211 | 0.432 | 0.923 | 0.939 | ||||
linear regression | |||||||||||||||||
0.699 | 0.700 | 1.972 | 1.908 | 1.856 | 0.106 | 0.106 | 0.040 | 0.235 | 0.401 | 0.245 | 0.404 | 0.953 | 0.937 | ||||
proposed approach | |||||||||||||||||
0 | 0.690 | 0.688 | 1.937 | 2.054 | 1.820 | 0.157 | 0.161 | 0.094 | 0.359 | 0.413 | 0.363 | 0.429 | 0.941 | 0.948 | |||
0.691 | 0.687 | 1.937 | 2.035 | 1.823 | 0.158 | 0.162 | 0.095 | 0.360 | 0.415 | 0.361 | 0.431 | 0.945 | 0.941 | ||||
0.696 | 0.694 | 1.958 | 1.960 | 1.847 | 0.128 | 0.133 | 0.065 | 0.327 | 0.408 | 0.323 | 0.419 | 0.939 | 0.942 | ||||
0.697 | 0.703 | 1.976 | 1.921 | 1.869 | 0.099 | 0.098 | 0.037 | 0.234 | 0.395 | 0.241 | 0.403 | 0.950 | 0.933 | ||||
linear regression | |||||||||||||||||
0.441 | 0.885 | 1.973 | 1.921 | 1.880 | 0.131 | 0.066 | 0.036 | 0.227 | 0.393 | 0.246 | 0.407 | 0.955 | 0.938 | ||||
proposed approach | |||||||||||||||||
0 | 0.461 | 0.864 | 1.948 | 2.073 | 1.841 | 0.178 | 0.098 | 0.076 | 0.356 | 0.414 | 0.366 | 0.432 | 0.943 | 0.948 | |||
0.462 | 0.863 | 1.947 | 2.051 | 1.831 | 0.179 | 0.099 | 0.079 | 0.356 | 0.416 | 0.365 | 0.435 | 0.950 | 0.945 | ||||
0.453 | 0.874 | 1.961 | 1.981 | 1.862 | 0.155 | 0.083 | 0.064 | 0.307 | 0.410 | 0.324 | 0.420 | 0.965 | 0.945 | ||||
0.449 | 0.884 | 1.979 | 1.934 | 1.879 | 0.116 | 0.059 | 0.033 | 0.225 | 0.394 | 0.244 | 0.407 | 0.945 | 0.950 |
© 2017 Walter de Gruyter GmbH, Berlin/Boston
Artikel in diesem Heft
- Commentary
- Big Data, Small Sample
- Research Articles
- Parameter Estimation of a Two-Colored Urn Model Class
- Combinatorial Mixtures of Multiparameter Distributions: An Application to Bivariate Data
- On the Conditional Power in Survival Time Analysis Considering Cure Fractions
- Comparing Four Methods for Estimating Tree-Based Treatment Regimes
- On Stratified Adjusted Tests by Binomial Trials
- Improvement Screening for Ultra-High Dimensional Data with Censored Survival Outcomes and Varying Coefficients
- Bayesian Variable Selection Methods for Matched Case-Control Studies
- Testing Equality of Treatments under an Incomplete Block Crossover Design with Ordinal Responses
- Empirical Likelihood in Nonignorable Covariate-Missing Data Problems
- A Quantitative Concordance Measure for Comparing and Combining Treatment Selection Markers
- Median Analysis of Repeated Measures Associated with Recurrent Events in Presence of Terminal Event
- A Theorem at the Core of Colliding Bias
- Group Tests for High-dimensional Failure Time Data with the Additive Hazards Models
- Characterizing Highly Benefited Patients in Randomized Clinical Trials
Artikel in diesem Heft
- Commentary
- Big Data, Small Sample
- Research Articles
- Parameter Estimation of a Two-Colored Urn Model Class
- Combinatorial Mixtures of Multiparameter Distributions: An Application to Bivariate Data
- On the Conditional Power in Survival Time Analysis Considering Cure Fractions
- Comparing Four Methods for Estimating Tree-Based Treatment Regimes
- On Stratified Adjusted Tests by Binomial Trials
- Improvement Screening for Ultra-High Dimensional Data with Censored Survival Outcomes and Varying Coefficients
- Bayesian Variable Selection Methods for Matched Case-Control Studies
- Testing Equality of Treatments under an Incomplete Block Crossover Design with Ordinal Responses
- Empirical Likelihood in Nonignorable Covariate-Missing Data Problems
- A Quantitative Concordance Measure for Comparing and Combining Treatment Selection Markers
- Median Analysis of Repeated Measures Associated with Recurrent Events in Presence of Terminal Event
- A Theorem at the Core of Colliding Bias
- Group Tests for High-dimensional Failure Time Data with the Additive Hazards Models
- Characterizing Highly Benefited Patients in Randomized Clinical Trials