Abstract
The generalized odds ratio (GOR) for paired sample is considered to measure the relative treatment effect on patient responses in ordinal data. Under a three-treatment two-period incomplete block crossover design, both asymptotic and exact procedures are developed for testing equality between treatments with ordinal responses. Monte Carlo simulation is employed to evaluate and compare the finite-sample performance of these test procedures. A discussion on advantages and disadvantages of the proposed test procedures based on the GOR versus those based on Wald’s tests under the normal random effects proportional odds model is provided. The data taken as a part of a crossover trial studying the effects of low and high doses of an analgesic versus a placebo for the relief of pain in primary dysmenorrhea over the first two periods are applied to illustrate the use of these test procedures.
1 Introduction
When studying non-curable chronic diseases, including angina pectoris, epilepsy, hypertension, asthma, etc., we may often consider using a crossover design to reduce the number of patients needed for a parallel group design [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14]. When there are more than two treatments under comparison, the trial duration for a crossover design can be much longer than that for a parallel group design if each patient is to receive every treatment in use of the former. The longer the duration of a trial, the higher is the patient risk of being lost to follow-up. Furthermore, a lengthy trial duration can cause the difficulty in recruiting patients into a trial and ensuring patients to closely follow a study protocol. To alleviate these concerns, we may consider using an incomplete block crossover design, in which each patient is to receive only a subset of treatments [1]. For example, consider the double-blind placebo controlled crossover trial comparing 12
Because ordinal responses are not on an interval scale, it is generally not appropriate to apply arithmetic operation to ordinal data [19]. In practice, we may commonly assign arbitrary scores to ordinal responses and do hypothesis testing with use of the t-test. Since the relative distances between consecutive categories in ordinal data are not truly comparable, converting ordinal responses into a universally agreeable score scale is difficult. Also, how to provide a meaningful and easily-understood summary measure based on these arbitrarily assigned scores to quantify the relative treatment effect can be challenging. On the other hand, if we dichotomize the ordinal responses into binary outcomes, the test procedures for binary data will probably lose efficiency.
In this paper, we propose use of the generalized odds ratio (GOR) for paired samples [20, 21, 22] to measure the relative treatment effect on patient responses in ordinal data. We focus our discussion on an incomplete block two-period crossover trial comparing three treatments with ordinal responses. We derive asymptotic test procedures based on the weighted-least-squares (WLS) and Mantel-Haenszel (MH) estimators [23] for testing equality between treatments. We further derive the exact test procedures for testing equality of treatments for small-sample cases. We employ Monte Carlo simulation to evaluate the finite-sample performance of these test procedures in a variety of situations. We use the data taken as a part of the crossover trial [24] comparing the low and high doses of an analgesic with a placebo for the relief of pain in primary dysmenorrhea over the first two periods to illustrate the use of these test procedures.
2 Notation, assumption and methods
Consider comparing two experimental treatments A and B with a placebo P under an incomplete block crossover design with two periods. We let X-Y denote the treatment-receipt sequence of receiving treatment X at period 1 and then crossover to receive treatment Y at period 2. Suppose that we randomly assign
where
When
On the basis of model (1), for a randomly selected patient
Similarly, for a randomly selected patient
Note that because we do not assume any parametric p.d.f. for
For simplicity in notation, we define
We denote for a randomly selected patient
These represent the probability that a randomly selected patient
Let
For convenience, we define three 2 × 2 tables consisting of cell frequencies
and
When testing
where
for k = 2, 3,
When the observed marginal frequencies
When both the numbers of patients
where
Thus, the joint conditional probability distribution of
where
where C =
As shown in Appendix I, we can easily modify asymptotic test procedures (10) and (11) and the exact test procedure (14) to account for testing
3 Monte Carlo simulation
To evaluate and compare the performance of the WLS, MH and exact procedures for testing equality between treatments, we employ Monte Carlo simulation. By use of the conditional arguments, we do not need to estimate the nuisance period effect
4 Results
We summarize in Table 1 the estimated Type I error (in boldface) and power of using the WLS, MH and exact procedures for testing
The estimated Type I error (in boldface) and power of using the WLS, MH and Exact tests for testing
Testing | Testing | ||||||||
---|---|---|---|---|---|---|---|---|---|
n | WLS | MH | Exact | WLS | MH | Exact | |||
0.5 | 0.00 | 0.00 | 10 | 0.020 | 0.050 | 0.046 | 0.022 | 0.051 | 0.048 |
15 | 0.028 | 0.048 | 0.049 | 0.031 | 0.051 | 0.049 | |||
25 | 0.037 | 0.048 | 0.049 | 0.034 | 0.046 | 0.050 | |||
1.00 | 10 | 0.016 | 0.047 | 0.040 | 0.259 | 0.369 | 0.253 | ||
15 | 0.029 | 0.051 | 0.051 | 0.458 | 0.520 | 0.401 | |||
25 | 0.034 | 0.048 | 0.047 | 0.750 | 0.749 | 0.636 | |||
0.50 | 0.50 | 10 | 0.073 | 0.129 | 0.089 | 0.073 | 0.131 | 0.090 | |
15 | 0.131 | 0.174 | 0.122 | 0.134 | 0.178 | 0.125 | |||
25 | 0.251 | 0.269 | 0.195 | 0.255 | 0.271 | 0.199 | |||
1.50 | 10 | 0.058 | 0.125 | 0.093 | 0.528 | 0.649 | 0.501 | ||
15 | 0.124 | 0.175 | 0.132 | 0.781 | 0.825 | 0.731 | |||
25 | 0.226 | 0.250 | 0.195 | 0.970 | 0.973 | 0.938 | |||
1.0 | 0.00 | 0.00 | 10 | 0.018 | 0.052 | 0.043 | 0.017 | 0.046 | 0.044 |
15 | 0.025 | 0.048 | 0.030 | 0.053 | 0.053 | ||||
25 | 0.036 | 0.051 | 0.051 | 0.033 | 0.047 | 0.049 | |||
1.00 | 10 | 0.013 | 0.050 | 0.037 | 0.208 | 0.330 | 0.222 | ||
15 | 0.021 | 0.049 | 0.047 | 0.401 | 0.477 | 0.361 | |||
25 | 0.035 | 0.052 | 0.047 | 0.681 | 0.697 | 0.583 | |||
0.50 | 0.50 | 10 | 0.054 | 0.117 | 0.079 | 0.059 | 0.121 | 0.083 | |
15 | 0.114 | 0.159 | 0.112 | 0.115 | 0.160 | 0.109 | |||
25 | 0.219 | 0.247 | 0.169 | 0.213 | 0.240 | 0.169 | |||
1.50 | 10 | 0.043 | 0.112 | 0.074 | 0.457 | 0.597 | 0.447 | ||
15 | 0.093 | 0.150 | 0.114 | 0.732 | 0.789 | 0.672 | |||
25 | 0.198 | 0.230 | 0.174 | 0.947 | 0.951 | 0.904 |
5 An example
Consider the data (Table 2) taken as a part of a crossover trial comparing an analgesic at low (L) and high (H) doses with a placebo (P) for the relief of pain in primary dysmenorrhea patients over the first two-periods [24]. Here, we refer the low and high doses as treatments A and B. There were 86 patients randomly assigned to the six groups: P-L (g = 1); L-P (g = 2); P-H (g = 3); H-P (g = 4); L-H (g = 5); and H-L (g = 6). At the end of each treatment period, each patient was assessed the extent of relief on the ordinal scale: none (coded as 1), moderate (coded as 2) and complete (coded as 3). When applying the WLS and MH procedures, as well as the exact procedure to test
The frequency of patients for the relief of pain (1: none or minimal; 2: moderate; 3: complete) in primary dysmenorrhea at the first two periods versus the groups determined by the treatment-receipt sequence (P: placebo; L: low dose; H: high dose).
Group of Treatment-Receipt Sequence | ||||||
---|---|---|---|---|---|---|
g = | 1 | 2 | 3 | 4 | 5 | 6 |
Responses | P-L | L-P | P-H | H-P | L-H | H-L |
(1,1) | 2 | 1 | 2 | 3 | 1 | 1 |
(1,2) | 9 | 2 | 3 | 2 | 0 | 0 |
(1,3) | 2 | 2 | 6 | 0 | 1 | 1 |
(2,1) | 0 | 4 | 1 | 1 | 0 | 4 |
(2,2) | 2 | 1 | 2 | 1 | 6 | 1 |
(2,3) | 0 | 0 | 2 | 1 | 1 | 2 |
(3,1) | 0 | 5 | 0 | 6 | 1 | 4 |
(3,2) | 0 | 0 | 0 | 0 | 2 | 0 |
(3,3) | 0 | 0 | 0 | 0 | 0 | 1 |
15 | 15 | 16 | 14 | 12 | 14 |
Note that if we assume the proportional odds model using cumulative logit [28, 31] with normal random effects [19] due to patients, we can apply Proc GLIMMIX [31] to analyze the data in Table 2. When employing this SAS procedure (http://edoras.sdsu.edu/~kjl/exaordex1.htm), we obtain the parameter estimates (and their estimated standard error (SD)) of the relative effect for the low dose versus the placebo, the high dose versus the placebo, and the high dose versus the low dose are: –1.7656 (SD = 0.3898), –2.3404 (SD = 0.4011), and –0.5748 (SD = 0.3537). On the basis of these estimates (of which the signs are all < 0), we may conclude that both the low dose and high dose can significantly improve, as compared with the placebo, the relief of pain at the 5 % level; both p-values are < 0.0001. Furthermore, though the high dose can improve the outcome of patients as compared with the low dose, this improvement is not significant at the 5 %-level. All the above results are essentially similar to those reported previously for the complete block crossover design over a three-period trial [32].
6 Discussion
We do not recommend, as noted previously [1, 4, 11, 12, 25, 26, 27], use of the crossover design if we cannot ensure ourselves to nullify the carry-over effects with an adequate wash-out period. On the other hand, if there are carry-over effects due to earlier treatments, we note that the test procedures proposed here can still be valid for use under the simple carry-over model (Appendix II). Also, we note that although one may apply the estimator as given in Appendix II for the difference in carry-over effects to test whether there are differential carry-over effects, we do not recommend using this test to determine whether the assumption of no carry-over effects holds. This is because the concerns raised by Freeman [33] and Senn [34] for using the two-stage test procedure suggested by Grizzle [2].
When employing the test procedures developed here, we do not need to assume any parametric distribution for the random effects due to patients. Thus, our procedures is, as noted before, semi-parametric. Also, the number of patients for a crossover trial is often small. The exact test procedure (14) can be of use in practice. By contrast, one needs to assume the random effects due to patients independently follow the normal distribution in use of Proc GLIMMIX [35]. Also, note that the proportional odds model can be badly violated by many bivariate distributions [20, 36, 37]. Furthermore, note that Wald’s test can be invalid for use if the number of patients in a trial is small and the data are sparse.
We note that when the number of subjects per group n is small (say, 10), use of the WLS procedure can be conservative, while the MH and exact test procedure can perform well (Table 1). We further note that the MH test procedure is generally of more power than the other two procedures in the situations considered here. Because use of the MH procedure does not involve any sophisticated numerical procedure and can be calculated by a hand calculator, we may recommend the MH test procedure for general use when n is not large. We may use of the exact test procedure if one has the concern of normal approximation for a small n. When the number of subjects n per group is large (say, 40), however, we want to note that the WLS procedure can be of more power than the MH and exact procedures on the basis of Monte Carlo simulation (not shown here).
If we wish to study the relative period effect, we may apply similar ideas as above to derive the corresponding procedures for testing
Following similar arguments as for comparing the treatment effect, we can derive on the basis of (15) the WLS, MH and exact procedures for testing
In summary, we have derived the WLS, MH and exact procedures for testing equality between treatments under an incomplete block crossover design with ordinal responses. We have evaluated and compare their performance in a variety of situations based on Monte Carlo simulation. We have noted that the proposed test procedures are valid for use in the presence of simple carry-over effects. We have compared the proposed test procedures with use of Wald’s test procedures assuming the normal random effects proportional odds model. We have noted that the proposed test procedures include those for testing equality of treatments in binary data as special cases. The results, findings and discussions should have use for biostatisticians and clinicians when they employ a two-period crossover design to compare three treatments in ordinal data.
Acknowledgements
The author wishes to thank the reviewer for valuable and useful comments and suggestions to improve the contents and clarity of this article.
Appendix I
On the basis of the assumed model
On the basis of eqs (16) and (17), we have
which is free from the period effect. Similarly, we can see from eqs (18) and (20) that
Furthermore, we can see from eqs (21) and (19) that
Following similar arguments as for deriving eqs (22)–(24), we obtain the following three consistent estimators for the GOR of responses between treatment B and placebo P as
Again, for convenience in the following discussion we define three 2 × 2 tables consisting of frequencies
and
Again, using the same arguments as above, we may further obtain the following three consistent estimators for
Also, we define the following three 2 × 2 tables consisting of frequencies
and
Appendix II
Using the simple carryover model, we assume that
and
where
On the basis of the assumed model (29), we obtain
On the basis of model (30), we may obtain the GOR of responses between periods 2 and 1 in group g (= 1, 2, 3, 4, 5, 6) as
From eqs (31) and (32), we have
Similarly, from eqs (33)–(36), we obtain
and
Under
We define
where
,
Note that the weights (41) are function of unknown variances
References
1. Senn S. Cross-over trials in clinical research, 2nd ed. Chichester: Wiley; 2002.10.1002/0470854596Suche in Google Scholar
2. Grizzle JE. The two-period change-over design and its use in clinical trials. Biometrics. 1965;21:467–480.10.2307/2528104Suche in Google Scholar
3. Hill M, Armitage P. The two-period cross-over clinical trial. Br J Clin Pharmacol. 1979;8:7–20.10.1111/j.1365-2125.1979.tb05903.xSuche in Google Scholar PubMed PubMed Central
4. Fleiss JL. The design and analysis of clinical experiments. New York: Wiley; 1986.Suche in Google Scholar
5. Lui K-J, Chang K-C. Hypothesis testing and estimation in ordinal data under a simple crossover design. J Biopharm Stat. 2012;22:1137–1147.10.1080/10543406.2011.574326Suche in Google Scholar PubMed
6. Jones B, Kenward MG. Modelling binary data from a three-period cross-over trial. Stat Med. 1987;6:555–564.10.1002/sim.4780060504Suche in Google Scholar PubMed
7. Jones B, Kenward MG. Design and analysis of cross-over trials, 3rd ed. London: Chapman and Hall, Taylor & Francis Group; 2014.10.1201/b17537Suche in Google Scholar
8. Senn S. Cross-over trials in Statistics in Medicine: The first ‘25’ years. Stat Med. 2006;25:3430–3442.10.1002/sim.2706Suche in Google Scholar PubMed
9. Gart JJ. An exact test for comparing matched proportions in crossover designs. Biometrika. 1969;56:75–80.10.1093/biomet/56.1.75Suche in Google Scholar
10. Ezzet F, Whitehead J. A random effects model for binary data from cross-over trials. Appl Stat. 1992;41:117–126.10.2307/2347622Suche in Google Scholar
11. Schouten H, Kester A. A simple analysis of a simple crossover trial with a dichotomous outcome measure. Stat Med. 2010;29:193–198.10.1002/sim.3771Suche in Google Scholar PubMed
12. Lui K-J. Crossover designs: Testing, estimation, and sample size. New York: Wiley; 2016.10.1002/9781119114710Suche in Google Scholar
13. Lui K-J, Chang K-C. Test non-inferiority (and equivalence) based on the odds ratio under a simple crossover trial. Stat Med. 2011;30:1230–1242.10.1002/sim.4166Suche in Google Scholar PubMed
14. Lui K-J, Chang K-C. Exact sample size determination in testing non-inferiority under a simple crossover design. Pharm Stat. 2012;11:129–134.10.1002/pst.506Suche in Google Scholar PubMed
15. Lui K-J. Notes on testing carry-over effects in continuous data under an incomplete block crossover design. Commun Stat Simul Comput. 2016.. . doi:10.1080/03610918.2016.1146764.Suche in Google Scholar
16. Lui K-J, Chang K-C. Test and estimation in binary data analysis under an incomplete block crossover design. Comput Stat Data Anal. 2015;81:130–138.10.1016/j.csda.2014.07.017Suche in Google Scholar
17. Lui K-J. Estimation of the treatment effect under an incomplete block crossover design in binary data –a conditional likelihood approach. Stat Methods Med Res. 2015. doi:10.1177/0962280215595434.Suche in Google Scholar PubMed
18. Lui K-J, Chang K-C. Exact tests in binary data under an incomplete block crossover design. Stat Methods Med Res. 2016. doi:10.1177/0962280216638382.Suche in Google Scholar PubMed
19. Ezzet F, Whitehead J. A random effects model for ordinal responses from a crossover trial. Stat Med. 1991;10:901–907.10.1002/sim.4780100611Suche in Google Scholar PubMed
20. Agresti A. generalized odds ratios for ordinal data. Biometrics. 1980;36:59–67.10.2307/2530495Suche in Google Scholar
21. Lui K-J. Notes on estimation of the general odds ratio and the general risk difference for paired-sample data. Biometrical J. 2002;44:957–968.10.1002/bimj.200290007Suche in Google Scholar
22. Lui K-J. Statistical estimation of epidemiology risk. Wiley: New York; 2004.10.1002/0470094087Suche in Google Scholar
23. Fleiss JL, Levin B, Paik MC. Statistical methods for rates and proportions, 3rd ed.. New York: Wiley; 2003.10.1002/0471445428Suche in Google Scholar
24. Kenward M, Jones B. The analysis of categorical data from cross-over trials using a latent variable model. Stat Med. 1991;10:1607–1619.10.1002/sim.4780101012Suche in Google Scholar
25. Senn SJ. Is the simple carry-over model useful? Stat Med. 1992;11:715–726.10.1002/sim.4780110603Suche in Google Scholar
26. Fleiss JL. On multiperiod crossover studies. Biometrics (Correspondence). 1986;42:449–450.Suche in Google Scholar
27. Fleiss JL. A critique of recent research on the two-treatment crossover design. Control Clin Trials. 1989;10:237–243.10.1016/0197-2456(89)90065-2Suche in Google Scholar
28. Agresti A. Categorical data analysis. New York: Wiley; 1990.Suche in Google Scholar
29. Ne B, Ne D. Statistical methods in cancer research, volume 1. The analysis of case-control studies. Lyon: International Agency for Research on Cancer; 1980.Suche in Google Scholar
30. Gart JJ. Point and interval estimation of the common odds ratio in the combination of 2x2 tables with fixed margins. Biometrika. 1970;57:471–475.Suche in Google Scholar
31. SAS Institute, Inc. User’s Guide, 2nd ed. Cary, NC: SAS Institute; 2009.Suche in Google Scholar
32. Lui K-J, Chang K-C, Lin C-D. Testing equality and interval estimation of the generalized odds ratio in ordinal data under a three-period crossover design. Stat Methods Med Res. 2015. doi:10.1177/0962280215569623.Suche in Google Scholar PubMed
33. Freeman P. The performance of the two-stage analysis of two-treatment, two-period cross-over trials. Stat Med. 1989;8:1421–1432.10.1002/sim.4780081202Suche in Google Scholar PubMed
34. Senn SJ. Cross-over trials, carry-over effects and the art of self-delusion. Stat Med. 1988;7:1099–1101.10.1002/sim.4780071010Suche in Google Scholar PubMed
35. Clayton D, Cuzick J. Multivariate generalizations of the proportional hazards model. J R Stat Soc Ser. 1985;148:82–117.10.2307/2981943Suche in Google Scholar
36. Fleiss JL. On the asserted invariance of the odds ratio. Br J Prev Soc Med. 1970;24:45–46.10.1136/jech.24.1.45Suche in Google Scholar PubMed PubMed Central
37. Mosteller F. Association and estimation in contingency tables. J Am Stat Assoc. 1968;63:1–28.10.1080/01621459.1968.11009219Suche in Google Scholar
© 2017 Walter de Gruyter GmbH, Berlin/Boston
Artikel in diesem Heft
- Commentary
- Big Data, Small Sample
- Research Articles
- Parameter Estimation of a Two-Colored Urn Model Class
- Combinatorial Mixtures of Multiparameter Distributions: An Application to Bivariate Data
- On the Conditional Power in Survival Time Analysis Considering Cure Fractions
- Comparing Four Methods for Estimating Tree-Based Treatment Regimes
- On Stratified Adjusted Tests by Binomial Trials
- Improvement Screening for Ultra-High Dimensional Data with Censored Survival Outcomes and Varying Coefficients
- Bayesian Variable Selection Methods for Matched Case-Control Studies
- Testing Equality of Treatments under an Incomplete Block Crossover Design with Ordinal Responses
- Empirical Likelihood in Nonignorable Covariate-Missing Data Problems
- A Quantitative Concordance Measure for Comparing and Combining Treatment Selection Markers
- Median Analysis of Repeated Measures Associated with Recurrent Events in Presence of Terminal Event
- A Theorem at the Core of Colliding Bias
- Group Tests for High-dimensional Failure Time Data with the Additive Hazards Models
- Characterizing Highly Benefited Patients in Randomized Clinical Trials
Artikel in diesem Heft
- Commentary
- Big Data, Small Sample
- Research Articles
- Parameter Estimation of a Two-Colored Urn Model Class
- Combinatorial Mixtures of Multiparameter Distributions: An Application to Bivariate Data
- On the Conditional Power in Survival Time Analysis Considering Cure Fractions
- Comparing Four Methods for Estimating Tree-Based Treatment Regimes
- On Stratified Adjusted Tests by Binomial Trials
- Improvement Screening for Ultra-High Dimensional Data with Censored Survival Outcomes and Varying Coefficients
- Bayesian Variable Selection Methods for Matched Case-Control Studies
- Testing Equality of Treatments under an Incomplete Block Crossover Design with Ordinal Responses
- Empirical Likelihood in Nonignorable Covariate-Missing Data Problems
- A Quantitative Concordance Measure for Comparing and Combining Treatment Selection Markers
- Median Analysis of Repeated Measures Associated with Recurrent Events in Presence of Terminal Event
- A Theorem at the Core of Colliding Bias
- Group Tests for High-dimensional Failure Time Data with the Additive Hazards Models
- Characterizing Highly Benefited Patients in Randomized Clinical Trials