Abstract
In randomized clinical trials, we often encounter ordinal categorical responses with repeated measurements. We propose a model-free approach with using the generalized odds ratio (GOR) to measure the relative treatment effect. We develop procedures for testing equality of treatment effects and derive interval estimators for the GOR. We further develop a simple procedure for testing the treatment-by-period interaction. To illustrate the use of test procedures and interval estimators developed here, we consider two real-life data sets, one studying the gender effect on pain scores on an ordinal scale after hip joint resurfacing surgeries, and the other investigating the effect of an active hypnotic drug in insomnia patients on ordinal categories of time to falling asleep.
1 Introduction
In clinical trials or health-related studies, we often encounter the patient response on an ordinal scale, for example, worse, same or better. Arbitrarily assigning scores (such as –1, 0, 1) to these ordinal categories for arithmetic operation can be inappropriate due to the fact that the relative distance between any two successive ordinal categories is not really equal or even comparable. One may also have difficulty in interpreting the mean of these arbitrary scores in terms of practical meaning. Grouping multiple categories into a single category to reduce ordinal outcomes into dichotomous responses can cause the loss of efficiency.
To reduce the number of patients in clinical trials, taking more than one measurement on each patient frequently arises in practice. Research on repeated ordinal responses has been intensive [1, 2, 3–5, 6, 7, 8, 9]. Francom et al. [2] applied a family of structural log-linear models and required investigators to assign the scores to represent the relative distance between ordinal categories. Agresti [1] addressed use of generalized linear models with different link functions and linear predictors. Ware et al. [9] as well as Kenward and Jones [4] discussed various approaches to analyze repeated categorical measurements. Parsons et al. [7] considered the proportional odds logistic regression with a range of working correlation models. All these publications focused discussions on model-based methods. By contrast, the methods proposed here is model-free and does not assume any parametric form for the data structure. Furthermore, there is no need to assume or specify any particular dependence structure between repeated measurements. Also, rather than concentrating attentions on testing equality between treatments in ordinal data with repeated measurements, there were publications [10, 11] discussing and deriving procedures for testing positive (quadrant) dependence under various parameter constraints and marginal modeling in two-way tables with ordinal categories. Agresti and Coull [12] focused discussion on model-based approach as well and addressed testing hypothesis against order-restricted alternatives. The purposes of these papers are different from what we focus here is to derive model-free procedures for testing equality of treatments, while the dependence between measurements taken within patients is nuisance effect.
Using the generalized odds ratio (GOR) [13], we develop in this paper model-free procedures for testing equality of treatments and derive interval estimators for the relative treatment effect in ordinal data with repeated measurements. We further develop a simple procedure for testing the treatment-by-period interaction. We discuss the usefulness and limitations of test procedures developed here. To illustrate the use of these procedures, we consider two real-life data sets, one taken from a trial studying the gender effect on pain scores on an ordinal scale after hip joint resurfacing surgeries [7] and the other taken from a double-blind randomized trial comparing an active hypnotic drug with a placebo in insomnia patients with respect to the ordinal category of time to falling asleep [2].
2 Notation and Methods
Consider comparing two treatments in a randomized clinical trial, in which we randomly assign
Note that we can estimate
where
Note that we define
Following the same arguments as for deriving
where
delta method, we can show that an estimated asymptotic variance of
Also, we define
Note that
2.1 Test non-equality between treatments in the absence of interactions
When treatments received at the two periods in a group g are the same and there is no treatment-by-period interaction (i. e.,
where
2.2 Test non-equality between treatments in the presence of interactions
When treatments received at the two periods in a group are not the same or there is a treatment-by-period interaction (i. e.,
where
2.3 Procedure for testing the group-by-period interaction
When wishing to study whether there is a group-by-period interaction, we may consider testing
where
2.4 Interval estimation of the relative treatment effect
When treatments received at the two periods in a group are the same and there is no treatment-by-period interaction, we let
where
When treatments received at the two periods are not the same or there is a treatment-by-period interaction, we may wish to obtain an interval estimator for
3 Examples
To illustrate the use of point estimators
Frequency distribution of patients with pain scores coded as: none (1), slight (2), mild, moderate or marked pain (3) taken at two and five years post-surgery between females and males.
| Gender | At Two Years | At Five Years | ||||
|---|---|---|---|---|---|---|
| None | Slight | Mild, Moderate or Marked Pain | Marginal Total | Marginal Percentage | ||
| Female | None | 7 | 1 | 2 | 10 | 0.476 |
| Slight | 3 | 1 | 3 | 7 | 0.333 | |
| Mild, Moderate or Marked Pain | 0 | 1 | 3 | 4 | 0.190 | |
| Marginal Total | 10 | 3 | 8 | 21 | ||
| Marginal Percentage | 0.476 | 0.143 | 0.381 | 1.000 | ||
| None | Slight | Mild, Moderate or Marked Pain | Marginal Total | Marginal Percentage | ||
| Male | None | 19 | 7 | 2 | 28 | 0.757 |
| Slight | 1 | 3 | 1 | 5 | 0.135 | |
| Mild, Moderate or Marked Pain | 0 | 0 | 4 | 4 | 0.108 | |
| Marginal Total | 20 | 10 | 7 | 37 | ||
| Marginal Percentage | 0.541 | 0.270 | 0.189 | 1.000 | ||
[0.476×(0.135+0.108)+0.333×0.108]/[0.333×0.757+0.190×(0.757+0.135)]=0.360. Similarly, using marginal percentages for columns between the two sub-tables, we obtain
When assuming a normal random effects proportional odds model [17], we may employ Proc Glimmix in SAS [18] to study the difference in pain scores between genders on the basis of the model-based approach. Using the data in Table 1, we have obtained the parameter estimate (and its estimated standard error (SE)) for the relative gender effect of males versus females to be 0.961 (SE=0.6148). This leads the p-value for testing the equality of pain scores across the two periods between genders is 0.124, which is similar to those obtained by use of test procedures (5) and (6). Also, note that the parameter estimate 0.961 is larger than 0. Thus, males tend to fall in categories with lower pain scores than females. This inference is identical to that obtained on the basis of the GOR focused here.
We may sometimes encounter a trial in which the patient response taken at period (z=) 1 actually represents the baseline response. In this case, we may apply the test procedure (7), developed for testing group-by-period interaction, to study whether there is a relative treatment effect. To illustrate this point, we consider the data (Table 2) taken from a double-blind randomized clinical trial comparing an active hypnotic drug (g=1) with a placebo (g=2) in patients with insomnia [1, 14, 2]. The outcome of interest is to respond the question “How quickly did you fall asleep after going to bed?” and is recorded on a four-point ordinal scale (< 20, 20–30, 30–60, > 60 in minutes). Each participated subject was asked this question twice, one after a one-week placebo washout period for both groups and the other at the conclusion of a two-week treatment period. Using the data in Table 2, we obtain
Frequency distribution of patients with time to falling asleep (in minutes) taken at the end of one-week washout period and at the conclusion of two-week treatment period.
| Treatment | At One-week Period | At Two-week Treatment Period | ||||
|---|---|---|---|---|---|---|
| <20 | 20-30 | 30-60 | >60 | Total | ||
| Active | < 20 | 7 | 4 | 1 | 0 | 12 |
| 20-30 | 11 | 5 | 2 | 2 | 20 | |
| 30-60 | 13 | 23 | 3 | 1 | 40 | |
| > 60 | 9 | 17 | 13 | 8 | 47 | |
| Total | 40 | 49 | 19 | 11 | 119 | |
| <20 | 20-30 | 30-60 | >60 | Total | ||
| Placebo | < 20 | 7 | 4 | 2 | 1 | 14 |
| 20-30 | 14 | 5 | 1 | 0 | 20 | |
| 30-60 | 6 | 9 | 18 | 2 | 35 | |
| > 60 | 4 | 11 | 14 | 22 | 51 | |
| Total | 31 | 29 | 35 | 25 | 120 | |
4 Discussion
We can employ, as demonstrated here, the GOR to measure the relative treatment effect without the need to assume any specific parametric model. Since the GOR has a simple interpretation and is easily understood, the GOR is of use in ordinal data. In fact, the GOR is closely related to the gamma correlation [19], a commonly-used measure of the strength of association between two ordinal variables. We refer readers to some publications on estimation of the GOR and its applications under other situations [20, 21, 22–24]. When repeated measurements are taken at the same time (or there are no period effects), one may employ the Dirichlet-multinomial distribution to model the intraclass correlation between repeated measurements [25]. As considered in the above two examples, however, repeated measurements on patients are often taken at different time intervals in clinical trials. The period effect is likely to exist and is required to be incorporated to avoid bias in data analysis [26, 27]. The methods based on the Dirichlet-multinomial model for cluster sampling without accounting for the period effect would not be appropriate for use in situations focused here.
The proportional odds model is probably one of the most commonly-used models to analyze ordinal data. Just like all model-based approaches, we can easily extend the proportional odds model to account for confounders (if there were) or accommodate other general situations. However, the implicit assumption of the proportional odds model can be badly violated by many bivariate distributions [13, 28, 29]. Furthermore, when applying Proc Glimmix in SAS [18] based on the random effects proportional odds model to ordinal data with repeated measurements, we need to assume that the random effects (accounting for the intraclass correlation between repeated measurements) due to patients follow a normal distribution. This normal assumption for random effects can be difficult to be justified. By contrast, the proposed method is model-free. It does not require the random effects due to patients to follow the normal distribution, not does assume any parametric models for the data structure. Thus, our methods are applicable despite of various parametric models for the underlying data structure and distribution assumptions for the patient random effects. Furthermore, the point estimators, test procedures and interval estimators developed here can all be expressed in closed forms. Readers may employ these test procedures and estimators by use of a pocket calculator even without knowledge of any statistical software. The interpretation of the GOR is, as illustrated in examples, easily understood. When there are confounders in a trial of a large size, we may extend the methods proposed here by use of stratified analysis with strata determined by the combined levels of confounders [23]. But we want to note that the model-based approach can be preferable to the model-free approach proposed here if there are many covariates to adjust for a trial of a small or moderate size.
Finally, we note that using similar arguments as above, it is straightforward to extend the results to accommodate the cases with three or more periods. We outlines the extension of results presented here to accommodate three periods in Appendix II.
In summary, we have developed model-free test procedures for testing equality of treatments in ordinal data with repeated measurements. We have further derived interval estimators for the relative treatment effect measured by the GOR. We recommend use of the summary test procedure to improve power when treatments received at two periods in a group are the same and there is no treatment-by-period interaction. However, we should cautiously employ this summary test procedure when there is a treatment-by-period interaction. The bivariate test procedure can be of use in this case. We further outline the extension of results to accommodate three periods. The results, findings and discussions should have use for biostatisticians and clinicians when they encounter ordinal responses with repeated measurements.
Funding statement: Funding: The research received no specific grant from any funding agency in the public, commercial, or not for-for-profit sectors.
Acknowledgements
The author wishes to thank the associate editor and two reviewers for many valuable comments and suggestions to improve the clarity and contents of this article.
Appendix I
Suppose that random vectors
On the basis of (A.1), we have
Using the delta method, we may obtain the asymptotic covariance
We can obtain an estimated asymptotic covariance
Appendix II
For a randomly selected patient
where
Using the delta method [14], we obtain an estimated asymptotic variance for
where
where
where
Note that the covariance
Thus, from (A.1) we have
Using the delta method, we obtain the asymptotic covariance
We obtain the estimated asymptotic covariance
where
References
1. Agresti A. A survey of models for repeated ordered categorical response data. Stat Med 1989;8:1209–24.10.1002/sim.4780081005Search in Google Scholar PubMed
2. Francom SF, Chuang-Stein C, Landis JR. A log-linear model for ordinal data to characterize differential change among treatments. Stat Med 1989;8:571–82.10.1002/sim.4780080506Search in Google Scholar PubMed
3. Kenward MG, Jones B. The analysis of categorical data from cross-over trials using a latent variable model. Stat Med 1991;10:1607–19.10.1002/sim.4780101012Search in Google Scholar PubMed
4. Kenward MG, Jones B. Alternative approaches to the analysis of binary and categorical repeated measurements. J Biopharm Stat 1992;2:137–70.10.1080/10543409208835036Search in Google Scholar PubMed
5. Koch GG, Landis JR, Freeman JL, Freeman JrDH, Lehnen RG. A general methodology for the analysis of experiments with repeated measurement of categorical data. Biometrics 1977;33:133–58.10.2307/2529309Search in Google Scholar
6. McCullagh P. A logistic model for paired comparisons with ordered categorical data. Biometrika 1977;64:449–53.10.2307/2345320Search in Google Scholar
7. Parsons NR, Costa ML, Achten J, Stallard N. Repeated measures proportional odds logistic regression analysis of ordinal score data in the statistical software package R. Comput Stat Data Anal 2009;53:632–41.10.1016/j.csda.2008.08.004Search in Google Scholar
8. Stram DO, Wei LJ, Ware JH. The analysis of repeated ordinal categorical outcomes with possible missing observations and time-dependent covariates. J Am Stat Assoc 1989;83:631–7.10.1080/01621459.1988.10478642Search in Google Scholar
9. Ware JH, Lipsitz S, Speizer FE. Issues in the analysis of repeated categorical outcomes. Stat Med 1988;7:95–107.10.1002/sim.4780070113Search in Google Scholar PubMed
10. Bartolucci F, Forcina A, Dardanoni V. Positive quadrant dependence and marginal modeling in two-way tales with ordered margins. J Am Stat Assoc 2001;96:1497–505.10.1198/016214501753382390Search in Google Scholar
11. Bartolucci F, Forcina A. Extended RC association models allowing for order restrictions and marginal modeling. J Am Stat Assoc 2002;97:1192–9.10.1198/016214502388618988Search in Google Scholar
12. Agresti A, Coull BA. The analysis of contingency tables under inequality constraints. J Stat Plan Infernce 2002;107:45–73.10.1016/S0378-3758(02)00243-4Search in Google Scholar
13. Agresti A. Generalized odds ratios for ordinal data. Biometrics 1980;36:59–67.10.2307/2530495Search in Google Scholar
14. Agresti A. Categorical Data Analysis. New York: Wiley, 1990.Search in Google Scholar
15. Lui K-J. Statistical Estimation of Epidemiology Risk. New York: Wiley, 2004.10.1002/0470094087Search in Google Scholar
16. Harris WH. Traumatic arthritis of the hip after dislocation and acetabular fractures: Treatment by mold arthroplasty. An end-result study using a new method of result evaluation. J Bone Joint Surg 1969;51-A:737–55.10.2106/00004623-196951040-00012Search in Google Scholar
17. Ezzet F, Whitehead J. A random effects model for ordinal responses from a crossover trial. Stat Med 1991;10:901–7.10.1002/sim.4780100611Search in Google Scholar
18. SAS Institute, Inc. User’s Guide, 2nd ed. Cary, NC: SAS Institute, 2009.Search in Google Scholar
19. Goodman LA, Kruskal WH. Measure of association for cross classification. J Am Stat Assoc 1954;49:732–64.10.1080/01621459.1954.10501231Search in Google Scholar
20. Edwardes MD, Baltzan M. The generalization of the odds ratio, risk ratio and risk difference to r k tables. Stat Med 2000;19:1901–14.10.1002/1097-0258(20000730)19:14<1901::AID-SIM514>3.0.CO;2-VSearch in Google Scholar
21. Lui K-J. Notes on estimation of the general odds ratio and the general risk difference for paired-sample data. Biometric J 2002a;44:957–68.10.1002/bimj.200290007Search in Google Scholar
22. Lui K-J, Chang K-C. Hypothesis testing and estimation in ordinal data under a simple crossover design. J Biopharm Stat 2012;22:1137–47.10.1080/10543406.2011.574326Search in Google Scholar
23. Lui K-J, Chang K-C. Notes on interval estimation of the generalized odds ratio under stratified random sampling. J Biopharm Stat 2013a;23:513–25.10.1080/10543406.2011.616977Search in Google Scholar
24. Lui K-J, Chang K-C. Notes on testing noninferiority in ordinal data under the parallel groups design. J Biopharm Stat 2013b;23:1294–307.10.1080/10543406.2013.834923Search in Google Scholar PubMed
25. Lui K-J. Interval estimation of generalized odds ratio in data with repeated measurements. Stat Med 2002b;21:3107–17.10.1002/sim.1239Search in Google Scholar PubMed
26. Gart JJ. An exact test for comparing matched proportions in crossover designs. Biometrika 1969;56:75–80.10.1093/biomet/56.1.75Search in Google Scholar
27. Senn S. Cross-over Trials in Clinical Research, 2nd ed. Chichester: Wiley, 2002.10.1002/0470854596Search in Google Scholar
28. Fleiss JL. On the asserted invariance of the odds ratio. Br J Preventive Soc Med 1970;24:45–6.10.1136/jech.24.1.45Search in Google Scholar PubMed PubMed Central
29. Mosteller F. Association and estimation in contingency tables. J Am Stat Assoc 1968;63:1–28.10.1080/01621459.1968.11009219Search in Google Scholar
© 2016 Walter de Gruyter GmbH, Berlin/Boston
Articles in the same Issue
- Research Articles
- A Comparison of Some Approximate Confidence Intervals for a Single Proportion for Clustered Binary Outcome Data
- Effect of Smoothing in Generalized Linear Mixed Models on the Estimation of Covariance Parameters for Longitudinal Data
- Adaptive Design for Staggered-Start Clinical Trial
- A Binomial Integer-Valued ARCH Model
- Testing Equality in Ordinal Data with Repeated Measurements: A Model-Free Approach
- Mendelian Randomization using Public Data from Genetic Consortia
- Tree Based Method for Aggregate Survival Data Modeling
- Multi-locus Test and Correction for Confounding Effects in Genome-Wide Association Studies
- Semiparametric Regression Estimation for Recurrent Event Data with Errors in Covariates under Informative Censoring
- Joint Model for Mortality and Hospitalization
- Effect Estimation in Point-Exposure Studies with Binary Outcomes and High-Dimensional Covariate Data – A Comparison of Targeted Maximum Likelihood Estimation and Inverse Probability of Treatment Weighting
- Sample Size for Assessing Agreement between Two Methods of Measurement by Bland−Altman Method
- Using Relative Statistics and Approximate Disease Prevalence to Compare Screening Tests
- Multiple Comparisons Using Composite Likelihood in Clustered Data
Articles in the same Issue
- Research Articles
- A Comparison of Some Approximate Confidence Intervals for a Single Proportion for Clustered Binary Outcome Data
- Effect of Smoothing in Generalized Linear Mixed Models on the Estimation of Covariance Parameters for Longitudinal Data
- Adaptive Design for Staggered-Start Clinical Trial
- A Binomial Integer-Valued ARCH Model
- Testing Equality in Ordinal Data with Repeated Measurements: A Model-Free Approach
- Mendelian Randomization using Public Data from Genetic Consortia
- Tree Based Method for Aggregate Survival Data Modeling
- Multi-locus Test and Correction for Confounding Effects in Genome-Wide Association Studies
- Semiparametric Regression Estimation for Recurrent Event Data with Errors in Covariates under Informative Censoring
- Joint Model for Mortality and Hospitalization
- Effect Estimation in Point-Exposure Studies with Binary Outcomes and High-Dimensional Covariate Data – A Comparison of Targeted Maximum Likelihood Estimation and Inverse Probability of Treatment Weighting
- Sample Size for Assessing Agreement between Two Methods of Measurement by Bland−Altman Method
- Using Relative Statistics and Approximate Disease Prevalence to Compare Screening Tests
- Multiple Comparisons Using Composite Likelihood in Clustered Data