Abstract
Efficacy estimation of medical interventions in clinical trials requires diagnostics to assess the study endpoint. The imperfect nature of diagnostics often leads to biased efficacy estimations, usually underestimation, that could result in failure of clinical trials. Adjustment methods can be used if sensitivity and specificity are known, but they are not regularly applied. Double-arm clinical trials are the standard for demonstrating the superiority of a medical intervention. However, sometimes single-arm trials are the only option: for example, if a parallel group trial design with a control group is regarded as unethical. Based on the Gart and Buck prevalence estimator (aka the Rogan-Gladen estimator), we demonstrate how the concept of diagnostic accuracy-adjusted effect estimation can be used for estimating the efficacy in single-arm trials. The accuracy of the diagnostic measure (sensitivity and specificity) used for endpoint assessment as well as the predictive accuracy of a predictor and its allocation among the study population have to be known to obtain consistent efficacy estimation of a medical intervention. If double-arm trials are not feasible, an approach such as that described here can provide evidence regarding the efficacy of a medical intervention based on a single-arm trial.
Brief summary
Especially in situations where a double-arm trial is not feasible or is regarded as ethically unacceptable, options for obtaining reliable efficacy estimates from single-arm trials can be very valuable to provide evidence. Accordingly, we suggest a procedure that allows for obtaining efficacy estimates in single-arm trials using methods to adjust for imperfect predictive accuracy of diagnostic tests.
Efficacy estimations in clinical trials rely on diagnostic results that confirm the endpoint. The sensitivity and specificity of diagnostic tests used for endpoint definitions are usually lower than 1, which generates misclassification. An early discussion of the impact of sensitivity and specificity on the reliability of prevalence measures was published in 1966 [1] and 12 years later [2]. The authors of both studies [1], [2] presented an identical prevalence estimator adjusted for diagnostic accuracy. In 1998, Lachenbruch [3] published an adjusted estimator for intervention efficacy based on the adjusted prevalence estimator. This estimator is important for clinical trials as efficacy estimations in such trials are biased if the impact of diagnostic sensitivity and specificity on effect estimations is neglected. If the sum of sensitivity and specificity is greater than 1 but smaller than 2, the measured (unadjusted) effect underestimates the true effect of a medical intervention. If the sum of sensitivity and specificity is even smaller than 1, the measured (unadjusted) effect of a medical intervention is actually in the opposite direction to the real effect. Accordingly, the diagnostic accuracy has to be taken into account while planning clinical trials.
Our approach of estimating the efficacy of medical interventions in single-arm trials is based on the Gart and Buck prevalence estimator that is commonly known as the Rogan-Gladen estimator and which is adjusted for diagnostic accuracy. So far, a sensitivity- and specificity-adjusted estimator for treatment efficacy has been presented only for double-arm trials [3].
For estimation of the efficacy of medical interventions in single-arm trials, the predictive accuracy (sensitivity and specificity) of an additional endpoint-associated predictor (apart from the intervention under investigation) and its distribution in the trial population have to be known. Such predictors can be any information of relevance for which the predictive accuracy for the endpoint is known. Further, we adapted the concept of adjusting prevalence and efficacy estimates of medical interventions for diagnostic accuracy to single-arm trials using the prognostic accuracy of a predictor for the endpoint. In the course of this, we will use the following definitions:
| n | Number of individuals tested |
| PrevE | Point estimator for prevalence |
| prevE | Estimated prevalence for a study endpoint |
| seE | Sensitivity of a diagnostic test for the study endpoint |
| seP | Sensitivity of a diagnostic test for the predictor |
| spE | Specificity of a diagnostic test for the study endpoint |
| spP | Specificity of a diagnostic test for the predictor |
| TE | Random variable for the rate of positively tested individuals for the endpoint E, i.e. n*TE follows a binomial distribution where n is the number of tested individuals |
| tE | Rate of individuals tested positive for the study endpoint as realization of TE |
| TP | Random variable for the rate of positively tested individuals for the predictor P, i.e. n*TP follows a binomial distribution where n is the number of tested individuals |
| tP | Rate of individuals tested positive for the predictor as realization of TP |
| ME | Point estimator for efficacy of the medical intervention regarding the study endpoint |
| IM | Point estimator of incidence of the study endpoint with medical intervention |
| iM | Estimated incidence of the study endpoint with medical intervention |
| Point estimator of incidence of the study endpoint without medical intervention | |
| Estimated incidence of the study endpoint without medical intervention | |
| PPVP | P (incident|predictor present) |
| P (incident|predictor absent) | |
| Var() | Variance of the random variable in parentheses |
With the binomial-distributed random variable TE for the rate of positively tested individuals for an endpoint E given, an unbiased point estimator for the prevalence of E that is adjusted for diagnostic sensitivity and specificity was introduced in 1966 [1] as well as in 1978 [2]. It is known as the Rogan-Gladen estimator which is given as follows:
Note that the sum of the sensitivity and specificity has to be unequal to 1. For the accuracy of a prevalence estimation based on an adjustment for diagnostic accuracy, it is more important that the sum of sensitivity and specificity is considerably different from 1 than simply that it be larger than 1. So, a diagnostic test with a sum of sensitivity and specificity that is even smaller than 1 would lead to smaller confidence intervals than a diagnostic test with “better” accuracy but a sum of sensitivity and specificity that is close to 1.
The variance of PrevE as published by Rogan and Gladen is given as follows:
Lachenbruch [3] used the aforementioned estimator and introduced an adjusted estimator for treatment efficacy based on identical diagnostic accuracy over the study arms. Lachenbruch’s diagnostic accuracy-adjusted efficacy estimator of a medical intervention ME for varying diagnostic accuracy over the study arms can be given as follows:
with
with the variances as given by equation (2).
We demonstrate how to estimate the efficacy of medical interventions in single-arm trials by the following approach: (i) using the prevalence estimator adjusted for diagnostic inaccuracy using known sensitivity and specificity of the diagnostic test for the endpoint; (ii) using the knowledge of an endpoint predictor and its accuracy; (iii) using the prevalence estimator adjusted for diagnostic inaccuracy using known sensitivity and specificity of the diagnostic test for the predictor. Of course, the survey of the predictor distribution among the study population should be done before baseline and blinded.
In a population without medical intervention, for predictor P and incidence I holds (law of total probability):
With known predictive accuracy of the predictor for the endpoint, an unbiased and consistent estimator for incidence without medical intervention is given as follows:
where PPVP is the positive predictive value for the predictor being present,
To estimate the efficacy of a medical intervention in a single-arm trial, all individuals initially tested for a predictor with known predictive accuracy for the endpoint will be subjected to the medical intervention and followed up for disease independently of the initial diagnostic test result for the predictor.
The estimator for the efficacy of the medical intervention ME in a single-arm trial is now given as follows:
where IM is the incidence of the population being subjected to the medical intervention and
and
The variance of ME is given by
with
It should be noted that the predictive accuracy (sensitivity and specificity) of the predictor for the endpoint has to be well known. Furthermore, the evaluated incidence period in the study should be the same as the predicted incidence period. An example is provided in Supplementary Material 1.
Single-arm trials are unlikely to replace the generally accepted standard of investigator-blinded, controlled double-arm trials but they can be used if circumstances make a double-arm trial very difficult or even impossible. Efficacy estimators can be considered if the impact of a predictor on the risk to reach the study endpoint is known. A decision analysis model supplemented by a Monte Carlo simulation could be such an approach to estimate the accuracy of possible predictors. Several methods for the estimation of diagnostic accuracy have been published [4] and these could also be adapted.
Also, precise knowledge of the sensitivity and specificity of all diagnostic tests conducted is essential for reliably estimating the efficacy of a medical intervention. This requirement is not limited to single-arm trials.
Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.
Research funding: None declared.
Employment or leadership: None declared.
Honorarium: None declared.
Competing interests: The funding organization(s) played no role in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the report for publication.
Ethical statement: Not applicable.
References
1. Gart JJ, Buck AA. Comparison of a screening test and a reference test in epidemiologic studies. II. A probabilistic model for the comparison of diagnostic tests. Am J Epidemiol 1966;83:593–602.10.1093/oxfordjournals.aje.a120610Search in Google Scholar PubMed
2. Rogan WJ, Gladen B. Estimating prevalence from the results of a screening test. Am J Epidemiol 1978;107:71–6.10.1093/oxfordjournals.aje.a112510Search in Google Scholar PubMed
3. Lachenbruch PA. Sensitivity, specificity, and vaccine efficacy. Control Clin Trials 1998;19:569–74.10.1016/S0197-2456(98)00042-7Search in Google Scholar
4. Enøe C, Georgiadis MP, Johnson WO. Estimation of sensitivity and specificity of diagnostic tests and disease prevalence when the true disease state is unknown. Prev Vet Med 2000;45:61–81.10.1016/S0167-5877(00)00117-3Search in Google Scholar
Supplementary Material
The online version of this article offers supplementary material (https://doi.org/10.1515/labmed-2019-0091).
©2019 Walter de Gruyter GmbH, Berlin/Boston
This article is distributed under the terms of the Creative Commons Attribution Non-Commercial License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Articles in the same Issue
- Frontmatter
- Laboratory Management
- Recommended changes of the current version of the German Rili-BAEK
- Analysis of a 6-year pilot external quality assurance survey of free light chain using Sigma metrics
- Allergy and Autoimmunity
- Different vitamin D status in common multiorgan autoimmune disease patients
- Investigation of the dual cascade algorithm in the diagnosis of antinuclear antibodies
- Neurology Laboratory
- Ischemia-modified albumin (IMA) and dynamic thiol-disulfide homeostasis in patients with postherpetic neuralgia
- Endocrinology
- Effect of hemoglobin F and A2 on hemoglobin A1c determined by cation exchange high-performance liquid chromatography
- Original Article
- A colorimetric method to measure oxidized, reduced and total glutathione levels in erythrocytes
- Short Communication
- Sparing the control arm using well-characterized diagnostic approaches – the Gart and Buck prevalence estimator for efficacy estimation in single-arm trials
- Letter to the Editor
- Ambiguous pharmacogenetic genotyping results in a patient with bone marrow transplantation
Articles in the same Issue
- Frontmatter
- Laboratory Management
- Recommended changes of the current version of the German Rili-BAEK
- Analysis of a 6-year pilot external quality assurance survey of free light chain using Sigma metrics
- Allergy and Autoimmunity
- Different vitamin D status in common multiorgan autoimmune disease patients
- Investigation of the dual cascade algorithm in the diagnosis of antinuclear antibodies
- Neurology Laboratory
- Ischemia-modified albumin (IMA) and dynamic thiol-disulfide homeostasis in patients with postherpetic neuralgia
- Endocrinology
- Effect of hemoglobin F and A2 on hemoglobin A1c determined by cation exchange high-performance liquid chromatography
- Original Article
- A colorimetric method to measure oxidized, reduced and total glutathione levels in erythrocytes
- Short Communication
- Sparing the control arm using well-characterized diagnostic approaches – the Gart and Buck prevalence estimator for efficacy estimation in single-arm trials
- Letter to the Editor
- Ambiguous pharmacogenetic genotyping results in a patient with bone marrow transplantation