Abstract:
The DGKL Working Group Guide Limits (Arbeitsgruppe Richtwerte) has published a proposal for deriving permissible analytical uncertainty limits related to biological variation data. Reference intervals were used to estimate biological variation. Biological variation data as basis for permissible uncertainty limits are generally accepted. These concepts usually apply a fixed factor leading to unrealistic stringent limits for quantities with a relatively small biological variation and to very permissive limits for quantities with relatively large biological variation. The working group has suggested a non-linear relation between biological variation and permissible uncertainty limits. The new approach has been exemplified with 84 quantities listed in the RiliBÄK (official German guidelines). The algorithms published allowed to derive permissible limits for all quantitative measurands in laboratory medicine. After its publication, three supplements appear necessary: 1. additional specifications of standard uncertainty, 2. a discussion on permissible limits for diagnosis and monitoring purposes, and 3. a discussion on circular reasoning in our approach.
Zusammenfassung:
Die DGKL Arbeitsgruppe Richtwerte hat einen Vorschlag zur Ableitung zulässiger Messunsicherheit auf der Basis biologischer Varianz publiziert. Die biologische Varianz wurde aus den Referenzintervallen geschätzt. Biologische Varianzdaten als Basis von zulässigen Grenzen der Messunsicherheit sind allgemein akzeptiert. Die vorgeschlagenen Konzepte wenden fixe Faktoren an, die zu unrealistisch stringenten Grenzen für Messgrößen mit relativ kleiner biologischer Varianz und zu wenig stringenten Grenzen für Messgrößen mit großer biologischer Varianz führen. Die Arbeitsgruppe empfahl statt fixer Faktoren eine nicht-lineare Beziehung zwischen biologischer Varianz und zulässiger Messunsicherheit. Der neue Vorschlag der Arbeitsgruppe wurde an 84 Messgrößen getestet, die in der RiliBÄK aufgelistet sind. Die publizierten Algorithmen erlauben zulässige Messunsicherheiten für alle quantitativen Messgrößen der Laboratoriumsmedizin abzuleiten. Nach der Publikation erscheinen nun drei Ergänzungen erforderlich: 1. Zusätzliche Spezifikationen der Standard-Messunsicherheit, 2. eine Diskussion über zulässige Messunsicherheiten für diagnostische und monitoring Zwecke, und 3. über eine Zirkelargumentation in dem vorgeschlagenen Konzept.
Abbreviations: GUM, Guide to measurement uncertainty; MU, measurement uncertainty; CVA, analytical coefficient of variation; CVB, biological coefficient of variation; CVC, combined intra- and inter-individual variation; CVE, empirical CVB; CVE*empirical CVB derived of the logarithmic scale; CVG, inter-individual variation; CVI, intra-individual variation; sA, analytical standard deviation; psA, permissible analytical standard deviation; pCVA, permissible analytical CVA; sE, empirical standard deviation (linear scale from Gaussian distribution); sE,ln, empirical standard deviation (logarithmic scale); sE,lin, empirical standard deviation (linear scale from logarithmic scale); RI, reference interval; RL, reference limit; RL1, lower reference limit (RL2.5); RL2, upper reference limit (RL97.5); RiliBÄK, Richtlinie der Bundesärztekammer, official German guidelines; RMSD, root mean square of measurement deviation; TE, total error; uP, pre-examination uncertainty; uB, bias uncertainty; uS, standard uncertainty; uC, combined uncertainty; puC, permissible combined uncertainty; U, expanded uncertainty; pU, permissible expanded uncertainty.
Introduction
The DGKL Working Group Guide Limits (Arbeitsgruppe Richtwerte) recently has published a proposal for deriving permissible uncertainty limits of biological variation data [1]. Reference intervals were used to estimate biological variation.
Biological variation data as the basis for permissible uncertainty limits are generally accepted. These concepts usually apply a fixed factor leading to unrealistic stringent limits for quantities with a relatively small biological variation (e.g. plasma sodium) and to very permissive limits for quantities with relatively large biological variation (e.g. plasma triglycerides). To avoid these drawbacks, the working group has suggested a non-linear relation between biological variation and permissible uncertainty limits [1]. This proposal led to a compromise between the biological variation model and the state-of-the-art concept.
The new approach has been published in detail and exemplified with 84 quantities listed in the RiliBÄK [2]. The algorithms published allow to derive permissible limits for all quantitative measurands in laboratory medicine. The concept was based on intermediate imprecision. After its publication, three supplements appear necessary: 1. additional specifications of standard uncertainty, 2. a discussion on permissible limits for diagnosis and monitoring purposes, and 3. a discussion on circular reasoning in our approach.
Additional specifications of standard uncertainty
Recently, we published a proposal for permissible uncertainty of quantitative measurements in laboratory medicine [1]. One component of the uncertainty can be inferred from the statistical variation of the results of a series of measurements [category A of the Guide to Measurement Uncertainty (GUM) concept] [3]. In terms of laboratory medicine, it means the analytical imprecision and can be expressed by the experimental standard deviation. The term imprecision should be further specified.
Various international standard organizations [3–5] specified various types of standard uncertainty (imprecision) as summarized in Table 1. The maximum time span of repeatability is 1 day. Intermediate imprecision covers measurements from several days. The position of the control material in one run should always be the same, preferably before unknown samples are determined. If several control samples are measured during 1 day, only the first result should be included for calculating the imprecision. Further details have been published by Thompson [6].
ISO | Synonyms sometimes found in the literature |
---|---|
1. Repeatability | Immediate imprecision, within-run imprecision, short-term variability |
2. Intermediate imprecision | Within-laboratory imprecision, day-to-day imprecision |
3. Reproducibility | Inter-laboratory imprecision |
The imprecision has several operational sources of uncertainty including, e.g. pipetting and temperature controlling, variations caused by operators (mainly calibration and maintenance) and manufacturer caused variations. The imprecision is expressed as standard deviation s or coefficient of variation CV. Evaluating s and CV, different lengths of series are used. However, both s and CV are estimations with confidence intervals decreasing with an increasing amount of repeats (Figure 1). For n=20 measurements, the additive effect of the confidence interval decreases to about 10% and the effect of additional repeats is small. The choice of n=20 is a practical compromise followed by many medical laboratories worldwide.
![Figure 1: Confidence limits (95%) for an estimated standard deviation (broken lines), obtained by Monte Carlo simulation (20,000 replicates per n value).Limits were calculated as described in ref. [7], equation 6.102, p. 344). Horizontal axis: Number of values from which the standard deviation is calculated. Drawn line: mean estimated standard deviation.](/document/doi/10.1515/labmed-2015-0112/asset/graphic/j_labmed-2015-0112_fig_001.jpg)
Confidence limits (95%) for an estimated standard deviation (broken lines), obtained by Monte Carlo simulation (20,000 replicates per n value).
Limits were calculated as described in ref. [7], equation 6.102, p. 344). Horizontal axis: Number of values from which the standard deviation is calculated. Drawn line: mean estimated standard deviation.
Our proposal (1) was based on intermediate imprecision which is determined during one control cycle (e.g. 20 d×1=20 days, one measurement per day), with one instrument/analytical system and the same control material as also claimed by CLSI [5]. Several operators may be involved. Variations caused by operators are included, but variations caused by the manufacturer (e.g. between reagent lot variation) are excluded. However, they are tolerated as long as the permissible expanded uncertainty is not exceeded. If calibration and maintenance actions are undertaken only occasionally, the control cycle can be extended to, e.g. a 3-month period to include these effects on s. The longer series result in smaller confidence intervals estimating s. However, for series lengths above 20 this effect is small (Figure 1).
The quantities to be assured should be close to the most appropriate reference limits. That means more than one control material should be applied. The German RiliBÄK requires at least two materials for each measurand [2].
The imprecision as determined by control materials is an estimate of the actual imprecision designed for internal quality assurance purposes. It does not provide an estimate of the overall imprecision of the results received by the requesting physicians for materials taken from patients. For the latter purpose, the critical difference (also called reference change value) concept provides more meaningful information to the requester. Then, the uncertainty may be expressed as 1.64·(sA*+sI*)* [sA analytical standard deviation, sI intra-individual standard deviation].
If estimating imprecision, it is sometimes appropriate to eliminate suspect results which do not belong to the underlying distribution (outliers). Often, they are identified by subjective expertise. However, it is preferable to apply a statistical test. The literature contains a lot of more or less complicated outlier tests [8]. A simple and often used test was proposed by Youden [9]: elimination of all values outside mean±3s and recalculation of s (s=standard deviation). Probably, more reliable are the r-test [10, 11] or the Tukey Test [12].
Different permissible uncertainty limits for diagnostic and monitoring purposes?
Various concepts are suggested for uncertainty limits in laboratory medicine. Most proposals [13, 14] are based on biological variation (CVB). If CVB is multiplied by a fixed factor common to all measurements, quantities with a relatively small CVB (e.g. plasma sodium) lead to a permissible analytical uncertainty (pU%) which is unrealistic low under the present technology, and quantities with a relatively large CVB have a pU% which is too permissive (e.g. plasma triglycerides). Therefore, Fraser [14] proposed three different factors and we proposed even five different factors [15]. If the analytical uncertainty obtained by the majority of medical laboratories is presented in relation to CVB, different factors lead to artificial jumps (as indicated by vertical black lines in Figure 2). An alternative is to use a curve with a continuous relation between CVB and pU% [1, 16] as demonstrated by blue diamonds in Figure 2.
![Figure 2: The relation between empirical biological variation (CVE*), the permissible extended combined uncertainty pU% (diamonds) and the permissible RiliBÄK requirements (RMSD, root mean square of measurement deviation, brown rectangles) [1, 16].The straight blue lines represent the permissible uncertainty limits derived of CVE* by fixed factors (lower line: 0.25·CVE, middle line: 0.5·CVE*, upper line: 0.75·CVE*). CVE* means CVE calculated from ln values of the reference limits. For further explanation see ref. 1. The blue diamonds follow the equation pU%=2.39·(CVE*–0.25)0.5 [1].](/document/doi/10.1515/labmed-2015-0112/asset/graphic/j_labmed-2015-0112_fig_002.jpg)
The relation between empirical biological variation (CVE*), the permissible extended combined uncertainty pU% (diamonds) and the permissible RiliBÄK requirements (RMSD, root mean square of measurement deviation, brown rectangles) [1, 16].
The straight blue lines represent the permissible uncertainty limits derived of CVE* by fixed factors (lower line: 0.25·CVE, middle line: 0.5·CVE*, upper line: 0.75·CVE*). CVE* means CVE calculated from ln values of the reference limits. For further explanation see ref. 1. The blue diamonds follow the equation pU%=2.39·(CVE*–0.25)0.5 [1].
So far, no consensus exists which biological variation should be applied to derive permissible limits for uncertainty. Different CVB concepts have been described: CVI means intra-individual variation, CVG inter-individual variation, CVC combined intra- and inter-individual variation, and CVE empirical biological variation (CVC and CVA, analytical variation).
Several authors proposed that CVI may be applied for monitoring purposes and CVG for diagnostic purposes [14, 17]. Although this idea can be scientifically justified, it does not appear realistic under routine conditions. Usually most medical laboratories serve diagnostic and monitoring purposes. In most cases, the laboratory does not know for which purpose a test is requested. Therefore, only the monitoring purpose should be chosen because it is most restrictive. Then, the adequate CVB would be CVI. The relation between CVI and pU% is also non-linear. In Figure 3, the circles show a similar curved pattern than the rectangles in Figure 2. The curved behavior is demonstrated by a local regression line in Figure 3. Because the curves in Figures 2 and 3 are oriented on the technical reality (technical feasibility), the permissible U% obtained is similar. In this case, both CVB (CVI and CVE) can be used to derive permissible uncertainty limits.
![Figure 3: Intra-individual variation (CVI, data taken from ref. 18) vs. the permissible expanded uncertainty (RMSD of the RiliBÄK, Table B1a, column 3 [2]).The black line is the local regression.](/document/doi/10.1515/labmed-2015-0112/asset/graphic/j_labmed-2015-0112_fig_003.jpg)
Intra-individual variation (CVI, data taken from ref. 18) vs. the permissible expanded uncertainty (RMSD of the RiliBÄK, Table B1a, column 3 [2]).
The black line is the local regression.
We prefer to use the model based on CVE for several reasons [1]: it realizes a compromise between technical reality and biological variation, considers imprecision profiles, considers normal and skewed distributions, takes advantage of the fact that laboratories are obliged to use reference limits for all quantities determined either by establishing their own reference limits or to verify the transferability of reference limits chosen from outside sources. According to existing guidelines, all reference limits must be validated and periodically verified [4].
Because permissible limits can be derived from any CVB concept and should be oriented on the technical reality, the purpose for its application (monitoring or diagnostic) should be of secondary concern in our model.
Discussion of circular reasoning
A frequently raised critical comment on our proposal is that the combination of CVA and CVC in CVE leads to a circular reasoning. Although this argument is correct, it can be neglected if the analytical variation is small in relation to the biological variation. This is the case for most measurands in clinical chemistry (Table 2).
Influence of imprecision on the permissible standard uncertainty shown for several plasma quantities.
A | B | C | D | E | F | G | H | I | J | K |
---|---|---|---|---|---|---|---|---|---|---|
Quantity | Lower RL | Upper RL | CVE* | pCVA | CVC=CVE* (if pCVA=0) | CVA | CVE*, new (CVC+CVA) | pCVA, new | I minus E | J in % of E |
Sodium | 135 | 145 | 1.82 | 1.26 | 1.82 | 1.00 | 2.08 | 1.36 | 0.09522 | 7.56 |
AST | 10 | 35 | 32.8 | 5.71 | 32.8 | 3.50 | 32.99 | 5.72 | 0.01629 | 0.29 |
TSH | 0.5 | 2.5 | 42.8 | 6.52 | 42.8 | 5.00 | 43.09 | 6.55 | 0.02227 | 0.34 |
hCG | 0.75 | 5 | 51.4 | 7.15 | 51.4 | 7.00 | 51.87 | 7.19 | 0.03571 | 0.50 |
CRP | 1.5 | 10 | 51.4 | 7.15 | 51.4 | 7.00 | 51.87 | 7.19 | 0.03571 | 0.50 |
For details see ref. [1].
A relative large empirical biological variation was encountered for h-choriongonadotropine (hCG) and C-reactive protein with a CVE*=51.4 and a pCVA=7.15 [16]. Assuming a CVA=7.0, and CVc =CVE* and CVC+pCVA= (54.42+7.02)0.5=51.9. The corresponding pCVA value of 7.19 means a negligible increase (+0.5%).
With plasma sodium (a quantity with a relatively small biological variation), the increase of CVE* is 7.6% due to the non-linear relation mentioned above (Table 2). However, the pCVA derived of our model is equal to the permissible limit of the RiliBÄK [2].
Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.
Research funding: None declared.
Employment or leadership: None declared.
Honorarium: None declared.
Competing interests: The funding organization(s) played no role in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the report for publication.
References
1. Haeckel R, Wosniok W, Gurr E, Peil B. Permissible limits for uncertainty of measurement in laboratory medicine. Clin Chem Lab Med 2015;53:1161–71.10.1515/cclm-2014-0874Search in Google Scholar PubMed
2. Richtlinie der Bundesaerztekammer zur Qualitätssicherung laboratoriumsmedizinischer Untersuchungen. Dt Aerzteblatt 2008;105:C301-13. www.aerzteblatt.de/plus1308.Search in Google Scholar
3. International vocabulary of basic and general terms in metrology (VIM), 3rd ed. JCGM 200:2008, http://www.bipm.org/vim.Search in Google Scholar
4. International Standard Medical laboratories – Requirements for quality and competence, ISO 15189-2012(E), 1–39.Search in Google Scholar
5. CLSI: Evaluation of precision of quantitative measurement procedures; approved guideline-third edition. CLSI document EP05-A3. Wayne, PA: Clinical and Laboratory Standard Institute; October 2014.Search in Google Scholar
6. Thompson M. Precision in chemical analysis: a critical survey of uses and abuses. Anal Methods 2012;4:1598–611.10.1039/c2ay25083gSearch in Google Scholar
7. Hedderich J, Sachs L. Angewandte Statistik, 14th ed. Berlin: Springer-Verlag, 2012:1–881.10.1007/978-3-642-24401-8Search in Google Scholar
8. Bablok W, Haeckel R, Meyers W, Wosniok W: Outlier detection. In: R.Haeckel, editor. Evaluation methods in laboratory medicine. Weinheim, New York: VCH publishers; 1993:227–38.Search in Google Scholar
9. Grant EL. Statistical quality control, 3rd ed. New York: McGraw-Hill Book Comp.; 1964:561–3.Search in Google Scholar
10. Gottschalk K. Statistical evaluation of investigation results. Z Analyt Chem 1968:243:793–801.Search in Google Scholar
11. Dixon WJ. Analysis of extreme values. Ann Math Statist 1950;21:488–506.10.1214/aoms/1177729747Search in Google Scholar
12. Tukey JW. Exploratory data analysis. Reading, MA: Addison Wesley, 1977:688.Search in Google Scholar
13. Cotlove E, Harris EK, Williams GZ. Biological and analytic components of variation in long-term studies of serum constituents in normal subjects. Clin Chem 1970;16:1028–32.10.1093/clinchem/16.12.1028Search in Google Scholar
14. Fraser CG. Biological variation: from principles to practice. Washington DC, AACC Press, 2001:1–151.Search in Google Scholar
15. Haeckel R, Wosniok W. A new concept to derive permissible limits for analytical imprecision and bias considering diagnostic requirements and technical state-of-the-art. Clin Chem Lab Med 2011;49:623–35.10.1515/CCLM.2011.116Search in Google Scholar PubMed
16. Permissible imprecision (pCVA) and combined uncertainty (pU%) for a particular measurand (xi). www.dgkl.de, assessed 01.10.2014.Search in Google Scholar
17. Oosterhuis WP, Sandberg S. Proposal for the modification of the conventional model for establishing performance specifications. Clin Chem Lab Med 2015;53:925–37.10.1515/cclm-2014-1146Search in Google Scholar PubMed
18. Ricos C, Garcia-Lario JV, Alvarez V, Cava F, Domenech M, Hernandez A, et al. 2008. www.westgard.com/guest17.htm, updated 2008.Search in Google Scholar
©2016 by De Gruyter
This article is distributed under the terms of the Creative Commons Attribution Non-Commercial License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Articles in the same Issue
- Frontmatter
- Allergie und Autoimmunität/Allergy and Autoimmunity / Redaktion: U. Sack/K. Conrad
- Qualitätskontrolle und Validierung in der diagnostischen Durchflusszytometrie
- Quality management in IgE-based allergy diagnostics
- Labordiagnostik bei systemischen Autoimmunerkrankungen
- Geriatrisches Labor/Geriatrics Laboratory / Redaktion: P. Schuff-Werner
- Surrogatmarker der Insulinresistenz bei Studienteilnehmern mit metabolischem Syndrom – Daten der Berliner Altersstudie II
- Neurologisches Labor/Neurological Laboratory / Edited by: H. Tumani/U.K. Zettl
- Expression of interferon type-I receptor isoforms, clinical response and development of neutralizing antibodies in multiple sclerosis patients – results of a prospective study
- Fallbericht/Case Report
- Labordiagnostik bei der monoklonalen Gammopathie unklarer Signifikanz (MGUS)
- Originalartikel/Original Articles
- Comparison between capillary glucose measured with a Contour glucometer and plasma glucose in a population survey
- Supplements to a recent proposal for permissible uncertainty of measurements in laboratory medicine
- Butyrylcholinesterase as an additional marker in the diagnostic network of acute myocardial infarction
- Buchbesprechung/Book Review
- Molekulare Allergiediagnostik
Articles in the same Issue
- Frontmatter
- Allergie und Autoimmunität/Allergy and Autoimmunity / Redaktion: U. Sack/K. Conrad
- Qualitätskontrolle und Validierung in der diagnostischen Durchflusszytometrie
- Quality management in IgE-based allergy diagnostics
- Labordiagnostik bei systemischen Autoimmunerkrankungen
- Geriatrisches Labor/Geriatrics Laboratory / Redaktion: P. Schuff-Werner
- Surrogatmarker der Insulinresistenz bei Studienteilnehmern mit metabolischem Syndrom – Daten der Berliner Altersstudie II
- Neurologisches Labor/Neurological Laboratory / Edited by: H. Tumani/U.K. Zettl
- Expression of interferon type-I receptor isoforms, clinical response and development of neutralizing antibodies in multiple sclerosis patients – results of a prospective study
- Fallbericht/Case Report
- Labordiagnostik bei der monoklonalen Gammopathie unklarer Signifikanz (MGUS)
- Originalartikel/Original Articles
- Comparison between capillary glucose measured with a Contour glucometer and plasma glucose in a population survey
- Supplements to a recent proposal for permissible uncertainty of measurements in laboratory medicine
- Butyrylcholinesterase as an additional marker in the diagnostic network of acute myocardial infarction
- Buchbesprechung/Book Review
- Molekulare Allergiediagnostik