Startseite Before defining performance criteria we must agree on what a “qualitative test procedure” is
Artikel Öffentlich zugänglich

Before defining performance criteria we must agree on what a “qualitative test procedure” is

  • Gunnar Nordin EMAIL logo
Veröffentlicht/Copyright: 16. April 2015
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

The term “qualitative test” is ambiguous and should not be used for nominal (classification) or ordinal (grading) tests. Characteristics for nominal and ordinal scale test results, such as traceability and uncertainty, remains to be established before general performance criteria can be agreed upon. For ordinal binary test with a quantitative back ground scale an assay could be characterized with the three quantities “C5”, “C50”, and “C95”. The C50-value, or the equivalence point, ought to be declared by the manufacturers of diagnostic products. For the correct understanding and communication of results from ordinal and nominal tests the way of expressing the results also need to be harmonized.

What is a qualitative test?

Rapid tests of various types such as screening tests for drugs of abuse or for Strep A antigen are numerously performed laboratory investigations. Sometimes these types of tests are called semi-quantitative and sometimes qualitative. Both terms are ambiguous. The International Union of Pure and Applied Chemistry (IUPAC) defines qualitative analysis as tests for classification of nominal properties; “Analysis in which substances are identified or classified on the basis of their chemical or physical properties, such as …” [1]. This differs from the Clinical and Laboratory Standards Institute (CLSI) use of qualitative tests for “… methods that provide only categorical responses (i.e., positive/negative or yes/no)” [2].

Another way to categorize these types of examinations is according to the results, which can be either nominal (a result of a classification) or ordinal (a grading). Nominal and ordinal values can be very accurately defined, and are thus not by definition semi-quantitative in the sense that the results have a large uncertainty. Therefore, the use of the term semi-quantitative is not recommended. Nominal and ordinal values are not restricted to binary scale examinations. A urine test strip for albumin is an example of a “multinary” ordinal examination procedure with a possible outcome of more than two alternative values.

Results for an ordinal quantity are measurement results [3], although the results have no measurement units. Ordinal values are ordered in some way by magnitude. This excludes, e.g., alphabetical ordering of the values. The values “yes” and “no”, included in the CLSI use above, cannot be ordered by magnitude, and are therefore not ordinal values. The results “positive” and “negative” are usually regarded as ordered by magnitude (or 0/1), in the sense that positive implies more of something, e.g., a concentration above a certain cut-off limit. A pregnancy test might be either negative or positive, depending on the concentration of hCG in a urine sample. The results “negative” and “positive” can be interpreted as ordinal values, e.g., <5 mIU/L and >5 mIU/L, respectively.

Results with the binary outcome yes/no are often nominal. Results from a PCR test that detects a specified mutation, e.g., for BCR-ABL1 gene can be reported with “yes” (with the interpretation “…gene is present”) or “no” (with the meaning the “…gene is not present”), without any respect to a possible concentration of the gene. In this case the outcome of the test is not an ordinal measurement but a classification, or a nominal examination [4]. One characteristic of these types of nominal examination results, e.g., for the appearance of gene or classification of a disease, is that values have no individual biological variation. The results are discrete and a true value in between “yes” and “no” is not possible.

Urine test strips is a classic example of “multinary” ordinal test. The possible test values are categorized in multiple arbitrary scale steps with result classes without measurement units [5], although the name of the scale steps may mimic measurement units.

Measurement characteristics of ordinal tests

The concepts of precision, trueness, uncertainty and traceability for ordinal and nominal examination results are not clearly defined. The counterpart to the metrological terminology and description of measurement traceability and uncertainty of results for ratio scale results that is given in ISO 17511:2003 [6] remains to be developed for nominal and ordinal results. Uncertainties for ordinal and nominal examinations might be expressed as probabilities of correct or false results, which are concepts that can be understood for a set of results, e.g., 1000 samples, but is more difficult to understand in relation to a single positive or negative result.

For ordinal examination results with a quantitative, often ratio type, background scale we also lack a clear definition of the detection limit under which most of the results are expected to be negative, and an upper concentration limit above which most test results are expected to be positive. In the interval between these two limits either negative or positive results can be expected by chance. This interval might be termed the gray zone, unreliability region [7], or the “C5–C95 interval” as suggested by CLSI.

C5 describes the concentration below which <5% of the results are positive, C95 is the concentration above which >95% results are positive when many replicates are measured.

The C50-value, the equivalence point or the cut-off level, is the concentration where half of the test results become positive and half negative. One method to estimate the equivalence point has been developed through rankit transformation [8] of the accumulated frequency of positive results in relation the logarithmic concentration. The same principles can also be applied to calculate C5–C95 interval. The C50-value has the benefit, compared to the C5- and C95-values, that it can be estimated with higher precision. For statistical reasons a much larger number of measurements is needed in order to estimate C5 and C95 with certainty.

When the C5–C95 interval is narrow, the ordinal test is sometime called more precise. For tests with multiple scale steps a C5–C95 interval can be described for each of the steps. For a typical urine test strip the C5–C95 interval for each step covers the range of concentration corresponding to several scale steps [5]. For multiple scale steps to be meaningful the test must be precise enough with narrow C5–C95 interval.

As mentioned above traceability and other metrological terms for nominal and ordinal results have not yet been clearly defined. For ordinal results the relation between the C50-value and a reference method value seems to be an important part of the description of the traceability for the ordinal results. Ordinal measurement trueness can be described as closeness of agreement between the equivalence point for a binary ordinal test and a reference quantity value.

Analytical performance goals for ordinal tests

Internal quality assessment (IQA) consists of samples for which the expected results are known and predictable. For ordinal tests the internal quality control therefore must consist of samples with concentrations outside the C5–C95 interval. Samples with concentrations within this interval will give results that are unpredictable by definition. Therefore these are not useful as internal quality control samples, for the purpose of a check that the measurement procedure is under control.

For samples used in external quality assessment (EQA) the values must be unknown for the participants before examination. Performance for samples with concentrations outside the C5–C95 interval is a measure of performance of the individual laboratory or participant. However, contrary to the case with IQA, results for samples within the C5–C95 interval are also informative in the context of EQA. The outcome of these allows the estimation of the C50 value for a specific test procedure, which is often an in vitro diagnostic product on the market. In this way EQA for an ordinal test becomes a measure of ordinal trueness for the procedure.

In addition to estimation of trueness, the robustness and selectivity must be challenged in EQA. The analytical specificity should be checked either by using native sample materials, with natural occurrences of molecular species and antigens other than the target analyte, for which it is important that the test is specific.

Analytical performance goals for ordinal scale test procedures must be set according to the intended use of the procedure. When the test is used for a screening purpose, and a positive test will be confirmed with a more specific secondary test, then the performance goal for the screening procedure can be a very high sensitivity. In the case of a screening test for drugs of abuse, the cut-off limit and other performance criteria might be set for legal reasons.

In other situations an ordinal scale test is primarily used to exclude a positive result. This might be the case for a Strep A antigen rapid detection test, that is used with the purpose to avoid antibiotic treatment for patients with sore throat, but without group A Streptococcus antigen. In this situation a high specificity is preferred to a high sensitivity. A negative result should be reliable.

It is usually not possible to base performance goal for ordinal test on biological variation. The gray zone, or the C5–C95 interval, is probably larger than the biological variation for many measurands.

The performance goals have so far probably been based mainly on state-of-the-art for ordinal scale tests, e.g., urine test strips. With the future technological development the performance of many ordinal scale tests can be expected to improve.

Another area in need of standardization is the mode of reporting ordinal test results. Results for test strips are today reported in several different ways: positive/negative, or 0/1, –/+/++, “approx. 10–20” etc. These multiple ways to report the same result is confusing and leads to risk for misinterpretation when results are registered in health records and communicated.

Analytical performance goals for nominal tests

Performance goals for nominal examination procedures must be set with respect to the type of test. For genetic tests and, e.g., blood group determination, a negligible minimal examination uncertainty is essential for the proper use of the results. For other types of nominal examinations, e.g., cell classification based on morphologic appearance, the uncertainty might be considerable.

Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.

Financial support: None declared.

Employment or leadership: None declared.

Honorarium: None declared.

Competing interests: The funding organization(s) played no role in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the report for publication.


Corresponding author: Gunnar Nordin, Equalis, Uppsala, Sweden, E-mail:

References

1. IUPAC Gold Book. Available from: http://goldbook.iupac.org/. Accessed November, 2014.Suche in Google Scholar

2. CLSI. User protocol for evaluation of qualitative test performance: approved guideline, 2nd ed. EP12-A2. Wayne, PA: CLSI, 2008.Suche in Google Scholar

3. JCGM. International vocabulary of metrology – basic and general concepts and associated terms (VIM), 2008. JGGM 200:2008.Suche in Google Scholar

4. Nordin G, Dybkaer R, Forsum U, Fuentes-Arderiu X, Schadow G, Pontet F, on behalf of the IFCC-IUPAC Committee on Nomenclature, Properties, and Units (C-NPU). An outline for a vocabulary of nominal properties and examinations – basic and general concepts and associated terms. Clin Chem Lab Med 2010;48:1553–66.10.1515/CCLM.2010.325Suche in Google Scholar PubMed

5. Hyltoft Petersen P, Gade Christensen N, Sandberg S, Nordin G, Pedersen M, Nordic Control Organizations. How to deal with semi-quantitative tests? Application of an ordinal scale model to measurements of urine glucose. Scand J Clin Lab Invest 2009;69:662–72.10.3109/00365510902968756Suche in Google Scholar PubMed

6. ISO 17511:2003 In vitro diagnostic medical devices – measurement of quantities in biological samples – metrological traceability of values assigned to calibrators and control materials. Geneva: ISO, 2003.Suche in Google Scholar

7. Ríos A, Barceló D, Buydens L, Cárdenas S, Heydorn K, Karlberg B, et al. Quality assurance of qualitative analysis in the framework of the European project ‘MEQUALAN’. Accred Qual Assur 2003;8:68–77.10.1007/s00769-002-0556-xSuche in Google Scholar

8. Hyltoft Petersen P, Gade Christensen N, Sandberg S, Nordin G, Pedersen M, Nordic Control Organizations. How to deal with dichotomous tests? Application of a rankit ordinal scale model with examples from the Nordic ordinal scale project on screening tests. Scand J Clin Lab Invest 2008;68:298–311.10.1080/00365510701742592Suche in Google Scholar PubMed

Received: 2015-1-2
Accepted: 2015-3-12
Published Online: 2015-4-16
Published in Print: 2015-5-1

©2015 by De Gruyter

Artikel in diesem Heft

  1. Frontmatter
  2. Editorial
  3. Defining analytical performance specifications 15 years after the Stockholm conference
  4. Consensus Statement
  5. Defining analytical performance specifications: Consensus Statement from the 1st Strategic Conference of the European Federation of Clinical Chemistry and Laboratory Medicine
  6. Opinion Papers
  7. The 1999 Stockholm Consensus Conference on quality specifications in laboratory medicine
  8. Setting analytical performance specifications based on outcome studies – is it possible?
  9. Performance criteria based on true and false classification and clinical outcomes. Influence of analytical performance on diagnostic outcome using a single clinical component
  10. Analytical performance specifications based on how clinicians use laboratory tests. Experiences from a post-analytical external quality assessment programme
  11. Rationale for using data on biological variation
  12. Reliability of biological variation data available in an online database: need for improvement
  13. A checklist for critical appraisal of studies of biological variation
  14. Optimizing the use of the “state-of-the-art” performance criteria
  15. Are regulation-driven performance criteria still acceptable? – The German point of view
  16. Performance criteria for reference measurement procedures and reference materials
  17. Performance criteria for combined uncertainty budget in the implementation of metrological traceability
  18. How to define a significant deviation from the expected internal quality control result
  19. Analytical performance specifications for EQA schemes – need for harmonisation
  20. Proposal for the modification of the conventional model for establishing performance specifications
  21. Before defining performance criteria we must agree on what a “qualitative test procedure” is
  22. Performance criteria and quality indicators for the pre-analytical phase
  23. Performance criteria of the post-analytical phase
Heruntergeladen am 2.12.2025 von https://www.degruyterbrill.com/document/doi/10.1515/cclm-2015-0005/html
Button zum nach oben scrollen