Home Medicine Statistical pearls: Importance of effect-size, blinding, randomization, publication bias, and the overestimated p-values
Article Publicly Available

Statistical pearls: Importance of effect-size, blinding, randomization, publication bias, and the overestimated p-values

  • Harald Breivik EMAIL logo , Leiv Arne Rosseland and Audun Stubhaug
Published/Copyright: October 1, 2013
Become an author with De Gruyter Brill

In this issue of the Scandinavian Journal of Pain, our associate editor for medical statistics, Professor Eva Skovlund focuses on some of the important, and frequently misunderstood, basic issues of statistical analyses of medical research data [1].

1 The over-rated p-value is often misunderstood

p-values are always there in scientific publications from medical researchers. They impress reviewers, editors, and readers alike, more than they deserve. A low p-value is taken as a “proof” of the truth of the research-question, but the p-value only indicates the likelihood of finding the observed difference between treatments by random/by chance, even if there is no difference. Still, the observed difference can be of limited clinical relevance, in spite of a low p-value [1].

2 Effect-size and clinical significance

As Eva Skovlund writes in her review [1]: “a p-value does neither assess the size of an effect as such nor whether a statistically significant result is of any clinical relevance”.

For clinically meaningful information from the statistical analyses, the size of the effect and its 95% confidence interval are needed.

The effect-size is the difference between the observed value(s) and the expected value(s). If the null-hypothesis is true, i.e. that there is no difference between the therapies, the effect-size will be zero. Therefore, the effect-size should always be included in clinical trial reports [1].

It is important that the observed effect size is valid (only) for the population studied, with its baseline severity of illness, such as pain intensity. Thus, the results and the effect size may not be the same in a different study population or in a population with less or more severe illness (pain).

3 Pre-study estimate of sample size

The clinician knows the size of the effect that is interesting and relevant for the new intervention compared with standard treatment or placebo. With this knowledge, it is possible to estimate the number of patients needed in a trial in order to find such a difference in effect with reasonable certainty. A power of about 80 will ensure that if the researcher repeats the trial 10 times, a statistically significant difference will be detected in 8 of the 10 trials.

It is not always easy to define one primary outcome-measurement on which value the sample-size estimate should be based. Sometimes the outcome of interest is repeated measurements of a variable. Statistical methods based on regression or multi level modelling, have become more common in clinical pain trials [2, 3]. Multi level modelling is powerful, but sample size calculations are more complicated because one has to take into consideration the correlation within the participating individuals.

4 Ethical aspects of planning clinical trials

It is ethically unacceptable to expose patients to possible risks of new interventions in a clinical trial of insufficient size. A trial unable to detect a real difference between treatment groups due to poor planning and group-size calculation is not only unethical but also a waste of resources. An Ethical Research Board cannot approve a clinical trial protocol that has no estimate of the sample size that is needed to find a difference between treatments if in fact a clinically important difference is expected. We agree with Eva Skovlund that an Ethical Research Board also should evaluate the planned statistical methods [1]. Missing values are the rule in clinical research, and she describes clearly important risks of biased analyses because of missing values [1]. This loss of validity of outcome-estimates can, in addition to the strategies outlined by Eva Skovlund, be reduced by using multi level modelling since all participants will be included, even if some values are missing [2].

Another ethical issue in clinical pain intervention trials is the patients’ baseline pain intensity. It is a striking observation that numerous pain studies attempt to measure pain relief when there is no certainty about the level of pain that the treatment is expected to reduce. A trial giving pain-relieving medication before surgery for postoperative pain is bound to include many patients who in fact do not need any pain medication after surgery. It is documented that including only patients with relevant (high enough) baseline pain-intensity will reduce the variance, and because the necessary sample size is directly related to the variance [1] the necessary sample size will be much smaller [4]. Agood example of the importance of including only patients with sufficient baseline pain is the series of trials of pain relief from intra-articular morphine administered after knee arthroscopy only when the patients reported postoperative pain [5]. All other such studies had administered the morphine (or placebo) into the knee-joint at wound-closure to all patients and therefore included a majority of patients with no pain or only minor pain after arthroscopic procedures (see next paragraph) [3].

5 Publication bias is a major problem, and false medical “truths” based on publication bias are difficult to erase

Small trials that by chance, find spurious positive (p <0.05!) outcome-differences are more often accepted by reviewers and editors of scientific journals than “negative” outcome studies. This publication bias has a tendency to grow and cause false medical “truths”. An illustrative example is the result of many small studies falsely concluding that a dose of morphine injected into the knee joint at the end of the procedure caused prolonged postoperative analgesia. Editors, who believed that this was a fact, accepted more such false positive trial reports for publication. Researchers, who did not find positive outcome, thought maybe that they had done something wrong in their trial, and they did not bother to submit a trial report. Alternatively, if they did submit a manuscript, their trial reports were likely to be rejected by reviewers and editors.

In a systematic review where scientific quality and group size were emphasized, we found no evidence of analgesic effect of saline with 0.5–5 mg morphine into the knee joint, compared with saline alone [3]. Interestingly, a new meta-analysis (August 2013) concludes that there might be a positive effect [6]. However, the number of well designed trials with sufficient sample size is almost the same as 10 years ago and only one trial included patients with sufficient baseline pain [7]. This illustrates how difficult it is to remove a “false” medical truth, created by small-sample trials with large type I errors and publication bias – see also Hopewell et al. [15].

6 Obligations of journal editors to publish “negative” study-reports

This is one reason that the editors of the Scandinavian Journal of Pain accept for publication well planned, well conducted, and correctly analyzed trials with “negative” findings, i.e. they have to accept the truth of the null-hypothesis [8, 9]. They can be even more important than trials that reject their null-hypothesis and report a positive trial outcome.

7 Blinding is necessary for documenting specific treatment effect

The context-sensitive therapeutic effect (often called a “placebo”-effect), is a powerful positive effect when patients with a long-lasting pain-condition are treated well in a team of dedicated nurses, physiotherapists, and pain doctors (“multidisciplinary pain management”). This context-sensitive effect goes on and on, as long as patients and treatment providers are convinced that what is being done is helpful (see also Breivik et al. [14]).

Comparing baseline-data with outcome-data can thus give low p-values, even if the treatment effect is mostly due to unspecific (but good) context-sensitive effects. This is the conclusion of Stephen Morley when estimating outcomes of cognitive-behavioural-therapy (CBT) for chronic pain [10].

Therefore, to be able to show any specific pain-relieving effect of an intervention or a new drug, it is mandatory to have watertight blinding of patients as well as the team involved in outcome assessment. If possible, all treatment providers should also be blinded.

8 Randomization, when done correctly, reduces selection bias

Conscious or unconscious selection bias can be reduced by randomization of patients to standard treatment (or placebo) and new treatment. However, there are “true” randomizations and there are “pseudo”-randomizations. An example of pseudo-randomization, that still occurs, is to give standard treatment on even-numbered dates, new treatment on odd-numbered dates [11]. This means that the researchers selecting candidates for inclusion (consciously or unconsciously) may include “good-responders” on days when the new treatment is given.

Descriptive statistics are needed for demographics and basic variables such as age, gender, weight etc. However, when p-values are printed for differences between groups, one may suspect that the authors do not trust their randomization-procedure [1].

9 Why placebo-treatment is ethically acceptable in pain trials

If two active treatments are compared and the tests find no difference between the treatment, this is not a proof of equivalence. First, we do not know if the study was able to find any difference at all. Including a placebo group allows the demonstration of study sensitivity when the analysis shows that active treatment differs from placebo. Secondly, the difference between a gold standard and placebo can serve as a yardstick in the evaluation of the new treatment [12, 13].

Patients, who do not respond, will soon enough receive a rescue analgesic. Therefore, it is good ethical trial practice to include placebo-treatment in pain studies.

10 More statistical pearls made easy

In her important review Eva Skovlund explains several more important statistical issues such as non-inferiority trials, multiple testing, what to do with missing values, intention to treat-analyses, and more [1]. This review should be an obligatory piece for all novice clinical researchers. We are sure the many-seasoned clinical researcher also will benefit from reading this paper.


DOI of refers to article: http://dx.doi.Org/10.1016/j.sjpain.2013.07.023



Department of Pain Management and Research, Rikshospitalet, Oslo University Hospital, PB 4950 Nydalen, 0424 Oslo, Norway. Tel.: +47 23073691; fax: +47 23073690; mobile: +47 95865323

  1. Conflict of interest

    No conflict of interests declared.

References

[1] Skovlund E. Significance tests in clinical research – challenges and pitfalls. Scand J Pain 2013;4:220–3.Search in Google Scholar

[2] Ma Ys, Mazumdar M, Memtsoudis SG. Beyond repeated-measures analysis of variance: advanced statistical methods for the analysis of longitudinal data in anesthesia research. Reg Anesth Pain Med 2012;37:99–105.Search in Google Scholar

[3] Rosseland LA. No evidence for analgesic effect of intra-articular morphine after knee arthroscopy: a qualitative systematic review. Reg Anesth Pain Med 2005;30:83–98.Search in Google Scholar

[4] Breivik EK, Haanaes HR, Barkvoll P. Upside assay sensitivity in a dental pain model. Eur J Pain 1998;2:179–86.Search in Google Scholar

[5] Rosseland LA, Stubhaug A, Sandberg L, Breivik H. Intra-articular (IA) catheter administration of postoperative analgesics. A new trial design allows evaluation of baseline pain, demonstrates large variation in need of analgesics, and finds no analgesic effect of IA ketamine compared with IA saline. Pain 2003;104:25–34.Search in Google Scholar

[6] Chao Zeng C, Gao S, Cheng L, Luo W, Li Y, Tu M, Tian J, Xu M, Zhang F, Jiang W, Wei L, Lei G. Single-dose intra-articular morphine after arthroscopic knee surgery: a meta-analysis of randomized placebo-controlled studies. Arthroscopy 2013;1:1–11.Search in Google Scholar

[7] Solheim N, Rosseland LA, Stubhaug A. Intra-articular morphine 5 mg after knee arthroscopy does not produce significant pain relief when administered to patients with moderate to severe pain via an intra-articular catheter. Reg Anesth Pain Med 2006;31:506–13.Search in Google Scholar

[8] Kontinen V, Kalso E. Why we are proud to publish well-performed negative clinical studies? Scand J Pain 2013;4:15–6.Search in Google Scholar

[9] Breivik H, Stubhaug A, Hals EKB, Rosseland LA. Why we publish negative studies prescriptions on how to do clinical pain trials well. Scand J Pain 2010;1: 98–9.Search in Google Scholar

[10] Morley S. Efficacy and effectiveness of cognitive behaviour therapy for chronic pain: progress and some challenges. Pain 2011;152(Suppl. 1): S99–106.Search in Google Scholar

[11] Bernard SA, Gray TW, Buist MD, Jones BM, Silvester W, Gutteridge G, Smith K. Treatment of comatose survivors of out-of-hospital cardiac. Arrest with induced hypothermia. N Engl J Med 2002;346:557–63.Search in Google Scholar

[12] Stubhaug A, Breivik H. Clinical trials: acute and chronic pain. In: Breivik H, Campbell WI, Nicholas MK, editors. Clinical Pain Management – Practice and Procedures. 2nd ed. London: Hodder Arnold; 2008. p. 514–28.Search in Google Scholar

[13] Breivik Hals EK. Clinical trials: dental pain. In: Breivik H, Campbell WI, Nicholas MK, editors. Clinical Pain Management – Practice and Procedures. 2nd ed. London: Hodder Arnold; 2008. p. 529–37.Search in Google Scholar

[14] Breivik H, Ljossa TM, Stengaard-Pedersen K, Persson J, Aro H, Villumsen J, Tvinnemose D. A 6-months, randomised, placebo-controlled evaluation of efficacy and tolerability of a low-dose 7 day buprenorphine transdermal patch in osteoarthritis participants naive to potent opioids. Scand J Pain 2010;1:122–41.Search in Google Scholar

[15] Hopewell S, Loudon K, Clarke MJ, Oxman AD, Dickersin K. Publication bias in clinical trials due to statistical significance or direction of trial results (review). Cochrane Library 2009;(1).Search in Google Scholar

Published Online: 2013-10-01
Published in Print: 2013-10-01

© 2013 Scandinavian Association for the Study of Pain

Articles in the same Issue

  1. Editorial comment
  2. Chronic pain – The invisible disease? Not anymore!
  3. Clinical pain research
  4. New objective findings after whiplash injuries: High blood flow in painful cervical soft tissue: An ultrasound pilot study
  5. Editorial comment
  6. Chronic pain is strongly associated with work disability
  7. Observational studies
  8. Chronic pain: One year prevalence and associated characteristics (the HUNT pain study)
  9. Editorial comment
  10. Pain rehabilitation in general practice in rural areas? It works!
  11. Clinical pain research
  12. Effectiveness of multidisciplinary rehabilitation treatment for patients with chronic pain in a primary health care unit
  13. Editorial comment
  14. Mirror-therapy: An important tool in the management of Complex Regional Pain Syndrome (CRPS)
  15. Topical review
  16. Mirror therapy for Complex Regional Pain Syndrome (CRPS)—A literature review and an illustrative case report
  17. Editorial comment
  18. New insight in migraine pathogenesis: Vasoactive intestinal peptide (VIP) and pituitary adenylate cyclase-activating polypeptide (PACAP) in the circulation after sumatriptan
  19. Original experimental
  20. Vasoactive intestinal peptide (VIP) and pituitary adenylate cyclase-activating polypeptide (PACAP) in the circulation after sumatriptan
  21. Editorial comment
  22. Statistical pearls: Importance of effect-size, blinding, randomization, publication bias, and the overestimated p-values
  23. Topical review
  24. Significance tests in clinical research—Challenges and pitfalls
  25. Editorial comment
  26. Biomarkers of pain – Zemblanity?
  27. Topical review
  28. Mechanistic, translational, quantitative pain assessment tools in profiling of pain patients and for development of new analgesic compounds
  29. Editorial comment
  30. Chronic Benign Paroxysmal Positional Vertigo (BPPV): A possible cause of chronic, otherwise unexplained neck-pain, headache, and widespread pain and fatigue, which may respond positively to repeated particle repositioning manoeuvres (PRM)
  31. Observational studies
  32. Pain and other symptoms in patients with chronic benign paroxysmal positional vertigo (BPPV)
  33. Editorial comment
  34. The most important step forward in modern medicine, “a giant leap for mankind”: Insensibility to pain during surgery and painful procedures
  35. Topical review
  36. In praise of anesthesia: Two case studies of pain and suffering during major surgical procedures with and without anesthesia in the United States Civil War-1861–65
  37. Editorial comment
  38. Intravenous non-opioids for immediate postop pain relief in day-case programmes: Paracetamol (acetaminophen) and ketorolac are good choices reducing opioid needs and opioid side-effects
  39. Clinical pain research
  40. Intravenous acetaminophen vs. ketorolac for postoperative analgesia after ambulatory parathyroidectomy
  41. Editorial comment
  42. Scandinavian Association for the Study of Pain 2013—Annual scientific meeting abstracts of pain research presentations and greetings from incoming President
  43. Abstracts
  44. Why does the impact of multidisciplinary pain management on quality of life differ so much between chronic pain patients?
  45. Abstracts
  46. Health care utilization in chronic pain—A population based study
  47. Abstracts
  48. Pain treatment in rural Ghana—A qualitative study
  49. Abstracts
  50. Pain psychology specialist training 2012–2014
  51. Abstracts
  52. Pain assessment, documentation, and management in a university hospital
  53. Abstracts
  54. Promising effects of donepezil when added to patients treated with gabapentin for neuropathic pain
  55. Abstracts
  56. A pediatric patients’ pain evaluation in the emergency unit
  57. Abstracts
  58. Proteomic analysis of cerebrospinal fluid gives insight into the pain relief of spinal cord stimulation
  59. Abstracts
  60. The DQB1(*)03:02 HLA haplotype is associated with increased risk of chronic pain after inguinal hernia surgery and lumbar disc herniation
  61. Abstracts
  62. On the pharmacological effects of two lidocaine concentrations tested on spontaneous and evoked pain in human painful neuroma: A new clinical model of neuropathic pain
  63. Abstracts
  64. The mineralocorticoid receptor antagonist spironolactone enhances morphine antinociception
  65. Abstracts
  66. Expression of calcium/calmodulin-dependent protein kinase II in dorsal root ganglia in diabetic rats 6 months and 1 year after diabetes induction
  67. Abstracts
  68. Histamine in the locus coeruleus attenuates neuropathic hypersensitivity
  69. Abstracts
  70. Pronociceptive effects of a TRPA1 channel agonist methylglyoxal in healthy control and diabetic animals
  71. Abstracts
  72. Human inducible pluripotent stem cell-derived sensory neurons express multiple functional ion channels and GPCRs
Downloaded on 26.12.2025 from https://www.degruyterbrill.com/document/doi/10.1016/j.sjpain.2013.08.001/html?lang=en
Scroll to top button