Home Could generative artificial intelligence replace fieldwork in pain research?
Article Open Access

Could generative artificial intelligence replace fieldwork in pain research?

  • Suzana Bojic EMAIL logo , Nemanja Radovanovic , Milica Radovic and Dusica Stamenkovic
Published/Copyright: March 7, 2024
Become an author with De Gruyter Brill

Abstract

Background

Generative artificial intelligence (AI) models offer potential assistance in pain research data acquisition, yet concerns persist regarding data accuracy and reliability. In a comparative study, we evaluated open generative AI models’ capacity to acquire data on acute pain in rock climbers comparable to field research.

Methods

Fifty-two rock climbers (33 m/19 f; age 29.0 [24.0–35.75] years) were asked to report pain location and intensity during a single climbing session. Five generative pretrained transformer models were tasked with responses to the same questions.

Results

Climbers identified the back of the forearm (19.2%) and toes (17.3%) as primary pain sites, with reported median pain intensity at 4 [3–5] and median maximum pain intensity at 7 [5–8]. Conversely, AI models yielded divergent findings, indicating fingers, hands, shoulders, legs, and feet as primary pain localizations with average and maximum pain intensity ranging from 3 to 4.4 and 5 to 10, respectively. Only two AI models provided references that were untraceable in PubMed and Google searches.

Conclusion

Our findings reveal that, currently, open generative AI models cannot match the quality of field-collected data on acute pain in rock climbers. Moreover, the models generated nonexistent references, raising concerns about their reliability.

1 Introduction

Since their introduction a year ago, open generative artificial intelligence (AI) models have had a tremendous but controversial impact on science [1]. They are the first tool available to the general public, including the scientific community, which were trained through crowdsourcing, web crawling and scraping, public datasets, user-generated content, and customer data [2], giving them access to unprecedented amount of diverse data. Although generative AI models may not be able to access information from the internet in real time, they have had access to it within a relatively close time limit up until now, during their training.

Given this fact, we pondered if generative AI could be used for data acquisition in pain research. Data acquisition is often the most time-consuming, labor-intensive, and costly aspect of any research. While using generative AI to access data holds the potential to save significant resources, concerns persist regarding the accuracy and reliability of the data obtained in this manner.

In this study, we aimed to assess if currently available open generative AI models are able to produce data on the intensity and location of acute pain in rock climbers of similar quality to data collected through standard field research.

2 Methods

This comparative study was conducted as a two-step process.

First, we performed a small cross-sectional study involving all consenting rock climbers in the field researchers’ area of residence. This part of the research was approved by an Institutional Ethics Board (decision No. 01-49/1-2022). A written informed consent was obtained from all subjects. Climbers were asked to provide detailed reports on the primary location and maximum and average intensity of pain they experienced during a single indoor training session. Average and maximum pain levels were assessed using an 11-point numeric rating scale.

Second, we employed five open AI deep learning models developed using the generative pretrained transformer (GPT) architecture. The authors used all AI models that were available within their country of residence at the time of writing. These models were instructed to provide responses to the same questions as those presented to rock climbers, so that data on localization should be presented as percentages and data on pain intensity as mean ± SD or median and interquartile range. AI models were also instructed to disclose the source of the data.

2.1 Statistical analysis

Statistical analysis was performed in SPSS Statistics Version 25 software (SPSS Inc., Chicago, IL, USA). The normality of the data was assessed with the Kolmogorov–Smirnov test. The data are presented as median and 25th–75th quartiles or frequencies. AI models were asked to report results on pain intensity as median and 25th–75th quartiles or frequencies or mean and standard deviation.

3 Results

The study included 52 rock climbers (33 males and 19 females, with an average age of 29.0 [24.0–35.75] years) with 2 [1–6] years of climbing experience. They identified the back of the forearm (19.2%), toes (17.3%), fingertips (15.4%), volar forearm (9.6%), base of the thumb (5.8%), calves, low back, anterior, lateral and posterior shoulder (3.8%, each), and foot arch, hamstrings, and lateral and medial elbow (1.9%, each) as the primary sites where they experienced pain. The reported median pain intensity was 4 [3–5], while the median maximum pain intensity was 7 [5–8].

In contrast, the AI models presented divergent findings: three models indicated fingers and hands as the primary locations of pain, one model pointed to the shoulder, and another to legs and feet. The assessed average pain intensity by the AI models varied, with three models providing ratings between 3 and 4.4. The maximum pain intensity was assessed by four of the AI models, yielding values within a range of 5–10. Notably, only two of the AI models provided references for their results, which were untraceable in both PubMed and Google searches (Table 1).

Table 1

AI-generated outcomes for acute pain intensity and localization among rock climbers

ChatGPT (https://openai.com/blog/chatgpt) BARD AI (https://bard.google.com/chat) ChatSonic (https://chatsonic.pro/) CopyAI (https://www.copy.ai/) AiryChat (https://airychat.com/)
Average pain intensity (NRS)a 3–4 3.7 ± 1.8b 38% 4.4 Cannot produce number
Maximum pain intensity (NRS) 7–10 8 5 6 Cannot produce number
Localization of pain (%) Finger joints and tendons 60–70% Fingers and hands 40–60% Legs and feet 50% Shoulders Fingers
Hands
Wrists
Elbows 10–20% Knee Forearms
Forearms – pumps 20–30% Shoulders 10–20% Feet – calcaneus, talus Shoulders
Elbows 5–10% Knees 5–10% Cannot give % Elbow
Shoulders 1–5% Ankles 5–10% Cannot give %
References (Yes/No) no yes (4) yes (4) no no
Do references exist? (Yes/No) no no no NA NA

Values of average and maximum pain intensity, as well as pain localization, in rock climbers were generated by open AI deep learning models based on GPT architecture.

aNRS – numeric rating scale; bmean ± SD; AI: artificial intelligence; GPT: generative pretrained transformer; NA – not applicable.

4 Discussion

To test the assumption that open generative AI models could be used for data acquisition in pain research, we asked them to provide the answers to the simplest pain-related questions – pain intensity and localization – and compared the answers to those acquired by field researchers. Based on our results, current iterations of open generative AI models were not able to deliver data on the intensity and location of acute pain in rock climbers that could match the quality of the data collected by field researchers.

Average pain intensity values reported by AI models were fairly similar to those found by field researchers, while maximum pain intensity levels and pain localization varied greatly. Given that a Google search using “rock climbing” and “pain” results in 62 million hits, more uniform answers regarding pain intensity between different generative AI models could have been expected.

More concerning, however, is that these AI models provided incorrect information in the form of references that did not exist, which, if not double-checked, could have a detrimental effect on research credibility. GPT models actually have 11 identified categories of failures, including reasoning, factual errors, math, coding, and bias [3]. They are also known to produce compelling disinformation [4] and even hallucinations [5], as was the case in our research.

We argue that current versions of generative AI that are available to the general public could not be reliably used for data acquisition in pain research. Although web crawling and scraping, public datasets, user-generated content, and customer data are already used in pain research, a comprehensive and free-of-charge tool, similar to open GPT models, that would be conducive to these techniques in real time is not yet available. Beta versions of AI models that facilitate reference search or identify research questions, however, are already emerging.

Acknowledgement

The study employed ChatGPT, BARD AI, ChatSonic, CopyAI, and AiryChat, as detailed in the Methods section.

  1. Research ethics: Part of the research involving human subjects complied with all relevant national regulations and institutional policies, is in accordance with the tenets of the Helsinki Declaration (as amended in 2013), and has been approved by the authors Institutional Research Ethical Committee (No: 01-49/1-2022).

  2. Informed consent: Informed consent has been obtained from all individuals included in this study.

  3. Author contributions: S.B. conceived and designed the study and drafted the manuscript. M.R. and N.R. performed data acquisition, including field research and interaction with the generative AI models. All authors participated in data processing, discussed the results, and provided meaningful contribution to the manuscript writing. All authors have accepted responsibility for the entire content of this manuscript and approved its submission.

  4. Competing interests: The authors state that there is no conflict of interest.

  5. Research funding: The authors state that no funding is involved.

  6. Data availability: Not applicable.

References

[1] Stokel-Walker C, Van Noorden R. ChatGPT and generative AI’s impact on science. Nature. 2023;614(7947):214–6. 10.1038/d41586-023-00340-6.Search in Google Scholar PubMed

[2] Schedlbauer J, Raptis G, Ludwig B. Web crawling, web scraping, and text mining for medical informatics labor market analysis. Int J Med Inform. 2021;150:104453. 10.1016/j.ijmedinf.2021.104453.Search in Google Scholar PubMed

[3] Borji A. A categorical archive of ChatGPT failures. ArXiv; 2023. 10.48550/arXiv.2302.03494.Search in Google Scholar

[4] Spitale G, Biller-Andorno N, Germani F. GPT-3 AI model (dis)informs better than humans. Sci Adv. 2023;9(26):eadh1850. 10.1126/sciadv.adh1850.Search in Google Scholar PubMed PubMed Central

[5] Moskatel LS, Zhang N. An observational, qualitative study on the utility of ChatGPT in assessing literature on migraine prevention. Front Neurol. 2023;14:1225223. 10.3389/fneur.2023.1225223.Search in Google Scholar PubMed PubMed Central

Received: 2023-11-22
Revised: 2023-12-08
Accepted: 2023-12-11
Published Online: 2024-03-07

© 2024 the author(s), published by De Gruyter

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

  1. Editorial Comment
  2. From pain to relief: Exploring the consistency of exercise-induced hypoalgesia
  3. Christmas greetings 2024 from the Editor-in-Chief
  4. Original Articles
  5. The Scandinavian Society for the Study of Pain 2022 Postgraduate Course and Annual Scientific (SASP 2022) Meeting 12th to 14th October at Rigshospitalet, Copenhagen
  6. Comparison of ultrasound-guided continuous erector spinae plane block versus continuous paravertebral block for postoperative analgesia in patients undergoing proximal femur surgeries
  7. Clinical Pain Researches
  8. The effect of tourniquet use on postoperative opioid consumption after ankle fracture surgery – a retrospective cohort study
  9. Changes in pain, daily occupations, lifestyle, and health following an occupational therapy lifestyle intervention: a secondary analysis from a feasibility study in patients with chronic high-impact pain
  10. Tonic cuff pressure pain sensitivity in chronic pain patients and its relation to self-reported physical activity
  11. Reliability, construct validity, and factorial structure of a Swedish version of the medical outcomes study social support survey (MOS-SSS) in patients with chronic pain
  12. Hurdles and potentials when implementing internet-delivered Acceptance and commitment therapy for chronic pain: a retrospective appraisal using the Quality implementation framework
  13. Exploring the outcome “days with bothersome pain” and its association with pain intensity, disability, and quality of life
  14. Fatigue and cognitive fatigability in patients with chronic pain
  15. The Swedish version of the pain self-efficacy questionnaire short form, PSEQ-2SV: Cultural adaptation and psychometric evaluation in a population of patients with musculoskeletal disorders
  16. Pain coping and catastrophizing in youth with and without cerebral palsy
  17. Neuropathic pain after surgery – A clinical validation study and assessment of accuracy measures of the 5-item NeuPPS scale
  18. Translation, contextual adaptation, and reliability of the Danish Concept of Pain Inventory (COPI-Adult (DK)) – A self-reported outcome measure
  19. Cosmetic surgery and associated chronic postsurgical pain: A cross-sectional study from Norway
  20. The association of hemodynamic parameters and clinical demographic variables with acute postoperative pain in female oncological breast surgery patients: A retrospective cohort study
  21. Healthcare professionals’ experiences of interdisciplinary collaboration in pain centres – A qualitative study
  22. Effects of deep brain stimulation and verbal suggestions on pain in Parkinson’s disease
  23. Painful differences between different pain scale assessments: The outcome of assessed pain is a matter of the choices of scale and statistics
  24. Prevalence and characteristics of fibromyalgia according to three fibromyalgia diagnostic criteria: A secondary analysis study
  25. Sex moderates the association between quantitative sensory testing and acute and chronic pain after total knee/hip arthroplasty
  26. Tramadol-paracetamol for postoperative pain after spine surgery – A randomized, double-blind, placebo-controlled study
  27. Cancer-related pain experienced in daily life is difficult to communicate and to manage – for patients and for professionals
  28. Making sense of pain in inflammatory bowel disease (IBD): A qualitative study
  29. Patient-reported pain, satisfaction, adverse effects, and deviations from ambulatory surgery pain medication
  30. Does pain influence cognitive performance in patients with mild traumatic brain injury?
  31. Hypocapnia in women with fibromyalgia
  32. Application of ultrasound-guided thoracic paravertebral block or intercostal nerve block for acute herpes zoster and prevention of post-herpetic neuralgia: A case–control retrospective trial
  33. Translation and examination of construct validity of the Danish version of the Tampa Scale for Kinesiophobia
  34. A positive scratch collapse test in anterior cutaneous nerve entrapment syndrome indicates its neuropathic character
  35. ADHD-pain: Characteristics of chronic pain and association with muscular dysregulation in adults with ADHD
  36. The relationship between changes in pain intensity and functional disability in persistent disabling low back pain during a course of cognitive functional therapy
  37. Intrathecal pain treatment for severe pain in patients with terminal cancer: A retrospective analysis of treatment-related complications and side effects
  38. Psychometric evaluation of the Danish version of the Pain Self-Efficacy Questionnaire in patients with subacute and chronic low back pain
  39. Dimensionality, reliability, and validity of the Finnish version of the pain catastrophizing scale in chronic low back pain
  40. To speak or not to speak? A secondary data analysis to further explore the context-insensitive avoidance scale
  41. Pain catastrophizing levels differentiate between common diseases with pain: HIV, fibromyalgia, complex regional pain syndrome, and breast cancer survivors
  42. Prevalence of substance use disorder diagnoses in patients with chronic pain receiving reimbursed opioids: An epidemiological study of four Norwegian health registries
  43. Pain perception while listening to thrash heavy metal vs relaxing music at a heavy metal festival – the CoPainHell study – a factorial randomized non-blinded crossover trial
  44. Observational Studies
  45. Cutaneous nerve biopsy in patients with symptoms of small fiber neuropathy: a retrospective study
  46. The incidence of post cholecystectomy pain (PCP) syndrome at 12 months following laparoscopic cholecystectomy: a prospective evaluation in 200 patients
  47. Associations between psychological flexibility and daily functioning in endometriosis-related pain
  48. Relationship between perfectionism, overactivity, pain severity, and pain interference in individuals with chronic pain: A cross-lagged panel model analysis
  49. Access to psychological treatment for chronic cancer-related pain in Sweden
  50. Validation of the Danish version of the knowledge and attitudes survey regarding pain
  51. Associations between cognitive test scores and pain tolerance: The Tromsø study
  52. Healthcare experiences of fibromyalgia patients and their associations with satisfaction and pain relief. A patient survey
  53. Video interpretation in a medical spine clinic: A descriptive study of a diverse population and intervention
  54. Role of history of traumatic life experiences in current psychosomatic manifestations
  55. Social determinants of health in adults with whiplash associated disorders
  56. Which patients with chronic low back pain respond favorably to multidisciplinary rehabilitation? A secondary analysis of a randomized controlled trial
  57. A preliminary examination of the effects of childhood abuse and resilience on pain and physical functioning in patients with knee osteoarthritis
  58. Differences in risk factors for flare-ups in patients with lumbar radicular pain may depend on the definition of flare
  59. Real-world evidence evaluation on consumer experience and prescription journey of diclofenac gel in Sweden
  60. Patient characteristics in relation to opioid exposure in a chronic non-cancer pain population
  61. Topical Reviews
  62. Bridging the translational gap: adenosine as a modulator of neuropathic pain in preclinical models and humans
  63. What do we know about Indigenous Peoples with low back pain around the world? A topical review
  64. The “future” pain clinician: Competencies needed to provide psychologically informed care
  65. Systematic Reviews
  66. Pain management for persistent pain post radiotherapy in head and neck cancers: systematic review
  67. High-frequency, high-intensity transcutaneous electrical nerve stimulation compared with opioids for pain relief after gynecological surgery: a systematic review and meta-analysis
  68. Reliability and measurement error of exercise-induced hypoalgesia in pain-free adults and adults with musculoskeletal pain: A systematic review
  69. Noninvasive transcranial brain stimulation in central post-stroke pain: A systematic review
  70. Short Communications
  71. Are we missing the opioid consumption in low- and middle-income countries?
  72. Association between self-reported pain severity and characteristics of United States adults (age ≥50 years) who used opioids
  73. Could generative artificial intelligence replace fieldwork in pain research?
  74. Skin conductance algesimeter is unreliable during sudden perioperative temperature increases
  75. Original Experimental
  76. Confirmatory study of the usefulness of quantum molecular resonance and microdissectomy for the treatment of lumbar radiculopathy in a prospective cohort at 6 months follow-up
  77. Pain catastrophizing in the elderly: An experimental pain study
  78. Improving general practice management of patients with chronic musculoskeletal pain: Interdisciplinarity, coherence, and concerns
  79. Concurrent validity of dynamic bedside quantitative sensory testing paradigms in breast cancer survivors with persistent pain
  80. Transcranial direct current stimulation is more effective than pregabalin in controlling nociceptive and anxiety-like behaviors in a rat fibromyalgia-like model
  81. Paradox pain sensitivity using cuff pressure or algometer testing in patients with hemophilia
  82. Physical activity with person-centered guidance supported by a digital platform or with telephone follow-up for persons with chronic widespread pain: Health economic considerations along a randomized controlled trial
  83. Measuring pain intensity through physical interaction in an experimental model of cold-induced pain: A method comparison study
  84. Pharmacological treatment of pain in Swedish nursing homes: Prevalence and associations with cognitive impairment and depressive mood
  85. Neck and shoulder pain and inflammatory biomarkers in plasma among forklift truck operators – A case–control study
  86. The effect of social exclusion on pain perception and heart rate variability in healthy controls and somatoform pain patients
  87. Revisiting opioid toxicity: Cellular effects of six commonly used opioids
  88. Letter to the Editor
  89. Post cholecystectomy pain syndrome: Letter to Editor
  90. Response to the Letter by Prof Bordoni
  91. Response – Reliability and measurement error of exercise-induced hypoalgesia
  92. Is the skin conductance algesimeter index influenced by temperature?
  93. Skin conductance algesimeter is unreliable during sudden perioperative temperature increase
  94. Corrigendum
  95. Corrigendum to “Chronic post-thoracotomy pain after lung cancer surgery: a prospective study of preoperative risk factors”
  96. Obituary
  97. A Significant Voice in Pain Research Björn Gerdle in Memoriam (1953–2024)
Downloaded on 11.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/sjpain-2023-0136/html
Scroll to top button