Lights and shadows of artificial intelligence in laboratory medicine

Giuseppe Lippi; Mario Plebani

doi:10.1515/almed-2025-0024

Article Open Access

Lights and shadows of artificial intelligence in laboratory medicine

Giuseppe Lippi and Mario Plebani

Published/Copyright: February 24, 2025

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Advances in Laboratory Medicine / Avances en Medicina de Laboratorio Volume 6 Issue 1

Keywords: artificial intelligence; laboratory medicine; deep learning; large language models

Although a universally accepted definition does not exist, artificial intelligence (AI) is commonly described as the use of computational systems to simulate human intelligence, with the purpose of performing a vast array of tasks that require reasoning, learning and decision-making [1]. The term “artificial intelligence” was originally coined by John McCarthy and colleagues in the mid-1950s, when it was defined as “the science and engineering of making intelligent machines that exhibit critical thinking comparable to humans” [2]. Since then, remarkable technological advancements have enhanced the power and use of AI tools, making them an integral part of both personal and professional domains. Key components of AI include machine learning (ML), which encompasses algorithms designed to improve autonomously through experience, and deep learning (DL), a subset of ML that uses neural networks with multiple layers to analyze complex data. Neural networks themselves are computational models that replicate the structure of the human brain, enabling capabilities such as pattern recognition and predictive modeling [3]. Generative AI, a specific subset of AI, uses DL models, and is hence primarily focused on generating novel content, including text (e.g., large language models, LLMs), images, music and so forth [3].

In healthcare, AI systems have been developed to improve efficiency and accuracy, particularly in tasks such as diagnosing diseases from imaging data and processing large amount of clinical and diagnostic information [4]. Due to the extensive use of technology and the potential to generate an immense amount of clinical information (commonly referred to as “big data”) in the form of laboratory test results, AI has gained significant momentum in the field of laboratory medicine [5]. A recent survey conducted by the European Federation of Clinical Chemistry and Laboratory Medicine (EFLM) revealed considerable interest in AI training among laboratory professionals, even if only a minority (approximately 25 %) of surveyed laboratories reported having active AI projects in place [6].

Regardless of the varying perceptions among laboratory professionals, it is indisputable that AI will continue to play an increasingly prominent role in organization and operations of medical laboratories. Several AI tools are already widely deployed in clinical laboratories, such as ML models for optimizing sample processing and quality control management [7], automated verification and validation modules integrated with the laboratory information system (LIS) [8], digital morphology techniques in laboratory hematology [9] and for assisting diagnostic reasoning across various medical fields [10], as well as tools for supporting creation and editing of documents. Additionally, DL models are playing an essential role in managing and interpreting the massive datasets generated through genomics and proteomics research.

A paradigmatic example of how AI can support clinical decision-making has been provided by a recent single-blind randomized clinical trial, in which physicians specializing in family, internal or emergency medicine were randomized to either use conventional resources alone or to have access to the same resources supplemented with LLM [10]. Although the use of LLM did not significantly improve clinical reasoning compared to the use of conventional resources alone (median diagnostic reasoning score per case: 76 vs. 74 %; p=0.60), the median diagnostic reasoning score for LLM alone was 92 %, thus offsetting the scores of both physician cohorts by over 16 %.

Although it is increasingly clear that AI will become an integral part of organization and activities of medical laboratory services worldwide, significant challenges remain (Table 1). The primary issue is the limited flexibility of AI in interpreting laboratory data. In contrast to human cognition, which can use intuitive judgment and gestalt reasoning [11], AI systems currently lack the adaptive capacity to interpret data with equivalent contextual understanding. The same principle applies to image recognition. While DL models can be trained to identify digital images of prototype cell types or other components in body fluids, the inherent heterogeneity in the presentation of these elements – especially the variability of abnormalities across different physiological or pathological states – limits the accuracy of classification. This challenge persists whether using specialized digital cell imaging systems or more general AI tools, like ChatGPT [12]. In all these cases the current generation of AI tools would still need interactive learning frameworks and human oversight for reviewing the initial classification. Analogously, the interpretation of human symptoms and individual laboratory data by AI often relies on predefined algorithms that are designed to identify patterns based on existing data and conditions. Nevertheless, diseases can present in many diverse ways, and the range of possible laboratory abnormalities associated with different conditions can vary widely, making it challenging to always provide accurate or comprehensive interpretations [5], 13]. Additional limitations include the potential for overreliance on AI systems, which could completely replace human reasoning; limited flexibility and adaptability, leading to errors in interpreting complex or rare laboratory abnormalities, as previously exemplified; the lack of standardization, as different AI tools may yield inconsistent answers to the same clinical question [14]; the regulatory challenges that must drive integration into routine laboratory practice; the inherent cost of acquiring and regularly updating the software; the complexity between human and AI interaction; the lack of transparency and explainability due to the often unidentified nature of the algorithms; accountability concerns when errors occur; ethical and legal issues related to privacy, data security and potential bias in decision-making [15]. Finally, the use of an external validation set is always needed to ensure reproducibility and generalizability of any AI tool or algorithm.

Table 1:

The lights and shadows of artificial intelligence (AI) in laboratory medicine.

Advantages
Possibility to streamline process and activities by automating routine tasks Autoverification and autovalidation of laboratory data, enhancing efficiency and accuracy Assistance in digital image recognition for improved diagnostic accuracy Support in interpretation of laboratory data, aiding clinical decision-making Facilitated creation and editing of documents Management and analysis of big data

Limitations

Limited flexibility and adaptability in handling unexpected or complex data Potential errors in interpreting complex or rare laboratory abnormalities Overreliance on AI, potentially diminishing human clinical judgment Lack of standardization, leading to inconsistent results between different AI tools Regulatory requirements, which may slow down AI integration High costs related to setup, maintenance and training for AI systems Complexity in human–AI interaction Lack of transparency and exploitability in AI decision-making processes Poor accountability for errors made in data interpretation Ethical and legal concerns, including data privacy issues and algorithmic bias Need for external validation to ensure reproducibility and generalizability

In conclusion, while AI holds promise to revolutionize laboratory medicine by enhancing efficiency, diagnostic accuracy and data management, it also carries significant challenges. The integration of AI into routine laboratory practice must be approached with caution, addressing key unresolved issues such as risk of overreliance, limited flexibility and potential for bias and errors. Regulatory, ethical and legal concerns must also be addressed to ensure that AI is deployed transparently and accountably. As AI continues to evolve, its role in laboratory medicine services will expand, but its successful implementation will require active collaboration among laboratory professionals, technology developers, policymakers and healthcare managers.

Corresponding author: Prof. Giuseppe Lippi, MD, Section of Clinical Biochemistry, University Hospital of Verona, Piazzale L.A. Scuro, 10, 37134 Verona, Italy, E-mail: giuseppe.lippi@univr.it

Research ethics: Not applicable.
Informed consent: Not applicable.
Author contributions: All authors have accepted responsibility for the entire content of this manuscript and approved its submission.
Use of Large Language Models, AI and Machine Learning Tools: None declared.
Conflict of interest: The authors state no conflict of interest.
Research funding: None declared.
Data availability: Not applicable.
Article Note: A translation of this article can be found here: https://doi.org/10.1515/almed-2025-0039.

References

1. Amisha, MP, Pathania, M, Rathaur, VK. Overview of artificial intelligence in medicine. J Fam Med Prim Care 2019;8:2328–31.10.4103/jfmpc.jfmpc_440_19Search in Google Scholar PubMed PubMed Central

2. McCarthy, J, Minsky, ML, Rochester, N, Shannon, CE. A proposal for the Dartmouth Summer Research Project on artificial intelligence, August 31, 1955. AI Mag 2006;27:12–4.Search in Google Scholar

3. Hulsen, T. Literature analysis of artificial intelligence in biomedicine. Ann Transl Med 2022;10:1284.10.21037/atm-2022-50Search in Google Scholar PubMed PubMed Central

4. Bekbolatova, M, Mayer, J, Ong, CW, Toma, M. Transformative potential of AI in healthcare: definitions, applications, and navigating the ethical landscape and public perspectives. Healthcare (Basel) 2024;12:125.10.3390/healthcare12020125Search in Google Scholar PubMed PubMed Central

5. Cadamuro, J, Cabitza, F, Debeljak, Z, De Bruyne, S, Frans, G, Perez, SM, et al.. Potentials and pitfalls of ChatGPT and natural-language artificial intelligence models for the understanding of laboratory medicine test results. An assessment by the European Federation of Clinical Chemistry and Laboratory Medicine (EFLM) Working Group on Artificial Intelligence (WG-AI). Clin Chem Lab Med 2023;61:1158–66.10.1515/cclm-2023-0355Search in Google Scholar PubMed

6. Cadamuro, J, Carobene, A, Cabitza, F, Debeljak, Z, De Bruyne, S, van Doorn, W, et al.. A comprehensive survey of artificial intelligence adoption in European laboratory medicine: current utilization and prospects. Clin Chem Lab Med 2025;63:692–703. https://doi.org/10.1515/cclm-2024-1016.Search in Google Scholar PubMed

7. You, J, Seok, HS, Kim, S, Shin, H. Advancing laboratory medicine practice with machine learning: swift yet exact. Ann Lab Med 2025;45:22–35.10.3343/alm.2024.0354Search in Google Scholar PubMed PubMed Central

8. Guidi, GC, Poli, G, Bassi, A, Giobelli, L, Benetollo, PP, Lippi, G. Development and implementation of an automatic system for verification, validation and delivery of laboratory test results. Clin Chem Lab Med 2009;47:1355–60.10.1515/CCLM.2009.316Search in Google Scholar PubMed

9. Kratz, A, Lee, SH, Zini, G, Riedl, JA, Hur, M, Machin, S. Digital morphology analyzers in hematology: ICSH review and recommendations. Int J Lab Hematol 2019;41:437–47.10.1111/ijlh.13042Search in Google Scholar PubMed

10. Goh, E, Gallo, R, Hom, J, Strong, E, Weng, Y, Kerman, H, et al.. Large language model influence on diagnostic reasoning: a randomized clinical trial. JAMA Netw Open 2024;7:e2440969.10.1001/jamanetworkopen.2024.40969Search in Google Scholar PubMed PubMed Central

11. Cervellin, G, Borghi, L, Lippi, G. Do clinicians decide relying primarily on Bayesians principles or on Gestalt perception? Some pearls and pitfalls of Gestalt perception in medicine. Intern Emerg Med 2014;9:513–9.10.1007/s11739-014-1049-8Search in Google Scholar PubMed

12. Negrini, D, Pighi, L, Tosi, M, Lippi, G. Evaluating the accuracy of ChatGPT in classifying normal and abnormal blood cell morphology. Clin Chem Lab Med 2025. https://doi.org/10.1515/cclm-2024-1469. [Epub ahead of print].Search in Google Scholar PubMed

13. Meyer, A, Soleman, A, Riese, J, Streichert, T. Comparison of ChatGPT, Gemini, and Le Chat with physician interpretations of medical laboratory questions from an online health forum. Clin Chem Lab Med 2024;62:2425–34.10.1515/cclm-2024-0246Search in Google Scholar PubMed

14. Lippi, G, Mattiuzzi, C, Favaloro, EJ. Reliability of generative artificial intelligence in identifying the major risk factors for venous thrombosis. Blood Coagul Fibrinolysis 2024;35:354–5.10.1097/MBC.0000000000001322Search in Google Scholar PubMed

15. Pennestrì, F, Banfi, G. Artificial intelligence in laboratory medicine: fundamental ethical issues and normative key-points. Clin Chem Lab Med 2022;60:1867–74.10.1515/cclm-2022-0096Search in Google Scholar PubMed

Published Online: 2025-02-24

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

https://doi.org/10.1515/almed-2025-0024

Keywords for this article

artificial intelligence; laboratory medicine; deep learning; large language models

Creative Commons

BY 4.0