A future role for health applications of large language models depends on regulators enforcing safety standards

Oscar Freyer; Isabella Catharina Wiest; Jakob Nikolas Kather; Stephen Gilbert

doi:10.1016/S2589-7500(24)00124-9

A future role for health applications of large language models depends on regulators enforcing safety standards

Research output: Contribution to journal › Review article › Contributed › peer-review

Contributors

Oscar Freyer - , Else Kröner Fresenius Center for Digital Health (Author)
Isabella Catharina Wiest - , Else Kröner Fresenius Center for Digital Health, Universitätsmedizin Mannheim (Author)
Jakob Nikolas Kather - , Else Kröner Fresenius Center for Digital Health, Department of Internal Medicine I, National Center for Tumor Diseases (NCT) Heidelberg (Author)
Stephen Gilbert - , Else Kröner Fresenius Center for Digital Health (Author)

Abstract

Among the rapid integration of artificial intelligence in clinical settings, large language models (LLMs), such as Generative Pre-trained Transformer-4, have emerged as multifaceted tools that have potential for health-care delivery, diagnosis, and patient care. However, deployment of LLMs raises substantial regulatory and safety concerns. Due to their high output variability, poor inherent explainability, and the risk of so-called AI hallucinations, LLM-based health-care applications that serve a medical purpose face regulatory challenges for approval as medical devices under US and EU laws, including the recently passed EU Artificial Intelligence Act. Despite unaddressed risks for patients, including misdiagnosis and unverified medical advice, such applications are available on the market. The regulatory ambiguity surrounding these tools creates an urgent need for frameworks that accommodate their unique capabilities and limitations. Alongside the development of these frameworks, existing regulations should be enforced. If regulators fear enforcing the regulations in a market dominated by supply or development by large technology companies, the consequences of layperson harm will force belated action, damaging the potentiality of LLM-based applications for layperson medical advice.

Details

Original language	English
Pages (from-to)	e662-e672
Journal	The Lancet Digital Health
Volume	6
Issue number	9
Publication status	Published - Sept 2024
Peer-reviewed	Yes

External IDs

PubMed	39179311
ORCID	/0000-0002-1997-1689/work/173517327
ORCID	/0000-0003-3323-2492/work/173517359
ORCID	/0000-0002-3730-5348/work/198594609

Keywords

ASJC Scopus subject areas

Library keywords

610 Medicine and health

Research Portal of the TU Dresden