A future role for health applications of large language models depends on regulators enforcing safety standards

Research output: Contribution to journalReview articleContributedpeer-review

Contributors

Abstract

Among the rapid integration of artificial intelligence in clinical settings, large language models (LLMs), such as Generative Pre-trained Transformer-4, have emerged as multifaceted tools that have potential for health-care delivery, diagnosis, and patient care. However, deployment of LLMs raises substantial regulatory and safety concerns. Due to their high output variability, poor inherent explainability, and the risk of so-called AI hallucinations, LLM-based health-care applications that serve a medical purpose face regulatory challenges for approval as medical devices under US and EU laws, including the recently passed EU Artificial Intelligence Act. Despite unaddressed risks for patients, including misdiagnosis and unverified medical advice, such applications are available on the market. The regulatory ambiguity surrounding these tools creates an urgent need for frameworks that accommodate their unique capabilities and limitations. Alongside the development of these frameworks, existing regulations should be enforced. If regulators fear enforcing the regulations in a market dominated by supply or development by large technology companies, the consequences of layperson harm will force belated action, damaging the potentiality of LLM-based applications for layperson medical advice.

Details

Original languageEnglish
Pages (from-to)e662-e672
JournalThe Lancet Digital Health
Volume6
Issue number9
Publication statusPublished - Sept 2024
Peer-reviewedYes

External IDs

PubMed 39179311
ORCID /0000-0002-1997-1689/work/173517327
ORCID /0000-0003-3323-2492/work/173517359