Privacy-preserving large language models for structured medical information retrieval

Isabella Catharina Wiest; Dyke Ferber; Jiefu Zhu; Marko van Treeck; Sonja K. Meyer; Radhika Juglan; Zunamys I. Carrero; Daniel Paech; Jens Kleesiek; Matthias P. Ebert; Daniel Truhn; Jakob Nikolas Kather

doi:10.1038/s41746-024-01233-2

Privacy-preserving large language models for structured medical information retrieval

Publikation: Beitrag in Fachzeitschrift › Forschungsartikel › Beigetragen › Begutachtung

Beitragende

Isabella Catharina Wiest - , Else Kröner Fresenius Zentrum für Digitale Gesundheit, Universitätsmedizin Mannheim (Autor:in)
Dyke Ferber - , Else Kröner Fresenius Zentrum für Digitale Gesundheit, Nationales Zentrum für Tumorerkrankungen (NCT) Heidelberg (Autor:in)
Jiefu Zhu - , Else Kröner Fresenius Zentrum für Digitale Gesundheit (Autor:in)
Marko van Treeck - , Else Kröner Fresenius Zentrum für Digitale Gesundheit (Autor:in)
Sonja K. Meyer - , Universitätsklinikum Würzburg (Autor:in)
Radhika Juglan - , Else Kröner Fresenius Zentrum für Digitale Gesundheit (Autor:in)
Zunamys I. Carrero - , Else Kröner Fresenius Zentrum für Digitale Gesundheit (Autor:in)
Daniel Paech - , Deutsches Krebsforschungszentrum (DKFZ), Universität Bonn (Autor:in)
Jens Kleesiek - , Technische Universität (TU) Dortmund, Universitätsklinikum Essen, Cancer Research Center Cologne Essen (Autor:in)
Matthias P. Ebert - , Universität Heidelberg, European Molecular Biology Laboratory (EMBL) Heidelberg, DKFZ-Hector Krebsinstitut an der Universitätsmedizin Mannheim (Autor:in)
Daniel Truhn - , Universitätsklinikum Aachen (Autor:in)
Jakob Nikolas Kather - , Else Kröner Fresenius Zentrum für Digitale Gesundheit, Medizinische Klinik und Poliklinik I, Nationales Zentrum für Tumorerkrankungen (NCT) Heidelberg (Autor:in)

Abstract

Most clinical information is encoded as free text, not accessible for quantitative analysis. This study presents an open-source pipeline using the local large language model (LLM) “Llama 2” to extract quantitative information from clinical text and evaluates its performance in identifying features of decompensated liver cirrhosis. The LLM identified five key clinical features in a zero- and one-shot manner from 500 patient medical histories in the MIMIC IV dataset. We compared LLMs of three sizes and various prompt engineering approaches, with predictions compared against ground truth from three blinded medical experts. Our pipeline achieved high accuracy, detecting liver cirrhosis with 100% sensitivity and 96% specificity. High sensitivities and specificities were also yielded for detecting ascites (95%, 95%), confusion (76%, 94%), abdominal pain (84%, 97%), and shortness of breath (87%, 97%) using the 70 billion parameter model, which outperformed smaller versions. Our study successfully demonstrates the capability of locally deployed LLMs to extract clinical information from free text with low hardware requirements.

Details

Originalsprache	Englisch
Aufsatznummer	257
Fachzeitschrift	npj digital medicine
Jahrgang	7
Ausgabenummer	1
Publikationsstatus	Veröffentlicht - Dez. 2024
Peer-Review-Status	Ja

Externe IDs

ORCID	/0000-0001-8501-1566/work/173517347

Forschungsportal der TU Dresden

Privacy-preserving large language models for structured medical information retrieval

Beitragende

Abstract

Details

Externe IDs

Schlagworte

ASJC Scopus Sachgebiete