Development and validation of an autonomous artificial intelligence agent for clinical decision-making in oncology

Dyke Ferber; Omar S.M. El Nahhas; Georg Wölflein; Isabella C. Wiest; Jan Clusmann; Marie Elisabeth Leßmann; Sebastian Foersch; Jacqueline Lammert; Maximilian Tschochohei; Dirk Jäger; Manuel Salto-Tellez; Nikolaus Schultz; Daniel Truhn; Jakob Nikolas Kather

doi:10.1038/s43018-025-00991-6

Development and validation of an autonomous artificial intelligence agent for clinical decision-making in oncology

Research output: Contribution to journal › Research article › Contributed › peer-review

Contributors

Dyke Ferber - , Else Kröner Fresenius Center for Digital Health, National Center for Tumor Diseases (NCT) Heidelberg (Author)
Omar S.M. El Nahhas - , Else Kröner Fresenius Center for Digital Health (Author)
Georg Wölflein - , University of St Andrews (Author)
Isabella C. Wiest - , Else Kröner Fresenius Center for Digital Health, Heidelberg University (Author)
Jan Clusmann - , Else Kröner Fresenius Center for Digital Health, RWTH Aachen University (Author)
Marie Elisabeth Leßmann - , Department of Internal Medicine I, Else Kröner Fresenius Center for Digital Health (Author)
Sebastian Foersch - , University Medical Center Mainz (Author)
Jacqueline Lammert - , Technical University of Munich, European Reference Network for Rare Cancers (EURACAN), German Cancer Consortium (DKTK) partner site Munich (Author)
Maximilian Tschochohei - , Google Munich Cloud Space (Author)
Dirk Jäger - , Heidelberg University (Author)
Manuel Salto-Tellez - , Institute of Cancer Research (Author)
Nikolaus Schultz - , Memorial Sloan-Kettering Cancer Center (Author)
Daniel Truhn - , RWTH Aachen University (Author)
Jakob Nikolas Kather - , Else Kröner Fresenius Center for Digital Health, National Center for Tumor Diseases (NCT) Heidelberg (Author)

Abstract

Clinical decision-making in oncology is complex, requiring the integration of multimodal data and multidomain expertise. We developed and evaluated an autonomous clinical artificial intelligence (AI) agent leveraging GPT-4 with multimodal precision oncology tools to support personalized clinical decision-making. The system incorporates vision transformers for detecting microsatellite instability and KRAS and BRAF mutations from histopathology slides, MedSAM for radiological image segmentation and web-based search tools such as OncoKB, PubMed and Google. Evaluated on 20 realistic multimodal patient cases, the AI agent autonomously used appropriate tools with 87.5% accuracy, reached correct clinical conclusions in 91.0% of cases and accurately cited relevant oncology guidelines 75.5% of the time. Compared to GPT-4 alone, the integrated AI agent drastically improved decision-making accuracy from 30.3% to 87.2%. These findings demonstrate that integrating language models with precision oncology and search tools substantially enhances clinical accuracy, establishing a robust foundation for deploying AI-driven personalized oncology support systems.

Details

Original language	English
Pages (from-to)	1337-1349
Number of pages	13
Journal	Nature cancer
Volume	6
Issue number	8
Publication status	Published - Aug 2025
Peer-reviewed	Yes

External IDs

ORCID	/0009-0005-7029-0028/work/188859431
PubMed	40481323
ORCID	/0000-0002-3730-5348/work/198594680

Research Portal of the TU Dresden

Development and validation of an autonomous artificial intelligence agent for clinical decision-making in oncology

Contributors

Abstract

Details

External IDs

Keywords

Sustainable Development Goals

ASJC Scopus subject areas