Prospective multicenter study using artificial intelligence to improve dermoscopic melanoma diagnosis in patient care

Research output: Contribution to journalResearch articleContributedpeer-review

Contributors

  • Lukas Heinlein - , German Cancer Research Center (DKFZ) (Author)
  • Roman C Maron - , German Cancer Research Center (DKFZ) (Author)
  • Achim Hekler - , German Cancer Research Center (DKFZ) (Author)
  • Sarah Haggenmüller - , German Cancer Research Center (DKFZ) (Author)
  • Christoph Wies - , Heidelberg University  (Author)
  • Jochen S Utikal - , DKFZ Hector Cancer Institute at the University Medical Center Mannheim (Author)
  • Friedegund Meier - , Department of Dermatology, Skin Tumor Center (Author)
  • Sarah Hobelsberger - , Department of Dermatology (Author)
  • Frank F Gellrich - , Department of Dermatology (Author)
  • Mildred Sergon - , Department of Dermatology (Author)
  • Axel Hauschild - , University Hospital Schleswig-Holstein Campus Kiel (Author)
  • Lars E French - , Hospital of the Ludwig-Maximilians-University (LMU) Munich (Author)
  • Lucie Heinzerling - , University Hospital at the Friedrich-Alexander University Erlangen-Nürnberg (Author)
  • Justin G Schlager - , Hospital of the Ludwig-Maximilians-University (LMU) Munich (Author)
  • Kamran Ghoreschi - , Charité – Universitätsmedizin Berlin (Author)
  • Max Schlaak - , Charité – Universitätsmedizin Berlin (Author)
  • Franz J Hilke - , Charité – Universitätsmedizin Berlin (Author)
  • Gabriela Poch - , Charité – Universitätsmedizin Berlin (Author)
  • Sören Korsing - , Charité – Universitätsmedizin Berlin (Author)
  • Carola Berking - , University Hospital at the Friedrich-Alexander University Erlangen-Nürnberg (Author)
  • Markus V Heppt - , University Hospital at the Friedrich-Alexander University Erlangen-Nürnberg (Author)
  • Michael Erdmann - , University Hospital at the Friedrich-Alexander University Erlangen-Nürnberg (Author)
  • Sebastian Haferkamp - , University Hospital Regensburg (Author)
  • Konstantin Drexler - , University Hospital Regensburg (Author)
  • Dirk Schadendorf - , University Hospital Essen (Author)
  • Wiebke Sondermann - , University Hospital Essen (Author)
  • Matthias Goebeler - , University Hospital of Würzburg, National Center for Tumor Diseases (NCT) WERA (Author)
  • Bastian Schilling - , University Hospital of Würzburg, National Center for Tumor Diseases (NCT) WERA (Author)
  • Eva Krieghoff-Henning - , German Cancer Research Center (DKFZ) (Author)
  • Titus J Brinker - , German Cancer Research Center (DKFZ) (Author)

Abstract

BACKGROUND: Early detection of melanoma, a potentially lethal type of skin cancer with high prevalence worldwide, improves patient prognosis. In retrospective studies, artificial intelligence (AI) has proven to be helpful for enhancing melanoma detection. However, there are few prospective studies confirming these promising results. Existing studies are limited by low sample sizes, too homogenous datasets, or lack of inclusion of rare melanoma subtypes, preventing a fair and thorough evaluation of AI and its generalizability, a crucial aspect for its application in the clinical setting.

METHODS: Therefore, we assessed "All Data are Ext" (ADAE), an established open-source ensemble algorithm for detecting melanomas, by comparing its diagnostic accuracy to that of dermatologists on a prospectively collected, external, heterogeneous test set comprising eight distinct hospitals, four different camera setups, rare melanoma subtypes, and special anatomical sites. We advanced the algorithm with real test-time augmentation (R-TTA, i.e., providing real photographs of lesions taken from multiple angles and averaging the predictions), and evaluated its generalization capabilities.

RESULTS: Overall, the AI shows higher balanced accuracy than dermatologists (0.798, 95% confidence interval (CI) 0.779-0.814 vs. 0.781, 95% CI 0.760-0.802; p = 4.0e-145), obtaining a higher sensitivity (0.921, 95% CI 0.900-0.942 vs. 0.734, 95% CI 0.701-0.770; p = 3.3e-165) at the cost of a lower specificity (0.673, 95% CI 0.641-0.702 vs. 0.828, 95% CI 0.804-0.852; p = 3.3e-165).

CONCLUSION: As the algorithm exhibits a significant performance advantage on our heterogeneous dataset exclusively comprising melanoma-suspicious lesions, AI may offer the potential to support dermatologists, particularly in diagnosing challenging cases.

Details

Original languageEnglish
Article number177
JournalCommunications medicine
Volume4
Issue number1
Publication statusPublished - 11 Sept 2024
Peer-reviewedYes

External IDs

PubMedCentral PMC11387610
Scopus 85203721327
ORCID /0000-0003-4340-9706/work/168720488
ORCID /0000-0002-2164-4644/work/168720648
ORCID /0000-0001-5703-324X/work/168720699