Uncertainty Estimation in Medical Image Classification: Systematic Review

Research output: Contribution to journalReview articleContributedpeer-review

Contributors

  • Alexander Kurz - , German Cancer Research Center (DKFZ) (Author)
  • Katja Hauser - , German Cancer Research Center (DKFZ) (Author)
  • Hendrik Alexander Mehrtens - , German Cancer Research Center (DKFZ) (Author)
  • Eva Krieghoff-Henning - , German Cancer Research Center (DKFZ) (Author)
  • Achim Hekler - , German Cancer Research Center (DKFZ) (Author)
  • Jakob Nikolas Kather - , RWTH Aachen University (Author)
  • Stefan Fröhling - , German Cancer Research Center (DKFZ) (Author)
  • Christof von Kalle - , Berlin Institute of Health at Charité (Author)
  • Titus Josef Brinker - , German Cancer Research Center (DKFZ) (Author)

Abstract

Background: Deep neural networks are showing impressive results in different medical image classification tasks. However, for real-world applications, there is a need to estimate the network's uncertainty together with its prediction. Objective: In this review, we investigate in what form uncertainty estimation has been applied to the task of medical image classification. We also investigate which metrics are used to describe the effectiveness of the applied uncertainty estimation Methods: Google Scholar, PubMed, IEEE Xplore, and ScienceDirect were screened for peer-reviewed studies, published between 2016 and 2021, that deal with uncertainty estimation in medical image classification. The search terms “uncertainty,” “uncertainty estimation,” “network calibration,” and “out-of-distribution detection” were used in combination with the terms “medical images,” “medical image analysis,” and “medical image classification.” Results: A total of 22 papers were chosen for detailed analysis through the systematic review process. This paper provides a table for a systematic comparison of the included works with respect to the applied method for estimating the uncertainty. Conclusions: The applied methods for estimating uncertainties are diverse, but the sampling-based methods Monte-Carlo Dropout and Deep Ensembles are used most frequently. We concluded that future works can investigate the benefits of uncertainty estimation in collaborative settings of artificial intelligence systems and human experts.

Details

Original languageEnglish
Article numbere36427
JournalJMIR medical informatics
Volume10
Issue number8
Publication statusPublished - 1 Aug 2022
Peer-reviewedYes
Externally publishedYes

Keywords

Keywords

  • deep learning, medical image classification, medical imaging, network calibration, out-of-distribution detection, uncertainty estimation