Why rankings of biomedical image analysis competitions should be interpreted with care

Research output: Contribution to journalResearch articleContributedpeer-review

Contributors

  • Lena Maier-Hein - , German Cancer Research Center (DKFZ) (Author)
  • Matthias Eisenmann - , German Cancer Research Center (DKFZ) (Author)
  • Annika Reinke - , German Cancer Research Center (DKFZ) (Author)
  • Sinan Onogur - , German Cancer Research Center (DKFZ) (Author)
  • Marko Stankovic - , German Cancer Research Center (DKFZ) (Author)
  • Patrick Scholz - , German Cancer Research Center (DKFZ) (Author)
  • Tal Arbel - , McGill University (Author)
  • Hrvoje Bogunovic - , Medical University of Vienna (Author)
  • Andrew P. Bradley - , Queensland University of Technology (Author)
  • Aaron Carass - , Johns Hopkins University (Author)
  • Carolin Feldmann - , German Cancer Research Center (DKFZ) (Author)
  • Alejandro F. Frangi - , University of Leeds (Author)
  • Peter M. Full - , German Cancer Research Center (DKFZ) (Author)
  • Bram van Ginneken - , Radboud University Nijmegen (Author)
  • Allan Hanbury - , Vienna University of Technology, Complexity Science Hub Vienna (Author)
  • Katrin Honauer - , Heidelberg University  (Author)
  • Michal Kozubek - , Masaryk University (Author)
  • Bennett A. Landman - , Vanderbilt University (Author)
  • Keno März - , German Cancer Research Center (DKFZ) (Author)
  • Oskar Maier - , University of Lübeck (Author)
  • Klaus Maier-Hein - , German Cancer Research Center (DKFZ) (Author)
  • Bjoern H. Menze - , Technical University of Munich (Author)
  • Henning Müller - , University of Applied Sciences and Arts of Western Switzerland (Author)
  • Peter F. Neher - , German Cancer Research Center (DKFZ) (Author)
  • Wiro Niessen - , Erasmus University Rotterdam (Author)
  • Nasir Rajpoot - , University of Warwick (Author)
  • Gregory C. Sharp - , Harvard University (Author)
  • Korsuk Sirinukunwattana - , University of Oxford (Author)
  • Stefanie Speidel - , National Center for Tumor Diseases Dresden (Author)
  • Christian Stock - , German Cancer Research Center (DKFZ) (Author)
  • Danail Stoyanov - , University College London (Author)
  • Abdel Aziz Taha - , Research Studios Austria (Author)
  • Fons van der Sommen - , Eindhoven University of Technology (Author)
  • Ching Wei Wang - , National Taiwan University of Science and Technology (Author)
  • Marc André Weber - , Rostock University Medical Centre (Author)
  • Guoyan Zheng - , University of Bern (Author)
  • Pierre Jannin - , Université de Rennes 1 (Author)
  • Annette Kopp-Schneider - , German Cancer Research Center (DKFZ) (Author)

Abstract

International challenges have become the standard for validation of biomedical image analysis methods. Given their scientific impact, it is surprising that a critical analysis of common practices related to the organization of challenges has not yet been performed. In this paper, we present a comprehensive analysis of biomedical image analysis challenges conducted up to now. We demonstrate the importance of challenges and show that the lack of quality control has critical consequences. First, reproducibility and interpretation of the results is often hampered as only a fraction of relevant information is typically provided. Second, the rank of an algorithm is generally not robust to a number of variables such as the test data used for validation, the ranking scheme applied and the observers that make the reference annotations. To overcome these problems, we recommend best practice guidelines and define open research questions to be addressed in the future.

Details

Original languageEnglish
Article number5217
JournalNature communications
Volume9
Issue number1
Publication statusPublished - 1 Dec 2018
Peer-reviewedYes

External IDs

PubMed 30523263
ORCID /0000-0002-4590-1908/work/163294102