Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge

Publikation: Beitrag in FachzeitschriftKurze Umfrage/ÜbersichtsartikelBeigetragenBegutachtung

Beitragende

  • Collaborators - (Autor:in)
  • Tobias Rueckert - , Ostbayerische Technische Hochschule Regensburg, AKTORmed Robotic Surgery, Technische Universität München (Autor:in)
  • David Rauber - , Ostbayerische Technische Hochschule Regensburg (Autor:in)
  • Raphaela Maerkl - , Ostbayerische Technische Hochschule Regensburg (Autor:in)
  • Leonard Klausmann - , Ostbayerische Technische Hochschule Regensburg (Autor:in)
  • Suemeyye R. Yildiran - , Ostbayerische Technische Hochschule Regensburg (Autor:in)
  • Max Gutbrod - , Ostbayerische Technische Hochschule Regensburg (Autor:in)
  • Danilo Weber Nunes - , Ostbayerische Technische Hochschule Regensburg (Autor:in)
  • Alvaro Fernandez Moreno - , Medtronic Ltd., University College London (Autor:in)
  • Imanol Luengo - , Medtronic Ltd. (Autor:in)
  • Danail Stoyanov - , Medtronic Ltd., University College London (Autor:in)
  • Nicolas Toussaint - , Medtronic Ltd. (Autor:in)
  • Enki Cho - , Kyung Hee University (Autor:in)
  • Hyeon Bae Kim - , Kyung Hee University (Autor:in)
  • Oh Sung Choo - , Kyung Hee University (Autor:in)
  • Ka Young Kim - , Kyung Hee University (Autor:in)
  • Seong Tae Kim - , Kyung Hee University (Autor:in)
  • Gonçalo Arantes - , Universidade do Minho (Autor:in)
  • Kehan Song - , Hanglok Tech (Autor:in)
  • Jianjun Zhu - , Hanglok Tech (Autor:in)
  • Junchen Xiong - , Hanglok Tech (Autor:in)
  • Tingyi Lin - , Hanglok Tech (Autor:in)
  • Shunsuke Kikuchi - , Jmees Inc. (Autor:in)
  • Hiroki Matsuzaki - , Jmees Inc. (Autor:in)
  • Atsushi Kouno - , Jmees Inc. (Autor:in)
  • João Renato Ribeiro Manesco - , Universidade Estadual Paulista Júlio de Mesquita Filho (Autor:in)
  • João Paulo Papa - , Universidade Estadual Paulista Júlio de Mesquita Filho (Autor:in)
  • Tae Min Choi - , Korea Institute of Science and Technology (Autor:in)
  • Tae Kyeong Jeong - , Korea Institute of Science and Technology (Autor:in)
  • Juyoun Park - , Korea Institute of Science and Technology (Autor:in)
  • Oluwatosin Alabi - , King's College London (KCL) (Autor:in)
  • Meng Wei - , King's College London (KCL) (Autor:in)
  • Tom Vercauteren - , King's College London (KCL) (Autor:in)
  • Runzhi Wu - , Chinese University of Hong Kong (Autor:in)
  • Mengya Xu - , Chinese University of Hong Kong (Autor:in)
  • An Wang - , Chinese University of Hong Kong (Autor:in)
  • Long Bai - , Chinese University of Hong Kong (Autor:in)
  • Hongliang Ren - , Chinese University of Hong Kong (Autor:in)
  • Amine Yamlahi - , Deutsches Krebsforschungszentrum (DKFZ) (Autor:in)
  • Jakob Hennighausen - , Deutsches Krebsforschungszentrum (DKFZ) (Autor:in)
  • Lena Maier-Hein - , Deutsches Krebsforschungszentrum (DKFZ) (Autor:in)
  • Satoshi Kondo - , Muroran Institute of Technology (Autor:in)
  • Satoshi Kasai - , Niigata University of Health and Welfare (Autor:in)
  • Kousuke Hirasawa - , Konica Minolta Inc (Autor:in)
  • Shu Yang - , Hong Kong University of Science and Technology (Autor:in)
  • Yihui Wang - , Hong Kong University of Science and Technology (Autor:in)
  • Hao Chen - , Hong Kong University of Science and Technology, HKUST Shenzhen-Hong Kong Collaborative Innovation Research Institute (Autor:in)
  • Santiago Rodríguez - , Universidad de los Andes Colombia (Autor:in)
  • Nicolás Aparicio - , Universidad de los Andes Colombia (Autor:in)
  • Leonardo Manrique - , Universidad de los Andes Colombia (Autor:in)
  • Stefanie Speidel - , Exzellenzcluster CeTI: Zentrum für Taktiles Internet, Nationales Centrum für Tumorerkrankungen Dresden (Autor:in)
  • Christoph Palm - , Ostbayerische Technische Hochschule Regensburg (Autor:in)

Abstract

Reliable recognition and localization of surgical instruments in endoscopic video recordings are foundational for a wide range of applications in computer- and robot-assisted minimally invasive surgery (RAMIS), including surgical training, skill assessment, and autonomous assistance. However, robust performance under real-world conditions remains a significant challenge. Incorporating surgical context – such as the current procedural phase – has emerged as a promising strategy to improve robustness and interpretability. To address these challenges, we organized the Surgical Procedure Phase, Keypoint, and Instrument Recognition (PhaKIR) sub-challenge as part of the Endoscopic Vision (EndoVis) challenge at MICCAI 2024. We introduced a novel, multi-center dataset comprising thirteen full-length laparoscopic cholecystectomy videos collected from three distinct medical institutions, with unified annotations for three interrelated tasks: surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation. Unlike existing datasets, ours enables joint investigation of instrument localization and procedural context within the same data while supporting the integration of temporal information across entire procedures. We report results and findings in accordance with the BIAS guidelines for biomedical image analysis challenges. The PhaKIR sub-challenge advances the field by providing a unique benchmark for developing temporally aware, context-driven methods in RAMIS and offers a high-quality resource to support future research in surgical scene understanding.

Details

OriginalspracheEnglisch
Aufsatznummer103945
FachzeitschriftMedical Image Analysis
Jahrgang109
PublikationsstatusVeröffentlicht - März 2026
Peer-Review-StatusJa

Schlagworte

Schlagwörter

  • Instrument instance segmentation, Instrument keypoint estimation, Robot-assisted surgery, Surgical phase recognition