Comparative validation of machine learning algorithms for surgical workflow and skill analysis with the HeiChole benchmark

Martin Wagner; Beat Peter Müller-Stich; Anna Kisilenko; Duc Tran; Patrick Heger; Lars Mündermann; David M. Lubotsky; Benjamin Müller; Tornike Davitashvili; Manuela Capek; Annika Reinke; Carissa Reid; Tong Yu; Armine Vardazaryan; Chinedu Innocent Nwoye; Nicolas Padoy; Xinyang Liu; Eung Joo Lee; Constantin Disch; Hans Meine; Tong Xia; Fucang Jia; Satoshi Kondo; Wolfgang Reiter; Yueming Jin; Yonghao Long; Meirui Jiang; Qi Dou; Pheng Ann Heng; Isabell Twick; Kadir Kirtac; Enes Hosgor; Jon Lindström Bolmgren; Michael Stenzel; Björn von Siemens; Long Zhao; Zhenxiao Ge; Haiming Sun; Di Xie; Mengqi Guo; Daochang Liu; Hannes G. Kenngott; Felix Nickel; Moritz von Frankenberg; Franziska Mathis-Ullrich; Annette Kopp-Schneider; Lena Maier-Hein; Stefanie Speidel; Sebastian Bodenstedt

doi:10.1016/j.media.2023.102770

Comparative validation of machine learning algorithms for surgical workflow and skill analysis with the HeiChole benchmark

Research output: Contribution to journal › Short survey/Review › Contributed › peer-review

Contributors

Martin Wagner - , Heidelberg University , National Center for Tumor Diseases (NCT) Heidelberg (Author)
Beat Peter Müller-Stich - , Heidelberg University , German Cancer Research Center (DKFZ) (Author)
Anna Kisilenko - , Heidelberg University , German Cancer Research Center (DKFZ) (Author)
Duc Tran - , Heidelberg University , German Cancer Research Center (DKFZ) (Author)
Patrick Heger - , Heidelberg University (Author)
Lars Mündermann - , Karl Storz SE & Co. KG (Author)
David M. Lubotsky - , Heidelberg University , German Cancer Research Center (DKFZ) (Author)
Benjamin Müller - , Heidelberg University , German Cancer Research Center (DKFZ) (Author)
Tornike Davitashvili - , Heidelberg University , German Cancer Research Center (DKFZ) (Author)
Manuela Capek - , Heidelberg University , German Cancer Research Center (DKFZ) (Author)
Annika Reinke - , German Cancer Research Center (DKFZ), Heidelberg University (Author)
Carissa Reid - , German Cancer Research Center (DKFZ) (Author)
Tong Yu - , University of Strasbourg, France (Author)
Armine Vardazaryan - , University of Strasbourg, France (Author)
Chinedu Innocent Nwoye - , University of Strasbourg, France (Author)
Nicolas Padoy - , University of Strasbourg, France (Author)
Xinyang Liu - , Children's National Hospital (Author)
Eung Joo Lee - , University of Maryland, College Park (Author)
Constantin Disch - , Fraunhofer Institute for Digital Medicine (Author)
Hans Meine - , Fraunhofer Institute for Digital Medicine, University of Bremen (Author)
Tong Xia - , Shenzhen Institute of Advanced Technology (SIAT) (Author)
Fucang Jia - , Shenzhen Institute of Advanced Technology (SIAT) (Author)
Satoshi Kondo - , Konica Minolta Inc (Author)
Wolfgang Reiter - , Wintegral GmbH (Author)
Yueming Jin - , Chinese University of Hong Kong (Author)
Yonghao Long - , Chinese University of Hong Kong (Author)
Meirui Jiang - , Chinese University of Hong Kong (Author)
Qi Dou - , Chinese University of Hong Kong (Author)
Pheng Ann Heng - , Chinese University of Hong Kong (Author)
Isabell Twick - , Caresyntax GmbH (Author)
Kadir Kirtac - , Caresyntax GmbH (Author)
Enes Hosgor - , Caresyntax GmbH (Author)
Jon Lindström Bolmgren - , Caresyntax GmbH (Author)
Michael Stenzel - , Caresyntax GmbH (Author)
Björn von Siemens - , Caresyntax GmbH (Author)
Long Zhao - , Hangzhou Hikvision Digital Technology Co. Ltd. (Author)
Zhenxiao Ge - , Hangzhou Hikvision Digital Technology Co. Ltd. (Author)
Haiming Sun - , Hangzhou Hikvision Digital Technology Co. Ltd. (Author)
Di Xie - , Hangzhou Hikvision Digital Technology Co. Ltd. (Author)
Mengqi Guo - , National University of Singapore (Author)
Daochang Liu - , Peking University (Author)
Hannes G. Kenngott - , Heidelberg University (Author)
Felix Nickel - , Heidelberg University (Author)
Moritz von Frankenberg - , Salem Hospital (Author)
Franziska Mathis-Ullrich - , Karlsruhe Institute of Technology (Author)
Annette Kopp-Schneider - , German Cancer Research Center (DKFZ) (Author)
Lena Maier-Hein - , German Cancer Research Center (DKFZ), Heidelberg University (Author)
Stefanie Speidel - , National Center for Tumor Diseases Dresden, Clusters of Excellence CeTI: Centre for Tactile Internet (Author)
Sebastian Bodenstedt - , TUD Dresden University of Technology (Author)

Abstract

Purpose: Surgical workflow and skill analysis are key technologies for the next generation of cognitive surgical assistance systems. These systems could increase the safety of the operation through context-sensitive warnings and semi-autonomous robotic assistance or improve training of surgeons via data-driven feedback. In surgical workflow analysis up to 91% average precision has been reported for phase recognition on an open data single-center video dataset. In this work we investigated the generalizability of phase recognition algorithms in a multicenter setting including more difficult recognition tasks such as surgical action and surgical skill. Methods: To achieve this goal, a dataset with 33 laparoscopic cholecystectomy videos from three surgical centers with a total operation time of 22 h was created. Labels included framewise annotation of seven surgical phases with 250 phase transitions, 5514 occurences of four surgical actions, 6980 occurences of 21 surgical instruments from seven instrument categories and 495 skill classifications in five skill dimensions. The dataset was used in the 2019 international Endoscopic Vision challenge, sub-challenge for surgical workflow and skill analysis. Here, 12 research teams trained and submitted their machine learning algorithms for recognition of phase, action, instrument and/or skill assessment. Results: F1-scores were achieved for phase recognition between 23.9% and 67.7% (n = 9 teams), for instrument presence detection between 38.5% and 63.8% (n = 8 teams), but for action recognition only between 21.8% and 23.3% (n = 5 teams). The average absolute error for skill assessment was 0.78 (n = 1 team). Conclusion: Surgical workflow and skill analysis are promising technologies to support the surgical team, but there is still room for improvement, as shown by our comparison of machine learning algorithms. This novel HeiChole benchmark can be used for comparable evaluation and validation of future work. In future studies, it is of utmost importance to create more open, high-quality datasets in order to allow the development of artificial intelligence and cognitive robotics in surgery.

Details

Original language	English
Article number	102770
Journal	Medical Image Analysis
Volume	86
Publication status	Published - May 2023
Peer-reviewed	Yes

External IDs

PubMed	36889206
ORCID	/0000-0002-4590-1908/work/163294007

Keywords

ASJC Scopus subject areas

Keywords

Endoscopic vision, Laparoscopic cholecystectomy, Surgical data science, Surgical workflow analysis

Research Portal of the TU Dresden

Contributors

Abstract

Details

External IDs

Keywords

ASJC Scopus subject areas

Keywords