SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

Abstract

Computer-assisted surgery (CAS) systems are designed to assist surgeons during procedures, thereby reducing complications and enhancing patient care. Training machine learning models for these systems requires a large corpus of annotated datasets, which is challenging to obtain in the surgical domain due to patient privacy concerns and the significant labeling effort required from doctors. Previous methods have explored unpaired image translation using generative models to create realistic surgical images from simulations. However, these approaches have struggled to produce high-quality, diverse surgical images. In this work, we introduce SurgicaL-CD, a consistency-distilled diffusion method to generate realistic surgical images with only a few sampling steps without paired data. We evaluate our approach on three datasets, assessing the generated images in terms of quality and utility as downstream training datasets. Our results demonstrate that our method outperforms GANs and diffusion-based approaches. Our code is available at https://gitlab.com/nct_tso_public/gan2diffusion.

Details

Original languageEnglish
Title of host publicationComputer Vision – ECCV 2024 Workshops
EditorsAlessio Del Bue, Cristian Canton, Jordi Pont-Tuset, Tatiana Tommasi
PublisherSpringer Science and Business Media B.V.
Pages218-235
Number of pages18
ISBN (electronic)978-3-031-91907-7
ISBN (print)978-3-031-91906-0
Publication statusE-pub ahead of print - May 2025
Peer-reviewedYes

Publication series

SeriesLecture Notes in Computer Science
Volume15642 LNCS
ISSN0302-9743

Conference

Title18th European Conference on Computer Vision
Abbreviated titleECCV 2024
Conference number18
Duration29 September - 4 October 2024
Website
LocationMiCo Milano
CityMilan
CountryItaly

External IDs

ORCID /0000-0002-4590-1908/work/192583275

Keywords

Keywords

  • Diffusion models, Surgical image generation, Unpaired image translation