Abnormality-Driven Representation Learning for Radiology Imaging

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Beitragende

Abstract

Radiology deep learning pipelines predominantly employ end-to-end 3D networks based on models pre-trained on other tasks, which are then fine-tuned on the task at hand. In contrast, adjacent medical fields such as pathology, which focus on 2D images, have effectively adopted task-agnostic foundational models based on self-supervised learning (SSL), combined with weakly-supervised deep learning (DL). However, the field of radiology still lacks task-agnostic representation models due to the computational and data demands of 3D imaging and the anatomical complexity inherent to radiology scans. To address this gap, we propose Clear, a framework for 3D radiology images that uses extracted embeddings from 2D slices along with attention-based aggregation to efficiently predict clinical endpoints. As part of this framework, we introduce Lecl, a novel approach to obtain visual representations driven by abnormalities in 2D axial slices across different locations of the CT scans. Specifically, we trained single-domain contrastive learning approaches using three different architectures: Vision Transformers, Vision State Space Models and Gated Convolutional Neural Networks. We evaluate our approach across three clinical tasks: tumor lesion location, lung disease detection, and patient staging, benchmarking against four state-of-the-art foundation models, including BiomedCLIP. Our findings demonstrate that Clear, using representations learned through Lecl, outperforms existing foundation models, while being substantially more compute- and data-efficient. The code is available at https://github.com/KatherLab/CLEAR.

Details

OriginalspracheEnglisch
TitelMedical Image Computing and Computer Assisted Intervention
Redakteure/-innenJames C. Gee, Jaesung Hong, Carole H. Sudre, Polina Golland, Daniel C. Alexander, Juan Eugenio Iglesias, Archana Venkataraman, Jong Hyo Kim
Herausgeber (Verlag)Springer Science and Business Media B.V.
Seiten14-24
Seitenumfang11
ISBN (elektronisch)978-3-032-04965-0
ISBN (Print)978-3-032-04964-3
PublikationsstatusVeröffentlicht - 2026
Peer-Review-StatusJa

Publikationsreihe

ReiheLecture notes in computer science
Band15963 LNCS
ISSN0302-9743

Konferenz

Titel28th International Conference on Medical Image Computing and Computer Assisted Intervention
KurztitelMICCAI 2025
Veranstaltungsnummer28
Dauer23 - 27 September 2025
Webseite
OrtDaejeon Convention Center
StadtDaejeon
LandSüdkorea

Externe IDs

ORCID /0000-0002-3730-5348/work/198594710

Schlagworte