Abnormality-Driven Representation Learning for Radiology Imaging

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

Abstract

Radiology deep learning pipelines predominantly employ end-to-end 3D networks based on models pre-trained on other tasks, which are then fine-tuned on the task at hand. In contrast, adjacent medical fields such as pathology, which focus on 2D images, have effectively adopted task-agnostic foundational models based on self-supervised learning (SSL), combined with weakly-supervised deep learning (DL). However, the field of radiology still lacks task-agnostic representation models due to the computational and data demands of 3D imaging and the anatomical complexity inherent to radiology scans. To address this gap, we propose Clear, a framework for 3D radiology images that uses extracted embeddings from 2D slices along with attention-based aggregation to efficiently predict clinical endpoints. As part of this framework, we introduce Lecl, a novel approach to obtain visual representations driven by abnormalities in 2D axial slices across different locations of the CT scans. Specifically, we trained single-domain contrastive learning approaches using three different architectures: Vision Transformers, Vision State Space Models and Gated Convolutional Neural Networks. We evaluate our approach across three clinical tasks: tumor lesion location, lung disease detection, and patient staging, benchmarking against four state-of-the-art foundation models, including BiomedCLIP. Our findings demonstrate that Clear, using representations learned through Lecl, outperforms existing foundation models, while being substantially more compute- and data-efficient. The code is available at https://github.com/KatherLab/CLEAR.

Details

Original languageEnglish
Title of host publicationMedical Image Computing and Computer Assisted Intervention
EditorsJames C. Gee, Jaesung Hong, Carole H. Sudre, Polina Golland, Daniel C. Alexander, Juan Eugenio Iglesias, Archana Venkataraman, Jong Hyo Kim
PublisherSpringer Science and Business Media B.V.
Pages14-24
Number of pages11
ISBN (electronic)978-3-032-04965-0
ISBN (print)978-3-032-04964-3
Publication statusPublished - 2026
Peer-reviewedYes

Publication series

SeriesLecture notes in computer science
Volume15963 LNCS
ISSN0302-9743

Conference

Title28th International Conference on Medical Image Computing and Computer Assisted Intervention
Abbreviated titleMICCAI 2025
Conference number28
Duration23 - 27 September 2025
Website
LocationDaejeon Convention Center
CityDaejeon
CountryKorea, Republic of

External IDs

ORCID /0000-0002-3730-5348/work/198594710

Keywords