Abnormality-Driven Representation Learning for Radiology Imaging
Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung
Beitragende
Abstract
Radiology deep learning pipelines predominantly employ end-to-end 3D networks based on models pre-trained on other tasks, which are then fine-tuned on the task at hand. In contrast, adjacent medical fields such as pathology, which focus on 2D images, have effectively adopted task-agnostic foundational models based on self-supervised learning (SSL), combined with weakly-supervised deep learning (DL). However, the field of radiology still lacks task-agnostic representation models due to the computational and data demands of 3D imaging and the anatomical complexity inherent to radiology scans. To address this gap, we propose Clear, a framework for 3D radiology images that uses extracted embeddings from 2D slices along with attention-based aggregation to efficiently predict clinical endpoints. As part of this framework, we introduce Lecl, a novel approach to obtain visual representations driven by abnormalities in 2D axial slices across different locations of the CT scans. Specifically, we trained single-domain contrastive learning approaches using three different architectures: Vision Transformers, Vision State Space Models and Gated Convolutional Neural Networks. We evaluate our approach across three clinical tasks: tumor lesion location, lung disease detection, and patient staging, benchmarking against four state-of-the-art foundation models, including BiomedCLIP. Our findings demonstrate that Clear, using representations learned through Lecl, outperforms existing foundation models, while being substantially more compute- and data-efficient. The code is available at https://github.com/KatherLab/CLEAR.
Details
| Originalsprache | Englisch |
|---|---|
| Titel | Medical Image Computing and Computer Assisted Intervention |
| Redakteure/-innen | James C. Gee, Jaesung Hong, Carole H. Sudre, Polina Golland, Daniel C. Alexander, Juan Eugenio Iglesias, Archana Venkataraman, Jong Hyo Kim |
| Herausgeber (Verlag) | Springer Science and Business Media B.V. |
| Seiten | 14-24 |
| Seitenumfang | 11 |
| ISBN (elektronisch) | 978-3-032-04965-0 |
| ISBN (Print) | 978-3-032-04964-3 |
| Publikationsstatus | Veröffentlicht - 2026 |
| Peer-Review-Status | Ja |
Publikationsreihe
| Reihe | Lecture notes in computer science |
|---|---|
| Band | 15963 LNCS |
| ISSN | 0302-9743 |
Konferenz
| Titel | 28th International Conference on Medical Image Computing and Computer Assisted Intervention |
|---|---|
| Kurztitel | MICCAI 2025 |
| Veranstaltungsnummer | 28 |
| Dauer | 23 - 27 September 2025 |
| Webseite | |
| Ort | Daejeon Convention Center |
| Stadt | Daejeon |
| Land | Südkorea |
Externe IDs
| ORCID | /0000-0002-3730-5348/work/198594710 |
|---|