UniMiSS+: Universal Medical Self-Supervised Learning From Cross-Dimensional Unpaired Data

Yutong Xie, Jianpeng Zhang, Yong Xia, Qi Wu

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Self-supervised learning (SSL) opens up huge opportunities for medical image analysis that is well known for its lack of annotations. However, aggregating massive (unlabeled) 3D medical images like computerized tomography (CT) remains challenging due to its high imaging cost and privacy restrictions. In our pilot study, we advocated bringing a wealth of 2D images like X-rays as compensation for the lack of 3D data, aiming to build a universal medical self-supervised representation learning framework, called UniMiSS. Especially, we designed a pyramid U-like medical Transformer (MiT) as the backbone to make UniMiSS possible to perform SSL with both 2D and 3D images. UniMiSS surpasses current 3D-specific SSL in effectiveness and versatility, excelling in various downstream tasks and overcoming the limitations of dimensionality. However, the initial version did not fully explore the anatomical correlations between 2D and 3D images due to the absence of paired multi-modal patient data. In this extension, we introduce UniMiSS+, which leverages digitally reconstructed radiographs (DRR) technology to simulate X-rays from CT volumes, providing access to paired data. Benefiting from the paired group, we introduce an extra pair-wise constraint to boost the cross modality correlation learning, which also can be adopted as a cross dimension regularization to further improve the representations. We conduct expensive experiments on multiple 3D/2D medical image analysis tasks, including segmentation and classification. The results show that our UniMiSS+ achieves promising performance on various downstream tasks, not only outperforming ImageNet pre-training and other advanced SSL counterparts but also improving the predecessor UniMiSS pre-training.

Original languageEnglish
Pages (from-to)10021-10035
Number of pages15
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume46
Issue number12
DOIs
StatePublished - 2024

Keywords

  • Medical image analysis
  • self-supervised learning
  • transformer
  • universal learning

Fingerprint

Dive into the research topics of 'UniMiSS+: Universal Medical Self-Supervised Learning From Cross-Dimensional Unpaired Data'. Together they form a unique fingerprint.

Cite this