MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention

  • Tianyi Wang
  • , Jianan Fan
  • , Dingxin Zhang
  • , Dongnan Liu
  • , Yong Xia
  • , Heng Huang
  • , Weidong Cai

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Histopathology and transcriptomics are fundamental modalities in cancer diagnostics, encapsulating the morphological and molecular characteristics of the disease. Multi-modal self-supervised learning has demonstrated remarkable potential in learning pathological representations by integrating diverse data sources. Conventional multi-modal integration methods primarily emphasize modality alignment, while paying insufficient attention to retaining the modality-specific intrinsic structures. However, unlike conventional scenarios where multi-modal inputs often share highly overlapping features, histopathology and transcriptomics exhibit pronounced heterogeneity, offering orthogonal yet complementary insights. Histopathology data provides morphological and spatial context, elucidating tissue architecture and cellular topology, whereas transcriptomics data delineates molecular signatures through quantifying gene expression patterns. This inherent disparity introduces a major challenge in aligning these modalities while maintaining modality-specific fidelity. To address these challenges, we present MIRROR, a novel multi-modal representation learning framework designed to foster both modality alignment and retention. MIRROR employs dedicated encoders to extract comprehensive feature representations for each modality, which is further complemented by a modality alignment module to achieve seamless integration between phenotype patterns and molecular profiles. Furthermore, a modality retention module safeguards unique attributes from each modality, while a style clustering module mitigates redundancy and enhances disease-relevant information by modeling and aligning consistent pathological signatures within a clustering space. Extensive evaluations on The Cancer Genome Atlas (TCGA) cohorts for cancer subtyping and survival analysis highlight MIRROR’s superior performance, demonstrating its effectiveness in constructing comprehensive oncological feature representations and benefiting the cancer diagnosis.

Original languageEnglish
JournalIEEE Transactions on Medical Imaging
DOIs
StateAccepted/In press - 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Multimodal Learning
  • Pathology
  • Self-Supervised Learning (SSL)
  • Transcriptomics
  • Whole Slide Image (WSI)

Fingerprint

Dive into the research topics of 'MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention'. Together they form a unique fingerprint.

Cite this