跳到主要导航 跳到搜索 跳到主要内容

MedIM: Boost Medical Image Representation via Radiology Report-Guided Masking

  • Yutong Xie
  • , Lin Gu
  • , Tatsuya Harada
  • , Jianpeng Zhang
  • , Yong Xia
  • , Qi Wu
  • University of Adelaide
  • RIKEN
  • The University of Tokyo
  • Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节会议稿件同行评审

15 引用 (Scopus)

摘要

Masked image modelling (MIM)-based pre-training shows promise in improving image representations with limited annotated data by randomly masking image patches and reconstructing them. However, random masking may not be suitable for medical images due to their unique pathology characteristics. This paper proposes Masked medical Image Modelling (MedIM), a novel approach, to our knowledge, the first research that masks and reconstructs discriminative areas guided by radiological reports, encouraging the network to explore the stronger semantic representations from medical images. We introduce two mutual comprehensive masking strategies, knowledge word-driven masking (KWM) and sentence-driven masking (SDM). KWM uses Medical Subject Headings (MeSH) words unique to radiology reports to identify discriminative cues mapped to MeSH words and guide the mask generation. SDM considers that reports usually have multiple sentences, each of which describes different findings, and therefore integrates sentence-level information to identify discriminative regions for mask generation. MedIM integrates both strategies by simultaneously restoring the images masked by KWM and SDM for a more robust and representative medical visual representation. Our extensive experiments on various downstream tasks covering multi-label/class image classification, medical image segmentation, and medical image-text analysis, demonstrate that MedIM with report-guided masking achieves competitive performance. Our method substantially outperforms ImageNet pre-training, MIM-based pre-training, and medical image-report pre-training counterparts. Codes are available at https://github.com/YtongXie/MedIM.

源语言英语
主期刊名Medical Image Computing and Computer Assisted Intervention – MICCAI 2023 - 26th International Conference, Proceedings
编辑Hayit Greenspan, Hayit Greenspan, Anant Madabhushi, Parvin Mousavi, Septimiu Salcudean, James Duncan, Tanveer Syeda-Mahmood, Russell Taylor
出版商Springer Science and Business Media Deutschland GmbH
13-23
页数11
ISBN(印刷版)9783031439063
DOI
出版状态已出版 - 2023
活动26th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2023 - Vancouver, 加拿大
期限: 8 10月 202312 10月 2023

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14220 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议26th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2023
国家/地区加拿大
Vancouver
时期8/10/2312/10/23

指纹

探究 'MedIM: Boost Medical Image Representation via Radiology Report-Guided Masking' 的科研主题。它们共同构成独一无二的指纹。

引用此