SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues

Yuxin Xie; Tao Zhou; Yi Zhou; Geng Chen

doi:10.1007/978-3-031-72111-3_60

SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues

Yuxin Xie, Tao Zhou, Yi Zhou, Geng Chen

School of Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

Weakly-supervised medical image segmentation is a challenging task that aims to reduce the annotation cost while keep the segmentation performance. In this paper, we present a novel framework, SimTxtSeg, that leverages simple text cues to generate high-quality pseudo-labels and study the cross-modal fusion in training segmentation models, simultaneously. Our contribution consists of two key components: an effective Textual-to-Visual Cue Converter that produces visual prompts from text prompts on medical images, and a text-guided segmentation model with Text-Vision Hybrid Attention that fuses text and image features. We evaluate our framework on two medical image segmentation tasks: colonic polyp segmentation and MRI brain tumor segmentation, and achieve consistent state-of-the-art performance. Source code is available at: https://github.com/xyx1024/SimTxtSeg.

Original language	English
Title of host publication	Medical Image Computing and Computer Assisted Intervention – MICCAI 2024 - 27th International Conference, Proceedings
Editors	Marius George Linguraru, Qi Dou, Aasa Feragen, Stamatia Giannarou, Ben Glocker, Karim Lekadir, Julia A. Schnabel
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	634-644
Number of pages	11
ISBN (Print)	9783031721106
DOIs	https://doi.org/10.1007/978-3-031-72111-3_60
State	Published - 2024
Event	27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024 - Marrakesh, Morocco Duration: 6 Oct 2024 → 10 Oct 2024

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	15008 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024
Country/Territory	Morocco
City	Marrakesh
Period	6/10/24 → 10/10/24

Keywords

Text-vision hybrid attention
Textual-to-visual cue converter
Weakly-supervised medical image segmentation

Access to Document

10.1007/978-3-031-72111-3_60

Cite this

Xie, Y., Zhou, T., Zhou, Y., & Chen, G. (2024). SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues. In M. G. Linguraru, Q. Dou, A. Feragen, S. Giannarou, B. Glocker, K. Lekadir, & J. A. Schnabel (Eds.), Medical Image Computing and Computer Assisted Intervention – MICCAI 2024 - 27th International Conference, Proceedings (pp. 634-644). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15008 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-72111-3_60

Xie, Yuxin ; Zhou, Tao ; Zhou, Yi et al. / SimTxtSeg : Weakly-Supervised Medical Image Segmentation with Simple Text Cues. Medical Image Computing and Computer Assisted Intervention – MICCAI 2024 - 27th International Conference, Proceedings. editor / Marius George Linguraru ; Qi Dou ; Aasa Feragen ; Stamatia Giannarou ; Ben Glocker ; Karim Lekadir ; Julia A. Schnabel. Springer Science and Business Media Deutschland GmbH, 2024. pp. 634-644 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{36dc38e6afeb4b188598b8dc81a0b8e6,

title = "SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues",

abstract = "Weakly-supervised medical image segmentation is a challenging task that aims to reduce the annotation cost while keep the segmentation performance. In this paper, we present a novel framework, SimTxtSeg, that leverages simple text cues to generate high-quality pseudo-labels and study the cross-modal fusion in training segmentation models, simultaneously. Our contribution consists of two key components: an effective Textual-to-Visual Cue Converter that produces visual prompts from text prompts on medical images, and a text-guided segmentation model with Text-Vision Hybrid Attention that fuses text and image features. We evaluate our framework on two medical image segmentation tasks: colonic polyp segmentation and MRI brain tumor segmentation, and achieve consistent state-of-the-art performance. Source code is available at: https://github.com/xyx1024/SimTxtSeg.",

keywords = "Text-vision hybrid attention, Textual-to-visual cue converter, Weakly-supervised medical image segmentation",

author = "Yuxin Xie and Tao Zhou and Yi Zhou and Geng Chen",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.; 27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024 ; Conference date: 06-10-2024 Through 10-10-2024",

year = "2024",

doi = "10.1007/978-3-031-72111-3_60",

language = "英语",

isbn = "9783031721106",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "634--644",

editor = "Linguraru, {Marius George} and Qi Dou and Aasa Feragen and Stamatia Giannarou and Ben Glocker and Karim Lekadir and Schnabel, {Julia A.}",

booktitle = "Medical Image Computing and Computer Assisted Intervention – MICCAI 2024 - 27th International Conference, Proceedings",

}

Xie, Y, Zhou, T, Zhou, Y & Chen, G 2024, SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues. in MG Linguraru, Q Dou, A Feragen, S Giannarou, B Glocker, K Lekadir & JA Schnabel (eds), Medical Image Computing and Computer Assisted Intervention – MICCAI 2024 - 27th International Conference, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 15008 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 634-644, 27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024, Marrakesh, Morocco, 6/10/24. https://doi.org/10.1007/978-3-031-72111-3_60

SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues. / Xie, Yuxin; Zhou, Tao; Zhou, Yi et al.
Medical Image Computing and Computer Assisted Intervention – MICCAI 2024 - 27th International Conference, Proceedings. ed. / Marius George Linguraru; Qi Dou; Aasa Feragen; Stamatia Giannarou; Ben Glocker; Karim Lekadir; Julia A. Schnabel. Springer Science and Business Media Deutschland GmbH, 2024. p. 634-644 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15008 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - SimTxtSeg

T2 - 27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024

AU - Xie, Yuxin

AU - Zhou, Tao

AU - Zhou, Yi

AU - Chen, Geng

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

PY - 2024

Y1 - 2024

N2 - Weakly-supervised medical image segmentation is a challenging task that aims to reduce the annotation cost while keep the segmentation performance. In this paper, we present a novel framework, SimTxtSeg, that leverages simple text cues to generate high-quality pseudo-labels and study the cross-modal fusion in training segmentation models, simultaneously. Our contribution consists of two key components: an effective Textual-to-Visual Cue Converter that produces visual prompts from text prompts on medical images, and a text-guided segmentation model with Text-Vision Hybrid Attention that fuses text and image features. We evaluate our framework on two medical image segmentation tasks: colonic polyp segmentation and MRI brain tumor segmentation, and achieve consistent state-of-the-art performance. Source code is available at: https://github.com/xyx1024/SimTxtSeg.

AB - Weakly-supervised medical image segmentation is a challenging task that aims to reduce the annotation cost while keep the segmentation performance. In this paper, we present a novel framework, SimTxtSeg, that leverages simple text cues to generate high-quality pseudo-labels and study the cross-modal fusion in training segmentation models, simultaneously. Our contribution consists of two key components: an effective Textual-to-Visual Cue Converter that produces visual prompts from text prompts on medical images, and a text-guided segmentation model with Text-Vision Hybrid Attention that fuses text and image features. We evaluate our framework on two medical image segmentation tasks: colonic polyp segmentation and MRI brain tumor segmentation, and achieve consistent state-of-the-art performance. Source code is available at: https://github.com/xyx1024/SimTxtSeg.

KW - Text-vision hybrid attention

KW - Textual-to-visual cue converter

KW - Weakly-supervised medical image segmentation

UR - http://www.scopus.com/inward/record.url?scp=85206897039&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-72111-3_60

DO - 10.1007/978-3-031-72111-3_60

M3 - 会议稿件

AN - SCOPUS:85206897039

SN - 9783031721106

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 634

EP - 644

BT - Medical Image Computing and Computer Assisted Intervention – MICCAI 2024 - 27th International Conference, Proceedings

A2 - Linguraru, Marius George

A2 - Dou, Qi

A2 - Feragen, Aasa

A2 - Giannarou, Stamatia

A2 - Glocker, Ben

A2 - Lekadir, Karim

A2 - Schnabel, Julia A.

PB - Springer Science and Business Media Deutschland GmbH

Y2 - 6 October 2024 through 10 October 2024

ER -

Xie Y, Zhou T, Zhou Y, Chen G. SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues. In Linguraru MG, Dou Q, Feragen A, Giannarou S, Glocker B, Lekadir K, Schnabel JA, editors, Medical Image Computing and Computer Assisted Intervention – MICCAI 2024 - 27th International Conference, Proceedings. Springer Science and Business Media Deutschland GmbH. 2024. p. 634-644. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-72111-3_60