Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation

Yutong Xie; Jianpeng Zhang; Yong Xia; Chunhua Shen

doi:10.1109/TPAMI.2023.3312587

Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation

Yutong Xie, Jianpeng Zhang, Yong Xia, Chunhua Shen

School of Computer Science

Research output: Contribution to journal › Article › peer-review

21 Scopus citations

Abstract

Medical image benchmarks for the segmentation of organs and tumors suffer from the partially labeling issue due to its intensive cost of labor and expertise. Current mainstream approaches follow the practice of one network solving one task. With this pipeline, not only the performance is limited by the typically small dataset of a single task, but also the computation cost linearly increases with the number of tasks. To address this, we propose a Transformer based dynamic on-demand network (TransDoDNet) that learns to segment organs and tumors on multiple partially labeled datasets. Specifically, TransDoDNet has a hybrid backbone that is composed of the convolutional neural network and Transformer. A dynamic head enables the network to accomplish multiple segmentation tasks flexibly. Unlike existing approaches that fix kernels after training, the kernels in the dynamic head are generated adaptively by the Transformer, which employs the self-attention mechanism to model long-range organ-wise dependencies and decodes the organ embedding that can represent each organ. We create a large-scale partially labeled Multi-Organ and Tumor Segmentation benchmark, termed MOTS, and demonstrate the superior performance of our TransDoDNet over other competitors on seven organ and tumor segmentation tasks. This study also provides a general 3D medical image segmentation model, which has been pre-trained on the large-scale MOTS benchmark and has demonstrated advanced performance over current predominant self-supervised learning methods.

Original language	English
Pages (from-to)	14905-14919
Number of pages	15
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume	45
Issue number	12
DOIs	https://doi.org/10.1109/TPAMI.2023.3312587
State	Published - 1 Dec 2023

Keywords

Dynamic convolution
medical image segmentation
partial label learning
transformer

Access to Document

10.1109/TPAMI.2023.3312587

Cite this

@article{46a94e3fd6ce47aa8d42ea33f674afcd,

title = "Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation",

abstract = "Medical image benchmarks for the segmentation of organs and tumors suffer from the partially labeling issue due to its intensive cost of labor and expertise. Current mainstream approaches follow the practice of one network solving one task. With this pipeline, not only the performance is limited by the typically small dataset of a single task, but also the computation cost linearly increases with the number of tasks. To address this, we propose a Transformer based dynamic on-demand network (TransDoDNet) that learns to segment organs and tumors on multiple partially labeled datasets. Specifically, TransDoDNet has a hybrid backbone that is composed of the convolutional neural network and Transformer. A dynamic head enables the network to accomplish multiple segmentation tasks flexibly. Unlike existing approaches that fix kernels after training, the kernels in the dynamic head are generated adaptively by the Transformer, which employs the self-attention mechanism to model long-range organ-wise dependencies and decodes the organ embedding that can represent each organ. We create a large-scale partially labeled Multi-Organ and Tumor Segmentation benchmark, termed MOTS, and demonstrate the superior performance of our TransDoDNet over other competitors on seven organ and tumor segmentation tasks. This study also provides a general 3D medical image segmentation model, which has been pre-trained on the large-scale MOTS benchmark and has demonstrated advanced performance over current predominant self-supervised learning methods.",

keywords = "Dynamic convolution, medical image segmentation, partial label learning, transformer",

author = "Yutong Xie and Jianpeng Zhang and Yong Xia and Chunhua Shen",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2023",

month = dec,

day = "1",

doi = "10.1109/TPAMI.2023.3312587",

language = "英语",

volume = "45",

pages = "14905--14919",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "12",

}

TY - JOUR

T1 - Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation

AU - Xie, Yutong

AU - Zhang, Jianpeng

AU - Xia, Yong

AU - Shen, Chunhua

PY - 2023/12/1

Y1 - 2023/12/1

N2 - Medical image benchmarks for the segmentation of organs and tumors suffer from the partially labeling issue due to its intensive cost of labor and expertise. Current mainstream approaches follow the practice of one network solving one task. With this pipeline, not only the performance is limited by the typically small dataset of a single task, but also the computation cost linearly increases with the number of tasks. To address this, we propose a Transformer based dynamic on-demand network (TransDoDNet) that learns to segment organs and tumors on multiple partially labeled datasets. Specifically, TransDoDNet has a hybrid backbone that is composed of the convolutional neural network and Transformer. A dynamic head enables the network to accomplish multiple segmentation tasks flexibly. Unlike existing approaches that fix kernels after training, the kernels in the dynamic head are generated adaptively by the Transformer, which employs the self-attention mechanism to model long-range organ-wise dependencies and decodes the organ embedding that can represent each organ. We create a large-scale partially labeled Multi-Organ and Tumor Segmentation benchmark, termed MOTS, and demonstrate the superior performance of our TransDoDNet over other competitors on seven organ and tumor segmentation tasks. This study also provides a general 3D medical image segmentation model, which has been pre-trained on the large-scale MOTS benchmark and has demonstrated advanced performance over current predominant self-supervised learning methods.

AB - Medical image benchmarks for the segmentation of organs and tumors suffer from the partially labeling issue due to its intensive cost of labor and expertise. Current mainstream approaches follow the practice of one network solving one task. With this pipeline, not only the performance is limited by the typically small dataset of a single task, but also the computation cost linearly increases with the number of tasks. To address this, we propose a Transformer based dynamic on-demand network (TransDoDNet) that learns to segment organs and tumors on multiple partially labeled datasets. Specifically, TransDoDNet has a hybrid backbone that is composed of the convolutional neural network and Transformer. A dynamic head enables the network to accomplish multiple segmentation tasks flexibly. Unlike existing approaches that fix kernels after training, the kernels in the dynamic head are generated adaptively by the Transformer, which employs the self-attention mechanism to model long-range organ-wise dependencies and decodes the organ embedding that can represent each organ. We create a large-scale partially labeled Multi-Organ and Tumor Segmentation benchmark, termed MOTS, and demonstrate the superior performance of our TransDoDNet over other competitors on seven organ and tumor segmentation tasks. This study also provides a general 3D medical image segmentation model, which has been pre-trained on the large-scale MOTS benchmark and has demonstrated advanced performance over current predominant self-supervised learning methods.

KW - Dynamic convolution

KW - medical image segmentation

KW - partial label learning

KW - transformer

UR - http://www.scopus.com/inward/record.url?scp=85171558783&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2023.3312587

DO - 10.1109/TPAMI.2023.3312587

M3 - 文章

C2 - 37672381

AN - SCOPUS:85171558783

SN - 0162-8828

VL - 45

SP - 14905

EP - 14919

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 12

ER -

Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this