Base and Meta: A New Perspective on Few-Shot Segmentation

Chunbo Lang; Gong Cheng; Binfei Tu; Chao Li; Junwei Han

doi:10.1109/TPAMI.2023.3265865

Base and Meta: A New Perspective on Few-Shot Segmentation

Chunbo Lang, Gong Cheng, Binfei Tu, Chao Li, Junwei Han

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

123 引用（Scopus）

摘要

Despite the progress made by few-shot segmentation (FSS) in low-data regimes, the generalization capability of most previous works could be fragile when countering hard query samples with seen-class objects. This paper proposes a fresh and powerful scheme to tackle such an intractable bias problem, dubbed base and meta (BAM). Concretely, we apply an auxiliary branch (base learner) to the conventional FSS framework (meta learner) to explicitly identify base-class objects, i.e., the regions that do not need to be segmented. Then, the coarse results output by these two learners in parallel are adaptively integrated to derive accurate segmentation predictions. Considering the sensitivity of meta learner, we further introduce adjustment factors to estimate the scene differences between support and query image pairs from both style and appearance perspectives, so as to facilitate the model ensemble forecasting. The remarkable performance gains on standard benchmarks (PASCAL-5i, COCO-20i, and FSS-1000) manifest the effectiveness, and surprisingly, our versatile scheme sets new state-of-the-arts even with two plain learners. Furthermore, in light of its unique nature, we also discuss several more practical but challenging extensions, including generalized FSS, 3D point cloud FSS, class-agnostic FSS, cross-domain FSS, weak-label FSS, and zero-shot segmentation. Our source code is available at https://github.com/chunbolang/BAM.

源语言	英语
页（从-至）	10669-10686
页数	18
期刊	IEEE Transactions on Pattern Analysis and Machine Intelligence
卷	45
期	9
DOI	https://doi.org/10.1109/TPAMI.2023.3265865
出版状态	已出版 - 1 9月 2023

访问文件

10.1109/TPAMI.2023.3265865

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{7ba216fff31a4e68aca7a0e4c614cf6d,

title = "Base and Meta: A New Perspective on Few-Shot Segmentation",

abstract = "Despite the progress made by few-shot segmentation (FSS) in low-data regimes, the generalization capability of most previous works could be fragile when countering hard query samples with seen-class objects. This paper proposes a fresh and powerful scheme to tackle such an intractable bias problem, dubbed base and meta (BAM). Concretely, we apply an auxiliary branch (base learner) to the conventional FSS framework (meta learner) to explicitly identify base-class objects, i.e., the regions that do not need to be segmented. Then, the coarse results output by these two learners in parallel are adaptively integrated to derive accurate segmentation predictions. Considering the sensitivity of meta learner, we further introduce adjustment factors to estimate the scene differences between support and query image pairs from both style and appearance perspectives, so as to facilitate the model ensemble forecasting. The remarkable performance gains on standard benchmarks (PASCAL-5i, COCO-20i, and FSS-1000) manifest the effectiveness, and surprisingly, our versatile scheme sets new state-of-the-arts even with two plain learners. Furthermore, in light of its unique nature, we also discuss several more practical but challenging extensions, including generalized FSS, 3D point cloud FSS, class-agnostic FSS, cross-domain FSS, weak-label FSS, and zero-shot segmentation. Our source code is available at https://github.com/chunbolang/BAM.",

keywords = "3D point cloud segmentation, Few-shot learning, few-shot segmentation, semantic segmentation",

author = "Chunbo Lang and Gong Cheng and Binfei Tu and Chao Li and Junwei Han",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2023",

month = sep,

day = "1",

doi = "10.1109/TPAMI.2023.3265865",

language = "英语",

volume = "45",

pages = "10669--10686",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "9",

}

TY - JOUR

T1 - Base and Meta

T2 - A New Perspective on Few-Shot Segmentation

AU - Lang, Chunbo

AU - Cheng, Gong

AU - Tu, Binfei

AU - Li, Chao

AU - Han, Junwei

PY - 2023/9/1

Y1 - 2023/9/1

N2 - Despite the progress made by few-shot segmentation (FSS) in low-data regimes, the generalization capability of most previous works could be fragile when countering hard query samples with seen-class objects. This paper proposes a fresh and powerful scheme to tackle such an intractable bias problem, dubbed base and meta (BAM). Concretely, we apply an auxiliary branch (base learner) to the conventional FSS framework (meta learner) to explicitly identify base-class objects, i.e., the regions that do not need to be segmented. Then, the coarse results output by these two learners in parallel are adaptively integrated to derive accurate segmentation predictions. Considering the sensitivity of meta learner, we further introduce adjustment factors to estimate the scene differences between support and query image pairs from both style and appearance perspectives, so as to facilitate the model ensemble forecasting. The remarkable performance gains on standard benchmarks (PASCAL-5i, COCO-20i, and FSS-1000) manifest the effectiveness, and surprisingly, our versatile scheme sets new state-of-the-arts even with two plain learners. Furthermore, in light of its unique nature, we also discuss several more practical but challenging extensions, including generalized FSS, 3D point cloud FSS, class-agnostic FSS, cross-domain FSS, weak-label FSS, and zero-shot segmentation. Our source code is available at https://github.com/chunbolang/BAM.

AB - Despite the progress made by few-shot segmentation (FSS) in low-data regimes, the generalization capability of most previous works could be fragile when countering hard query samples with seen-class objects. This paper proposes a fresh and powerful scheme to tackle such an intractable bias problem, dubbed base and meta (BAM). Concretely, we apply an auxiliary branch (base learner) to the conventional FSS framework (meta learner) to explicitly identify base-class objects, i.e., the regions that do not need to be segmented. Then, the coarse results output by these two learners in parallel are adaptively integrated to derive accurate segmentation predictions. Considering the sensitivity of meta learner, we further introduce adjustment factors to estimate the scene differences between support and query image pairs from both style and appearance perspectives, so as to facilitate the model ensemble forecasting. The remarkable performance gains on standard benchmarks (PASCAL-5i, COCO-20i, and FSS-1000) manifest the effectiveness, and surprisingly, our versatile scheme sets new state-of-the-arts even with two plain learners. Furthermore, in light of its unique nature, we also discuss several more practical but challenging extensions, including generalized FSS, 3D point cloud FSS, class-agnostic FSS, cross-domain FSS, weak-label FSS, and zero-shot segmentation. Our source code is available at https://github.com/chunbolang/BAM.

KW - 3D point cloud segmentation

KW - Few-shot learning

KW - few-shot segmentation

KW - semantic segmentation

UR - http://www.scopus.com/inward/record.url?scp=85153394559&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2023.3265865

DO - 10.1109/TPAMI.2023.3265865

M3 - 文章

C2 - 37037231

AN - SCOPUS:85153394559

SN - 0162-8828

VL - 45

SP - 10669

EP - 10686

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 9

ER -

Base and Meta: A New Perspective on Few-Shot Segmentation

摘要

访问文件

其它文件与链接

指纹

引用此