Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation

Qi Wang; Yuyu Jia; Wei Huang; Junyu Gao; Qiang Li

doi:10.1109/TGRS.2024.3519772

Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation

Qi Wang, Yuyu Jia, Wei Huang, Junyu Gao, Qiang Li

光电与智能研究院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Few-shot segmentation (FSS) for remote sensing (RS) imagery leverages supporting information from limited annotated samples to achieve query segmentation of novel classes. Previous efforts are dedicated to mining segmentation-guiding visual cues from a constrained set of support samples. However, they still struggle to address the pronounced intra-class differences in RS images, as sparse visual cues make it challenging to establish robust class-specific representations. In this article, we propose a holistic semantic embedding (HSE) approach that effectively harnesses general semantic knowledge, i.e., class description (CD) embeddings. Instead of the naive combination of CD embeddings and visual features for segmentation decoding, we investigate embedding the general semantic knowledge during the feature extraction stage. Specifically, in HSE, a spatial dense interaction (SDI) module allows the interaction of visual support features with CD embeddings along the spatial dimension via self-attention. Furthermore, a global content modulation (GCM) module efficiently augments the global information of the target category in both support and query features, thanks to the transformative fusion of visual features and CD embeddings. These two components holistically synergize CD embeddings and visual cues, constructing a robust class-specific representation. Through extensive experiments on the standard FSS benchmark, the proposed HSE approach demonstrates superior performance compared to peer work, setting a new state-of-the-art.

源语言	英语
文章编号	5603413
期刊	IEEE Transactions on Geoscience and Remote Sensing
卷	63
DOI	https://doi.org/10.1109/TGRS.2024.3519772
出版状态	已出版 - 2025

访问文件

10.1109/TGRS.2024.3519772

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{eeb18c943a434fefb97a738a892b0534,

title = "Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation",

abstract = "Few-shot segmentation (FSS) for remote sensing (RS) imagery leverages supporting information from limited annotated samples to achieve query segmentation of novel classes. Previous efforts are dedicated to mining segmentation-guiding visual cues from a constrained set of support samples. However, they still struggle to address the pronounced intra-class differences in RS images, as sparse visual cues make it challenging to establish robust class-specific representations. In this article, we propose a holistic semantic embedding (HSE) approach that effectively harnesses general semantic knowledge, i.e., class description (CD) embeddings. Instead of the naive combination of CD embeddings and visual features for segmentation decoding, we investigate embedding the general semantic knowledge during the feature extraction stage. Specifically, in HSE, a spatial dense interaction (SDI) module allows the interaction of visual support features with CD embeddings along the spatial dimension via self-attention. Furthermore, a global content modulation (GCM) module efficiently augments the global information of the target category in both support and query features, thanks to the transformative fusion of visual features and CD embeddings. These two components holistically synergize CD embeddings and visual cues, constructing a robust class-specific representation. Through extensive experiments on the standard FSS benchmark, the proposed HSE approach demonstrates superior performance compared to peer work, setting a new state-of-the-art.",

keywords = "Class description (CD) embeddings, few-shot segmentation (FSS), remote sensing (RS), semantic embedding",

author = "Qi Wang and Yuyu Jia and Wei Huang and Junyu Gao and Qiang Li",

note = "Publisher Copyright: {\textcopyright} 1980-2012 IEEE.",

year = "2025",

doi = "10.1109/TGRS.2024.3519772",

language = "英语",

volume = "63",

journal = "IEEE Transactions on Geoscience and Remote Sensing",

issn = "0196-2892",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation

AU - Wang, Qi

AU - Jia, Yuyu

AU - Huang, Wei

AU - Gao, Junyu

AU - Li, Qiang

PY - 2025

Y1 - 2025

N2 - Few-shot segmentation (FSS) for remote sensing (RS) imagery leverages supporting information from limited annotated samples to achieve query segmentation of novel classes. Previous efforts are dedicated to mining segmentation-guiding visual cues from a constrained set of support samples. However, they still struggle to address the pronounced intra-class differences in RS images, as sparse visual cues make it challenging to establish robust class-specific representations. In this article, we propose a holistic semantic embedding (HSE) approach that effectively harnesses general semantic knowledge, i.e., class description (CD) embeddings. Instead of the naive combination of CD embeddings and visual features for segmentation decoding, we investigate embedding the general semantic knowledge during the feature extraction stage. Specifically, in HSE, a spatial dense interaction (SDI) module allows the interaction of visual support features with CD embeddings along the spatial dimension via self-attention. Furthermore, a global content modulation (GCM) module efficiently augments the global information of the target category in both support and query features, thanks to the transformative fusion of visual features and CD embeddings. These two components holistically synergize CD embeddings and visual cues, constructing a robust class-specific representation. Through extensive experiments on the standard FSS benchmark, the proposed HSE approach demonstrates superior performance compared to peer work, setting a new state-of-the-art.

AB - Few-shot segmentation (FSS) for remote sensing (RS) imagery leverages supporting information from limited annotated samples to achieve query segmentation of novel classes. Previous efforts are dedicated to mining segmentation-guiding visual cues from a constrained set of support samples. However, they still struggle to address the pronounced intra-class differences in RS images, as sparse visual cues make it challenging to establish robust class-specific representations. In this article, we propose a holistic semantic embedding (HSE) approach that effectively harnesses general semantic knowledge, i.e., class description (CD) embeddings. Instead of the naive combination of CD embeddings and visual features for segmentation decoding, we investigate embedding the general semantic knowledge during the feature extraction stage. Specifically, in HSE, a spatial dense interaction (SDI) module allows the interaction of visual support features with CD embeddings along the spatial dimension via self-attention. Furthermore, a global content modulation (GCM) module efficiently augments the global information of the target category in both support and query features, thanks to the transformative fusion of visual features and CD embeddings. These two components holistically synergize CD embeddings and visual cues, constructing a robust class-specific representation. Through extensive experiments on the standard FSS benchmark, the proposed HSE approach demonstrates superior performance compared to peer work, setting a new state-of-the-art.

KW - Class description (CD) embeddings

KW - few-shot segmentation (FSS)

KW - remote sensing (RS)

KW - semantic embedding

UR - http://www.scopus.com/inward/record.url?scp=85210504671&partnerID=8YFLogxK

U2 - 10.1109/TGRS.2024.3519772

DO - 10.1109/TGRS.2024.3519772

M3 - 文章

AN - SCOPUS:85210504671

SN - 0196-2892

VL - 63

JO - IEEE Transactions on Geoscience and Remote Sensing

JF - IEEE Transactions on Geoscience and Remote Sensing

M1 - 5603413

ER -

Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation

摘要

访问文件

其它文件与链接

指纹

引用此