Semantic Segmentation of High-Resolution Remote Sensing Images Using an Improved Transformer

Yuheng Liu, Shaohui Mei, Shun Zhang, Ye Wang, Mingyi He, Qian Du

科研成果: 书/报告/会议事项章节会议稿件同行评审

8 引用 (Scopus)

摘要

Semantic segmentation has been widely researched for high level analysis of High Spatial Resolution (HSR) remote sensing images, where Convolutional Neural Network (CNN) is the mainstream method. However, the transformer with attention mechanism has its unique capacity of extracting global information which is generally ignored by CNN models. In this paper, a Swin Transformer with UPer head (STUP) is proposed to tackle with semantic segmentation problem on a challenging remote sensing land-cover dataset called LoveDA, which owns complex background samples and inconsistent classes distributions. The proposed STUP combines the Swin Transformer with Uper Head in the form of an encoder-decoder structure, to extract features of HSR images for segmentation. Furthermore, Focal Loss is adopted to handle the unbalanced distribution problem in the training step. Experimental results demonstrate that the proposed STUP clearly outperforms several state-of-the-art models.

源语言英语
主期刊名IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium
出版商Institute of Electrical and Electronics Engineers Inc.
3496-3499
页数4
ISBN(电子版)9781665427920
DOI
出版状态已出版 - 2022
活动2022 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2022 - Kuala Lumpur, 马来西亚
期限: 17 7月 202222 7月 2022

出版系列

姓名International Geoscience and Remote Sensing Symposium (IGARSS)
2022-July

会议

会议2022 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2022
国家/地区马来西亚
Kuala Lumpur
时期17/07/2222/07/22

指纹

探究 'Semantic Segmentation of High-Resolution Remote Sensing Images Using an Improved Transformer' 的科研主题。它们共同构成独一无二的指纹。

引用此