TY - GEN
T1 - Semantic Segmentation of High-Resolution Remote Sensing Images Using an Improved Transformer
AU - Liu, Yuheng
AU - Mei, Shaohui
AU - Zhang, Shun
AU - Wang, Ye
AU - He, Mingyi
AU - Du, Qian
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Semantic segmentation has been widely researched for high level analysis of High Spatial Resolution (HSR) remote sensing images, where Convolutional Neural Network (CNN) is the mainstream method. However, the transformer with attention mechanism has its unique capacity of extracting global information which is generally ignored by CNN models. In this paper, a Swin Transformer with UPer head (STUP) is proposed to tackle with semantic segmentation problem on a challenging remote sensing land-cover dataset called LoveDA, which owns complex background samples and inconsistent classes distributions. The proposed STUP combines the Swin Transformer with Uper Head in the form of an encoder-decoder structure, to extract features of HSR images for segmentation. Furthermore, Focal Loss is adopted to handle the unbalanced distribution problem in the training step. Experimental results demonstrate that the proposed STUP clearly outperforms several state-of-the-art models.
AB - Semantic segmentation has been widely researched for high level analysis of High Spatial Resolution (HSR) remote sensing images, where Convolutional Neural Network (CNN) is the mainstream method. However, the transformer with attention mechanism has its unique capacity of extracting global information which is generally ignored by CNN models. In this paper, a Swin Transformer with UPer head (STUP) is proposed to tackle with semantic segmentation problem on a challenging remote sensing land-cover dataset called LoveDA, which owns complex background samples and inconsistent classes distributions. The proposed STUP combines the Swin Transformer with Uper Head in the form of an encoder-decoder structure, to extract features of HSR images for segmentation. Furthermore, Focal Loss is adopted to handle the unbalanced distribution problem in the training step. Experimental results demonstrate that the proposed STUP clearly outperforms several state-of-the-art models.
KW - High Spatial Resolution
KW - Remote Sensing
KW - Semantic Segmentation
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=85140414119&partnerID=8YFLogxK
U2 - 10.1109/IGARSS46834.2022.9884103
DO - 10.1109/IGARSS46834.2022.9884103
M3 - 会议稿件
AN - SCOPUS:85140414119
T3 - International Geoscience and Remote Sensing Symposium (IGARSS)
SP - 3496
EP - 3499
BT - IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2022
Y2 - 17 July 2022 through 22 July 2022
ER -