Semantic Segmentation of High-Resolution Remote Sensing Images Using an Improved Transformer

Yuheng Liu, Shaohui Mei, Shun Zhang, Ye Wang, Mingyi He, Qian Du

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

Semantic segmentation has been widely researched for high level analysis of High Spatial Resolution (HSR) remote sensing images, where Convolutional Neural Network (CNN) is the mainstream method. However, the transformer with attention mechanism has its unique capacity of extracting global information which is generally ignored by CNN models. In this paper, a Swin Transformer with UPer head (STUP) is proposed to tackle with semantic segmentation problem on a challenging remote sensing land-cover dataset called LoveDA, which owns complex background samples and inconsistent classes distributions. The proposed STUP combines the Swin Transformer with Uper Head in the form of an encoder-decoder structure, to extract features of HSR images for segmentation. Furthermore, Focal Loss is adopted to handle the unbalanced distribution problem in the training step. Experimental results demonstrate that the proposed STUP clearly outperforms several state-of-the-art models.

Original languageEnglish
Title of host publicationIGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3496-3499
Number of pages4
ISBN (Electronic)9781665427920
DOIs
StatePublished - 2022
Event2022 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2022 - Kuala Lumpur, Malaysia
Duration: 17 Jul 202222 Jul 2022

Publication series

NameInternational Geoscience and Remote Sensing Symposium (IGARSS)
Volume2022-July

Conference

Conference2022 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2022
Country/TerritoryMalaysia
CityKuala Lumpur
Period17/07/2222/07/22

Keywords

  • High Spatial Resolution
  • Remote Sensing
  • Semantic Segmentation
  • Transformer

Fingerprint

Dive into the research topics of 'Semantic Segmentation of High-Resolution Remote Sensing Images Using an Improved Transformer'. Together they form a unique fingerprint.

Cite this