Hierarchical Shared Architecture Search for Real-Time Semantic Segmentation of Remote Sensing Images

Wenna Wang, Lingyan Ran, Hanlin Yin, Mingjun Sun, Xiuwei Zhang, Yanning Zhang

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Real-time semantic segmentation of remote-sensing images demands a trade-off between speed and accuracy, which makes it challenging. Apart from manually designed networks, researchers seek to adopt neural architecture search (NAS) to discover a real-time semantic segmentation model with optimal performance automatically. Most existing NAS methods stack up no more than two types of searched cells, omitting the characteristics of resolution variation. This article proposes the hierarchical shared architecture search (HAS) method to automatically build a real-time semantic segmentation model for remote sensing images. Our model contains a lightweight backbone and a multiscale feature fusion module. The lightweight backbone is carefully designed with low computational cost. The multiscale feature fusion module is searched using the NAS method, where only the blocks from the same layer share identical cells. Extensive experiments reveal that our searched real-time semantic segmentation model of remote sensing images achieves the state-of-the-art trade-off between accuracy and speed. Specifically, on the LoveDA, Potsdam, and Vaihingen datasets, the searched network achieves 54.5% mIoU, 87.8% mIoU, and 84.1% mIoU, respectively, with an inference speed of 132.7 FPS. Besides, our searched network achieves 72.6% mIoU at 164.0 FPS on the CityScapes dataset and 72.3% mIoU at 186.4 FPS on the CamVid dataset.

Original languageEnglish
Pages (from-to)1-13
Number of pages13
JournalIEEE Transactions on Geoscience and Remote Sensing
Volume62
DOIs
StatePublished - 2024

Keywords

  • Feature aggregation module
  • hierarchical shared search strategy
  • neural architecture search (NAS)
  • real-time semantic segmentation

Fingerprint

Dive into the research topics of 'Hierarchical Shared Architecture Search for Real-Time Semantic Segmentation of Remote Sensing Images'. Together they form a unique fingerprint.

Cite this