Skip to main navigation Skip to search Skip to main content

Semantic Segmentation of Remote Sensing Images With Inconsistent Resolutions via a Spectral-Geometric Iterative Fusion Network

  • Northwestern Polytechnical University Xian

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

The fusion of optical, hyperspectral, and synthetic aperture radar (SAR) images is essential for semantic segmentation in remote sensing, enabling more comprehensive land cover classification through multimodal data integration. However, disparities in spatial resolution and imaging characteristics across modalities impede effective feature alignment and fusion, degrading segmentation performance. To address this problem, we propose a novel spectral-geometric iterative fusion network (SGIFN), specifically designed to handle multimodal semantic segmentation with inconsistent resolutions. The core innovation of SGIFN lies in its unified architecture that progressively aligns, integrates, and enhances multimodal features through three newly designed modules. The spectral-spatial iterative decoupling (SSID) module introduces a novel iterative mechanism to adaptively align and decouple optical and hyperspectral features. The spectral-geometric synergistic conditional random field (SGS-CRF) module captures both local and long-range spatial dependencies by synergizing geometric (SAR) and spectral information. The class-guided multiscale contrastive aggregation (CG-MCA) module further strengthens feature representation across scales via multiclass, contrastive learning. We constructed a new multimodal remote sensing dataset comprising optical, hyperspectral, and SAR images with varying resolutions collected from Wuhan and Suzhou regions. Experimental results show that SGIFN achieves a mean Intersection over Union (mIoU) of 69.61% on Suzhou dataset and 63.96% on Wuhan dataset. These results demonstrate the effectiveness of SGIFN in handling multimodal data with inconsistent resolutions.

Original languageEnglish
Article number4419314
JournalIEEE Transactions on Geoscience and Remote Sensing
Volume63
DOIs
StatePublished - 2025

Keywords

  • Inconsistent resolutions
  • multimodal fusion
  • remote sensing
  • semantic segmentation

Fingerprint

Dive into the research topics of 'Semantic Segmentation of Remote Sensing Images With Inconsistent Resolutions via a Spectral-Geometric Iterative Fusion Network'. Together they form a unique fingerprint.

Cite this