PIFRNet: Position Information Guided Feature Reconstruction Network for Salient Object Detection in Remote Sensing Images

Zhen Wang, Ruixiang Li, Xiaotian Wang, Nan Xu, Zhuhong You

Research output: Contribution to journalArticlepeer-review

Abstract

Benefiting from the success of deep learning, salient object detection in natural scene images (NSI-SOD) has rapidly advanced. However, salient object detection for remote sensing images (RSI-SOD) faces unique challenges, including high resolution, diverse object scales, and cluttered backgrounds, which limit the effectiveness of existing methods. To overcome these issues, we propose a Position Information Guided Feature Reconstruction Network (PIFRNet), where each module is specifically designed to address a core RSI-SOD challenge. First, a hybrid dual-branch encoder integrates convolutional neural networks (CNNs) for robust local feature extraction and Transformers for capturing global contextual information, enabling simultaneous modeling of fine details and large-scale object relationships. Next, the Spatial Coordinate Attention Mechanism (SCAM) leverages positional correlations between spatial and channel dimensions to accurately highlight salient regions and suppress background noise. The Position-Sensitive Self-Attention Mechanism (PSSAM) further refines feature representation by modeling pixel-level spatial relationships, enhancing the network's ability to distinguish complex object boundaries. To address multi-scale object variation, the Multi-Scale Attention Mechanism (MSAM) adaptively aggregates information across scales, improving detection robustness for objects of all sizes. Finally, the Feature Reconstruction Module (FRM) restores finegrained details and sharp boundaries in the predicted saliency maps by leveraging spatial position information. Extensive experiments on three public RSI-SOD datasets demonstrate that our method achieves significant improvements over 36 state-ofthe-art approaches, validating the effectiveness of each proposed module.

Keywords

  • dual-branch encoder
  • feature reconstruction
  • optical remote sensing image (RSI)
  • Position information
  • salient object detection (SOD)

Fingerprint

Dive into the research topics of 'PIFRNet: Position Information Guided Feature Reconstruction Network for Salient Object Detection in Remote Sensing Images'. Together they form a unique fingerprint.

Cite this