MFSonar: Multiscale Frequency-Domain Contextual Denoising for Forward-Looking Sonar Image Semantic Segmentation

Jiayuan Li, Zhen Wang, Shen Ao Yuan, Zhu Hong You

科研成果: 期刊稿件文章同行评审

摘要

Semantic segmentation of forward-looking sonar (FLS) images is crucial for enhancing situational awareness in marine environments. However, FLS images are often degraded by environmental noise, similarity noise, and shading effects, which result in low resolution, poor signal-to-noise ratio, and suboptimal image quality. These issues significantly hinder the accuracy of semantic segmentation in FLS images. To address these challenges, we propose a novel method called MFSonar, which is based on the Transformer-Mamba architecture. MFSonar incorporates a context channel denoising module (CCDM) that exploits the similarity characteristics of local and global features to effectively suppress similarity noise and enhance target features. Additionally, the Multiscale Frequency-Domain Decoding Module integrates multiscale frequency-domain convolution with visual state-space (VSS) blocks, leveraging frequency-domain characteristics to mitigate environmental noise and occlusion shadows. Furthermore, our approach prioritizes local features before global features to achieve effective fusion and enhancement of global semantic features and multiscale local visual information. Extensive comparative experiments across multiple datasets demonstrate that MFSonar achieves state-of-the-art performance. Moreover, ablation studies and visual comparisons on a primary dataset validate the superiority, effectiveness, and uniqueness of our approach. Our implementation is available at https://github.com/NWPUFranklee/PVSonar.git.

源语言英语
页(从-至)11792-11808
页数17
期刊IEEE Sensors Journal
25
7
DOI
出版状态已出版 - 2025

指纹

探究 'MFSonar: Multiscale Frequency-Domain Contextual Denoising for Forward-Looking Sonar Image Semantic Segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此