TY - JOUR
T1 - U-SAS
T2 - U-Shape Network with Multilevel Enhancement and Global Decoding for Synthetic Aperture Sonar Image Semantic Segmentation
AU - Li, Jiayuan
AU - Wang, Zhen
AU - You, Zhuhong
AU - Zhao, Zhengyang
AU - Yuan, Zhanbin
N1 - Publisher Copyright:
© 2001-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Compared with side-scan sonar (SSS) and forward-looking sonar (FLS), synthetic aperture sonar (SAS) devices can generate high-resolution underwater images, which is important for marine topographic mapping. Nevertheless, existing deep learning (DL) methods face challenges in extracting detailed feature information from underwater SAS images for semantic segmentation tasks, primarily due to the significant interference from the complex underwater environment and seabed reverberation noise. To address these challenges, we propose a novel network named U-SAS for SAS image semantic segmentation, which uses the hybrid convolution and attention mechanism architecture to extract rich multilevel features from SAS images. Specifically, U-SAS separates the extracted features into complementary representations, namely salient features and abstract location features. To enhance the distinctive representation of multilevel features, U-SAS incorporates the max-min module (MMM) and convolutional block attention mechanism (CBAM). The MMM effectively emphasizes robust and significant underwater acoustic features, while the CBAM suppresses seabed reverberation noise and enhances the representation of positional information. Besides, we construct the global decoding module (GDM) to fuse salient features and abstract location features to enhance the correlation between local features and achieve global semantic understanding. To verify the effectiveness and feasibility of the proposed U-SAS network, we conducted extensive experiments on the SAS image dataset of complex underwater scenes. The experimental results show that U-SAS achieves 78.58% and 58.59% of mean accuracy (mAcc) and mean intersection over union (mIoU), respectively, which outperforms other state-of-the-art methods. Implementation codes will be available on https://github.com/darkseid-arch/U-SAS.
AB - Compared with side-scan sonar (SSS) and forward-looking sonar (FLS), synthetic aperture sonar (SAS) devices can generate high-resolution underwater images, which is important for marine topographic mapping. Nevertheless, existing deep learning (DL) methods face challenges in extracting detailed feature information from underwater SAS images for semantic segmentation tasks, primarily due to the significant interference from the complex underwater environment and seabed reverberation noise. To address these challenges, we propose a novel network named U-SAS for SAS image semantic segmentation, which uses the hybrid convolution and attention mechanism architecture to extract rich multilevel features from SAS images. Specifically, U-SAS separates the extracted features into complementary representations, namely salient features and abstract location features. To enhance the distinctive representation of multilevel features, U-SAS incorporates the max-min module (MMM) and convolutional block attention mechanism (CBAM). The MMM effectively emphasizes robust and significant underwater acoustic features, while the CBAM suppresses seabed reverberation noise and enhances the representation of positional information. Besides, we construct the global decoding module (GDM) to fuse salient features and abstract location features to enhance the correlation between local features and achieve global semantic understanding. To verify the effectiveness and feasibility of the proposed U-SAS network, we conducted extensive experiments on the SAS image dataset of complex underwater scenes. The experimental results show that U-SAS achieves 78.58% and 58.59% of mean accuracy (mAcc) and mean intersection over union (mIoU), respectively, which outperforms other state-of-the-art methods. Implementation codes will be available on https://github.com/darkseid-arch/U-SAS.
KW - Convolutional block attention mechanism (CBAM)
KW - global decoding
KW - max-min module (MMM)
KW - multilevel feature enhancement
KW - semantic segmentation
KW - synthetic aperture sonar (SAS)
UR - http://www.scopus.com/inward/record.url?scp=85209095363&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2024.3491095
DO - 10.1109/JSEN.2024.3491095
M3 - 文章
AN - SCOPUS:85209095363
SN - 1530-437X
VL - 25
SP - 1799
EP - 1813
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 1
ER -