Underwater Target Detection Using Side-Scan Sonar Images Based on Upsampling and Downsampling

Rui Tang, Yimin Chen, Jian Gao, Shaowen Hao, Hunhui He

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Side-scan sonar (SSS) images present unique challenges to computer vision due to their lower resolution, smaller targets, and fewer features. Although the mainstream backbone networks have shown promising results on traditional vision tasks, they utilize traditional convolution to reduce the dimensionality of feature maps, which may cause information loss for small targets and decrease performance in SSS images. To address this problem, based on the yolov8 network, we proposed a new underwater target detection model based on upsampling and downsampling. Firstly, we introduced a new general downsampling module called shallow robust feature downsampling (SRFD) and a receptive field convolution (RFCAConv) in the backbone network. Thereby multiple feature maps extracted by different downsampling techniques can be fused to create a more robust feature map with a complementary set of features. Additionally, an ultra-lightweight and efficient dynamic upsampling module (Dysample) is introduced to improve the accuracy of the feature pyramid network (FPN) in fusing different levels of features. On the underwater shipwreck dataset, our improved model’s mAP50 increased by 4.4% compared to the baseline model.

Original languageEnglish
Article number3874
JournalElectronics (Switzerland)
Volume13
Issue number19
DOIs
StatePublished - Oct 2024

Keywords

  • neural network
  • side-scan sonar image
  • underwater target detection

Fingerprint

Dive into the research topics of 'Underwater Target Detection Using Side-Scan Sonar Images Based on Upsampling and Downsampling'. Together they form a unique fingerprint.

Cite this