Underwater Target Detection Using Side-Scan Sonar Images Based on Upsampling and Downsampling

Rui Tang; Yimin Chen; Jian Gao; Shaowen Hao; Hunhui He

doi:10.3390/electronics13193874

Underwater Target Detection Using Side-Scan Sonar Images Based on Upsampling and Downsampling

Rui Tang, Yimin Chen, Jian Gao, Shaowen Hao, Hunhui He

School of Marine Science and Technology

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

Side-scan sonar (SSS) images present unique challenges to computer vision due to their lower resolution, smaller targets, and fewer features. Although the mainstream backbone networks have shown promising results on traditional vision tasks, they utilize traditional convolution to reduce the dimensionality of feature maps, which may cause information loss for small targets and decrease performance in SSS images. To address this problem, based on the yolov8 network, we proposed a new underwater target detection model based on upsampling and downsampling. Firstly, we introduced a new general downsampling module called shallow robust feature downsampling (SRFD) and a receptive field convolution (RFCAConv) in the backbone network. Thereby multiple feature maps extracted by different downsampling techniques can be fused to create a more robust feature map with a complementary set of features. Additionally, an ultra-lightweight and efficient dynamic upsampling module (Dysample) is introduced to improve the accuracy of the feature pyramid network (FPN) in fusing different levels of features. On the underwater shipwreck dataset, our improved model’s mAP50 increased by 4.4% compared to the baseline model.

Original language	English
Article number	3874
Journal	Electronics (Switzerland)
Volume	13
Issue number	19
DOIs	https://doi.org/10.3390/electronics13193874
State	Published - Oct 2024

Keywords

neural network
side-scan sonar image
underwater target detection

Access to Document

10.3390/electronics13193874

Cite this

@article{af87eb59b528478096984249b78a4a28,

title = "Underwater Target Detection Using Side-Scan Sonar Images Based on Upsampling and Downsampling",

abstract = "Side-scan sonar (SSS) images present unique challenges to computer vision due to their lower resolution, smaller targets, and fewer features. Although the mainstream backbone networks have shown promising results on traditional vision tasks, they utilize traditional convolution to reduce the dimensionality of feature maps, which may cause information loss for small targets and decrease performance in SSS images. To address this problem, based on the yolov8 network, we proposed a new underwater target detection model based on upsampling and downsampling. Firstly, we introduced a new general downsampling module called shallow robust feature downsampling (SRFD) and a receptive field convolution (RFCAConv) in the backbone network. Thereby multiple feature maps extracted by different downsampling techniques can be fused to create a more robust feature map with a complementary set of features. Additionally, an ultra-lightweight and efficient dynamic upsampling module (Dysample) is introduced to improve the accuracy of the feature pyramid network (FPN) in fusing different levels of features. On the underwater shipwreck dataset, our improved model{\textquoteright}s mAP50 increased by 4.4% compared to the baseline model.",

keywords = "neural network, side-scan sonar image, underwater target detection",

author = "Rui Tang and Yimin Chen and Jian Gao and Shaowen Hao and Hunhui He",

note = "Publisher Copyright: {\textcopyright} 2024 by the authors.",

year = "2024",

month = oct,

doi = "10.3390/electronics13193874",

language = "英语",

volume = "13",

journal = "Electronics (Switzerland)",

issn = "2079-9292",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "19",

}

TY - JOUR

T1 - Underwater Target Detection Using Side-Scan Sonar Images Based on Upsampling and Downsampling

AU - Tang, Rui

AU - Chen, Yimin

AU - Gao, Jian

AU - Hao, Shaowen

AU - He, Hunhui

PY - 2024/10

Y1 - 2024/10

N2 - Side-scan sonar (SSS) images present unique challenges to computer vision due to their lower resolution, smaller targets, and fewer features. Although the mainstream backbone networks have shown promising results on traditional vision tasks, they utilize traditional convolution to reduce the dimensionality of feature maps, which may cause information loss for small targets and decrease performance in SSS images. To address this problem, based on the yolov8 network, we proposed a new underwater target detection model based on upsampling and downsampling. Firstly, we introduced a new general downsampling module called shallow robust feature downsampling (SRFD) and a receptive field convolution (RFCAConv) in the backbone network. Thereby multiple feature maps extracted by different downsampling techniques can be fused to create a more robust feature map with a complementary set of features. Additionally, an ultra-lightweight and efficient dynamic upsampling module (Dysample) is introduced to improve the accuracy of the feature pyramid network (FPN) in fusing different levels of features. On the underwater shipwreck dataset, our improved model’s mAP50 increased by 4.4% compared to the baseline model.

AB - Side-scan sonar (SSS) images present unique challenges to computer vision due to their lower resolution, smaller targets, and fewer features. Although the mainstream backbone networks have shown promising results on traditional vision tasks, they utilize traditional convolution to reduce the dimensionality of feature maps, which may cause information loss for small targets and decrease performance in SSS images. To address this problem, based on the yolov8 network, we proposed a new underwater target detection model based on upsampling and downsampling. Firstly, we introduced a new general downsampling module called shallow robust feature downsampling (SRFD) and a receptive field convolution (RFCAConv) in the backbone network. Thereby multiple feature maps extracted by different downsampling techniques can be fused to create a more robust feature map with a complementary set of features. Additionally, an ultra-lightweight and efficient dynamic upsampling module (Dysample) is introduced to improve the accuracy of the feature pyramid network (FPN) in fusing different levels of features. On the underwater shipwreck dataset, our improved model’s mAP50 increased by 4.4% compared to the baseline model.

KW - neural network

KW - side-scan sonar image

KW - underwater target detection

UR - http://www.scopus.com/inward/record.url?scp=85206563971&partnerID=8YFLogxK

U2 - 10.3390/electronics13193874

DO - 10.3390/electronics13193874

M3 - 文章

AN - SCOPUS:85206563971

SN - 2079-9292

VL - 13

JO - Electronics (Switzerland)

JF - Electronics (Switzerland)

IS - 19

M1 - 3874

ER -

Underwater Target Detection Using Side-Scan Sonar Images Based on Upsampling and Downsampling

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this