ADAPTIVE DEEP NEURAL NETWORK DESIGN METHOD FOR UNDERWATER ACOUSTIC TARGET RECOGNITION

Qing Huang; Xiangyang Zeng

ADAPTIVE DEEP NEURAL NETWORK DESIGN METHOD FOR UNDERWATER ACOUSTIC TARGET RECOGNITION

Qing Huang, Xiangyang Zeng

航海学院

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

In recent years, with the amazing achievements of deep learning in the field of computer vision(CV), most researchers have applied it to the field of underwater acoustic target recognition. In order to directly transfer various advanced models in the CV field, researchers chose to use various time-frequency feature extraction methods to turn the ship's radiated noise into three-dimensional data. It still requires effort to design features and requires a large time or frequency range, it also fails to fully utilize the powerful learning ability of deep learning. Based on the limited duration and significant low-frequency effects of ship radiated noise, this paper proposes a Network Design Method of One Dim for Underwater Acoustic Target Recognition (UATR-ND1D), abbreviated as FFT-UATRND1D, which combines fast Fourier transform. Using the entry-level network ResNet as an example using the FFT-UATR-ND1D method, 4320 experiments and 360 experiments were done on two mainstream datasets, ShipsEar and DeepShip, respectively. For the ShipsEar dataset, an extremely lightweight model with only 0.17M parameters and 3.4M FLOPs can achieve an average recognition rate of 97.13% ± 0.43%; When the number of parameters is 2.1M and FLOPs is 5.0M, the optimal level of 98.89% can be achieved. For the DeepShip dataset, an extremely lightweight model with only 0.17M parameters and 6.8M FLOPs is required to achieve an average recognition rate of 95.30% ± 0.28%; When the number of parameters is 2.1M and FLOPs is 13.3M, the optimal level of 98.36% can be achieved. Compared to the methods in existing literature, the methods with parameters similar to this paper have a recognition rate that is more than 3% -5% lower. The papers with recognition rates similar to this paper have parameters and Flops that are at least 1 to 2 orders of magnitude higher than this paper.

源语言	英语
主期刊名	Proceedings of the 30th International Congress on Sound and Vibration, ICSV 2024
编辑	Wim van Keulen, Jim Kok
出版商	Society of Acoustics
ISBN（电子版）	9789090390581
出版状态	已出版 - 2024
活动	30th International Congress on Sound and Vibration, ICSV 2024 - Amsterdam, 荷兰期限: 8 7月 2024 → 11 7月 2024

出版系列

姓名	Proceedings of the International Congress on Sound and Vibration
ISSN（电子版）	2329-3675

会议

会议	30th International Congress on Sound and Vibration, ICSV 2024
国家/地区	荷兰
市	Amsterdam
时期	8/07/24 → 11/07/24

其它文件与链接

链接到 Scopus 的出版物

引用此

@inproceedings{0d57dfcc4966471eb8d18e05887495d9,

title = "ADAPTIVE DEEP NEURAL NETWORK DESIGN METHOD FOR UNDERWATER ACOUSTIC TARGET RECOGNITION",

abstract = "In recent years, with the amazing achievements of deep learning in the field of computer vision(CV), most researchers have applied it to the field of underwater acoustic target recognition. In order to directly transfer various advanced models in the CV field, researchers chose to use various time-frequency feature extraction methods to turn the ship's radiated noise into three-dimensional data. It still requires effort to design features and requires a large time or frequency range, it also fails to fully utilize the powerful learning ability of deep learning. Based on the limited duration and significant low-frequency effects of ship radiated noise, this paper proposes a Network Design Method of One Dim for Underwater Acoustic Target Recognition (UATR-ND1D), abbreviated as FFT-UATRND1D, which combines fast Fourier transform. Using the entry-level network ResNet as an example using the FFT-UATR-ND1D method, 4320 experiments and 360 experiments were done on two mainstream datasets, ShipsEar and DeepShip, respectively. For the ShipsEar dataset, an extremely lightweight model with only 0.17M parameters and 3.4M FLOPs can achieve an average recognition rate of 97.13% ± 0.43%; When the number of parameters is 2.1M and FLOPs is 5.0M, the optimal level of 98.89% can be achieved. For the DeepShip dataset, an extremely lightweight model with only 0.17M parameters and 6.8M FLOPs is required to achieve an average recognition rate of 95.30% ± 0.28%; When the number of parameters is 2.1M and FLOPs is 13.3M, the optimal level of 98.36% can be achieved. Compared to the methods in existing literature, the methods with parameters similar to this paper have a recognition rate that is more than 3% -5% lower. The papers with recognition rates similar to this paper have parameters and Flops that are at least 1 to 2 orders of magnitude higher than this paper.",

keywords = "one-dimensional network design, underwater target recognition",

author = "Qing Huang and Xiangyang Zeng",

note = "Publisher Copyright: {\textcopyright} 2024 Proceedings of the International Congress on Sound and Vibration. All rights reserved.; 30th International Congress on Sound and Vibration, ICSV 2024 ; Conference date: 08-07-2024 Through 11-07-2024",

year = "2024",

language = "英语",

series = "Proceedings of the International Congress on Sound and Vibration",

publisher = "Society of Acoustics",

editor = "{van Keulen}, Wim and Jim Kok",

booktitle = "Proceedings of the 30th International Congress on Sound and Vibration, ICSV 2024",

}

Huang, Q & Zeng, X 2024, ADAPTIVE DEEP NEURAL NETWORK DESIGN METHOD FOR UNDERWATER ACOUSTIC TARGET RECOGNITION. 在 W van Keulen & J Kok (编辑), Proceedings of the 30th International Congress on Sound and Vibration, ICSV 2024. Proceedings of the International Congress on Sound and Vibration, Society of Acoustics, 30th International Congress on Sound and Vibration, ICSV 2024, Amsterdam, 荷兰, 8/07/24.

ADAPTIVE DEEP NEURAL NETWORK DESIGN METHOD FOR UNDERWATER ACOUSTIC TARGET RECOGNITION. / Huang, Qing; Zeng, Xiangyang.
Proceedings of the 30th International Congress on Sound and Vibration, ICSV 2024. 编辑 / Wim van Keulen; Jim Kok. Society of Acoustics, 2024. (Proceedings of the International Congress on Sound and Vibration).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - ADAPTIVE DEEP NEURAL NETWORK DESIGN METHOD FOR UNDERWATER ACOUSTIC TARGET RECOGNITION

AU - Huang, Qing

AU - Zeng, Xiangyang

PY - 2024

Y1 - 2024

N2 - In recent years, with the amazing achievements of deep learning in the field of computer vision(CV), most researchers have applied it to the field of underwater acoustic target recognition. In order to directly transfer various advanced models in the CV field, researchers chose to use various time-frequency feature extraction methods to turn the ship's radiated noise into three-dimensional data. It still requires effort to design features and requires a large time or frequency range, it also fails to fully utilize the powerful learning ability of deep learning. Based on the limited duration and significant low-frequency effects of ship radiated noise, this paper proposes a Network Design Method of One Dim for Underwater Acoustic Target Recognition (UATR-ND1D), abbreviated as FFT-UATRND1D, which combines fast Fourier transform. Using the entry-level network ResNet as an example using the FFT-UATR-ND1D method, 4320 experiments and 360 experiments were done on two mainstream datasets, ShipsEar and DeepShip, respectively. For the ShipsEar dataset, an extremely lightweight model with only 0.17M parameters and 3.4M FLOPs can achieve an average recognition rate of 97.13% ± 0.43%; When the number of parameters is 2.1M and FLOPs is 5.0M, the optimal level of 98.89% can be achieved. For the DeepShip dataset, an extremely lightweight model with only 0.17M parameters and 6.8M FLOPs is required to achieve an average recognition rate of 95.30% ± 0.28%; When the number of parameters is 2.1M and FLOPs is 13.3M, the optimal level of 98.36% can be achieved. Compared to the methods in existing literature, the methods with parameters similar to this paper have a recognition rate that is more than 3% -5% lower. The papers with recognition rates similar to this paper have parameters and Flops that are at least 1 to 2 orders of magnitude higher than this paper.

AB - In recent years, with the amazing achievements of deep learning in the field of computer vision(CV), most researchers have applied it to the field of underwater acoustic target recognition. In order to directly transfer various advanced models in the CV field, researchers chose to use various time-frequency feature extraction methods to turn the ship's radiated noise into three-dimensional data. It still requires effort to design features and requires a large time or frequency range, it also fails to fully utilize the powerful learning ability of deep learning. Based on the limited duration and significant low-frequency effects of ship radiated noise, this paper proposes a Network Design Method of One Dim for Underwater Acoustic Target Recognition (UATR-ND1D), abbreviated as FFT-UATRND1D, which combines fast Fourier transform. Using the entry-level network ResNet as an example using the FFT-UATR-ND1D method, 4320 experiments and 360 experiments were done on two mainstream datasets, ShipsEar and DeepShip, respectively. For the ShipsEar dataset, an extremely lightweight model with only 0.17M parameters and 3.4M FLOPs can achieve an average recognition rate of 97.13% ± 0.43%; When the number of parameters is 2.1M and FLOPs is 5.0M, the optimal level of 98.89% can be achieved. For the DeepShip dataset, an extremely lightweight model with only 0.17M parameters and 6.8M FLOPs is required to achieve an average recognition rate of 95.30% ± 0.28%; When the number of parameters is 2.1M and FLOPs is 13.3M, the optimal level of 98.36% can be achieved. Compared to the methods in existing literature, the methods with parameters similar to this paper have a recognition rate that is more than 3% -5% lower. The papers with recognition rates similar to this paper have parameters and Flops that are at least 1 to 2 orders of magnitude higher than this paper.

KW - one-dimensional network design

KW - underwater target recognition

UR - http://www.scopus.com/inward/record.url?scp=85205372500&partnerID=8YFLogxK

M3 - 会议稿件

AN - SCOPUS:85205372500

T3 - Proceedings of the International Congress on Sound and Vibration

BT - Proceedings of the 30th International Congress on Sound and Vibration, ICSV 2024

A2 - van Keulen, Wim

A2 - Kok, Jim

PB - Society of Acoustics

T2 - 30th International Congress on Sound and Vibration, ICSV 2024

Y2 - 8 July 2024 through 11 July 2024

ER -

ADAPTIVE DEEP NEURAL NETWORK DESIGN METHOD FOR UNDERWATER ACOUSTIC TARGET RECOGNITION

摘要

出版系列

会议

其它文件与链接

指纹

引用此