Underwater Acoustic Target Recognition Method Based on Feature Fusion and Residual CNN

Yixin Yang; Qihai Yao; Yong Wang

doi:10.1109/JSEN.2024.3464754

Underwater Acoustic Target Recognition Method Based on Feature Fusion and Residual CNN

Yixin Yang, Qihai Yao, Yong Wang

School of Marine Science and Technology

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

This article presents an underwater acoustic target recognition method using feature fusion and residual convolutional neural network (CNN). Mel-frequency cepstrum coefficient (MFCC), Gammatone frequency cepstral coefficient (GFCC), low-frequency analyzer and recorder (LOFAR) spectrum, and constant Q transform (CQT) are extracted and fused first. On this basis, their Delta features are calculated and fused second. The feature dimension is reduced by neighborhood component analysis (NCA). With the fused features after the dimensionality reduction as input features, the residual CNN based on the ResNet18 model is used as classifier to recognize the underwater acoustic target. The other machine-learning models, such as support vector machine (SVM), VGG19, and common CNN, are also compared for inputting different features separately. Experimental results show that, MGCL-Delta-NCA-ResNet18 has the best recognition results among these models, with the recognition accuracy of 97.29%, because this model allows full play to the rich information advantages of feature fusion, advantages of feature dimensionality reduction by the NCA and the ability of ResNet18 to extract abundant characteristics. It can also realize the recognition effectively at low signal-to-noise ratio (SNR). Especially at 0 dB, the recognition accuracy can still reach 86.25%. The proposed method can also recognize multitarget signal effectively in the multiple target scenario. Although this model is used in ship and natural voice recognition, it can also be applied to the recognition of other target sounds, such as marine mammals.

Original language	English
Pages (from-to)	37342-37357
Number of pages	16
Journal	IEEE Sensors Journal
Volume	24
Issue number	22
DOIs	https://doi.org/10.1109/JSEN.2024.3464754
State	Published - 2024

Keywords

Acoustic target recognition
feature fusion
machine learning
residual convolutional neural network (CNN)

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/JSEN.2024.3464754

Cite this

@article{0f368a2417d244f5a69035f2386ed32e,

title = "Underwater Acoustic Target Recognition Method Based on Feature Fusion and Residual CNN",

abstract = "This article presents an underwater acoustic target recognition method using feature fusion and residual convolutional neural network (CNN). Mel-frequency cepstrum coefficient (MFCC), Gammatone frequency cepstral coefficient (GFCC), low-frequency analyzer and recorder (LOFAR) spectrum, and constant Q transform (CQT) are extracted and fused first. On this basis, their Delta features are calculated and fused second. The feature dimension is reduced by neighborhood component analysis (NCA). With the fused features after the dimensionality reduction as input features, the residual CNN based on the ResNet18 model is used as classifier to recognize the underwater acoustic target. The other machine-learning models, such as support vector machine (SVM), VGG19, and common CNN, are also compared for inputting different features separately. Experimental results show that, MGCL-Delta-NCA-ResNet18 has the best recognition results among these models, with the recognition accuracy of 97.29%, because this model allows full play to the rich information advantages of feature fusion, advantages of feature dimensionality reduction by the NCA and the ability of ResNet18 to extract abundant characteristics. It can also realize the recognition effectively at low signal-to-noise ratio (SNR). Especially at 0 dB, the recognition accuracy can still reach 86.25%. The proposed method can also recognize multitarget signal effectively in the multiple target scenario. Although this model is used in ship and natural voice recognition, it can also be applied to the recognition of other target sounds, such as marine mammals.",

keywords = "Acoustic target recognition, feature fusion, machine learning, residual convolutional neural network (CNN)",

author = "Yixin Yang and Qihai Yao and Yong Wang",

note = "Publisher Copyright: {\textcopyright} 2001-2012 IEEE.",

year = "2024",

doi = "10.1109/JSEN.2024.3464754",

language = "英语",

volume = "24",

pages = "37342--37357",

journal = "IEEE Sensors Journal",

issn = "1530-437X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "22",

}

TY - JOUR

T1 - Underwater Acoustic Target Recognition Method Based on Feature Fusion and Residual CNN

AU - Yang, Yixin

AU - Yao, Qihai

AU - Wang, Yong

PY - 2024

Y1 - 2024

N2 - This article presents an underwater acoustic target recognition method using feature fusion and residual convolutional neural network (CNN). Mel-frequency cepstrum coefficient (MFCC), Gammatone frequency cepstral coefficient (GFCC), low-frequency analyzer and recorder (LOFAR) spectrum, and constant Q transform (CQT) are extracted and fused first. On this basis, their Delta features are calculated and fused second. The feature dimension is reduced by neighborhood component analysis (NCA). With the fused features after the dimensionality reduction as input features, the residual CNN based on the ResNet18 model is used as classifier to recognize the underwater acoustic target. The other machine-learning models, such as support vector machine (SVM), VGG19, and common CNN, are also compared for inputting different features separately. Experimental results show that, MGCL-Delta-NCA-ResNet18 has the best recognition results among these models, with the recognition accuracy of 97.29%, because this model allows full play to the rich information advantages of feature fusion, advantages of feature dimensionality reduction by the NCA and the ability of ResNet18 to extract abundant characteristics. It can also realize the recognition effectively at low signal-to-noise ratio (SNR). Especially at 0 dB, the recognition accuracy can still reach 86.25%. The proposed method can also recognize multitarget signal effectively in the multiple target scenario. Although this model is used in ship and natural voice recognition, it can also be applied to the recognition of other target sounds, such as marine mammals.

AB - This article presents an underwater acoustic target recognition method using feature fusion and residual convolutional neural network (CNN). Mel-frequency cepstrum coefficient (MFCC), Gammatone frequency cepstral coefficient (GFCC), low-frequency analyzer and recorder (LOFAR) spectrum, and constant Q transform (CQT) are extracted and fused first. On this basis, their Delta features are calculated and fused second. The feature dimension is reduced by neighborhood component analysis (NCA). With the fused features after the dimensionality reduction as input features, the residual CNN based on the ResNet18 model is used as classifier to recognize the underwater acoustic target. The other machine-learning models, such as support vector machine (SVM), VGG19, and common CNN, are also compared for inputting different features separately. Experimental results show that, MGCL-Delta-NCA-ResNet18 has the best recognition results among these models, with the recognition accuracy of 97.29%, because this model allows full play to the rich information advantages of feature fusion, advantages of feature dimensionality reduction by the NCA and the ability of ResNet18 to extract abundant characteristics. It can also realize the recognition effectively at low signal-to-noise ratio (SNR). Especially at 0 dB, the recognition accuracy can still reach 86.25%. The proposed method can also recognize multitarget signal effectively in the multiple target scenario. Although this model is used in ship and natural voice recognition, it can also be applied to the recognition of other target sounds, such as marine mammals.

KW - Acoustic target recognition

KW - feature fusion

KW - machine learning

KW - residual convolutional neural network (CNN)

UR - http://www.scopus.com/inward/record.url?scp=85205756227&partnerID=8YFLogxK

U2 - 10.1109/JSEN.2024.3464754

DO - 10.1109/JSEN.2024.3464754

M3 - 文章

AN - SCOPUS:85205756227

SN - 1530-437X

VL - 24

SP - 37342

EP - 37357

JO - IEEE Sensors Journal

JF - IEEE Sensors Journal

IS - 22

ER -

Underwater Acoustic Target Recognition Method Based on Feature Fusion and Residual CNN

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this