TY - JOUR
T1 - Underwater Acoustic Target Recognition Method Based on Feature Fusion and Residual CNN
AU - Yang, Yixin
AU - Yao, Qihai
AU - Wang, Yong
N1 - Publisher Copyright:
© 2001-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - This article presents an underwater acoustic target recognition method using feature fusion and residual convolutional neural network (CNN). Mel-frequency cepstrum coefficient (MFCC), Gammatone frequency cepstral coefficient (GFCC), low-frequency analyzer and recorder (LOFAR) spectrum, and constant Q transform (CQT) are extracted and fused first. On this basis, their Delta features are calculated and fused second. The feature dimension is reduced by neighborhood component analysis (NCA). With the fused features after the dimensionality reduction as input features, the residual CNN based on the ResNet18 model is used as classifier to recognize the underwater acoustic target. The other machine-learning models, such as support vector machine (SVM), VGG19, and common CNN, are also compared for inputting different features separately. Experimental results show that, MGCL-Delta-NCA-ResNet18 has the best recognition results among these models, with the recognition accuracy of 97.29%, because this model allows full play to the rich information advantages of feature fusion, advantages of feature dimensionality reduction by the NCA and the ability of ResNet18 to extract abundant characteristics. It can also realize the recognition effectively at low signal-to-noise ratio (SNR). Especially at 0 dB, the recognition accuracy can still reach 86.25%. The proposed method can also recognize multitarget signal effectively in the multiple target scenario. Although this model is used in ship and natural voice recognition, it can also be applied to the recognition of other target sounds, such as marine mammals.
AB - This article presents an underwater acoustic target recognition method using feature fusion and residual convolutional neural network (CNN). Mel-frequency cepstrum coefficient (MFCC), Gammatone frequency cepstral coefficient (GFCC), low-frequency analyzer and recorder (LOFAR) spectrum, and constant Q transform (CQT) are extracted and fused first. On this basis, their Delta features are calculated and fused second. The feature dimension is reduced by neighborhood component analysis (NCA). With the fused features after the dimensionality reduction as input features, the residual CNN based on the ResNet18 model is used as classifier to recognize the underwater acoustic target. The other machine-learning models, such as support vector machine (SVM), VGG19, and common CNN, are also compared for inputting different features separately. Experimental results show that, MGCL-Delta-NCA-ResNet18 has the best recognition results among these models, with the recognition accuracy of 97.29%, because this model allows full play to the rich information advantages of feature fusion, advantages of feature dimensionality reduction by the NCA and the ability of ResNet18 to extract abundant characteristics. It can also realize the recognition effectively at low signal-to-noise ratio (SNR). Especially at 0 dB, the recognition accuracy can still reach 86.25%. The proposed method can also recognize multitarget signal effectively in the multiple target scenario. Although this model is used in ship and natural voice recognition, it can also be applied to the recognition of other target sounds, such as marine mammals.
KW - Acoustic target recognition
KW - feature fusion
KW - machine learning
KW - residual convolutional neural network (CNN)
UR - http://www.scopus.com/inward/record.url?scp=85205756227&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2024.3464754
DO - 10.1109/JSEN.2024.3464754
M3 - 文章
AN - SCOPUS:85205756227
SN - 1530-437X
VL - 24
SP - 37342
EP - 37357
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 22
ER -