TY - JOUR
T1 - Discriminative ensemble loss for deep neural network on classification of ship-radiated noise
AU - He, Lei
AU - Shen, Xiaohong
AU - Zhang, Muhang
AU - Wang, Haiyan
N1 - Publisher Copyright:
© 1994-2012 IEEE.
PY - 2021
Y1 - 2021
N2 - Despite the remarkable progress of deep learning on speech recognition and music processing, it is still challenging to classify general audio signals due to the high cost of collection and annotation of the samples. The ability to learn discriminative features from a small dataset makes deep metric learning a promising method for general audio classification. However, because of the difficulty in mining informative sample pairs, it usually suffers from slow convergence or even poor local minima. In this letter, to improve classification performance by exploiting the advantages of both the weight-based loss and the metric-based loss, we proposed a multi-positive metric loss and a framework to joint it with the common softmax loss. The proposed method eliminates the need for sub-loss weighting by measuring the similarity between samples in a consistent probabilistic form. It also enhances the classification performance by improving the estimation of the intra-class and inter-class relationships from multiple positive samples. Finally, we evaluated the proposed method on the ShipsEar dataset and the Ocean Networks Canada dataset, and the results verified its effectiveness.
AB - Despite the remarkable progress of deep learning on speech recognition and music processing, it is still challenging to classify general audio signals due to the high cost of collection and annotation of the samples. The ability to learn discriminative features from a small dataset makes deep metric learning a promising method for general audio classification. However, because of the difficulty in mining informative sample pairs, it usually suffers from slow convergence or even poor local minima. In this letter, to improve classification performance by exploiting the advantages of both the weight-based loss and the metric-based loss, we proposed a multi-positive metric loss and a framework to joint it with the common softmax loss. The proposed method eliminates the need for sub-loss weighting by measuring the similarity between samples in a consistent probabilistic form. It also enhances the classification performance by improving the estimation of the intra-class and inter-class relationships from multiple positive samples. Finally, we evaluated the proposed method on the ShipsEar dataset and the Ocean Networks Canada dataset, and the results verified its effectiveness.
KW - Audio classification
KW - deep metric learning
KW - loss ensemble
KW - ship-radiated noise
UR - http://www.scopus.com/inward/record.url?scp=85100842535&partnerID=8YFLogxK
U2 - 10.1109/LSP.2021.3057539
DO - 10.1109/LSP.2021.3057539
M3 - 文章
AN - SCOPUS:85100842535
SN - 1070-9908
VL - 28
SP - 449
EP - 453
JO - IEEE Signal Processing Letters
JF - IEEE Signal Processing Letters
M1 - 9349209
ER -