Discriminative ensemble loss for deep neural network on classification of ship-radiated noise

Lei He; Xiaohong Shen; Muhang Zhang; Haiyan Wang

doi:10.1109/LSP.2021.3057539

Discriminative ensemble loss for deep neural network on classification of ship-radiated noise

Lei He, Xiaohong Shen, Muhang Zhang, Haiyan Wang

School of Marine Science and Technology

Research output: Contribution to journal › Article › peer-review

7 Scopus citations

Abstract

Despite the remarkable progress of deep learning on speech recognition and music processing, it is still challenging to classify general audio signals due to the high cost of collection and annotation of the samples. The ability to learn discriminative features from a small dataset makes deep metric learning a promising method for general audio classification. However, because of the difficulty in mining informative sample pairs, it usually suffers from slow convergence or even poor local minima. In this letter, to improve classification performance by exploiting the advantages of both the weight-based loss and the metric-based loss, we proposed a multi-positive metric loss and a framework to joint it with the common softmax loss. The proposed method eliminates the need for sub-loss weighting by measuring the similarity between samples in a consistent probabilistic form. It also enhances the classification performance by improving the estimation of the intra-class and inter-class relationships from multiple positive samples. Finally, we evaluated the proposed method on the ShipsEar dataset and the Ocean Networks Canada dataset, and the results verified its effectiveness.

Original language	English
Article number	9349209
Pages (from-to)	449-453
Number of pages	5
Journal	IEEE Signal Processing Letters
Volume	28
DOIs	https://doi.org/10.1109/LSP.2021.3057539
State	Published - 2021

Keywords

Audio classification
deep metric learning
loss ensemble
ship-radiated noise

Access to Document

10.1109/LSP.2021.3057539

Cite this

@article{6ccfceb2f8ea498a91fc4a9c11a2c8f4,

title = "Discriminative ensemble loss for deep neural network on classification of ship-radiated noise",

abstract = "Despite the remarkable progress of deep learning on speech recognition and music processing, it is still challenging to classify general audio signals due to the high cost of collection and annotation of the samples. The ability to learn discriminative features from a small dataset makes deep metric learning a promising method for general audio classification. However, because of the difficulty in mining informative sample pairs, it usually suffers from slow convergence or even poor local minima. In this letter, to improve classification performance by exploiting the advantages of both the weight-based loss and the metric-based loss, we proposed a multi-positive metric loss and a framework to joint it with the common softmax loss. The proposed method eliminates the need for sub-loss weighting by measuring the similarity between samples in a consistent probabilistic form. It also enhances the classification performance by improving the estimation of the intra-class and inter-class relationships from multiple positive samples. Finally, we evaluated the proposed method on the ShipsEar dataset and the Ocean Networks Canada dataset, and the results verified its effectiveness.",

keywords = "Audio classification, deep metric learning, loss ensemble, ship-radiated noise",

author = "Lei He and Xiaohong Shen and Muhang Zhang and Haiyan Wang",

note = "Publisher Copyright: {\textcopyright} 1994-2012 IEEE.",

year = "2021",

doi = "10.1109/LSP.2021.3057539",

language = "英语",

volume = "28",

pages = "449--453",

journal = "IEEE Signal Processing Letters",

issn = "1070-9908",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Discriminative ensemble loss for deep neural network on classification of ship-radiated noise

AU - He, Lei

AU - Shen, Xiaohong

AU - Zhang, Muhang

AU - Wang, Haiyan

PY - 2021

Y1 - 2021

N2 - Despite the remarkable progress of deep learning on speech recognition and music processing, it is still challenging to classify general audio signals due to the high cost of collection and annotation of the samples. The ability to learn discriminative features from a small dataset makes deep metric learning a promising method for general audio classification. However, because of the difficulty in mining informative sample pairs, it usually suffers from slow convergence or even poor local minima. In this letter, to improve classification performance by exploiting the advantages of both the weight-based loss and the metric-based loss, we proposed a multi-positive metric loss and a framework to joint it with the common softmax loss. The proposed method eliminates the need for sub-loss weighting by measuring the similarity between samples in a consistent probabilistic form. It also enhances the classification performance by improving the estimation of the intra-class and inter-class relationships from multiple positive samples. Finally, we evaluated the proposed method on the ShipsEar dataset and the Ocean Networks Canada dataset, and the results verified its effectiveness.

AB - Despite the remarkable progress of deep learning on speech recognition and music processing, it is still challenging to classify general audio signals due to the high cost of collection and annotation of the samples. The ability to learn discriminative features from a small dataset makes deep metric learning a promising method for general audio classification. However, because of the difficulty in mining informative sample pairs, it usually suffers from slow convergence or even poor local minima. In this letter, to improve classification performance by exploiting the advantages of both the weight-based loss and the metric-based loss, we proposed a multi-positive metric loss and a framework to joint it with the common softmax loss. The proposed method eliminates the need for sub-loss weighting by measuring the similarity between samples in a consistent probabilistic form. It also enhances the classification performance by improving the estimation of the intra-class and inter-class relationships from multiple positive samples. Finally, we evaluated the proposed method on the ShipsEar dataset and the Ocean Networks Canada dataset, and the results verified its effectiveness.

KW - Audio classification

KW - deep metric learning

KW - loss ensemble

KW - ship-radiated noise

UR - http://www.scopus.com/inward/record.url?scp=85100842535&partnerID=8YFLogxK

U2 - 10.1109/LSP.2021.3057539

DO - 10.1109/LSP.2021.3057539

M3 - 文章

AN - SCOPUS:85100842535

SN - 1070-9908

VL - 28

SP - 449

EP - 453

JO - IEEE Signal Processing Letters

JF - IEEE Signal Processing Letters

M1 - 9349209

ER -

Discriminative ensemble loss for deep neural network on classification of ship-radiated noise

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this