A Weighted Binary Cross-Entropy for Sound Event Representation Learning and Few-Shot Classification

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

The performance of sound event classification, including detection or tagging, depends heavily on the number of training samples and the quality of the training data. This paper presents an approach to improving sound event classification performance for events with limited training samples through using a weighted binary cross-entropy loss function. This function aims to constrain the representation space to have lower intra-class variance and higher inter-class differences by mining difficult samples and applying stricter penalties. Experiments demonstrate that the proposed method outperforms the existing ones, and the improvement is particularly significant in scenarios with limited training samples.

Original languageEnglish
Title of host publication2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1069-1074
Number of pages6
ISBN (Electronic)9798350300673
DOIs
StatePublished - 2023
Event2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023 - Taipei, Taiwan, Province of China
Duration: 31 Oct 20233 Nov 2023

Publication series

Name2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023

Conference

Conference2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
Country/TerritoryTaiwan, Province of China
CityTaipei
Period31/10/233/11/23

Fingerprint

Dive into the research topics of 'A Weighted Binary Cross-Entropy for Sound Event Representation Learning and Few-Shot Classification'. Together they form a unique fingerprint.

Cite this