Environmental sound classification using central auditory representations and deep neural networks

Kean Chen, Lixue Yang, Zhiming Sang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This study presents an effective environmental sound classification method that combines central auditory representations and deep neural networks. The central auditory representation characterizes the responses of sound in primary auditory cortex and is related directly to human auditory perception. Meanwhile, the deep neural network is composed by no less than two hidden layers and is good at detecting high-level invariant features associated with the objects. An environmental sound classification task with 11 distinct classes is used for evaluation, and the timbre features and deep neural networks are combined and used as a baseline. The results show that the proposed method outperforms the baseline, and the highest accuracy of 89.94% is achieved by a two-layer network with 300 hidden nodes each layer.

源语言英语
主期刊名ICSV 2016 - 23rd International Congress on Sound and Vibration
主期刊副标题From Ancient to Modern Acoustics
出版商International Institute of Acoustics and Vibrations
ISBN(电子版)9789609922623
出版状态已出版 - 2016
活动23rd International Congress on Sound and Vibration, ICSV 2016 - Athens, 希腊
期限: 10 7月 201614 7月 2016

出版系列

姓名ICSV 2016 - 23rd International Congress on Sound and Vibration: From Ancient to Modern Acoustics

会议

会议23rd International Congress on Sound and Vibration, ICSV 2016
国家/地区希腊
Athens
时期10/07/1614/07/16

指纹

探究 'Environmental sound classification using central auditory representations and deep neural networks' 的科研主题。它们共同构成独一无二的指纹。

引用此