Multi-scale feature based salient environmental sound recognition for machine awareness

Jingyu Wang; Ke Zhang; Kurash Madani; Christophe Sabourin

doi:10.1109/ICAwST.2014.6981837

Multi-scale feature based salient environmental sound recognition for machine awareness

Jingyu Wang, Ke Zhang, Kurash Madani, Christophe Sabourin

School of Astronautics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

Auditory perception of surrounding environment is important to machine awareness. To provide artificial awareness ability for machines, a bio-inspired salient environmental sound detection and recognition method is proposed. The salient sounds are detected by using the auditory saliency map which based on heterogeneous saliency features from visual and acoustic domain. Spectral and temporal saliency features from both power spectral density (PSD) and mel-frequency cepstral coefficients (MFCC) as well as the visual saliency from log-scale spectrogram are applied to yield the final auditory saliency for salient sound detection. To improve the detection accuracy, short-term Shannon entropy (SSE) and a computational inhibition of return (IOR) model are initially proposed to verify the temporal saliency characteristic. The detected salient sounds are classified by using the features which based on the fuzzy vector of spectral energy distribution and MFCC. A two-level classification is presented based on the support vector machine (SVM) for recognition task. Experiments are carried out on the real environmental sound examples. The results show that, over 83% recognition accuracy can be achieved by using proposed fuzzy vector based features, and the overall accuracy of 94.65%

Original language	English
Title of host publication	2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781479973736
DOIs	https://doi.org/10.1109/ICAwST.2014.6981837
State	Published - 9 Dec 2014
Event	6th IEEE International Conference on Awareness Science and Technology, iCAST 2014 - Paris, France Duration: 29 Oct 2014 → 31 Oct 2014

Publication series

Name	2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014

Conference

Conference	6th IEEE International Conference on Awareness Science and Technology, iCAST 2014
Country/Territory	France
City	Paris
Period	29/10/14 → 31/10/14

Keywords

artificial awareness
environment sound signal
fuzzy vector
heterogeneous information
MFCC
saliency feature fusion
SVM

Access to Document

10.1109/ICAwST.2014.6981837

Cite this

Wang, J., Zhang, K., Madani, K., & Sabourin, C. (2014). Multi-scale feature based salient environmental sound recognition for machine awareness. In 2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014 Article 6981837 (2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICAwST.2014.6981837

Wang, Jingyu ; Zhang, Ke ; Madani, Kurash et al. / Multi-scale feature based salient environmental sound recognition for machine awareness. 2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014. Institute of Electrical and Electronics Engineers Inc., 2014. (2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014).

@inproceedings{99abcb0aee014f218e19656459a57d08,

title = "Multi-scale feature based salient environmental sound recognition for machine awareness",

abstract = "Auditory perception of surrounding environment is important to machine awareness. To provide artificial awareness ability for machines, a bio-inspired salient environmental sound detection and recognition method is proposed. The salient sounds are detected by using the auditory saliency map which based on heterogeneous saliency features from visual and acoustic domain. Spectral and temporal saliency features from both power spectral density (PSD) and mel-frequency cepstral coefficients (MFCC) as well as the visual saliency from log-scale spectrogram are applied to yield the final auditory saliency for salient sound detection. To improve the detection accuracy, short-term Shannon entropy (SSE) and a computational inhibition of return (IOR) model are initially proposed to verify the temporal saliency characteristic. The detected salient sounds are classified by using the features which based on the fuzzy vector of spectral energy distribution and MFCC. A two-level classification is presented based on the support vector machine (SVM) for recognition task. Experiments are carried out on the real environmental sound examples. The results show that, over 83% recognition accuracy can be achieved by using proposed fuzzy vector based features, and the overall accuracy of 94.65%",

keywords = "artificial awareness, environment sound signal, fuzzy vector, heterogeneous information, MFCC, saliency feature fusion, SVM",

author = "Jingyu Wang and Ke Zhang and Kurash Madani and Christophe Sabourin",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.; 6th IEEE International Conference on Awareness Science and Technology, iCAST 2014 ; Conference date: 29-10-2014 Through 31-10-2014",

year = "2014",

month = dec,

day = "9",

doi = "10.1109/ICAwST.2014.6981837",

language = "英语",

series = "2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014",

}

Wang, J, Zhang, K, Madani, K & Sabourin, C 2014, Multi-scale feature based salient environmental sound recognition for machine awareness. in 2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014., 6981837, 2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014, Institute of Electrical and Electronics Engineers Inc., 6th IEEE International Conference on Awareness Science and Technology, iCAST 2014, Paris, France, 29/10/14. https://doi.org/10.1109/ICAwST.2014.6981837

Multi-scale feature based salient environmental sound recognition for machine awareness. / Wang, Jingyu; Zhang, Ke; Madani, Kurash et al.
2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014. Institute of Electrical and Electronics Engineers Inc., 2014. 6981837 (2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi-scale feature based salient environmental sound recognition for machine awareness

AU - Wang, Jingyu

AU - Zhang, Ke

AU - Madani, Kurash

AU - Sabourin, Christophe

PY - 2014/12/9

Y1 - 2014/12/9

N2 - Auditory perception of surrounding environment is important to machine awareness. To provide artificial awareness ability for machines, a bio-inspired salient environmental sound detection and recognition method is proposed. The salient sounds are detected by using the auditory saliency map which based on heterogeneous saliency features from visual and acoustic domain. Spectral and temporal saliency features from both power spectral density (PSD) and mel-frequency cepstral coefficients (MFCC) as well as the visual saliency from log-scale spectrogram are applied to yield the final auditory saliency for salient sound detection. To improve the detection accuracy, short-term Shannon entropy (SSE) and a computational inhibition of return (IOR) model are initially proposed to verify the temporal saliency characteristic. The detected salient sounds are classified by using the features which based on the fuzzy vector of spectral energy distribution and MFCC. A two-level classification is presented based on the support vector machine (SVM) for recognition task. Experiments are carried out on the real environmental sound examples. The results show that, over 83% recognition accuracy can be achieved by using proposed fuzzy vector based features, and the overall accuracy of 94.65%

AB - Auditory perception of surrounding environment is important to machine awareness. To provide artificial awareness ability for machines, a bio-inspired salient environmental sound detection and recognition method is proposed. The salient sounds are detected by using the auditory saliency map which based on heterogeneous saliency features from visual and acoustic domain. Spectral and temporal saliency features from both power spectral density (PSD) and mel-frequency cepstral coefficients (MFCC) as well as the visual saliency from log-scale spectrogram are applied to yield the final auditory saliency for salient sound detection. To improve the detection accuracy, short-term Shannon entropy (SSE) and a computational inhibition of return (IOR) model are initially proposed to verify the temporal saliency characteristic. The detected salient sounds are classified by using the features which based on the fuzzy vector of spectral energy distribution and MFCC. A two-level classification is presented based on the support vector machine (SVM) for recognition task. Experiments are carried out on the real environmental sound examples. The results show that, over 83% recognition accuracy can be achieved by using proposed fuzzy vector based features, and the overall accuracy of 94.65%

KW - artificial awareness

KW - environment sound signal

KW - fuzzy vector

KW - heterogeneous information

KW - MFCC

KW - saliency feature fusion

KW - SVM

UR - http://www.scopus.com/inward/record.url?scp=84920527868&partnerID=8YFLogxK

U2 - 10.1109/ICAwST.2014.6981837

DO - 10.1109/ICAwST.2014.6981837

M3 - 会议稿件

AN - SCOPUS:84920527868

T3 - 2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014

BT - 2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 6th IEEE International Conference on Awareness Science and Technology, iCAST 2014

Y2 - 29 October 2014 through 31 October 2014

ER -

Wang J, Zhang K, Madani K, Sabourin C. Multi-scale feature based salient environmental sound recognition for machine awareness. In 2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014. Institute of Electrical and Electronics Engineers Inc. 2014. 6981837. (2014 IEEE 6th International Conference on Awareness Science and Technology, iCAST 2014). doi: 10.1109/ICAwST.2014.6981837

Multi-scale feature based salient environmental sound recognition for machine awareness

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this