Heterogeneous information saliency features' fusion approach for machine's environment sounds based awareness

Jingyu Wang, Ke Zhang, Kurosh Madani, Christophe Sabourin

Research output: Contribution to conferencePaperpeer-review

Abstract

Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of sense or awareness, which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine's awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine's environment sounds based awareness.

Original languageEnglish
Pages197-204
Number of pages8
DOIs
StatePublished - 2013
Event2013 International Joint Conference on Awareness Science and Technology, iCAST 2013 and 6th International Conference on Ubi-Media Computing, UMEDIA 2013 - Aizuwakamatsu, Japan
Duration: 2 Nov 20134 Nov 2013

Conference

Conference2013 International Joint Conference on Awareness Science and Technology, iCAST 2013 and 6th International Conference on Ubi-Media Computing, UMEDIA 2013
Country/TerritoryJapan
CityAizuwakamatsu
Period2/11/134/11/13

Keywords

  • Environment sound signal
  • Heterogeneous information
  • Machine's awareness
  • MFCC
  • Saliency feature fusion
  • Spectrogram
  • STFT algorithm

Fingerprint

Dive into the research topics of 'Heterogeneous information saliency features' fusion approach for machine's environment sounds based awareness'. Together they form a unique fingerprint.

Cite this