Abstract
Human beings are more intelligent in dealing with sound which occurred in everyday life than robots or other kind of unmanned ground vehicles because of the instinct of sense or awareness, which is an ability to distinguish the most salient sound, object or events in the surrounding environment. Inspired by the biological acoustic awareness of human hearing system and the visual saliency talent of human vision, a heterogeneous information saliency feature fusion (HISFF) approach which simulates human awareness of environment sound for machine's awareness is proposed in this paper. The sound signal is visualized by using the Short-Time Fourier Transform (STFT) algorithm in order to convert the acoustic saliency into visual saliency, and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the human acoustic awareness. The proposed HISFF approach is tested by using the environment sound data which collected from the real world of both indoor and outdoor environment. The results show that this approach is able to extract the saliency signal from both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for machine's environment sounds based awareness.
Original language | English |
---|---|
Pages | 197-204 |
Number of pages | 8 |
DOIs | |
State | Published - 2013 |
Event | 2013 International Joint Conference on Awareness Science and Technology, iCAST 2013 and 6th International Conference on Ubi-Media Computing, UMEDIA 2013 - Aizuwakamatsu, Japan Duration: 2 Nov 2013 → 4 Nov 2013 |
Conference
Conference | 2013 International Joint Conference on Awareness Science and Technology, iCAST 2013 and 6th International Conference on Ubi-Media Computing, UMEDIA 2013 |
---|---|
Country/Territory | Japan |
City | Aizuwakamatsu |
Period | 2/11/13 → 4/11/13 |
Keywords
- Environment sound signal
- Heterogeneous information
- Machine's awareness
- MFCC
- Saliency feature fusion
- Spectrogram
- STFT algorithm