A visualized acoustic saliency feature extraction method for environment sound signal processing

Jingyu Wang, Ke Zhang, Kurosh Madani, Christophe Sabourin

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Environment perception is an important research issue for both unmanned ground vehicles and robots. To improve the capacity of perception, a visualized acoustic saliency feature extraction (VASFE) method based on both the short-time Fourier transform (STFT) and the Mel-Frequency Cepstrum Coefficient (MFCC) for environment sound signal processing is proposed in this paper. Sound signal is visualized by using the STFT algorithm as local image feature and the Mel-Frequency Cepstrum Coefficient (MFCC) is used to represent the local acoustic feature of the signal. The proposed VASFE method is tested by the natural sound data which collected from real world of both indoor and outdoor environment. The results show that this method is able to extract the saliency features of both long-term and short-term sound signal successfully and clearly, and conducts to very distinguishable features for future processing of environment sound information.

源语言英语
主期刊名2013 IEEE International Conference of IEEE Region 10, IEEE TENCON 2013 - Conference Proceedings
DOI
出版状态已出版 - 2013
活动2013 IEEE International Conference of IEEE Region 10, IEEE TENCON 2013 - Xi'an, Shaanxi, 中国
期限: 22 10月 201325 10月 2013

出版系列

姓名IEEE Region 10 Annual International Conference, Proceedings/TENCON
ISSN(印刷版)2159-3442
ISSN(电子版)2159-3450

会议

会议2013 IEEE International Conference of IEEE Region 10, IEEE TENCON 2013
国家/地区中国
Xi'an, Shaanxi
时期22/10/1325/10/13

指纹

探究 'A visualized acoustic saliency feature extraction method for environment sound signal processing' 的科研主题。它们共同构成独一无二的指纹。

引用此