A two-stage multi-feature integration approach to unsupervised speaker change detection in real-time news broadcasting

Lei Xie, Guangsen Wang

科研成果: 书/报告/会议事项章节会议稿件同行评审

6 引用 (Scopus)

摘要

This paper presents a two-stage multi-feature integration approach for unsupervised speaker change detection in real-time news broadcasting. We integrate MFCC and LSP features (i.e. a perceptual feature plus a articulatory feature) in the metric-based potential speaker change detection stage to collect speaker boundary candidates as many as possible. We adopt a weighted Bayesian information criterion (BIC) to integrate boundary decisions from MFCC and LSP features in the speaker boundary confirmation stage. This multi-feature integration strategy makes use of the complementarity between perceptual features and articulatory features to achieve a performance gain. Speaker change detection experiments show that the multi-feature integration approach significantly outperforms the individual features with relative improvements of 26% over the LSP-only approach and 6% over the MFCC-only approach.

源语言英语
主期刊名Proceedings - 2008 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008
350-353
页数4
DOI
出版状态已出版 - 2008
活动2008 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008 - Kunming, 中国
期限: 16 12月 200819 12月 2008

出版系列

姓名Proceedings - 2008 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008

会议

会议2008 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008
国家/地区中国
Kunming
时期16/12/0819/12/08

指纹

探究 'A two-stage multi-feature integration approach to unsupervised speaker change detection in real-time news broadcasting' 的科研主题。它们共同构成独一无二的指纹。

引用此