TY - GEN
T1 - Prosody-based sentence boundary detection in Chinese broadcast news
AU - Xie, Lei
AU - Xu, Chenglin
AU - Wang, Xiaoxuan
PY - 2012
Y1 - 2012
N2 - In this paper, we explore the use of prosodic features in sentence boundary detection in Chinese broadcast news. The prosodic features include speaker turn, music, pause duration, pitch, energy and speaking rate. Specifically, considering the Chinese tonal effects in pitch trajectory, we propose to use tone-normalized pitch features. Experiments using decision trees demonstrate that the tone-normalized pitch features show superior performance in sentence boundary detection in Chinese broadcast news. Furthermore, feature combination is able to achieve apparent performance improvement by intuitive feature interactive rules formed in the decision tree. Pause duration and a tone-normalized pitch feature contribute the most part of the feature usage in the best-performing decision tree.
AB - In this paper, we explore the use of prosodic features in sentence boundary detection in Chinese broadcast news. The prosodic features include speaker turn, music, pause duration, pitch, energy and speaking rate. Specifically, considering the Chinese tonal effects in pitch trajectory, we propose to use tone-normalized pitch features. Experiments using decision trees demonstrate that the tone-normalized pitch features show superior performance in sentence boundary detection in Chinese broadcast news. Furthermore, feature combination is able to achieve apparent performance improvement by intuitive feature interactive rules formed in the decision tree. Pause duration and a tone-normalized pitch feature contribute the most part of the feature usage in the best-performing decision tree.
KW - rich transcription
KW - sentence boundary detection
KW - sentence segmentation
KW - speech prosody
UR - http://www.scopus.com/inward/record.url?scp=84874466791&partnerID=8YFLogxK
U2 - 10.1109/ISCSLP.2012.6423471
DO - 10.1109/ISCSLP.2012.6423471
M3 - 会议稿件
AN - SCOPUS:84874466791
SN - 9781467325059
T3 - 2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012
SP - 261
EP - 265
BT - 2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012
T2 - 2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012
Y2 - 5 December 2012 through 8 December 2012
ER -