Study of semi-supervised approaches to improving english-Mandarin code-switching speech recognition

Pengcheng Guo, Haihua Xu, Lei Xie, Eng Siong Chng

科研成果: 期刊稿件会议文章同行评审

30 引用 (Scopus)

摘要

In this paper, we present our efforts to improve the performance of a code-switching speech recognition system using semi-supervised training methods from lexicon learning to acoustic modeling, on the South East Asian Mandarin-English (SEAME) data. We first investigate semi-supervised lexicon learning approach to adapt the canonical lexicon, which is meant to alleviate the heavily accented pronunciation issue within the code-switching conversation of the local area. As a result, the learned lexicon yields improved performance. Furthermore, we attempt to use semi-supervised training to deal with those transcriptions that are highly mismatched between the human transcribers and the ASR system. Specifically, we conduct semi-supervised training assuming those poorly transcribed data as unsupervised data. We found the semi-supervised acoustic modeling can lead to improved results. Finally, to make up for the limitation of the conventional n-gram language models due to the data sparsity issue, we perform lattice rescoring using neural network language models, and significant WER reduction is obtained.

源语言英语
页(从-至)1928-1932
页数5
期刊Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
2018-September
DOI
出版状态已出版 - 2018
活动19th Annual Conference of the International Speech Communication, INTERSPEECH 2018 - Hyderabad, 印度
期限: 2 9月 20186 9月 2018

指纹

探究 'Study of semi-supervised approaches to improving english-Mandarin code-switching speech recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此