An asynchronous WFST-based decoder for automatic speech recognition

Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

We introduce asynchronous dynamic decoder, which adopts an efficient A* algorithm to incorporate big language models in the one-pass decoding for large vocabulary continuous speech recognition. Unlike standard one-pass decoding with on-the-fly composition decoder which might induce a significant computation overhead, the asynchronous dynamic decoder has a novel design where it has two fronts, with one performing “exploration” and the other “backfill”. The computation of the two fronts alternates in the decoding process, resulting in more effective pruning than the standard one-pass decoding with an on-the-fly composition decoder. Experiments show that the proposed decoder works notably faster than the standard one-pass decoding with on-the-fly composition decoder, while the acceleration will be more obvious with the increment of data complexity.

源语言英语
主期刊名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
6019-6023
页数5
ISBN(电子版)9781728176055
DOI
出版状态已出版 - 2021
活动2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021 - Virtual, Toronto, 加拿大
期限: 6 6月 202111 6月 2021

出版系列

姓名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2021-June
ISSN(印刷版)1520-6149

会议

会议2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021
国家/地区加拿大
Virtual, Toronto
时期6/06/2111/06/21

指纹

探究 'An asynchronous WFST-based decoder for automatic speech recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此