TY - GEN
T1 - Espresso
T2 - 2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019
AU - Wang, Yiming
AU - Khudanpur, Sanjeev
AU - Chen, Tongfei
AU - Xu, Hainan
AU - Ding, Shuoyang
AU - Lv, Hang
AU - Shao, Yiwen
AU - Peng, Nanyun
AU - Xie, Lei
AU - Watanabe, Shinji
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/12
Y1 - 2019/12
N2 - We present Espresso, an open-source, modular, extensible end-To-end neural automatic speech recognition (ASR) toolkit based on the deep learning library PyTorch and the popular neural machine translation toolkit FAIRSEQ. ESRESSO supports distributed training across GPUs and computing nodes, and features various decoding approaches commonly employed in ASR, including look-Ahead word-based language model fusion, for which a fast, parallelized decoder is implemented. Espresso achieves state-of-The-Art ASR performance on the WSJ, LibriSpeech, and Switchboard data sets among other end-To-end systems without data augmentation, and is 4-11x faster for decoding than similar systems (e.g. ESPNET).
AB - We present Espresso, an open-source, modular, extensible end-To-end neural automatic speech recognition (ASR) toolkit based on the deep learning library PyTorch and the popular neural machine translation toolkit FAIRSEQ. ESRESSO supports distributed training across GPUs and computing nodes, and features various decoding approaches commonly employed in ASR, including look-Ahead word-based language model fusion, for which a fast, parallelized decoder is implemented. Espresso achieves state-of-The-Art ASR performance on the WSJ, LibriSpeech, and Switchboard data sets among other end-To-end systems without data augmentation, and is 4-11x faster for decoding than similar systems (e.g. ESPNET).
KW - automatic speech recognition
KW - end-To-end
KW - language model fusion
KW - parallel decoding
UR - http://www.scopus.com/inward/record.url?scp=85081601429&partnerID=8YFLogxK
U2 - 10.1109/ASRU46091.2019.9003968
DO - 10.1109/ASRU46091.2019.9003968
M3 - 会议稿件
AN - SCOPUS:85081601429
T3 - 2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019 - Proceedings
SP - 136
EP - 143
BT - 2019 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 15 December 2019 through 18 December 2019
ER -