Optimizing Dysarthria Wake-Up Word Spotting: an End-to-End Approach For SLT 2024 LRDWWS Challenge

Shuiyun Liu, Yuxiang Kong, Pengcheng Guo, Weiji Zhuang, Peng Gao, Yujun Wang, Lei Xie

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Speech has emerged as a widely embraced user interface across diverse applications. However, for individuals with dysarthria, the inherent variability in their speech poses significant challenges. This paper presents an end-to-end Pretrain-based Dual-filter Dysarthria Wake-up word Spotting (PD-DWS) system for the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge. Specifically, our system improves performance from two key perspectives: audio modeling and dual-filter strategy. For audio modeling, we propose an innovative 2 branch- d2v2 model based on the pre-trained data2vec 2d2v2), which can simultaneously model automatic speech recognition (ASR) and wake-up word spotting (WWS) tasks through a unified multi-task finetuning paradigm. Additionally, a dual-filter strategy is introduced to reduce the false accept rate (FAR) while maintaining the same false reject rate (FRR). Experimental results demonstrate that our PD-DWS system achieves an FAR of 0.00321 and an FRR of 0.005, with a total score of 0.00821 on the test-B eval set, securing first place in the challenge.

源语言英语
主期刊名Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024
出版商Institute of Electrical and Electronics Engineers Inc.
578-585
页数8
ISBN(电子版)9798350392258
DOI
出版状态已出版 - 2024
活动2024 IEEE Spoken Language Technology Workshop, SLT 2024 - Macao, 中国
期限: 2 12月 20245 12月 2024

出版系列

姓名Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024

会议

会议2024 IEEE Spoken Language Technology Workshop, SLT 2024
国家/地区中国
Macao
时期2/12/245/12/24

指纹

探究 'Optimizing Dysarthria Wake-Up Word Spotting: an End-to-End Approach For SLT 2024 LRDWWS Challenge' 的科研主题。它们共同构成独一无二的指纹。

引用此