TY - GEN
T1 - Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
AU - Xue, Hongfei
AU - Gong, Rong
AU - Shao, Mingchen
AU - Xu, Xin
AU - Wang, Lezhi
AU - Xie, Lei
AU - Bu, Hui
AU - Zhou, Jiaming
AU - Qin, Yong
AU - Du, Jun
AU - Li, Ming
AU - Zhang, Binbin
AU - Jia, Bin
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The StutteringSpeech Challenge focuses on advancing speech technologies for people who stutter, specifically targeting Stuttering Event Detection (SED) and Automatic Speech Recognition (ASR) in Mandarin. The challenge comprises three tracks: (1) SED, which aims to develop systems for detection of stuttering events; (2) ASR, which focuses on creating robust systems for recognizing stuttered speech; and (3) Research track for innovative approaches utilizing the provided dataset. We utilizes an open-source Mandarin stuttering dataset AS-70, which has been split into new training and test sets for the challenge. This paper presents the dataset, details the challenge tracks, and analyzes the performance of the top systems, highlighting improvements in detection accuracy and reductions in recognition error rates. Our findings underscore the potential of specialized models and augmentation strategies in developing stuttered speech technologies.
AB - The StutteringSpeech Challenge focuses on advancing speech technologies for people who stutter, specifically targeting Stuttering Event Detection (SED) and Automatic Speech Recognition (ASR) in Mandarin. The challenge comprises three tracks: (1) SED, which aims to develop systems for detection of stuttering events; (2) ASR, which focuses on creating robust systems for recognizing stuttered speech; and (3) Research track for innovative approaches utilizing the provided dataset. We utilizes an open-source Mandarin stuttering dataset AS-70, which has been split into new training and test sets for the challenge. This paper presents the dataset, details the challenge tracks, and analyzes the performance of the top systems, highlighting improvements in detection accuracy and reductions in recognition error rates. Our findings underscore the potential of specialized models and augmentation strategies in developing stuttered speech technologies.
KW - Mandarin stuttered speech
KW - speech recognition
KW - stuttering event detection
UR - http://www.scopus.com/inward/record.url?scp=85217391996&partnerID=8YFLogxK
U2 - 10.1109/SLT61566.2024.10832208
DO - 10.1109/SLT61566.2024.10832208
M3 - 会议稿件
AN - SCOPUS:85217391996
T3 - Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024
SP - 385
EP - 392
BT - Proceedings of 2024 IEEE Spoken Language Technology Workshop, SLT 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE Spoken Language Technology Workshop, SLT 2024
Y2 - 2 December 2024 through 5 December 2024
ER -