TY - GEN
T1 - An Audio-Quality-Based Multi-Strategy Approach for Target Speaker Extraction in the Misp 2023 Challenge
AU - Han, Runduo
AU - Yan, Xiaopeng
AU - Xu, Weiming
AU - Guo, Pengcheng
AU - Sun, Jiayao
AU - Wang, He
AU - Lu, Quan
AU - Jiang, Ning
AU - Xie, Lei
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This paper describes our audio-quality-based multi-strategy approach for the audio-visual target speaker extraction (AVTSE) task in the Multi-modal Information based Speech Processing (MISP) 2023 Challenge. Specifically, our approach adopts different extraction strategies based on the audio quality, striking a balance between interference removal and speech preservation, which benifits the back-end automatic speech recognition (ASR) systems. Experiments show that our approach achieves a character error rate (CER) of 24.2% and 33.2% on the Dev and Eval set, respectively, obtaining the second place in the challenge.
AB - This paper describes our audio-quality-based multi-strategy approach for the audio-visual target speaker extraction (AVTSE) task in the Multi-modal Information based Speech Processing (MISP) 2023 Challenge. Specifically, our approach adopts different extraction strategies based on the audio quality, striking a balance between interference removal and speech preservation, which benifits the back-end automatic speech recognition (ASR) systems. Experiments show that our approach achieves a character error rate (CER) of 24.2% and 33.2% on the Dev and Eval set, respectively, obtaining the second place in the challenge.
KW - automatic speech recognition
KW - target speaker extraction
UR - http://www.scopus.com/inward/record.url?scp=85202431845&partnerID=8YFLogxK
U2 - 10.1109/ICASSPW62465.2024.10627638
DO - 10.1109/ICASSPW62465.2024.10627638
M3 - 会议稿件
AN - SCOPUS:85202431845
T3 - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings
SP - 27
EP - 28
BT - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024
Y2 - 14 April 2024 through 19 April 2024
ER -