TY - JOUR
T1 - The NPU-Elevoc Personalized Speech Enhancement System for Icassp2023 DNS Challenge
AU - Yan, Xiaopeng
AU - Yang, Yindi
AU - Guo, Zhihao
AU - Peng, Liangliang
AU - Xie, Lei
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - This paper describes our NPU-Elevoc personalized speech enhancement system (NAPSE) for the 5th Deep Noise Suppression Challenge[1] at ICASSP 2023. Based on the superior two-stage model TEA-PSE 2.0 [2], our system particularly explores better strategy for speaker embedding fusion, optimizes the model training pipeline, and leverages adversarial training and multi-scale loss. According to the results12, our system is tied for the 1st place in the headset track (track 1) and ranked 2nd in the speakerphone track (track 2).
AB - This paper describes our NPU-Elevoc personalized speech enhancement system (NAPSE) for the 5th Deep Noise Suppression Challenge[1] at ICASSP 2023. Based on the superior two-stage model TEA-PSE 2.0 [2], our system particularly explores better strategy for speaker embedding fusion, optimizes the model training pipeline, and leverages adversarial training and multi-scale loss. According to the results12, our system is tied for the 1st place in the headset track (track 1) and ranked 2nd in the speakerphone track (track 2).
KW - deep learning
KW - generative adversarial network
KW - personalized speech enhancement
KW - real-time
UR - http://www.scopus.com/inward/record.url?scp=85174794590&partnerID=8YFLogxK
U2 - 10.1109/ICASSP49357.2023.10096362
DO - 10.1109/ICASSP49357.2023.10096362
M3 - 会议文章
AN - SCOPUS:85174794590
SN - 1520-6149
JO - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
JF - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
T2 - 48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
Y2 - 4 June 2023 through 10 June 2023
ER -