TY - JOUR
T1 - Learning Noise Adapters for Incremental Speech Enhancement
AU - Yang, Ziye
AU - Song, Xiang
AU - Chen, Jie
AU - Richard, Cedric
AU - Cohen, Israel
N1 - Publisher Copyright:
© 1994-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - Incremental speech enhancement (ISE), with the ability to incrementally adapt to new noise domains, represents a critical yet comparatively under-investigated topic. While the regularization-based method has been proposed to solve the ISE task, it usually suffers from the dilemma wherein the gain of one domain directly entails the loss of another. To solve this issue, we propose an effective paradigm, termed Learning Noise Adapters (LNA), which significantly mitigates the catastrophic domain forgetting phenomenon in the ISE task. In our methodology, we employ a frozen pre-trained model to train and retain a domain-specific adapter for each newly encountered domain, enabling the capture of variations in feature distributions within these domains. Subsequently, our approach involves the development of an unsupervised, training-free noise selector for the inference stage, which is responsible for identifying the domains of test speech samples. A comprehensive experimental validation has substantiated the effectiveness of our approach.
AB - Incremental speech enhancement (ISE), with the ability to incrementally adapt to new noise domains, represents a critical yet comparatively under-investigated topic. While the regularization-based method has been proposed to solve the ISE task, it usually suffers from the dilemma wherein the gain of one domain directly entails the loss of another. To solve this issue, we propose an effective paradigm, termed Learning Noise Adapters (LNA), which significantly mitigates the catastrophic domain forgetting phenomenon in the ISE task. In our methodology, we employ a frozen pre-trained model to train and retain a domain-specific adapter for each newly encountered domain, enabling the capture of variations in feature distributions within these domains. Subsequently, our approach involves the development of an unsupervised, training-free noise selector for the inference stage, which is responsible for identifying the domains of test speech samples. A comprehensive experimental validation has substantiated the effectiveness of our approach.
KW - Catastrophic forgetting problem
KW - incremental learning
KW - noise adapter
KW - speech enhancement
UR - http://www.scopus.com/inward/record.url?scp=85207327888&partnerID=8YFLogxK
U2 - 10.1109/LSP.2024.3482171
DO - 10.1109/LSP.2024.3482171
M3 - 文章
AN - SCOPUS:85207327888
SN - 1070-9908
VL - 31
SP - 2915
EP - 2919
JO - IEEE Signal Processing Letters
JF - IEEE Signal Processing Letters
ER -