TY - JOUR
T1 - A brief overview of speech enhancement with linear filtering
AU - Benesty, Jacob
AU - Christensen, Mads Græsbøll
AU - Jensen, Jesper Rindom
AU - Chen, Jingdong
N1 - Publisher Copyright:
© 2014, Benesty et al.; licensee Springer.
PY - 2014
Y1 - 2014
N2 - In this paper, we provide an overview of some recently introduced principles and ideas for speech enhancement with linear filtering and explore how these are related and how they can be used in various applications. This is done in a general framework where the speech enhancement problem is stated as a signal vector estimation problem, i.e., with a filter matrix, where the estimate is obtained by means of a matrix-vector product of the filter matrix and the noisy signal vector. In this framework, minimum distortion, minimum variance distortionless response (MVDR), tradeoff, maximum signal-to-noise ratio (SNR), and Wiener filters are derived from the conventional speech enhancement approach and the recently introduced orthogonal decomposition approach. For each of the filters, we derive their properties in terms of output SNR and speech distortion. We then demonstrate how the ideas can be applied to single- and multichannel noise reduction in both the time and frequency domains as well as binaural noise reduction.
AB - In this paper, we provide an overview of some recently introduced principles and ideas for speech enhancement with linear filtering and explore how these are related and how they can be used in various applications. This is done in a general framework where the speech enhancement problem is stated as a signal vector estimation problem, i.e., with a filter matrix, where the estimate is obtained by means of a matrix-vector product of the filter matrix and the noisy signal vector. In this framework, minimum distortion, minimum variance distortionless response (MVDR), tradeoff, maximum signal-to-noise ratio (SNR), and Wiener filters are derived from the conventional speech enhancement approach and the recently introduced orthogonal decomposition approach. For each of the filters, we derive their properties in terms of output SNR and speech distortion. We then demonstrate how the ideas can be applied to single- and multichannel noise reduction in both the time and frequency domains as well as binaural noise reduction.
KW - Binaural
KW - Frequency domain
KW - Multichannel
KW - Noise reduction
KW - Optimal linear filtering
KW - Orthogonal decomposition
KW - Performance measures
KW - Single-channel
KW - Speech enhancement
KW - Time domain
UR - http://www.scopus.com/inward/record.url?scp=84911944231&partnerID=8YFLogxK
U2 - 10.1186/1687-6180-2014-162
DO - 10.1186/1687-6180-2014-162
M3 - 文献综述
AN - SCOPUS:84911944231
SN - 1931-7573
VL - 2014
JO - Nanoscale Research Letters
JF - Nanoscale Research Letters
IS - 1
M1 - 162
ER -