TY - JOUR
T1 - On the compromise between noise reduction and speech/noise spatial information preservation in binaural speech enhancement
AU - Leng, Xin
AU - Chen, Jingdong
AU - Benesty, Jacob
N1 - Publisher Copyright:
© 2021 Acoustical Society of America.
PY - 2021/5/1
Y1 - 2021/5/1
N2 - Spatial information is important for human perception of speech and sound signals. However, this information is often either distorted or completely neglected in noise reduction because it is challenging, to say the least, to achieve optimal noise reduction and accurate spatial information preservation at the same time. This paper studies the problem of binaural speech enhancement. By jointly diagonalizing the speech and noise correlation matrices, we present a method to construct the noise reduction filter as a linear combination of different eigenvectors, which span a certain subspace of the entire space. A different dimension of the subspace gives a different trade-off between noise reduction and speech/noise spatial information preservation. On the one side, if the dimension is equal to 1, maximum noise reduction is achieved but at the price of significant spatial information distortion. On the other extreme, if the dimension of the subspace is equal to that of the entire space, spatial information is accurately preserved but at the cost of no noise reduction. Therefore, one can achieve different levels of compromises between the amount of noise reduction and the level of speech/noise spatial information preservation by adjusting the dimension of the used subspace.
AB - Spatial information is important for human perception of speech and sound signals. However, this information is often either distorted or completely neglected in noise reduction because it is challenging, to say the least, to achieve optimal noise reduction and accurate spatial information preservation at the same time. This paper studies the problem of binaural speech enhancement. By jointly diagonalizing the speech and noise correlation matrices, we present a method to construct the noise reduction filter as a linear combination of different eigenvectors, which span a certain subspace of the entire space. A different dimension of the subspace gives a different trade-off between noise reduction and speech/noise spatial information preservation. On the one side, if the dimension is equal to 1, maximum noise reduction is achieved but at the price of significant spatial information distortion. On the other extreme, if the dimension of the subspace is equal to that of the entire space, spatial information is accurately preserved but at the cost of no noise reduction. Therefore, one can achieve different levels of compromises between the amount of noise reduction and the level of speech/noise spatial information preservation by adjusting the dimension of the used subspace.
UR - http://www.scopus.com/inward/record.url?scp=85105851878&partnerID=8YFLogxK
U2 - 10.1121/10.0004854
DO - 10.1121/10.0004854
M3 - 文章
C2 - 34241094
AN - SCOPUS:85105851878
SN - 0001-4966
VL - 149
SP - 3151
EP - 3162
JO - Journal of the Acoustical Society of America
JF - Journal of the Acoustical Society of America
IS - 5
ER -