TY - JOUR
T1 - On Multichannel Coherent-to-Diffuse Power Ratio Estimation
AU - Xiang, Qian
AU - Lei, Tao
AU - Pan, Chao
AU - Chen, Jingdong
AU - Benesty, Jacob
N1 - Publisher Copyright:
© 2001-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - The significance of the coherent-to-diffuse-power ratio (CDR) has grown in the fields of speech dereverberation and noise reduction. However, the existing CDR estimators are typically limited to applications with only two microphones. In this article, we investigate CDR estimation in multichannel acoustic systems with more than two microphones. We propose two estimation methods. The first approach involves decomposing the microphone array into several groups of subarrays, where each subarray consists of only two sensors. We estimate the CDR for each group and then fuse these group CDR estimates through weighted averaging to form the multichannel CDR estimate. This weighted-average CDR estimation can be seen as an extension of traditional two-channel CDR estimation methods to the multichannel scenario. The second method is based on array manifold estimation using a joint matrix diagonalization technique, eliminating the need for subarray decomposition. By integrating the CDR estimates with a parametric Wiener-type postfilter, we demonstrate, via simulations, the superior performance of the proposed techniques in terms of CDR estimation accuracy, signal-to-noise ratio (SNR) gain, log-spectral distortion (LSD), and direct-to-reverberant-energy ratio (DRR).
AB - The significance of the coherent-to-diffuse-power ratio (CDR) has grown in the fields of speech dereverberation and noise reduction. However, the existing CDR estimators are typically limited to applications with only two microphones. In this article, we investigate CDR estimation in multichannel acoustic systems with more than two microphones. We propose two estimation methods. The first approach involves decomposing the microphone array into several groups of subarrays, where each subarray consists of only two sensors. We estimate the CDR for each group and then fuse these group CDR estimates through weighted averaging to form the multichannel CDR estimate. This weighted-average CDR estimation can be seen as an extension of traditional two-channel CDR estimation methods to the multichannel scenario. The second method is based on array manifold estimation using a joint matrix diagonalization technique, eliminating the need for subarray decomposition. By integrating the CDR estimates with a parametric Wiener-type postfilter, we demonstrate, via simulations, the superior performance of the proposed techniques in terms of CDR estimation accuracy, signal-to-noise ratio (SNR) gain, log-spectral distortion (LSD), and direct-to-reverberant-energy ratio (DRR).
KW - CDR estimation
KW - coherent-to-diffuse-power ratio (CDR)
KW - dereverberation
KW - multichannel speech enhancement
UR - http://www.scopus.com/inward/record.url?scp=85206824041&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2024.3469548
DO - 10.1109/JSEN.2024.3469548
M3 - 文章
AN - SCOPUS:85206824041
SN - 1530-437X
VL - 24
SP - 37455
EP - 37462
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 22
ER -