On Multichannel Coherent-to-Diffuse Power Ratio Estimation

Qian Xiang; Tao Lei; Chao Pan; Jingdong Chen; Jacob Benesty

doi:10.1109/JSEN.2024.3469548

On Multichannel Coherent-to-Diffuse Power Ratio Estimation

Qian Xiang, Tao Lei, Chao Pan, Jingdong Chen, Jacob Benesty

School of Marine Science and Technology

Research output: Contribution to journal › Article › peer-review

Abstract

The significance of the coherent-to-diffuse-power ratio (CDR) has grown in the fields of speech dereverberation and noise reduction. However, the existing CDR estimators are typically limited to applications with only two microphones. In this article, we investigate CDR estimation in multichannel acoustic systems with more than two microphones. We propose two estimation methods. The first approach involves decomposing the microphone array into several groups of subarrays, where each subarray consists of only two sensors. We estimate the CDR for each group and then fuse these group CDR estimates through weighted averaging to form the multichannel CDR estimate. This weighted-average CDR estimation can be seen as an extension of traditional two-channel CDR estimation methods to the multichannel scenario. The second method is based on array manifold estimation using a joint matrix diagonalization technique, eliminating the need for subarray decomposition. By integrating the CDR estimates with a parametric Wiener-type postfilter, we demonstrate, via simulations, the superior performance of the proposed techniques in terms of CDR estimation accuracy, signal-to-noise ratio (SNR) gain, log-spectral distortion (LSD), and direct-to-reverberant-energy ratio (DRR).

Original language	English
Pages (from-to)	37455-37462
Number of pages	8
Journal	IEEE Sensors Journal
Volume	24
Issue number	22
DOIs	https://doi.org/10.1109/JSEN.2024.3469548
State	Published - 2024

Keywords

CDR estimation
coherent-to-diffuse-power ratio (CDR)
dereverberation
multichannel speech enhancement

Access to Document

10.1109/JSEN.2024.3469548

Cite this

@article{aca79e8d99f2407ebf9c4abadd5cd40a,

title = "On Multichannel Coherent-to-Diffuse Power Ratio Estimation",

abstract = "The significance of the coherent-to-diffuse-power ratio (CDR) has grown in the fields of speech dereverberation and noise reduction. However, the existing CDR estimators are typically limited to applications with only two microphones. In this article, we investigate CDR estimation in multichannel acoustic systems with more than two microphones. We propose two estimation methods. The first approach involves decomposing the microphone array into several groups of subarrays, where each subarray consists of only two sensors. We estimate the CDR for each group and then fuse these group CDR estimates through weighted averaging to form the multichannel CDR estimate. This weighted-average CDR estimation can be seen as an extension of traditional two-channel CDR estimation methods to the multichannel scenario. The second method is based on array manifold estimation using a joint matrix diagonalization technique, eliminating the need for subarray decomposition. By integrating the CDR estimates with a parametric Wiener-type postfilter, we demonstrate, via simulations, the superior performance of the proposed techniques in terms of CDR estimation accuracy, signal-to-noise ratio (SNR) gain, log-spectral distortion (LSD), and direct-to-reverberant-energy ratio (DRR).",

keywords = "CDR estimation, coherent-to-diffuse-power ratio (CDR), dereverberation, multichannel speech enhancement",

author = "Qian Xiang and Tao Lei and Chao Pan and Jingdong Chen and Jacob Benesty",

note = "Publisher Copyright: {\textcopyright} 2001-2012 IEEE.",

year = "2024",

doi = "10.1109/JSEN.2024.3469548",

language = "英语",

volume = "24",

pages = "37455--37462",

journal = "IEEE Sensors Journal",

issn = "1530-437X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "22",

}

TY - JOUR

T1 - On Multichannel Coherent-to-Diffuse Power Ratio Estimation

AU - Xiang, Qian

AU - Lei, Tao

AU - Pan, Chao

AU - Chen, Jingdong

AU - Benesty, Jacob

PY - 2024

Y1 - 2024

N2 - The significance of the coherent-to-diffuse-power ratio (CDR) has grown in the fields of speech dereverberation and noise reduction. However, the existing CDR estimators are typically limited to applications with only two microphones. In this article, we investigate CDR estimation in multichannel acoustic systems with more than two microphones. We propose two estimation methods. The first approach involves decomposing the microphone array into several groups of subarrays, where each subarray consists of only two sensors. We estimate the CDR for each group and then fuse these group CDR estimates through weighted averaging to form the multichannel CDR estimate. This weighted-average CDR estimation can be seen as an extension of traditional two-channel CDR estimation methods to the multichannel scenario. The second method is based on array manifold estimation using a joint matrix diagonalization technique, eliminating the need for subarray decomposition. By integrating the CDR estimates with a parametric Wiener-type postfilter, we demonstrate, via simulations, the superior performance of the proposed techniques in terms of CDR estimation accuracy, signal-to-noise ratio (SNR) gain, log-spectral distortion (LSD), and direct-to-reverberant-energy ratio (DRR).

AB - The significance of the coherent-to-diffuse-power ratio (CDR) has grown in the fields of speech dereverberation and noise reduction. However, the existing CDR estimators are typically limited to applications with only two microphones. In this article, we investigate CDR estimation in multichannel acoustic systems with more than two microphones. We propose two estimation methods. The first approach involves decomposing the microphone array into several groups of subarrays, where each subarray consists of only two sensors. We estimate the CDR for each group and then fuse these group CDR estimates through weighted averaging to form the multichannel CDR estimate. This weighted-average CDR estimation can be seen as an extension of traditional two-channel CDR estimation methods to the multichannel scenario. The second method is based on array manifold estimation using a joint matrix diagonalization technique, eliminating the need for subarray decomposition. By integrating the CDR estimates with a parametric Wiener-type postfilter, we demonstrate, via simulations, the superior performance of the proposed techniques in terms of CDR estimation accuracy, signal-to-noise ratio (SNR) gain, log-spectral distortion (LSD), and direct-to-reverberant-energy ratio (DRR).

KW - CDR estimation

KW - coherent-to-diffuse-power ratio (CDR)

KW - dereverberation

KW - multichannel speech enhancement

UR - http://www.scopus.com/inward/record.url?scp=85206824041&partnerID=8YFLogxK

U2 - 10.1109/JSEN.2024.3469548

DO - 10.1109/JSEN.2024.3469548

M3 - 文章

AN - SCOPUS:85206824041

SN - 1530-437X

VL - 24

SP - 37455

EP - 37462

JO - IEEE Sensors Journal

JF - IEEE Sensors Journal

IS - 22

ER -

On Multichannel Coherent-to-Diffuse Power Ratio Estimation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this