Attend to Listen: A Single-Input/Binaural-Output Heterophasic MVDR Filter for Noise Reduction and Perceptual Rendering

Ningning Pan; Jilu Jin; Xianrui Wang; Jacob Benesty; Jingdong Chen

doi:10.1109/TASLP.2024.3519895

Attend to Listen: A Single-Input/Binaural-Output Heterophasic MVDR Filter for Noise Reduction and Perceptual Rendering

Ningning Pan, Jilu Jin, Xianrui Wang, Jacob Benesty, Jingdong Chen

School of Marine Science and Technology

Research output: Contribution to journal › Article › peer-review

Abstract

In this paper, we present a novel single-input/binaural-output (SIBO) minimum variance distortionless response (MVDR) noise reduction method, which involves formulating two MVDR sub-filters, one for the left ear and the other for the right ear, by minimizing the interaural coherence of the noise signal while ensuring the distortionless constraint, so that the desired speech signal can pass through the filter without distortion. Subsequently, a unique heterophasic binaural presentation is generated. The method effectively reduces noise while directing the desired signal and residual noise to different directions/zones in the perceptual space. This utilization of human binaural perception properties enhances speech intelligibility. A deep neural network (DNN) based noise covariance matrix estimation method facilitates the implementation of the binaural heterophasic filters in simulations and listening tests. The results demonstrate the superiority of the proposed SIBO MVDR method in enhancing both speech quality and intelligibility as compared to the conventional single-input/single-output (SISO) MVDR filter.

Original language	English
Journal	IEEE/ACM Transactions on Audio Speech and Language Processing
DOIs	https://doi.org/10.1109/TASLP.2024.3519895
State	Accepted/In press - 2024

Keywords

Binaural noise reduction
heterophasic presentation
interaural coherence
MVDR filter
single-channel input
speech intelligibility

Access to Document

10.1109/TASLP.2024.3519895

Cite this

@article{7b99964681954cb4a644937f65f512cc,

title = "Attend to Listen: A Single-Input/Binaural-Output Heterophasic MVDR Filter for Noise Reduction and Perceptual Rendering",

abstract = "In this paper, we present a novel single-input/binaural-output (SIBO) minimum variance distortionless response (MVDR) noise reduction method, which involves formulating two MVDR sub-filters, one for the left ear and the other for the right ear, by minimizing the interaural coherence of the noise signal while ensuring the distortionless constraint, so that the desired speech signal can pass through the filter without distortion. Subsequently, a unique heterophasic binaural presentation is generated. The method effectively reduces noise while directing the desired signal and residual noise to different directions/zones in the perceptual space. This utilization of human binaural perception properties enhances speech intelligibility. A deep neural network (DNN) based noise covariance matrix estimation method facilitates the implementation of the binaural heterophasic filters in simulations and listening tests. The results demonstrate the superiority of the proposed SIBO MVDR method in enhancing both speech quality and intelligibility as compared to the conventional single-input/single-output (SISO) MVDR filter.",

keywords = "Binaural noise reduction, heterophasic presentation, interaural coherence, MVDR filter, single-channel input, speech intelligibility",

author = "Ningning Pan and Jilu Jin and Xianrui Wang and Jacob Benesty and Jingdong Chen",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2024",

doi = "10.1109/TASLP.2024.3519895",

language = "英语",

journal = "IEEE/ACM Transactions on Audio Speech and Language Processing",

issn = "2329-9290",

publisher = "IEEE Advancing Technology for Humanity",

}

TY - JOUR

T1 - Attend to Listen

T2 - A Single-Input/Binaural-Output Heterophasic MVDR Filter for Noise Reduction and Perceptual Rendering

AU - Pan, Ningning

AU - Jin, Jilu

AU - Wang, Xianrui

AU - Benesty, Jacob

AU - Chen, Jingdong

PY - 2024

Y1 - 2024

N2 - In this paper, we present a novel single-input/binaural-output (SIBO) minimum variance distortionless response (MVDR) noise reduction method, which involves formulating two MVDR sub-filters, one for the left ear and the other for the right ear, by minimizing the interaural coherence of the noise signal while ensuring the distortionless constraint, so that the desired speech signal can pass through the filter without distortion. Subsequently, a unique heterophasic binaural presentation is generated. The method effectively reduces noise while directing the desired signal and residual noise to different directions/zones in the perceptual space. This utilization of human binaural perception properties enhances speech intelligibility. A deep neural network (DNN) based noise covariance matrix estimation method facilitates the implementation of the binaural heterophasic filters in simulations and listening tests. The results demonstrate the superiority of the proposed SIBO MVDR method in enhancing both speech quality and intelligibility as compared to the conventional single-input/single-output (SISO) MVDR filter.

AB - In this paper, we present a novel single-input/binaural-output (SIBO) minimum variance distortionless response (MVDR) noise reduction method, which involves formulating two MVDR sub-filters, one for the left ear and the other for the right ear, by minimizing the interaural coherence of the noise signal while ensuring the distortionless constraint, so that the desired speech signal can pass through the filter without distortion. Subsequently, a unique heterophasic binaural presentation is generated. The method effectively reduces noise while directing the desired signal and residual noise to different directions/zones in the perceptual space. This utilization of human binaural perception properties enhances speech intelligibility. A deep neural network (DNN) based noise covariance matrix estimation method facilitates the implementation of the binaural heterophasic filters in simulations and listening tests. The results demonstrate the superiority of the proposed SIBO MVDR method in enhancing both speech quality and intelligibility as compared to the conventional single-input/single-output (SISO) MVDR filter.

KW - Binaural noise reduction

KW - heterophasic presentation

KW - interaural coherence

KW - MVDR filter

KW - single-channel input

KW - speech intelligibility

UR - http://www.scopus.com/inward/record.url?scp=85212868588&partnerID=8YFLogxK

U2 - 10.1109/TASLP.2024.3519895

DO - 10.1109/TASLP.2024.3519895

M3 - 文章

AN - SCOPUS:85212868588

SN - 2329-9290

JO - IEEE/ACM Transactions on Audio Speech and Language Processing

JF - IEEE/ACM Transactions on Audio Speech and Language Processing

ER -

Attend to Listen: A Single-Input/Binaural-Output Heterophasic MVDR Filter for Noise Reduction and Perceptual Rendering

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this