A binaural heterophasic adaptive beamformer and its deep learning assisted implementation

Jilu Jin; Ningning Pan; Jingdong Chen; Jacob Benesty; Yiqian Yang

doi:10.1016/j.patrec.2023.02.025

A binaural heterophasic adaptive beamformer and its deep learning assisted implementation

Jilu Jin, Ningning Pan, Jingdong Chen, Jacob Benesty, Yiqian Yang

航海学院

科研成果: 期刊稿件 › 文章 › 同行评审

2 引用（Scopus）

摘要

Beamforming is one of the most effective approaches to distant sound acquisition in complex acoustic environments, where noise, reverberation, and interference coexist; as a result, a significant number of efforts have been devoted to it over the last few decades. However, conventional beamformers produce a monaural output or colinear outputs, which are not optimal from the perception perspective. To take advantage of the human binaural hearing properties, a new type of fixed beamforming methods were developed recently, which attempt not only to attenuate noise but also render the signal of interest and residual noise into different perceptual regions, thereby achieving higher speech intelligibility. This work extends the principle of fixed binaural beamforming and develops a binaural heterophasic minimum variance distortionless response (MVDR) beamformer. A deep neural network (DNN) based noise estimation method is used to assist the implementation of this heterophasic MVDR beamformer, which is advantageous over the traditional one as it renders the desired source signal and residual noise to different perceptual regions, thereby yielding higher intelligibility. In comparison with the fixed binaural heterophasic beamformers, it can take advantage of the statistics of the noise to achieve better array performance. Results of simulations and listening tests validate the properties of the proposed technique.

源语言	英语
页（从-至）	24-30
页数	7
期刊	Pattern Recognition Letters
卷	168
DOI	https://doi.org/10.1016/j.patrec.2023.02.025
出版状态	已出版 - 4月 2023

访问文件

10.1016/j.patrec.2023.02.025

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{544f7f9f68fa45c192763e5f2d861afb,

title = "A binaural heterophasic adaptive beamformer and its deep learning assisted implementation",

abstract = "Beamforming is one of the most effective approaches to distant sound acquisition in complex acoustic environments, where noise, reverberation, and interference coexist; as a result, a significant number of efforts have been devoted to it over the last few decades. However, conventional beamformers produce a monaural output or colinear outputs, which are not optimal from the perception perspective. To take advantage of the human binaural hearing properties, a new type of fixed beamforming methods were developed recently, which attempt not only to attenuate noise but also render the signal of interest and residual noise into different perceptual regions, thereby achieving higher speech intelligibility. This work extends the principle of fixed binaural beamforming and develops a binaural heterophasic minimum variance distortionless response (MVDR) beamformer. A deep neural network (DNN) based noise estimation method is used to assist the implementation of this heterophasic MVDR beamformer, which is advantageous over the traditional one as it renders the desired source signal and residual noise to different perceptual regions, thereby yielding higher intelligibility. In comparison with the fixed binaural heterophasic beamformers, it can take advantage of the statistics of the noise to achieve better array performance. Results of simulations and listening tests validate the properties of the proposed technique.",

keywords = "Adaptive beamforming, Binaural beamforming, Deep neural network, heterophasic, Interaural coherence",

author = "Jilu Jin and Ningning Pan and Jingdong Chen and Jacob Benesty and Yiqian Yang",

note = "Publisher Copyright: {\textcopyright} 2023",

year = "2023",

month = apr,

doi = "10.1016/j.patrec.2023.02.025",

language = "英语",

volume = "168",

pages = "24--30",

journal = "Pattern Recognition Letters",

issn = "0167-8655",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - A binaural heterophasic adaptive beamformer and its deep learning assisted implementation

AU - Jin, Jilu

AU - Pan, Ningning

AU - Chen, Jingdong

AU - Benesty, Jacob

AU - Yang, Yiqian

PY - 2023/4

Y1 - 2023/4

N2 - Beamforming is one of the most effective approaches to distant sound acquisition in complex acoustic environments, where noise, reverberation, and interference coexist; as a result, a significant number of efforts have been devoted to it over the last few decades. However, conventional beamformers produce a monaural output or colinear outputs, which are not optimal from the perception perspective. To take advantage of the human binaural hearing properties, a new type of fixed beamforming methods were developed recently, which attempt not only to attenuate noise but also render the signal of interest and residual noise into different perceptual regions, thereby achieving higher speech intelligibility. This work extends the principle of fixed binaural beamforming and develops a binaural heterophasic minimum variance distortionless response (MVDR) beamformer. A deep neural network (DNN) based noise estimation method is used to assist the implementation of this heterophasic MVDR beamformer, which is advantageous over the traditional one as it renders the desired source signal and residual noise to different perceptual regions, thereby yielding higher intelligibility. In comparison with the fixed binaural heterophasic beamformers, it can take advantage of the statistics of the noise to achieve better array performance. Results of simulations and listening tests validate the properties of the proposed technique.

AB - Beamforming is one of the most effective approaches to distant sound acquisition in complex acoustic environments, where noise, reverberation, and interference coexist; as a result, a significant number of efforts have been devoted to it over the last few decades. However, conventional beamformers produce a monaural output or colinear outputs, which are not optimal from the perception perspective. To take advantage of the human binaural hearing properties, a new type of fixed beamforming methods were developed recently, which attempt not only to attenuate noise but also render the signal of interest and residual noise into different perceptual regions, thereby achieving higher speech intelligibility. This work extends the principle of fixed binaural beamforming and develops a binaural heterophasic minimum variance distortionless response (MVDR) beamformer. A deep neural network (DNN) based noise estimation method is used to assist the implementation of this heterophasic MVDR beamformer, which is advantageous over the traditional one as it renders the desired source signal and residual noise to different perceptual regions, thereby yielding higher intelligibility. In comparison with the fixed binaural heterophasic beamformers, it can take advantage of the statistics of the noise to achieve better array performance. Results of simulations and listening tests validate the properties of the proposed technique.

KW - Adaptive beamforming

KW - Binaural beamforming

KW - Deep neural network

KW - heterophasic

KW - Interaural coherence

UR - http://www.scopus.com/inward/record.url?scp=85149183557&partnerID=8YFLogxK

U2 - 10.1016/j.patrec.2023.02.025

DO - 10.1016/j.patrec.2023.02.025

M3 - 文章

AN - SCOPUS:85149183557

SN - 0167-8655

VL - 168

SP - 24

EP - 30

JO - Pattern Recognition Letters

JF - Pattern Recognition Letters

ER -

A binaural heterophasic adaptive beamformer and its deep learning assisted implementation

摘要

访问文件

其它文件与链接

指纹

引用此