On Multi-input Multi-frame MVDR Filter for Speech Enhancement with Heterophasic Presentation

  • Zixuan Chen
  • , Hanchen Pei
  • , Jilu Jin
  • , Xueqin Luo
  • , Ningning Pan
  • , Gongping Huang
  • , Jingdong Chen
  • , Jacob Benesty

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Multi-channel speech enhancement attempts to recover a target speech signal from noisy observations by exploiting spatial information captured by a microphone array. Conventional approaches typically produce a single output that contains both the desired speech and some residual noise, which neglects the benefits of human binaural hearing system. To overcome this limitation, we propose in this work a multi-input multi-frame binaural-output (MIMFBO) noise reduction method operating in the short-time-Fourier-transform (STFT) domain. This method utilizes both inter-channel and inter-frame correlations to design binaural filters that maximize the interaural coherence (IC) of the desired speech signal while minimizing the IC of the noise, all under distortionless constraints for the desired target speech. As a result, the perceived target signal and residual noise are spatially separated, substantially enhancing speech intelligibility. Simulation results demonstrate the proposed method’s superiority, showing significant improvements in PESQ scores over both the single-input binaural-output MVDR and multi-input binaural-output MVDR approaches. Moreover, subjective listening tests confirm its perceptual benefits.

Original languageEnglish
Title of host publicationMan-Machine Speech Communication - 20th National Conference, NCMMSC 2025, Proceedings
EditorsJia Jia, Zhiyong Wu, Lijian Gao, Gongping Huang, Ya Li
PublisherSpringer Science and Business Media Deutschland GmbH
Pages408-421
Number of pages14
ISBN (Print)9789819553815
DOIs
StatePublished - 2026
Event20th National Conference on Man-Machine Speech Communication, NCMMSC 2025 - Zhenjiang, China
Duration: 16 Oct 202519 Oct 2025

Publication series

NameCommunications in Computer and Information Science
Volume2662 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference20th National Conference on Man-Machine Speech Communication, NCMMSC 2025
Country/TerritoryChina
CityZhenjiang
Period16/10/2519/10/25

Keywords

  • heterophasic presentation
  • Multi-channel binaural-output speech enhancement
  • MVDR filter
  • noise reduction

Fingerprint

Dive into the research topics of 'On Multi-input Multi-frame MVDR Filter for Speech Enhancement with Heterophasic Presentation'. Together they form a unique fingerprint.

Cite this