LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION

Kaien Mo; Xianrui Wang; Yichen Yang; Shoji Makino; Jingdong Chen

doi:10.23919/eusipco63174.2024.10715020

LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION

Kaien Mo, Xianrui Wang, Yichen Yang, Shoji Makino, Jingdong Chen

School of Marine Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Blind-audio-source-separation (BASS) techniques, particularly those with low latency, play an important role in a wide range of real-time systems, e.g., hearing aids, in-car hand-free voice communication, real-time human-machine interaction, etc. Most existing BASS algorithms are deduced to run on batch mode, and therefore large latency is unavoidable. Recently, some online algorithms were developed, which achieve separation on a frame-by-frame basis in the short-time-Fourier-transform (STFT) domain and the latency is significantly reduced as compared to those batch methods. However, the latency with these algorithms may still be too long for many real-time systems to bear. To further reduce latency while achieving good separation performance, we propose in this work to integrate a weighted prediction error (WPE) module into a non-causal sample-truncating-based independent vector analysis (NST-IVA). The resulting algorithm can maintain the algorithmic delay as NST-IVA if the delay with WPE is appropriately controlled while achieving significantly better performance, which is validated by simulations.

Original language	English
Title of host publication	32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings
Publisher	European Signal Processing Conference, EUSIPCO
Pages	912-916
Number of pages	5
ISBN (Electronic)	9789464593617
DOIs	https://doi.org/10.23919/eusipco63174.2024.10715020
State	Published - 2024
Event	32nd European Signal Processing Conference, EUSIPCO 2024 - Lyon, France Duration: 26 Aug 2024 → 30 Aug 2024

Publication series

Name	European Signal Processing Conference
ISSN (Print)	2219-5491

Conference

Conference	32nd European Signal Processing Conference, EUSIPCO 2024
Country/Territory	France
City	Lyon
Period	26/08/24 → 30/08/24

Keywords

Independent vector analysis
algorithmic delay
non-causal sample truncating technique
weighted prediction error

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.23919/eusipco63174.2024.10715020

Cite this

Mo, K., Wang, X., Yang, Y., Makino, S., & Chen, J. (2024). LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION. In 32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings (pp. 912-916). (European Signal Processing Conference). European Signal Processing Conference, EUSIPCO. https://doi.org/10.23919/eusipco63174.2024.10715020

@inproceedings{11f58fd3fcf74d24b195847692c9115c,

title = "LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION",

abstract = "Blind-audio-source-separation (BASS) techniques, particularly those with low latency, play an important role in a wide range of real-time systems, e.g., hearing aids, in-car hand-free voice communication, real-time human-machine interaction, etc. Most existing BASS algorithms are deduced to run on batch mode, and therefore large latency is unavoidable. Recently, some online algorithms were developed, which achieve separation on a frame-by-frame basis in the short-time-Fourier-transform (STFT) domain and the latency is significantly reduced as compared to those batch methods. However, the latency with these algorithms may still be too long for many real-time systems to bear. To further reduce latency while achieving good separation performance, we propose in this work to integrate a weighted prediction error (WPE) module into a non-causal sample-truncating-based independent vector analysis (NST-IVA). The resulting algorithm can maintain the algorithmic delay as NST-IVA if the delay with WPE is appropriately controlled while achieving significantly better performance, which is validated by simulations.",

keywords = "Independent vector analysis, algorithmic delay, non-causal sample truncating technique, weighted prediction error",

author = "Kaien Mo and Xianrui Wang and Yichen Yang and Shoji Makino and Jingdong Chen",

note = "Publisher Copyright: {\textcopyright} 2024 European Signal Processing Conference, EUSIPCO. All rights reserved.; 32nd European Signal Processing Conference, EUSIPCO 2024 ; Conference date: 26-08-2024 Through 30-08-2024",

year = "2024",

doi = "10.23919/eusipco63174.2024.10715020",

language = "英语",

series = "European Signal Processing Conference",

publisher = "European Signal Processing Conference, EUSIPCO",

pages = "912--916",

booktitle = "32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings",

}

Mo, K, Wang, X, Yang, Y, Makino, S & Chen, J 2024, LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION. in 32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings. European Signal Processing Conference, European Signal Processing Conference, EUSIPCO, pp. 912-916, 32nd European Signal Processing Conference, EUSIPCO 2024, Lyon, France, 26/08/24. https://doi.org/10.23919/eusipco63174.2024.10715020

LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION. / Mo, Kaien; Wang, Xianrui; Yang, Yichen et al.
32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings. European Signal Processing Conference, EUSIPCO, 2024. p. 912-916 (European Signal Processing Conference).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION

AU - Mo, Kaien

AU - Wang, Xianrui

AU - Yang, Yichen

AU - Makino, Shoji

AU - Chen, Jingdong

PY - 2024

Y1 - 2024

N2 - Blind-audio-source-separation (BASS) techniques, particularly those with low latency, play an important role in a wide range of real-time systems, e.g., hearing aids, in-car hand-free voice communication, real-time human-machine interaction, etc. Most existing BASS algorithms are deduced to run on batch mode, and therefore large latency is unavoidable. Recently, some online algorithms were developed, which achieve separation on a frame-by-frame basis in the short-time-Fourier-transform (STFT) domain and the latency is significantly reduced as compared to those batch methods. However, the latency with these algorithms may still be too long for many real-time systems to bear. To further reduce latency while achieving good separation performance, we propose in this work to integrate a weighted prediction error (WPE) module into a non-causal sample-truncating-based independent vector analysis (NST-IVA). The resulting algorithm can maintain the algorithmic delay as NST-IVA if the delay with WPE is appropriately controlled while achieving significantly better performance, which is validated by simulations.

AB - Blind-audio-source-separation (BASS) techniques, particularly those with low latency, play an important role in a wide range of real-time systems, e.g., hearing aids, in-car hand-free voice communication, real-time human-machine interaction, etc. Most existing BASS algorithms are deduced to run on batch mode, and therefore large latency is unavoidable. Recently, some online algorithms were developed, which achieve separation on a frame-by-frame basis in the short-time-Fourier-transform (STFT) domain and the latency is significantly reduced as compared to those batch methods. However, the latency with these algorithms may still be too long for many real-time systems to bear. To further reduce latency while achieving good separation performance, we propose in this work to integrate a weighted prediction error (WPE) module into a non-causal sample-truncating-based independent vector analysis (NST-IVA). The resulting algorithm can maintain the algorithmic delay as NST-IVA if the delay with WPE is appropriately controlled while achieving significantly better performance, which is validated by simulations.

KW - Independent vector analysis

KW - algorithmic delay

KW - non-causal sample truncating technique

KW - weighted prediction error

UR - http://www.scopus.com/inward/record.url?scp=85208418491&partnerID=8YFLogxK

U2 - 10.23919/eusipco63174.2024.10715020

DO - 10.23919/eusipco63174.2024.10715020

M3 - 会议稿件

AN - SCOPUS:85208418491

T3 - European Signal Processing Conference

SP - 912

EP - 916

BT - 32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings

PB - European Signal Processing Conference, EUSIPCO

T2 - 32nd European Signal Processing Conference, EUSIPCO 2024

Y2 - 26 August 2024 through 30 August 2024

ER -

Mo K, Wang X, Yang Y, Makino S, Chen J. LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION. In 32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings. European Signal Processing Conference, EUSIPCO. 2024. p. 912-916. (European Signal Processing Conference). doi: 10.23919/eusipco63174.2024.10715020

LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION

Abstract

Publication series

Conference

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this