MBTFNET: Multi-Band Temporal-Frequency Neural Network for Singing Voice Enhancement

Weiming Xu, Zhouxuan Chen, Zhili Tan, Shubo Lv, Runduo Han, Wenjiang Zhou, Weifeng Zhao, Lei Xie

科研成果: 书/报告/会议事项章节会议稿件同行评审

3 引用 (Scopus)

摘要

A typical neural speech enhancement (SE) approach mainly handles speech and noise mixtures, which is not optimal for singing voice enhancement scenarios where singing is often mixed with vocal-correlated accompanies and singing has substantial differences from speaking. Music source separation (MSS) models treat vocals and various accompaniment components equally, which may reduce performance compared to the model that only considers vocal enhancement. In this paper, we propose a novel multi-band temporal-frequency neural network (MBTFNet) for singing voice enhancement, which particularly removes background music, noise and even backing vocals from singing recordings. MBTFNet combines inter and intra-band modeling for better processing of full-band signals. Dual-path modeling in the temporal and frequency axis and temporal dilation blocks are introduced to expand the receptive field of the model. Particularly for removing backing vocals, we propose an implicit personalized enhancement (IPE) stage based on signal-to-noise ratio (SNR) estimation, which further improves the performance of MBTFNet. Experiments show that our proposed model significantly outperforms several state-of-the-art SE and MSS models.

源语言英语
主期刊名2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798350306897
DOI
出版状态已出版 - 2023
活动2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023 - Taipei, 中国台湾
期限: 16 12月 202320 12月 2023

出版系列

姓名2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023

会议

会议2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023
国家/地区中国台湾
Taipei
时期16/12/2320/12/23

指纹

探究 'MBTFNET: Multi-Band Temporal-Frequency Neural Network for Singing Voice Enhancement' 的科研主题。它们共同构成独一无二的指纹。

引用此