Independent low-rank matrix analysis based on the Sinkhorn divergence source model for blind source separation

Jianyu Wang, Shanzheng Guan, Jingdong Chen, Jacob Benesty

科研成果: 期刊稿件会议文章同行评审

摘要

The so-called independent low-rank matrix analysis (ILRMA) has demonstrated a great potential for dealing with the problem of determined blind source separation (BSS) for audio and speech signals. This method assumes that the spectra from different frequency bands are independent and the spectral coefficients in any frequency band are Gaussian distributed. The Itakura-Saito divergence is then employed to estimate the source model related parameters. In reality, however, the spectral coefficients from different frequency bands may be dependent, which is not considered in the existing ILRMA algorithm. This paper presents an improved version of ILRMA, which considers the dependency between the spectral coefficients from different frequency bands. The Sinkhorn divergence is then exploited to optimize the source model parameters. As a result of using the cross-band information, the BSS performance is improved. But the number of parameters to be estimated also increases significantly, and so is the computational complexity. To reduce the algorithm complexity, we apply the Kronecker product to decompose the modeling matrix into the product of a number of matrices of much smaller dimensionality. An efficient algorithm is then developed to implement the Sinkhorn divergence based BSS algorithm and the complexity is reduced by an order of magnitude.

源语言英语
期刊Proceedings of the International Congress on Acoustics
出版状态已出版 - 2022
活动24th International Congress on Acoustics, ICA 2022 - Gyeongju, 韩国
期限: 24 10月 202228 10月 2022

指纹

探究 'Independent low-rank matrix analysis based on the Sinkhorn divergence source model for blind source separation' 的科研主题。它们共同构成独一无二的指纹。

引用此