TY - JOUR
T1 - Independent low-rank matrix analysis based on the Sinkhorn divergence source model for blind source separation
AU - Wang, Jianyu
AU - Guan, Shanzheng
AU - Chen, Jingdong
AU - Benesty, Jacob
N1 - Publisher Copyright:
© 2022 Proceedings of the International Congress on Acoustics. All rights reserved.
PY - 2022
Y1 - 2022
N2 - The so-called independent low-rank matrix analysis (ILRMA) has demonstrated a great potential for dealing with the problem of determined blind source separation (BSS) for audio and speech signals. This method assumes that the spectra from different frequency bands are independent and the spectral coefficients in any frequency band are Gaussian distributed. The Itakura-Saito divergence is then employed to estimate the source model related parameters. In reality, however, the spectral coefficients from different frequency bands may be dependent, which is not considered in the existing ILRMA algorithm. This paper presents an improved version of ILRMA, which considers the dependency between the spectral coefficients from different frequency bands. The Sinkhorn divergence is then exploited to optimize the source model parameters. As a result of using the cross-band information, the BSS performance is improved. But the number of parameters to be estimated also increases significantly, and so is the computational complexity. To reduce the algorithm complexity, we apply the Kronecker product to decompose the modeling matrix into the product of a number of matrices of much smaller dimensionality. An efficient algorithm is then developed to implement the Sinkhorn divergence based BSS algorithm and the complexity is reduced by an order of magnitude.
AB - The so-called independent low-rank matrix analysis (ILRMA) has demonstrated a great potential for dealing with the problem of determined blind source separation (BSS) for audio and speech signals. This method assumes that the spectra from different frequency bands are independent and the spectral coefficients in any frequency band are Gaussian distributed. The Itakura-Saito divergence is then employed to estimate the source model related parameters. In reality, however, the spectral coefficients from different frequency bands may be dependent, which is not considered in the existing ILRMA algorithm. This paper presents an improved version of ILRMA, which considers the dependency between the spectral coefficients from different frequency bands. The Sinkhorn divergence is then exploited to optimize the source model parameters. As a result of using the cross-band information, the BSS performance is improved. But the number of parameters to be estimated also increases significantly, and so is the computational complexity. To reduce the algorithm complexity, we apply the Kronecker product to decompose the modeling matrix into the product of a number of matrices of much smaller dimensionality. An efficient algorithm is then developed to implement the Sinkhorn divergence based BSS algorithm and the complexity is reduced by an order of magnitude.
KW - Blind source separation (BSS)
KW - Independent low-rank matrix analysis (ILRMA)
KW - Kronecker product
KW - Sinkhorn distance
UR - http://www.scopus.com/inward/record.url?scp=85192564197&partnerID=8YFLogxK
M3 - 会议文章
AN - SCOPUS:85192564197
SN - 2226-7808
JO - Proceedings of the International Congress on Acoustics
JF - Proceedings of the International Congress on Acoustics
T2 - 24th International Congress on Acoustics, ICA 2022
Y2 - 24 October 2022 through 28 October 2022
ER -