End-to-end sound field reproduction based on deep learning

Xi Hong, Bokai Du, Shuang Yang, Menghui Lei, Xiangyang Zeng

科研成果: 期刊稿件文章同行评审

8 引用 (Scopus)

摘要

Sound field reproduction, which attempts to create a virtual acoustic environment, is a fundamental technology in the achievement of virtual reality. In sound field reproduction, the driving signals of the loudspeakers are calculated by considering the signals collected by the microphones and working environment of the reproduction system. In this paper, an end-to-end reproduction method based on deep learning is proposed. The inputs and outputs of this system are the sound-pressure signals recorded by microphones and the driving signals of loudspeakers, respectively. A convolutional autoencoder network with skip connections in the frequency domain is used. Furthermore, sparse layers are applied to capture the sparse features of the sound field. Simulation results show that the reproduction errors of the proposed method are lower than those generated by the conventional pressure matching and least absolute shrinkage and selection operator methods, especially at high frequencies. Experiments were performed under conditions of single and multiple primary sources. The results in both cases demonstrate that the proposed method achieves better high-frequency performance than the conventional methods.

源语言英语
页(从-至)3055-3064
页数10
期刊Journal of the Acoustical Society of America
153
5
DOI
出版状态已出版 - 1 5月 2023

指纹

探究 'End-to-end sound field reproduction based on deep learning' 的科研主题。它们共同构成独一无二的指纹。

引用此