Sound field reconstruction using neural processes with dynamic kernels

Zining Liang; Wen Zhang; Thushara D. Abhayapala

doi:10.1186/s13636-024-00333-x

Sound field reconstruction using neural processes with dynamic kernels

Zining Liang, Wen Zhang, Thushara D. Abhayapala

航海学院

科研成果: 期刊稿件 › 文章 › 同行评审

6 引用（Scopus）

摘要

Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In recent studies, there has been a notable emphasis on efficiently estimating sound fields from a limited number of discrete observations. In particular, kernel-based methods using Gaussian processes (GPs) with a covariance function to model spatial correlations have been proposed. However, the current methods rely on pre-defined kernels for modeling, requiring the manual identification of optimal kernels and their parameters for different sound fields. In this work, we propose a novel approach that parameterizes GPs using a deep neural network based on neural processes (NPs) to reconstruct the magnitude of the sound field. This method has the advantage of dynamically learning kernels from data using an attention mechanism, allowing for greater flexibility and adaptability to the acoustic properties of the sound field. Numerical experiments demonstrate that our proposed approach outperforms current methods in reconstructing accuracy, providing a promising alternative for sound field reconstruction.

源语言	英语
文章编号	13
期刊	Eurasip Journal on Audio, Speech, and Music Processing
卷	2024
期	1
DOI	https://doi.org/10.1186/s13636-024-00333-x
出版状态	已出版 - 12月 2024

访问文件

10.1186/s13636-024-00333-x

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{cf10a789af2b4bb4be48e7f78177d279,

title = "Sound field reconstruction using neural processes with dynamic kernels",

abstract = "Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In recent studies, there has been a notable emphasis on efficiently estimating sound fields from a limited number of discrete observations. In particular, kernel-based methods using Gaussian processes (GPs) with a covariance function to model spatial correlations have been proposed. However, the current methods rely on pre-defined kernels for modeling, requiring the manual identification of optimal kernels and their parameters for different sound fields. In this work, we propose a novel approach that parameterizes GPs using a deep neural network based on neural processes (NPs) to reconstruct the magnitude of the sound field. This method has the advantage of dynamically learning kernels from data using an attention mechanism, allowing for greater flexibility and adaptability to the acoustic properties of the sound field. Numerical experiments demonstrate that our proposed approach outperforms current methods in reconstructing accuracy, providing a promising alternative for sound field reconstruction.",

keywords = "Gaussian processes, Kernels, Neural processes, Sound field reconstruction",

author = "Zining Liang and Wen Zhang and Abhayapala, {Thushara D.}",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",

year = "2024",

month = dec,

doi = "10.1186/s13636-024-00333-x",

language = "英语",

volume = "2024",

journal = "Eurasip Journal on Audio, Speech, and Music Processing",

issn = "1687-4714",

publisher = "Springer Publishing Company",

number = "1",

}

TY - JOUR

T1 - Sound field reconstruction using neural processes with dynamic kernels

AU - Liang, Zining

AU - Zhang, Wen

AU - Abhayapala, Thushara D.

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024/12

Y1 - 2024/12

N2 - Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In recent studies, there has been a notable emphasis on efficiently estimating sound fields from a limited number of discrete observations. In particular, kernel-based methods using Gaussian processes (GPs) with a covariance function to model spatial correlations have been proposed. However, the current methods rely on pre-defined kernels for modeling, requiring the manual identification of optimal kernels and their parameters for different sound fields. In this work, we propose a novel approach that parameterizes GPs using a deep neural network based on neural processes (NPs) to reconstruct the magnitude of the sound field. This method has the advantage of dynamically learning kernels from data using an attention mechanism, allowing for greater flexibility and adaptability to the acoustic properties of the sound field. Numerical experiments demonstrate that our proposed approach outperforms current methods in reconstructing accuracy, providing a promising alternative for sound field reconstruction.

AB - Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In recent studies, there has been a notable emphasis on efficiently estimating sound fields from a limited number of discrete observations. In particular, kernel-based methods using Gaussian processes (GPs) with a covariance function to model spatial correlations have been proposed. However, the current methods rely on pre-defined kernels for modeling, requiring the manual identification of optimal kernels and their parameters for different sound fields. In this work, we propose a novel approach that parameterizes GPs using a deep neural network based on neural processes (NPs) to reconstruct the magnitude of the sound field. This method has the advantage of dynamically learning kernels from data using an attention mechanism, allowing for greater flexibility and adaptability to the acoustic properties of the sound field. Numerical experiments demonstrate that our proposed approach outperforms current methods in reconstructing accuracy, providing a promising alternative for sound field reconstruction.

KW - Gaussian processes

KW - Kernels

KW - Neural processes

KW - Sound field reconstruction

UR - http://www.scopus.com/inward/record.url?scp=85185837195&partnerID=8YFLogxK

U2 - 10.1186/s13636-024-00333-x

DO - 10.1186/s13636-024-00333-x

M3 - 文章

AN - SCOPUS:85185837195

SN - 1687-4714

VL - 2024

JO - Eurasip Journal on Audio, Speech, and Music Processing

JF - Eurasip Journal on Audio, Speech, and Music Processing

IS - 1

M1 - 13

ER -

Sound field reconstruction using neural processes with dynamic kernels

摘要

访问文件

其它文件与链接

指纹

引用此