Abstract
Ambisonic room impulse responses (ARIRs) are recorded to capture the spatial acoustic characteristics of specific rooms, with widespread applications in virtual and augmented reality. While the first-order Ambisonics (FOA) microphone array is commonly employed for three-dimensional (3D) room acoustics recording due to its easy accessibility, higher spatial resolution necessitates using higher-order Ambisonics (HOA) in applications such as binaural rendering and sound field reconstruction. This paper introduces a novel approach, leveraging generative models to upmix ARIRs. The evaluation results validate the model’s effectiveness at upmixing first-order ARIRs to higher-order representations, surpassing the aliasing frequency limitations. Furthermore, the spectral errors observed in the Binaural Room Transfer Functions (BRTFs) indicate the potential benefits of using upmixed ARIRs for binaural rendering, significantly improving rendering accuracy.
| Original language | English |
|---|---|
| Article number | 11810 |
| Journal | Applied Sciences (Switzerland) |
| Volume | 13 |
| Issue number | 21 |
| DOIs | |
| State | Published - Nov 2023 |
Keywords
- Ambisonics
- generative model
- upmix
Fingerprint
Dive into the research topics of 'Upmix B-Format Ambisonic Room Impulse Responses Using a Generative Model'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver