TY - GEN
T1 - Magnitude Modelling of Individualized HRTFs Using DNN Based Spherical Harmonic Analysis
AU - Xi, Jingwei
AU - Zhang, Wen
AU - Abhayapala, Thushara D.
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Data-driven individualized HRTF models, such as DNN-based models, strongly depend on the data while most existing HRTF databases have a limited number of measurements on listening subjects. In addition, different HRTF databases have different measurement positions and conditions, making the fusion of these databases challenging. This paper proposes a method for magnitude modelling of an individualized HRTF using DNN-based spherical harmonic analysis of the HRTF data. The HRTF log-magnitude spectra are decomposed using spherical harmonics, based on which multiple HRTF databases can be combined to effectively increase the amount of data for the follow-up DNN modelling. Then, multiple sub-networks are trained to map between the anthropometric parameters and decomposed spherical harmonic coefficients, based on which the individualized HRTF magnitude spectra are generated. Through experimental validation and comparison with the existing data-driven approaches, we show that the proposed method has more accurate modelling performance and lower modelling complexity. In addition, with predicted spherical harmonic coefficients, an individualized HRTF of arbitrary direction can be generated.
AB - Data-driven individualized HRTF models, such as DNN-based models, strongly depend on the data while most existing HRTF databases have a limited number of measurements on listening subjects. In addition, different HRTF databases have different measurement positions and conditions, making the fusion of these databases challenging. This paper proposes a method for magnitude modelling of an individualized HRTF using DNN-based spherical harmonic analysis of the HRTF data. The HRTF log-magnitude spectra are decomposed using spherical harmonics, based on which multiple HRTF databases can be combined to effectively increase the amount of data for the follow-up DNN modelling. Then, multiple sub-networks are trained to map between the anthropometric parameters and decomposed spherical harmonic coefficients, based on which the individualized HRTF magnitude spectra are generated. Through experimental validation and comparison with the existing data-driven approaches, we show that the proposed method has more accurate modelling performance and lower modelling complexity. In addition, with predicted spherical harmonic coefficients, an individualized HRTF of arbitrary direction can be generated.
KW - DNN
KW - HRTF Magnitude Modelling
KW - Spatial Audio
KW - Spherical Harmonics
UR - http://www.scopus.com/inward/record.url?scp=85123444990&partnerID=8YFLogxK
U2 - 10.1109/WASPAA52581.2021.9632704
DO - 10.1109/WASPAA52581.2021.9632704
M3 - 会议稿件
AN - SCOPUS:85123444990
T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
SP - 266
EP - 270
BT - 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2021
Y2 - 17 October 2021 through 20 October 2021
ER -