TY - JOUR
T1 - Research on the Evaluation Model of Sound Quality in Vehicles Based on Dynamic Activated Mel-Spectrogram
AU - Yan, Xinlong
AU - Tang, Zhao
AU - Li, Shuang
AU - Li, Cheng
AU - Chen, Kean
N1 - Publisher Copyright:
© 2025 International Institute of Acoustics and Vibrations. All rights reserved.
PY - 2025
Y1 - 2025
N2 - To address the high experimental cost of the subjective evaluation of interior vehicle sound quality, this paper proposes an objective evaluation model of sound quality based on the annoyance level of interior noise. First, noise samples of different models under different working conditions were collected. Second, subjective experiments were carried out with annoyance as the evaluation index to construct the in-vehicle noise data set. In order to include both static and continuity features in the model input, we performed two differencing and activation of the Mel-Spectrogram to extract a new dynamic activated Mel-Spectrogram (DAM) by using the original Mel-Spectrogram to learn the dynamic weights obtained after activation. Then the DAM is fed into ResNet152 (Residual Networks 152) for sound quality prediction and the network is optimized using ECA (Efficient Channel Attention). After a large amount of data training, the model obtained an accuracy of 98.87% on the test set. Finally, according to the classification accuracy and time consumed, the proposed model is compared with other models, and the comparison results show that the proposed model has excellent performance and good sound quality evaluation ability, which can lay a practical foundation for sound quality improvement tasks.
AB - To address the high experimental cost of the subjective evaluation of interior vehicle sound quality, this paper proposes an objective evaluation model of sound quality based on the annoyance level of interior noise. First, noise samples of different models under different working conditions were collected. Second, subjective experiments were carried out with annoyance as the evaluation index to construct the in-vehicle noise data set. In order to include both static and continuity features in the model input, we performed two differencing and activation of the Mel-Spectrogram to extract a new dynamic activated Mel-Spectrogram (DAM) by using the original Mel-Spectrogram to learn the dynamic weights obtained after activation. Then the DAM is fed into ResNet152 (Residual Networks 152) for sound quality prediction and the network is optimized using ECA (Efficient Channel Attention). After a large amount of data training, the model obtained an accuracy of 98.87% on the test set. Finally, according to the classification accuracy and time consumed, the proposed model is compared with other models, and the comparison results show that the proposed model has excellent performance and good sound quality evaluation ability, which can lay a practical foundation for sound quality improvement tasks.
UR - http://www.scopus.com/inward/record.url?scp=105001151222&partnerID=8YFLogxK
U2 - 10.20855/ijav.2025.30.12088
DO - 10.20855/ijav.2025.30.12088
M3 - 文章
AN - SCOPUS:105001151222
SN - 1027-5851
VL - 30
SP - 12
EP - 21
JO - International Journal of Acoustics and Vibrations
JF - International Journal of Acoustics and Vibrations
IS - 1
ER -