Research on the Evaluation Model of Sound Quality in Vehicles Based on Dynamic Activated Mel-Spectrogram

Xinlong Yan; Zhao Tang; Shuang Li; Cheng Li; Kean Chen

doi:10.20855/ijav.2025.30.12088

Research on the Evaluation Model of Sound Quality in Vehicles Based on Dynamic Activated Mel-Spectrogram

Xinlong Yan, Zhao Tang, Shuang Li, Cheng Li, Kean Chen

School of Marine Science and Technology

Research output: Contribution to journal › Article › peer-review

Abstract

To address the high experimental cost of the subjective evaluation of interior vehicle sound quality, this paper proposes an objective evaluation model of sound quality based on the annoyance level of interior noise. First, noise samples of different models under different working conditions were collected. Second, subjective experiments were carried out with annoyance as the evaluation index to construct the in-vehicle noise data set. In order to include both static and continuity features in the model input, we performed two differencing and activation of the Mel-Spectrogram to extract a new dynamic activated Mel-Spectrogram (DAM) by using the original Mel-Spectrogram to learn the dynamic weights obtained after activation. Then the DAM is fed into ResNet152 (Residual Networks 152) for sound quality prediction and the network is optimized using ECA (Efficient Channel Attention). After a large amount of data training, the model obtained an accuracy of 98.87% on the test set. Finally, according to the classification accuracy and time consumed, the proposed model is compared with other models, and the comparison results show that the proposed model has excellent performance and good sound quality evaluation ability, which can lay a practical foundation for sound quality improvement tasks.

Original language	English
Pages (from-to)	12-21
Number of pages	10
Journal	International Journal of Acoustics and Vibrations
Volume	30
Issue number	1
DOIs	https://doi.org/10.20855/ijav.2025.30.12088
State	Published - 2025

Access to Document

10.20855/ijav.2025.30.12088

Cite this

@article{378170c689c64adf8feb73a60db0f7a2,

title = "Research on the Evaluation Model of Sound Quality in Vehicles Based on Dynamic Activated Mel-Spectrogram",

abstract = "To address the high experimental cost of the subjective evaluation of interior vehicle sound quality, this paper proposes an objective evaluation model of sound quality based on the annoyance level of interior noise. First, noise samples of different models under different working conditions were collected. Second, subjective experiments were carried out with annoyance as the evaluation index to construct the in-vehicle noise data set. In order to include both static and continuity features in the model input, we performed two differencing and activation of the Mel-Spectrogram to extract a new dynamic activated Mel-Spectrogram (DAM) by using the original Mel-Spectrogram to learn the dynamic weights obtained after activation. Then the DAM is fed into ResNet152 (Residual Networks 152) for sound quality prediction and the network is optimized using ECA (Efficient Channel Attention). After a large amount of data training, the model obtained an accuracy of 98.87% on the test set. Finally, according to the classification accuracy and time consumed, the proposed model is compared with other models, and the comparison results show that the proposed model has excellent performance and good sound quality evaluation ability, which can lay a practical foundation for sound quality improvement tasks.",

author = "Xinlong Yan and Zhao Tang and Shuang Li and Cheng Li and Kean Chen",

year = "2025",

doi = "10.20855/ijav.2025.30.12088",

language = "英语",

volume = "30",

pages = "12--21",

journal = "International Journal of Acoustics and Vibrations",

issn = "1027-5851",

publisher = "International Institute of Acoustics and Vibrations",

number = "1",

}

TY - JOUR

T1 - Research on the Evaluation Model of Sound Quality in Vehicles Based on Dynamic Activated Mel-Spectrogram

AU - Yan, Xinlong

AU - Tang, Zhao

AU - Li, Shuang

AU - Li, Cheng

AU - Chen, Kean

PY - 2025

Y1 - 2025

N2 - To address the high experimental cost of the subjective evaluation of interior vehicle sound quality, this paper proposes an objective evaluation model of sound quality based on the annoyance level of interior noise. First, noise samples of different models under different working conditions were collected. Second, subjective experiments were carried out with annoyance as the evaluation index to construct the in-vehicle noise data set. In order to include both static and continuity features in the model input, we performed two differencing and activation of the Mel-Spectrogram to extract a new dynamic activated Mel-Spectrogram (DAM) by using the original Mel-Spectrogram to learn the dynamic weights obtained after activation. Then the DAM is fed into ResNet152 (Residual Networks 152) for sound quality prediction and the network is optimized using ECA (Efficient Channel Attention). After a large amount of data training, the model obtained an accuracy of 98.87% on the test set. Finally, according to the classification accuracy and time consumed, the proposed model is compared with other models, and the comparison results show that the proposed model has excellent performance and good sound quality evaluation ability, which can lay a practical foundation for sound quality improvement tasks.

AB - To address the high experimental cost of the subjective evaluation of interior vehicle sound quality, this paper proposes an objective evaluation model of sound quality based on the annoyance level of interior noise. First, noise samples of different models under different working conditions were collected. Second, subjective experiments were carried out with annoyance as the evaluation index to construct the in-vehicle noise data set. In order to include both static and continuity features in the model input, we performed two differencing and activation of the Mel-Spectrogram to extract a new dynamic activated Mel-Spectrogram (DAM) by using the original Mel-Spectrogram to learn the dynamic weights obtained after activation. Then the DAM is fed into ResNet152 (Residual Networks 152) for sound quality prediction and the network is optimized using ECA (Efficient Channel Attention). After a large amount of data training, the model obtained an accuracy of 98.87% on the test set. Finally, according to the classification accuracy and time consumed, the proposed model is compared with other models, and the comparison results show that the proposed model has excellent performance and good sound quality evaluation ability, which can lay a practical foundation for sound quality improvement tasks.

UR - http://www.scopus.com/inward/record.url?scp=105001151222&partnerID=8YFLogxK

U2 - 10.20855/ijav.2025.30.12088

DO - 10.20855/ijav.2025.30.12088

M3 - 文章

AN - SCOPUS:105001151222

SN - 1027-5851

VL - 30

SP - 12

EP - 21

JO - International Journal of Acoustics and Vibrations

JF - International Journal of Acoustics and Vibrations

IS - 1

ER -

Research on the Evaluation Model of Sound Quality in Vehicles Based on Dynamic Activated Mel-Spectrogram

Abstract

Access to Document

Other files and links

Fingerprint

Cite this