A reverberation compensation method for speaker recognition in rooms

Xiangyang Zeng; Qiang Wang

A reverberation compensation method for speaker recognition in rooms

Xiangyang Zeng, Qiang Wang

School of Marine Science and Technology

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

Abstract

To overcome the problem that the accuracy of speaker recognition systems in rooms descends rapidly as a result of the mismatch between training and testing environments, a differential feature extraction method based on reverberation compensation has been brought forward. Different from the recognition phase that uses traditional MFCCs, Schroeder inverse integration is applied to obtaining the energy decay curve in rooms, so that reverberation can be compensated for MFCC features of pure sound signals in training phase. Furthermore MFCCs are processed by CMN (Cepstral Mean Normalization) and RASTA to suppress the room channel effect. The experimental results in different real rooms with various reverberation degrees and their analysis have shown preliminarily that the method we presented can enhance the recognition rate and performs well in suppressing the influence of reverberation.

Original language	English
Pages (from-to)	420-425
Number of pages	6
Journal	Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University
Volume	33
Issue number	3
State	Published - 1 Jun 2015

Keywords

Cepstral mean normalization (CMN)
Covariance matrix
Energy dissipation
Experiments
Feature extraction
Identification (control systems)
Identification of MFCC feature with reverberation compensation model
Integration
REMOS (reverberation models)
Reverberation
RIR (Room Impulse Response)
Schematic diagrams
Schroeder inverse integration
Speaker recognition
Stability
Testing

Cite this

@article{551737a8c34f401cacd66a91837324e5,

title = "A reverberation compensation method for speaker recognition in rooms",

abstract = "To overcome the problem that the accuracy of speaker recognition systems in rooms descends rapidly as a result of the mismatch between training and testing environments, a differential feature extraction method based on reverberation compensation has been brought forward. Different from the recognition phase that uses traditional MFCCs, Schroeder inverse integration is applied to obtaining the energy decay curve in rooms, so that reverberation can be compensated for MFCC features of pure sound signals in training phase. Furthermore MFCCs are processed by CMN (Cepstral Mean Normalization) and RASTA to suppress the room channel effect. The experimental results in different real rooms with various reverberation degrees and their analysis have shown preliminarily that the method we presented can enhance the recognition rate and performs well in suppressing the influence of reverberation.",

keywords = "Cepstral mean normalization (CMN), Covariance matrix, Energy dissipation, Experiments, Feature extraction, Identification (control systems), Identification of MFCC feature with reverberation compensation model, Integration, REMOS (reverberation models), Reverberation, RIR (Room Impulse Response), Schematic diagrams, Schroeder inverse integration, Speaker recognition, Stability, Testing",

author = "Xiangyang Zeng and Qiang Wang",

year = "2015",

month = jun,

day = "1",

language = "英语",

volume = "33",

pages = "420--425",

journal = "Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University",

issn = "1000-2758",

publisher = "Northwestern Polytechnical University",

number = "3",

}

TY - JOUR

T1 - A reverberation compensation method for speaker recognition in rooms

AU - Zeng, Xiangyang

AU - Wang, Qiang

PY - 2015/6/1

Y1 - 2015/6/1

N2 - To overcome the problem that the accuracy of speaker recognition systems in rooms descends rapidly as a result of the mismatch between training and testing environments, a differential feature extraction method based on reverberation compensation has been brought forward. Different from the recognition phase that uses traditional MFCCs, Schroeder inverse integration is applied to obtaining the energy decay curve in rooms, so that reverberation can be compensated for MFCC features of pure sound signals in training phase. Furthermore MFCCs are processed by CMN (Cepstral Mean Normalization) and RASTA to suppress the room channel effect. The experimental results in different real rooms with various reverberation degrees and their analysis have shown preliminarily that the method we presented can enhance the recognition rate and performs well in suppressing the influence of reverberation.

AB - To overcome the problem that the accuracy of speaker recognition systems in rooms descends rapidly as a result of the mismatch between training and testing environments, a differential feature extraction method based on reverberation compensation has been brought forward. Different from the recognition phase that uses traditional MFCCs, Schroeder inverse integration is applied to obtaining the energy decay curve in rooms, so that reverberation can be compensated for MFCC features of pure sound signals in training phase. Furthermore MFCCs are processed by CMN (Cepstral Mean Normalization) and RASTA to suppress the room channel effect. The experimental results in different real rooms with various reverberation degrees and their analysis have shown preliminarily that the method we presented can enhance the recognition rate and performs well in suppressing the influence of reverberation.

KW - Cepstral mean normalization (CMN)

KW - Covariance matrix

KW - Energy dissipation

KW - Experiments

KW - Feature extraction

KW - Identification (control systems)

KW - Identification of MFCC feature with reverberation compensation model

KW - Integration

KW - REMOS (reverberation models)

KW - Reverberation

KW - RIR (Room Impulse Response)

KW - Schematic diagrams

KW - Schroeder inverse integration

KW - Speaker recognition

KW - Stability

KW - Testing

UR - http://www.scopus.com/inward/record.url?scp=84940998412&partnerID=8YFLogxK

M3 - 文章

AN - SCOPUS:84940998412

SN - 1000-2758

VL - 33

SP - 420

EP - 425

JO - Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University

JF - Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University

IS - 3

ER -

A reverberation compensation method for speaker recognition in rooms

Abstract

Keywords

Other files and links

Fingerprint

Cite this