Abstract
To overcome the problem that the accuracy of speaker recognition systems in rooms descends rapidly as a result of the mismatch between training and testing environments, a differential feature extraction method based on reverberation compensation has been brought forward. Different from the recognition phase that uses traditional MFCCs, Schroeder inverse integration is applied to obtaining the energy decay curve in rooms, so that reverberation can be compensated for MFCC features of pure sound signals in training phase. Furthermore MFCCs are processed by CMN (Cepstral Mean Normalization) and RASTA to suppress the room channel effect. The experimental results in different real rooms with various reverberation degrees and their analysis have shown preliminarily that the method we presented can enhance the recognition rate and performs well in suppressing the influence of reverberation.
Original language | English |
---|---|
Pages (from-to) | 420-425 |
Number of pages | 6 |
Journal | Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University |
Volume | 33 |
Issue number | 3 |
State | Published - 1 Jun 2015 |
Keywords
- Cepstral mean normalization (CMN)
- Covariance matrix
- Energy dissipation
- Experiments
- Feature extraction
- Identification (control systems)
- Identification of MFCC feature with reverberation compensation model
- Integration
- REMOS (reverberation models)
- Reverberation
- RIR (Room Impulse Response)
- Schematic diagrams
- Schroeder inverse integration
- Speaker recognition
- Stability
- Testing