Abstract
Spoken term detection (STD) for low resource languages has drawn much interest. A partial matching strategy based on phoneme boundaries is presented here to solve the fuzzy matching problem in query-by-example spoken term detection with dynamic time warping. A variety of features were used to validate the strategy on the QUESST 2014 dataset. Tests show that this strategy is not only quite effective for fuzzy match tasks T2 and T3 but also effective for the exact match task T1. This strategy has significantly improved performance in fusion tests.
Original language | English |
---|---|
Pages (from-to) | 18-23 |
Number of pages | 6 |
Journal | Qinghua Daxue Xuebao/Journal of Tsinghua University |
Volume | 57 |
Issue number | 1 |
DOIs | |
State | Published - 1 Jan 2017 |
Keywords
- Dynamic time warping
- Low resource languages
- Partial matching
- Spoken term detection