Spoken term detection based on DTW

Jingyong Hou, Lei Xie, Peng Yang, Xiong Xiao, Cheung Chi Leung, Haihua Xu, Lei Wang, Hang Lü, Bin Ma, Engsiong Chng, Haizhou Li

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

Spoken term detection (STD) for low resource languages has drawn much interest. A partial matching strategy based on phoneme boundaries is presented here to solve the fuzzy matching problem in query-by-example spoken term detection with dynamic time warping. A variety of features were used to validate the strategy on the QUESST 2014 dataset. Tests show that this strategy is not only quite effective for fuzzy match tasks T2 and T3 but also effective for the exact match task T1. This strategy has significantly improved performance in fusion tests.

Original languageEnglish
Pages (from-to)18-23
Number of pages6
JournalQinghua Daxue Xuebao/Journal of Tsinghua University
Volume57
Issue number1
DOIs
StatePublished - 1 Jan 2017

Keywords

  • Dynamic time warping
  • Low resource languages
  • Partial matching
  • Spoken term detection

Fingerprint

Dive into the research topics of 'Spoken term detection based on DTW'. Together they form a unique fingerprint.

Cite this