Learning neural network representations using cross-lingual bottleneck features withword-pair information

Yougen Yuan, Cheung Chi Leung, Lei Xie, Bin Ma, Haizhou Li

Research output: Contribution to journalConference articlepeer-review

17 Scopus citations

Abstract

We assume that only word pairs identified by human are available in a low-resource target language. The word pairs are parameterized by a bottleneck feature (BNF) extractor that is trained using transcribed data in a high-resource language. The cross-lingual BNFs of the word pairs are used for training another neural network to generate a new feature representation in the target language. Pairwise learning of frame-level and word-level feature representations are investigated. Our proposed feature representations were evaluated in a word discrimination task on the Switchboard telephone speech corpus. Our learned features could bring 27.5% relative improvement over the previously best reported result on the task.

Original languageEnglish
Pages (from-to)788-792
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume08-12-September-2016
DOIs
StatePublished - 2016
Event17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016 - San Francisco, United States
Duration: 8 Sep 201616 Sep 2016

Keywords

  • Bottleneck features (BNFs)
  • Feature representations
  • Low-resource speech processing
  • Pairwise learning
  • Siamese network

Fingerprint

Dive into the research topics of 'Learning neural network representations using cross-lingual bottleneck features withword-pair information'. Together they form a unique fingerprint.

Cite this