A density peak clustering approach to unsupervised acoustic subword units discovery

Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, Haizhou Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

This paper studies unsupervised acoustic units discovery from unlabelled speech data. This task is usually approached by two steps, i.e., partitioning speech utterances into segments and clustering these segments into subword categories. In previous approaches, the clustering step usually assumes the number of subword units are known beforehand, which is unreasonable for zero-resource languages. Moreover, the previously-used clustering methods are not able to detect non-spherical clusters that are often present in real-world speech data. We address the two problems by a brand new clustering method, called density peak clustering (DPC), which is motivated by the observation that cluster centers are characterized by a higher density than their neighbors and by a relatively large distance from other points of a higher density in the space. Experiments on unsupervised acoustic units discovery demonstrate that our DPC approach can easily discover the number of subword units and it outperforms the recently proposed normalized cuts (NC) clustering approaches [1].

Original languageEnglish
Title of host publication2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages178-183
Number of pages6
ISBN (Electronic)9789881476807
DOIs
StatePublished - 19 Feb 2016
Event2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015 - Hong Kong, Hong Kong
Duration: 16 Dec 201519 Dec 2015

Publication series

Name2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015

Conference

Conference2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015
Country/TerritoryHong Kong
CityHong Kong
Period16/12/1519/12/15

Fingerprint

Dive into the research topics of 'A density peak clustering approach to unsupervised acoustic subword units discovery'. Together they form a unique fingerprint.

Cite this