Phoneme lattice based texttiling towards multilingual story segmentation

Xiaoxuan Wang; Lei Xie; Bin Ma; Eng Siong Chng; Haizhou Li

Phoneme lattice based texttiling towards multilingual story segmentation

Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li

School of Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

6 Scopus citations

Abstract

This paper proposes a phoneme lattice based TextTiling approach towards multilingual story segmentation. The phoneme is the smallest segmental unit in a language and the number of phonemes in a language is usually far smaller than the number of words. Furthermore, many phonemes are shared by different languages. These properties make phonemes particularly appropriate for representing multilingual speech. As phoneme recognition is far from perfect, phoneme lattices, which carry much richer statistics than the 1-best hypotheses, are adopted in this paper as the input to the TextTiling approach. The term frequencies used in traditional TextTiling are replaced by the expected counts of phoneme n-gram units calculated from phoneme lattices. Experiments on TDT2 English and Mandarin corpora show that the phoneme lattice based TextTiling outperforms the phoneme 1-best based TextTiling and word based TextTiling in broadcast news story segmentation.

Original language	English
Title of host publication	Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
Publisher	International Speech Communication Association
Pages	1305-1308
Number of pages	4
State	Published - 2010

Publication series

Name	Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010

Keywords

Phoneme lattice
Speech processing
Spoken document retrieval
Story segmentation
Topic detection and tracking

Cite this

Wang, X., Xie, L., Ma, B., Chng, E. S., & Li, H. (2010). Phoneme lattice based texttiling towards multilingual story segmentation. In Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010 (pp. 1305-1308). (Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010). International Speech Communication Association.

Wang, Xiaoxuan ; Xie, Lei ; Ma, Bin et al. / Phoneme lattice based texttiling towards multilingual story segmentation. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010. International Speech Communication Association, 2010. pp. 1305-1308 (Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010).

@inproceedings{14a057891c7f432a962371a2212eb422,

title = "Phoneme lattice based texttiling towards multilingual story segmentation",

abstract = "This paper proposes a phoneme lattice based TextTiling approach towards multilingual story segmentation. The phoneme is the smallest segmental unit in a language and the number of phonemes in a language is usually far smaller than the number of words. Furthermore, many phonemes are shared by different languages. These properties make phonemes particularly appropriate for representing multilingual speech. As phoneme recognition is far from perfect, phoneme lattices, which carry much richer statistics than the 1-best hypotheses, are adopted in this paper as the input to the TextTiling approach. The term frequencies used in traditional TextTiling are replaced by the expected counts of phoneme n-gram units calculated from phoneme lattices. Experiments on TDT2 English and Mandarin corpora show that the phoneme lattice based TextTiling outperforms the phoneme 1-best based TextTiling and word based TextTiling in broadcast news story segmentation.",

keywords = "Phoneme lattice, Speech processing, Spoken document retrieval, Story segmentation, Topic detection and tracking",

author = "Xiaoxuan Wang and Lei Xie and Bin Ma and Chng, {Eng Siong} and Haizhou Li",

year = "2010",

language = "英语",

series = "Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010",

publisher = "International Speech Communication Association",

pages = "1305--1308",

booktitle = "Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010",

}

Wang, X, Xie, L, Ma, B, Chng, ES & Li, H 2010, Phoneme lattice based texttiling towards multilingual story segmentation. in Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, International Speech Communication Association, pp. 1305-1308.

Phoneme lattice based texttiling towards multilingual story segmentation. / Wang, Xiaoxuan; Xie, Lei; Ma, Bin et al.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010. International Speech Communication Association, 2010. p. 1305-1308 (Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Phoneme lattice based texttiling towards multilingual story segmentation

AU - Wang, Xiaoxuan

AU - Xie, Lei

AU - Ma, Bin

AU - Chng, Eng Siong

AU - Li, Haizhou

PY - 2010

Y1 - 2010

N2 - This paper proposes a phoneme lattice based TextTiling approach towards multilingual story segmentation. The phoneme is the smallest segmental unit in a language and the number of phonemes in a language is usually far smaller than the number of words. Furthermore, many phonemes are shared by different languages. These properties make phonemes particularly appropriate for representing multilingual speech. As phoneme recognition is far from perfect, phoneme lattices, which carry much richer statistics than the 1-best hypotheses, are adopted in this paper as the input to the TextTiling approach. The term frequencies used in traditional TextTiling are replaced by the expected counts of phoneme n-gram units calculated from phoneme lattices. Experiments on TDT2 English and Mandarin corpora show that the phoneme lattice based TextTiling outperforms the phoneme 1-best based TextTiling and word based TextTiling in broadcast news story segmentation.

AB - This paper proposes a phoneme lattice based TextTiling approach towards multilingual story segmentation. The phoneme is the smallest segmental unit in a language and the number of phonemes in a language is usually far smaller than the number of words. Furthermore, many phonemes are shared by different languages. These properties make phonemes particularly appropriate for representing multilingual speech. As phoneme recognition is far from perfect, phoneme lattices, which carry much richer statistics than the 1-best hypotheses, are adopted in this paper as the input to the TextTiling approach. The term frequencies used in traditional TextTiling are replaced by the expected counts of phoneme n-gram units calculated from phoneme lattices. Experiments on TDT2 English and Mandarin corpora show that the phoneme lattice based TextTiling outperforms the phoneme 1-best based TextTiling and word based TextTiling in broadcast news story segmentation.

KW - Phoneme lattice

KW - Speech processing

KW - Spoken document retrieval

KW - Story segmentation

KW - Topic detection and tracking

UR - http://www.scopus.com/inward/record.url?scp=79959852680&partnerID=8YFLogxK

M3 - 会议稿件

AN - SCOPUS:79959852680

T3 - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010

SP - 1305

EP - 1308

BT - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010

PB - International Speech Communication Association

ER -

Wang X, Xie L, Ma B, Chng ES, Li H. Phoneme lattice based texttiling towards multilingual story segmentation. In Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010. International Speech Communication Association. 2010. p. 1305-1308. (Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010).

Phoneme lattice based texttiling towards multilingual story segmentation

Abstract

Publication series

Keywords

Other files and links

Fingerprint

Cite this