Modelling and segmenting subunits for sign language recognition based on hand motion analysis

Junwei Han; George Awad; Alistair Sutherland

doi:10.1016/j.patrec.2008.12.010

Modelling and segmenting subunits for sign language recognition based on hand motion analysis

Junwei Han, George Awad, Alistair Sutherland

Dublin City University

科研成果: 期刊稿件 › 文章 › 同行评审

83 引用（Scopus）

摘要

Modelling and segmenting subunits is one of the important topics in sign language study. Many scholars have proposed the functional definition to subunits from the view of linguistics while the problem of efficiently implementing it using computer vision techniques is a challenge. On the other hand, a number of subunit segmentation work has been investigated for the task of vision-based sign language recognition whereas their subunits either somewhat lack the linguistic support or are improper. In this paper, we attempt to define and segment subunits using computer vision techniques, which also can be basically explained by sign language linguistics. A subunit is firstly defined as one continuous visual hand action in time and space, which comprises a series of interrelated consecutive frames. Then, a simple but efficient solution is developed to detect the subunit boundary using hand motion discontinuity. Finally, temporal clustering by dynamic time warping is adopted to merge similar segments and refine the results. The presented work does not need prior knowledge of the types of signs or number of subunits and is more robust to signer behaviour variation. Furthermore, it correlates highly with the definition of syllables in sign language while sharing characteristics of syllables in spoken languages. A set of comprehensive experiments on real-world signing videos demonstrates the effectiveness of the proposed model.

源语言	英语
页（从-至）	623-633
页数	11
期刊	Pattern Recognition Letters
卷	30
期	6
DOI	https://doi.org/10.1016/j.patrec.2008.12.010
出版状态	已出版 - 15 4月 2009
已对外发布	是

访问文件

10.1016/j.patrec.2008.12.010

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{5bd26ce3bc7a4ebf8075262d8d982cad,

title = "Modelling and segmenting subunits for sign language recognition based on hand motion analysis",

abstract = "Modelling and segmenting subunits is one of the important topics in sign language study. Many scholars have proposed the functional definition to subunits from the view of linguistics while the problem of efficiently implementing it using computer vision techniques is a challenge. On the other hand, a number of subunit segmentation work has been investigated for the task of vision-based sign language recognition whereas their subunits either somewhat lack the linguistic support or are improper. In this paper, we attempt to define and segment subunits using computer vision techniques, which also can be basically explained by sign language linguistics. A subunit is firstly defined as one continuous visual hand action in time and space, which comprises a series of interrelated consecutive frames. Then, a simple but efficient solution is developed to detect the subunit boundary using hand motion discontinuity. Finally, temporal clustering by dynamic time warping is adopted to merge similar segments and refine the results. The presented work does not need prior knowledge of the types of signs or number of subunits and is more robust to signer behaviour variation. Furthermore, it correlates highly with the definition of syllables in sign language while sharing characteristics of syllables in spoken languages. A set of comprehensive experiments on real-world signing videos demonstrates the effectiveness of the proposed model.",

keywords = "Dynamic time warping, Hand motion, Phoneme, Sign language recognition, Subunit",

author = "Junwei Han and George Awad and Alistair Sutherland",

year = "2009",

month = apr,

day = "15",

doi = "10.1016/j.patrec.2008.12.010",

language = "英语",

volume = "30",

pages = "623--633",

journal = "Pattern Recognition Letters",

issn = "0167-8655",

publisher = "Elsevier B.V.",

number = "6",

}

TY - JOUR

T1 - Modelling and segmenting subunits for sign language recognition based on hand motion analysis

AU - Han, Junwei

AU - Awad, George

AU - Sutherland, Alistair

PY - 2009/4/15

Y1 - 2009/4/15

N2 - Modelling and segmenting subunits is one of the important topics in sign language study. Many scholars have proposed the functional definition to subunits from the view of linguistics while the problem of efficiently implementing it using computer vision techniques is a challenge. On the other hand, a number of subunit segmentation work has been investigated for the task of vision-based sign language recognition whereas their subunits either somewhat lack the linguistic support or are improper. In this paper, we attempt to define and segment subunits using computer vision techniques, which also can be basically explained by sign language linguistics. A subunit is firstly defined as one continuous visual hand action in time and space, which comprises a series of interrelated consecutive frames. Then, a simple but efficient solution is developed to detect the subunit boundary using hand motion discontinuity. Finally, temporal clustering by dynamic time warping is adopted to merge similar segments and refine the results. The presented work does not need prior knowledge of the types of signs or number of subunits and is more robust to signer behaviour variation. Furthermore, it correlates highly with the definition of syllables in sign language while sharing characteristics of syllables in spoken languages. A set of comprehensive experiments on real-world signing videos demonstrates the effectiveness of the proposed model.

AB - Modelling and segmenting subunits is one of the important topics in sign language study. Many scholars have proposed the functional definition to subunits from the view of linguistics while the problem of efficiently implementing it using computer vision techniques is a challenge. On the other hand, a number of subunit segmentation work has been investigated for the task of vision-based sign language recognition whereas their subunits either somewhat lack the linguistic support or are improper. In this paper, we attempt to define and segment subunits using computer vision techniques, which also can be basically explained by sign language linguistics. A subunit is firstly defined as one continuous visual hand action in time and space, which comprises a series of interrelated consecutive frames. Then, a simple but efficient solution is developed to detect the subunit boundary using hand motion discontinuity. Finally, temporal clustering by dynamic time warping is adopted to merge similar segments and refine the results. The presented work does not need prior knowledge of the types of signs or number of subunits and is more robust to signer behaviour variation. Furthermore, it correlates highly with the definition of syllables in sign language while sharing characteristics of syllables in spoken languages. A set of comprehensive experiments on real-world signing videos demonstrates the effectiveness of the proposed model.

KW - Dynamic time warping

KW - Hand motion

KW - Phoneme

KW - Sign language recognition

KW - Subunit

UR - http://www.scopus.com/inward/record.url?scp=61849122379&partnerID=8YFLogxK

U2 - 10.1016/j.patrec.2008.12.010

DO - 10.1016/j.patrec.2008.12.010

M3 - 文章

AN - SCOPUS:61849122379

SN - 0167-8655

VL - 30

SP - 623

EP - 633

JO - Pattern Recognition Letters

JF - Pattern Recognition Letters

IS - 6

ER -

Modelling and segmenting subunits for sign language recognition based on hand motion analysis

摘要

访问文件

其它文件与链接

指纹

引用此