Online object tracking based on BLSTM-RNN with contextual-sequential labeling

Xiangzeng Zhou; Lei Xie; Peng Zhang; Yanning Zhang

doi:10.1007/s12652-017-0514-4

Online object tracking based on BLSTM-RNN with contextual-sequential labeling

Xiangzeng Zhou, Lei Xie, Peng Zhang, Yanning Zhang

School of Computer Science

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

9 Scopus citations

Abstract

Object context has been verified its significance for appearance modeling in different proposed tracking-by-detection approaches. Unfortunately, the restrictive representation of the target’s contextual relationship within spatial domain has intensively limited its utility with high-level classification strategies. By investigating the learning capability of long-term dependencies from sequential data, in this paper, we propose a novel appearance model by transforming the target contextual dependency into a semantic sequential representation. It can be effectively utilized by a recurrent neural network embedded with bidirectional long short-term memory cells for online tracking-by-learning. Based on the trained BLSTM-RNN model, a searching mechanism by labeling score is proposed to improve the tracking robustness. With the implied appearance variation by labeling, the proposed tracking method has demonstrated to outperform most of state-of-the-art trackers on challenging benchmark videos via a heuristic strategy for model updating.

Original language	English
Pages (from-to)	861-870
Number of pages	10
Journal	Journal of Ambient Intelligence and Humanized Computing
Volume	8
Issue number	6
DOIs	https://doi.org/10.1007/s12652-017-0514-4
State	Published - 1 Nov 2017

Keywords

LSTM
RNN
Sequence labeling
Tracking-by-detection
Visual tracking

Access to Document

10.1007/s12652-017-0514-4

Cite this

@article{5b18e75b88894390b46787e4b707c760,

title = "Online object tracking based on BLSTM-RNN with contextual-sequential labeling",

abstract = "Object context has been verified its significance for appearance modeling in different proposed tracking-by-detection approaches. Unfortunately, the restrictive representation of the target{\textquoteright}s contextual relationship within spatial domain has intensively limited its utility with high-level classification strategies. By investigating the learning capability of long-term dependencies from sequential data, in this paper, we propose a novel appearance model by transforming the target contextual dependency into a semantic sequential representation. It can be effectively utilized by a recurrent neural network embedded with bidirectional long short-term memory cells for online tracking-by-learning. Based on the trained BLSTM-RNN model, a searching mechanism by labeling score is proposed to improve the tracking robustness. With the implied appearance variation by labeling, the proposed tracking method has demonstrated to outperform most of state-of-the-art trackers on challenging benchmark videos via a heuristic strategy for model updating.",

keywords = "LSTM, RNN, Sequence labeling, Tracking-by-detection, Visual tracking",

author = "Xiangzeng Zhou and Lei Xie and Peng Zhang and Yanning Zhang",

note = "Publisher Copyright: {\textcopyright} 2017, Springer-Verlag Berlin Heidelberg.",

year = "2017",

month = nov,

day = "1",

doi = "10.1007/s12652-017-0514-4",

language = "英语",

volume = "8",

pages = "861--870",

journal = "Journal of Ambient Intelligence and Humanized Computing",

issn = "1868-5137",

publisher = "Springer Verlag",

number = "6",

}

TY - JOUR

T1 - Online object tracking based on BLSTM-RNN with contextual-sequential labeling

AU - Zhou, Xiangzeng

AU - Xie, Lei

AU - Zhang, Peng

AU - Zhang, Yanning

PY - 2017/11/1

Y1 - 2017/11/1

N2 - Object context has been verified its significance for appearance modeling in different proposed tracking-by-detection approaches. Unfortunately, the restrictive representation of the target’s contextual relationship within spatial domain has intensively limited its utility with high-level classification strategies. By investigating the learning capability of long-term dependencies from sequential data, in this paper, we propose a novel appearance model by transforming the target contextual dependency into a semantic sequential representation. It can be effectively utilized by a recurrent neural network embedded with bidirectional long short-term memory cells for online tracking-by-learning. Based on the trained BLSTM-RNN model, a searching mechanism by labeling score is proposed to improve the tracking robustness. With the implied appearance variation by labeling, the proposed tracking method has demonstrated to outperform most of state-of-the-art trackers on challenging benchmark videos via a heuristic strategy for model updating.

AB - Object context has been verified its significance for appearance modeling in different proposed tracking-by-detection approaches. Unfortunately, the restrictive representation of the target’s contextual relationship within spatial domain has intensively limited its utility with high-level classification strategies. By investigating the learning capability of long-term dependencies from sequential data, in this paper, we propose a novel appearance model by transforming the target contextual dependency into a semantic sequential representation. It can be effectively utilized by a recurrent neural network embedded with bidirectional long short-term memory cells for online tracking-by-learning. Based on the trained BLSTM-RNN model, a searching mechanism by labeling score is proposed to improve the tracking robustness. With the implied appearance variation by labeling, the proposed tracking method has demonstrated to outperform most of state-of-the-art trackers on challenging benchmark videos via a heuristic strategy for model updating.

KW - LSTM

KW - RNN

KW - Sequence labeling

KW - Tracking-by-detection

KW - Visual tracking

UR - http://www.scopus.com/inward/record.url?scp=85031772411&partnerID=8YFLogxK

U2 - 10.1007/s12652-017-0514-4

DO - 10.1007/s12652-017-0514-4

M3 - 文章

AN - SCOPUS:85031772411

SN - 1868-5137

VL - 8

SP - 861

EP - 870

JO - Journal of Ambient Intelligence and Humanized Computing

JF - Journal of Ambient Intelligence and Humanized Computing

IS - 6

ER -

Online object tracking based on BLSTM-RNN with contextual-sequential labeling

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this