Two-stage learning to robust visual track via CNNs

Dan Hu; Xingshe Zhou; Xiaohao Yu; Zhiqiang Hou

doi:10.1007/978-3-319-21969-1_44

Two-stage learning to robust visual track via CNNs

Dan Hu, Xingshe Zhou, Xiaohao Yu, Zhiqiang Hou

School of Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

Convolutional Neural Networks (CNN) are an alternative type of deep neural network that can be used to model local correlations and reduce translation variations, which have demonstrated great performance in some computer vision areas except the visual tracking due to the lack of training data. In this paper, we explore applying a two-stage learning CNN as a generic feature extractor offline pretrained with a large auxiliary dataset and then transfer its rich feature hierarchies to the robust visual tracking task. Instead of traditional neuron models in CNNs, we introduce a strategy to use ReLU for training acceleration. Empirical comparisons prove our CNN based tracker outperforms several state-of-the-art methods on an open tracking benchmark.

Original language	English
Title of host publication	Image and Graphics - 8th International Conference, ICIG 2015, Proceedings
Editors	Yu-Jin Zhang
Publisher	Springer Verlag
Pages	491-498
Number of pages	8
ISBN (Print)	9783319219684
DOIs	https://doi.org/10.1007/978-3-319-21969-1_44
State	Published - 2015
Event	8th International Conference on Image and Graphics, ICIG 2015 - Tianjin, China Duration: 13 Aug 2015 → 16 Aug 2015

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	9219
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	8th International Conference on Image and Graphics, ICIG 2015
Country/Territory	China
City	Tianjin
Period	13/08/15 → 16/08/15

Keywords

Convolutional neural network
Deep learning
Visual tracking

Access to Document

10.1007/978-3-319-21969-1_44

Cite this

Hu, D., Zhou, X., Yu, X., & Hou, Z. (2015). Two-stage learning to robust visual track via CNNs. In Y.-J. Zhang (Ed.), Image and Graphics - 8th International Conference, ICIG 2015, Proceedings (pp. 491-498). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9219). Springer Verlag. https://doi.org/10.1007/978-3-319-21969-1_44

@inproceedings{896464666de64a50a4da5133e65fe6c3,

title = "Two-stage learning to robust visual track via CNNs",

abstract = "Convolutional Neural Networks (CNN) are an alternative type of deep neural network that can be used to model local correlations and reduce translation variations, which have demonstrated great performance in some computer vision areas except the visual tracking due to the lack of training data. In this paper, we explore applying a two-stage learning CNN as a generic feature extractor offline pretrained with a large auxiliary dataset and then transfer its rich feature hierarchies to the robust visual tracking task. Instead of traditional neuron models in CNNs, we introduce a strategy to use ReLU for training acceleration. Empirical comparisons prove our CNN based tracker outperforms several state-of-the-art methods on an open tracking benchmark.",

keywords = "Convolutional neural network, Deep learning, Visual tracking",

author = "Dan Hu and Xingshe Zhou and Xiaohao Yu and Zhiqiang Hou",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing Switzerland 2015.; 8th International Conference on Image and Graphics, ICIG 2015 ; Conference date: 13-08-2015 Through 16-08-2015",

year = "2015",

doi = "10.1007/978-3-319-21969-1_44",

language = "英语",

isbn = "9783319219684",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "491--498",

editor = "Yu-Jin Zhang",

booktitle = "Image and Graphics - 8th International Conference, ICIG 2015, Proceedings",

}

Hu, D, Zhou, X, Yu, X & Hou, Z 2015, Two-stage learning to robust visual track via CNNs. in Y-J Zhang (ed.), Image and Graphics - 8th International Conference, ICIG 2015, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9219, Springer Verlag, pp. 491-498, 8th International Conference on Image and Graphics, ICIG 2015, Tianjin, China, 13/08/15. https://doi.org/10.1007/978-3-319-21969-1_44

Two-stage learning to robust visual track via CNNs. / Hu, Dan; Zhou, Xingshe; Yu, Xiaohao et al.
Image and Graphics - 8th International Conference, ICIG 2015, Proceedings. ed. / Yu-Jin Zhang. Springer Verlag, 2015. p. 491-498 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9219).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Two-stage learning to robust visual track via CNNs

AU - Hu, Dan

AU - Zhou, Xingshe

AU - Yu, Xiaohao

AU - Hou, Zhiqiang

N1 - Publisher Copyright: © Springer International Publishing Switzerland 2015.

PY - 2015

Y1 - 2015

N2 - Convolutional Neural Networks (CNN) are an alternative type of deep neural network that can be used to model local correlations and reduce translation variations, which have demonstrated great performance in some computer vision areas except the visual tracking due to the lack of training data. In this paper, we explore applying a two-stage learning CNN as a generic feature extractor offline pretrained with a large auxiliary dataset and then transfer its rich feature hierarchies to the robust visual tracking task. Instead of traditional neuron models in CNNs, we introduce a strategy to use ReLU for training acceleration. Empirical comparisons prove our CNN based tracker outperforms several state-of-the-art methods on an open tracking benchmark.

AB - Convolutional Neural Networks (CNN) are an alternative type of deep neural network that can be used to model local correlations and reduce translation variations, which have demonstrated great performance in some computer vision areas except the visual tracking due to the lack of training data. In this paper, we explore applying a two-stage learning CNN as a generic feature extractor offline pretrained with a large auxiliary dataset and then transfer its rich feature hierarchies to the robust visual tracking task. Instead of traditional neuron models in CNNs, we introduce a strategy to use ReLU for training acceleration. Empirical comparisons prove our CNN based tracker outperforms several state-of-the-art methods on an open tracking benchmark.

KW - Convolutional neural network

KW - Deep learning

KW - Visual tracking

UR - http://www.scopus.com/inward/record.url?scp=84943613755&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-21969-1_44

DO - 10.1007/978-3-319-21969-1_44

M3 - 会议稿件

AN - SCOPUS:84943613755

SN - 9783319219684

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 491

EP - 498

BT - Image and Graphics - 8th International Conference, ICIG 2015, Proceedings

A2 - Zhang, Yu-Jin

PB - Springer Verlag

T2 - 8th International Conference on Image and Graphics, ICIG 2015

Y2 - 13 August 2015 through 16 August 2015

ER -

Hu D, Zhou X, Yu X, Hou Z. Two-stage learning to robust visual track via CNNs. In Zhang YJ, editor, Image and Graphics - 8th International Conference, ICIG 2015, Proceedings. Springer Verlag. 2015. p. 491-498. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-21969-1_44

Two-stage learning to robust visual track via CNNs

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this