Robust visual track using an ensemble cascade of convolutional neural networks

Dan Hu; Xingshe Zhou; Xiaohao Yu; Zhiqiang Hou

doi:10.1117/12.2228001

Robust visual track using an ensemble cascade of convolutional neural networks

Dan Hu, Xingshe Zhou, Xiaohao Yu, Zhiqiang Hou

School of Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Convolutional Neural Networks (CNN) have dramatically boosted the performance of various computer vision tasks except visual tracking due to the lack of training data. In this paper, we pre-train a deep CNN offline to classify the 1 million images from 256 classes with very leaky non-saturating neurons for training acceleration, which is transformed to a discriminative classifier by adding an additional classification layer. In addition, we propose a novel approach for combining increasingly our CNN classifiers in a "cascade" structure through a modification of the AdaBoost framework, and then transfer the selected discriminative features from the ensemble of CNN classifiers to the robust visual tracking task, by updating online to robustly discard the background regions from promising object-like region to cope with appearance changes of the target. Extensive experimental evaluations on an open tracker benchmark demonstrate outstanding performance of our tracker by improving tracking success rate and tracking precision on an average of 9.2% and 13.9% at least over other state-of-the-art trackers.

Original language	English
Title of host publication	Seventh International Conference on Graphic and Image Processing, ICGIP 2015
Editors	Xudong Jiang, Xudong Jiang, Yulin Wang, Xudong Jiang, Yulin Wang, Xudong Jiang, Yulin Wang, Yulin Wang
Publisher	SPIE
ISBN (Electronic)	9781510600584, 9781510600584, 9781510600584, 9781510600584
DOIs	https://doi.org/10.1117/12.2228001
State	Published - 2015
Event	7th International Conference on Graphic and Image Processing, ICGIP 2015 - Singapore, Singapore Duration: 23 Oct 2015 → 25 Oct 2015

Publication series

Name	Proceedings of SPIE - The International Society for Optical Engineering
Volume	9817
ISSN (Print)	0277-786X
ISSN (Electronic)	1996-756X

Conference

Conference	7th International Conference on Graphic and Image Processing, ICGIP 2015
Country/Territory	Singapore
City	Singapore
Period	23/10/15 → 25/10/15

Keywords

AdaBoost
Convolutional neural network
Deep learning
Visual tracking

Access to Document

10.1117/12.2228001

Cite this

Hu, D., Zhou, X., Yu, X., & Hou, Z. (2015). Robust visual track using an ensemble cascade of convolutional neural networks. In X. Jiang, X. Jiang, Y. Wang, X. Jiang, Y. Wang, X. Jiang, Y. Wang, & Y. Wang (Eds.), Seventh International Conference on Graphic and Image Processing, ICGIP 2015 Article 98170W (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 9817). SPIE. https://doi.org/10.1117/12.2228001

Hu, Dan ; Zhou, Xingshe ; Yu, Xiaohao et al. / Robust visual track using an ensemble cascade of convolutional neural networks. Seventh International Conference on Graphic and Image Processing, ICGIP 2015. editor / Xudong Jiang ; Xudong Jiang ; Yulin Wang ; Xudong Jiang ; Yulin Wang ; Xudong Jiang ; Yulin Wang ; Yulin Wang. SPIE, 2015. (Proceedings of SPIE - The International Society for Optical Engineering).

@inproceedings{ce377a948c5a4a969973787cafc8d7a2,

title = "Robust visual track using an ensemble cascade of convolutional neural networks",

abstract = "Convolutional Neural Networks (CNN) have dramatically boosted the performance of various computer vision tasks except visual tracking due to the lack of training data. In this paper, we pre-train a deep CNN offline to classify the 1 million images from 256 classes with very leaky non-saturating neurons for training acceleration, which is transformed to a discriminative classifier by adding an additional classification layer. In addition, we propose a novel approach for combining increasingly our CNN classifiers in a {"}cascade{"} structure through a modification of the AdaBoost framework, and then transfer the selected discriminative features from the ensemble of CNN classifiers to the robust visual tracking task, by updating online to robustly discard the background regions from promising object-like region to cope with appearance changes of the target. Extensive experimental evaluations on an open tracker benchmark demonstrate outstanding performance of our tracker by improving tracking success rate and tracking precision on an average of 9.2% and 13.9% at least over other state-of-the-art trackers.",

keywords = "AdaBoost, Convolutional neural network, Deep learning, Visual tracking",

author = "Dan Hu and Xingshe Zhou and Xiaohao Yu and Zhiqiang Hou",

note = "Publisher Copyright: {\textcopyright} 2015 SPIE.; 7th International Conference on Graphic and Image Processing, ICGIP 2015 ; Conference date: 23-10-2015 Through 25-10-2015",

year = "2015",

doi = "10.1117/12.2228001",

language = "英语",

series = "Proceedings of SPIE - The International Society for Optical Engineering",

publisher = "SPIE",

editor = "Xudong Jiang and Xudong Jiang and Yulin Wang and Xudong Jiang and Yulin Wang and Xudong Jiang and Yulin Wang and Yulin Wang",

booktitle = "Seventh International Conference on Graphic and Image Processing, ICGIP 2015",

}

Hu, D, Zhou, X, Yu, X & Hou, Z 2015, Robust visual track using an ensemble cascade of convolutional neural networks. in X Jiang, X Jiang, Y Wang, X Jiang, Y Wang, X Jiang, Y Wang & Y Wang (eds), Seventh International Conference on Graphic and Image Processing, ICGIP 2015., 98170W, Proceedings of SPIE - The International Society for Optical Engineering, vol. 9817, SPIE, 7th International Conference on Graphic and Image Processing, ICGIP 2015, Singapore, Singapore, 23/10/15. https://doi.org/10.1117/12.2228001

Robust visual track using an ensemble cascade of convolutional neural networks. / Hu, Dan; Zhou, Xingshe; Yu, Xiaohao et al.
Seventh International Conference on Graphic and Image Processing, ICGIP 2015. ed. / Xudong Jiang; Xudong Jiang; Yulin Wang; Xudong Jiang; Yulin Wang; Xudong Jiang; Yulin Wang; Yulin Wang. SPIE, 2015. 98170W (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 9817).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Robust visual track using an ensemble cascade of convolutional neural networks

AU - Hu, Dan

AU - Zhou, Xingshe

AU - Yu, Xiaohao

AU - Hou, Zhiqiang

PY - 2015

Y1 - 2015

N2 - Convolutional Neural Networks (CNN) have dramatically boosted the performance of various computer vision tasks except visual tracking due to the lack of training data. In this paper, we pre-train a deep CNN offline to classify the 1 million images from 256 classes with very leaky non-saturating neurons for training acceleration, which is transformed to a discriminative classifier by adding an additional classification layer. In addition, we propose a novel approach for combining increasingly our CNN classifiers in a "cascade" structure through a modification of the AdaBoost framework, and then transfer the selected discriminative features from the ensemble of CNN classifiers to the robust visual tracking task, by updating online to robustly discard the background regions from promising object-like region to cope with appearance changes of the target. Extensive experimental evaluations on an open tracker benchmark demonstrate outstanding performance of our tracker by improving tracking success rate and tracking precision on an average of 9.2% and 13.9% at least over other state-of-the-art trackers.

AB - Convolutional Neural Networks (CNN) have dramatically boosted the performance of various computer vision tasks except visual tracking due to the lack of training data. In this paper, we pre-train a deep CNN offline to classify the 1 million images from 256 classes with very leaky non-saturating neurons for training acceleration, which is transformed to a discriminative classifier by adding an additional classification layer. In addition, we propose a novel approach for combining increasingly our CNN classifiers in a "cascade" structure through a modification of the AdaBoost framework, and then transfer the selected discriminative features from the ensemble of CNN classifiers to the robust visual tracking task, by updating online to robustly discard the background regions from promising object-like region to cope with appearance changes of the target. Extensive experimental evaluations on an open tracker benchmark demonstrate outstanding performance of our tracker by improving tracking success rate and tracking precision on an average of 9.2% and 13.9% at least over other state-of-the-art trackers.

KW - AdaBoost

KW - Convolutional neural network

KW - Deep learning

KW - Visual tracking

UR - http://www.scopus.com/inward/record.url?scp=85028318514&partnerID=8YFLogxK

U2 - 10.1117/12.2228001

DO - 10.1117/12.2228001

M3 - 会议稿件

AN - SCOPUS:85028318514

T3 - Proceedings of SPIE - The International Society for Optical Engineering

BT - Seventh International Conference on Graphic and Image Processing, ICGIP 2015

A2 - Jiang, Xudong

A2 - Wang, Yulin

A2 - Jiang, Xudong

A2 - Wang, Yulin

A2 - Jiang, Xudong

A2 - Wang, Yulin

PB - SPIE

T2 - 7th International Conference on Graphic and Image Processing, ICGIP 2015

Y2 - 23 October 2015 through 25 October 2015

ER -

Hu D, Zhou X, Yu X, Hou Z. Robust visual track using an ensemble cascade of convolutional neural networks. In Jiang X, Jiang X, Wang Y, Jiang X, Wang Y, Jiang X, Wang Y, Wang Y, editors, Seventh International Conference on Graphic and Image Processing, ICGIP 2015. SPIE. 2015. 98170W. (Proceedings of SPIE - The International Society for Optical Engineering). doi: 10.1117/12.2228001

Robust visual track using an ensemble cascade of convolutional neural networks

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this