Robust visual track using an ensemble cascade of convolutional neural networks

Dan Hu, Xingshe Zhou, Xiaohao Yu, Zhiqiang Hou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Convolutional Neural Networks (CNN) have dramatically boosted the performance of various computer vision tasks except visual tracking due to the lack of training data. In this paper, we pre-train a deep CNN offline to classify the 1 million images from 256 classes with very leaky non-saturating neurons for training acceleration, which is transformed to a discriminative classifier by adding an additional classification layer. In addition, we propose a novel approach for combining increasingly our CNN classifiers in a "cascade" structure through a modification of the AdaBoost framework, and then transfer the selected discriminative features from the ensemble of CNN classifiers to the robust visual tracking task, by updating online to robustly discard the background regions from promising object-like region to cope with appearance changes of the target. Extensive experimental evaluations on an open tracker benchmark demonstrate outstanding performance of our tracker by improving tracking success rate and tracking precision on an average of 9.2% and 13.9% at least over other state-of-the-art trackers.

Original languageEnglish
Title of host publicationSeventh International Conference on Graphic and Image Processing, ICGIP 2015
EditorsXudong Jiang, Xudong Jiang, Yulin Wang, Xudong Jiang, Yulin Wang, Xudong Jiang, Yulin Wang, Yulin Wang
PublisherSPIE
ISBN (Electronic)9781510600584, 9781510600584, 9781510600584, 9781510600584
DOIs
StatePublished - 2015
Event7th International Conference on Graphic and Image Processing, ICGIP 2015 - Singapore, Singapore
Duration: 23 Oct 201525 Oct 2015

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume9817
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference7th International Conference on Graphic and Image Processing, ICGIP 2015
Country/TerritorySingapore
CitySingapore
Period23/10/1525/10/15

Keywords

  • AdaBoost
  • Convolutional neural network
  • Deep learning
  • Visual tracking

Fingerprint

Dive into the research topics of 'Robust visual track using an ensemble cascade of convolutional neural networks'. Together they form a unique fingerprint.

Cite this