Robust tracking based on H-CNN with low-resource sampling and scaling by frame-wise motion localization

Peng Zhang, Tao Zhuo, Hanqiao Huang, Kangli Chen, Bo Zhang, Mohan Kankanhalli

科研成果: 期刊稿件文章同行评审

摘要

In big data age, learning with deep models has shown its outstanding effectiveness in a variety of vision tasks. Unfortunately, the requirement of enormous training samples and computational cost still limit its practicability in the low resource media computing based applications such online object tracking. More recently, CNN based feature extraction has helped tracking-by-learning strategies make a significant progress, although the coarse resolution outputs from the last layer still substantially limit a further improvement of tracking performance. By exploiting the hierarchies of convolutional layers as an image pyramid representation, earlier convolutional layers of hierarchical CNN have shown a certain enhancement of spatial localization but are less invariant to target appearance changes, which inevitably led to an inaccurate region for sampling when the non-rigid objects have intrinsic motion. To guarantee a qualified sampling for tracking-by-learning with hierarchical CNN, in this paper, we incorporated an inter-frame motion guidance with the intra-frame appearance correlations by formulating different energy optimization process in both spatial and temporal domains. With an optional functionality for the extracted regions combination, the proposed algorithm is able to achieve more precise target localization for qualified sampling. Experiments on challenging non-rigid tracking benchmark dataset have demonstrated a superior performance of the proposed tracking in comparison to the other state-of-art trackers.

源语言英语
页(从-至)18781-18800
页数20
期刊Multimedia Tools and Applications
77
14
DOI
出版状态已出版 - 1 7月 2018

指纹

探究 'Robust tracking based on H-CNN with low-resource sampling and scaling by frame-wise motion localization' 的科研主题。它们共同构成独一无二的指纹。

引用此