Deep object tracking with multi-modal data

Xuezhi Zhang, Yuan Yuan, Xiaoqiang Lu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Object tracking is a challenging topic in the field of computer vision since its performance is easily disturbed by occlusion, illumination change, background clutter, scale variation, etc. In this paper, we introduce a robust tracking algorithm that fuses information from both visible images and infrared (IR) images. The proposed tracking algorithm not only incorporates convolutional feature maps from the visible channel, but also employs a scale pyramid representation from IR channel. We estimate the target location by fusing multilayer convolutional feature maps, and predict the target scale from a scale pyramid. The pipeline of the proposed method is as follows. First, the hierarchical convolutional feature maps are obtained from visible images using VGG-Nets. Then, the accurate target location is predicted by the maximum response of correlation filters with the visible image feature maps. Finally, we obtain the precise object scale with a scale pyramid from infrared images where the difference between the target and the background is clear. In order to verify the performance of the proposed method, we capture six video sequences under different conditions. These sequences contain both visible channel and IR channel. Ten state-of-the-art tracking algorithms are compared with our method, and the experimental results show the effectiveness of the proposed tracker.

Original languageEnglish
Title of host publicationIEEE CITS 2016 - 2016 International Conference on Computer, Information and Telecommunication Systems
EditorsFei Gao, Zan Li, Daniel Cascado Caballero, Jing Fan, Mohammad S. Obaidat, Petros Nicoploitidis, Kuei Fang Hsiao
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509034406
DOIs
StatePublished - 16 Aug 2016
Externally publishedYes
Event2016 International Conference on Computer, Information and Telecommunication Systems, CITS 2016 - Kunming, China
Duration: 6 Jul 20168 Jul 2016

Publication series

NameIEEE CITS 2016 - 2016 International Conference on Computer, Information and Telecommunication Systems

Conference

Conference2016 International Conference on Computer, Information and Telecommunication Systems, CITS 2016
Country/TerritoryChina
CityKunming
Period6/07/168/07/16

Fingerprint

Dive into the research topics of 'Deep object tracking with multi-modal data'. Together they form a unique fingerprint.

Cite this