How does human interest modeling help in computer vision: Tracking-by-saliency in unconstrained social videos

Zhang Peng, Tao Zhuo, Kangli Chen, Huang Wei

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Sample quality plays an important role in tracking-by-learning strategies, but the reliable online samples are hard to be obtained due to challenges of variational environments. By modeling how human visual interest actively guiding the seek of salient regions and movements in video sequences, in this paper, a compositional tracking strategy is proposed based on an integrated saliency map, which is able to accurately guide the process of online samples generation. Meanwhile, a segmentation based refinement method is also proposed for effective model updating. With a high performance kernelized correlation filter, the proposed tracking can efficiently handle the complex intrinsic and extrinsic appearance changes. Experiments on challenging benchmark databases demonstrate that the robust accuracy of the proposed tracking against with the other state-of-the-art trackers.

Original languageEnglish
Title of host publication2016 IEEE International Conference on Multimedia and Expo Workshop, ICMEW 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509015528
DOIs
StatePublished - 22 Sep 2016
Event2016 IEEE International Conference on Multimedia and Expo Workshop, ICMEW 2016 - Seattle, United States
Duration: 11 Jul 201615 Jul 2016

Publication series

Name2016 IEEE International Conference on Multimedia and Expo Workshop, ICMEW 2016

Conference

Conference2016 IEEE International Conference on Multimedia and Expo Workshop, ICMEW 2016
Country/TerritoryUnited States
CitySeattle
Period11/07/1615/07/16

Keywords

  • Detection
  • Human Interest
  • Online Tracking
  • Saliency
  • Segmentation

Fingerprint

Dive into the research topics of 'How does human interest modeling help in computer vision: Tracking-by-saliency in unconstrained social videos'. Together they form a unique fingerprint.

Cite this