Skip to main navigation Skip to search Skip to main content

Deep Adaptive Discriminate Siamese Network with Multi-Level Response for Visual Object Tracking

  • Yichen Wang
  • , Zhaoyong Mao
  • , Xin Wang
  • , Jing Ren
  • , Chenlin Meng
  • , Junge Shen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Visual object tracking has been intensively studied for its role in traffic surveillance, human action recognition, and autonomous driving. Siamese network-based methods have demonstrated a satisfactory trade-off between precision and efficiency for visual tracking. Nevertheless, the accuracy of Siamese trackers is limited when it comes to predicting the target's location in scenarios involving background clutter, changes in illumination, variations in scale, deformation, fast motion, among others. We suggest a novel approach in our manuscript, which involves a deep adaptive discriminative Siamese network equipped with an advanced fusion scheme for multiple level responses. To enhance the feature discriminability of the Siamese network, we introduce a novel residual channel attention clipping unit. This unit seamlessly integrates residual connections and channel attention, leading to significant optimization and improved representation in the network. Then, we introduce a multi-response adaptive fusion structure that takes the advantages of the low-level, mediate-level, and high-level features, yielding a comprehensive score map that reveals multiple levels of semantics. Our experiments demonstrate that our tracker performs exceptionally well compared to current leading trackers on widely-used public tracking datasets such as OTB-2015 and GOT10k. The method attains an AUC score of 0.655 on OTB2015, while maintaining a processing speed of 63 FPS.

Original languageEnglish
Title of host publicationProceedings - 2023 3rd International Conference on Frontiers of Electronics, Information and Computation Technologies, ICFEICT 2023
EditorsWeijian Liu, Zhuo Zheng Wang, Peng You
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages197-203
Number of pages7
ISBN (Electronic)9798350302356
DOIs
StatePublished - 2023
Event3rd International Conference on Frontiers of Electronics, Information and Computation Technologies, ICFEICT 2023 - Yangzhou, China
Duration: 26 May 202329 May 2023

Publication series

NameProceedings - 2023 3rd International Conference on Frontiers of Electronics, Information and Computation Technologies, ICFEICT 2023

Conference

Conference3rd International Conference on Frontiers of Electronics, Information and Computation Technologies, ICFEICT 2023
Country/TerritoryChina
CityYangzhou
Period26/05/2329/05/23

Keywords

  • Channel attention
  • Multi-response fusion structure
  • Siamese network
  • Visual tracking

Fingerprint

Dive into the research topics of 'Deep Adaptive Discriminate Siamese Network with Multi-Level Response for Visual Object Tracking'. Together they form a unique fingerprint.

Cite this