Robust Correlation Tracking in Unmanned Aerial Vehicle Videos via Deep Target-Specific Rectification Networks

Ke Zhang, Wuwei Wang, Jingyu Wang

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Unmanned aerial vehicle (UAV) videos have received active research attention in the remote sensing field by taking full advantage of the bird's eye view offered by UAVs. At the same time, visual tracking approaches based on discriminative correlation filters (DCFs) have recently achieved increasing popularity and success. Despite their success, the tracking robustness and accuracy of existing DCF-based trackers are hard to promote in challenging tracking scenarios of aerial videos due to excessive reliance on the response map and fixed linear model update strategy. To resolve these issues, we propose a robust DCF-based tracking framework via an effective pretrained rectification network for UAV-based remote sensing. Specifically, the target-specific rectification network is offline trained to discriminately classify the target and background. During the online tracking stage, the DCF module performs fast inference to obtain the potential locations of the target. After that, the deep rectification network evaluates the correlation-specific proposals offered by the DCF module and provides precise tracking results. Besides, to achieve a robust and adaptive model update strategy, we propose to finetune both the DCF module and rectification network according to the classification confidence of the estimated result. Extensive experimental results on recent UAV benchmarks demonstrate that our method achieves better performance than other competing algorithms.

Original languageEnglish
Article number6510605
JournalIEEE Geoscience and Remote Sensing Letters
Volume19
DOIs
StatePublished - 2022

Keywords

  • Correlation filter
  • object tracking
  • rectification network
  • unmanned aerial vehicle (UAV)-based remote sensing

Fingerprint

Dive into the research topics of 'Robust Correlation Tracking in Unmanned Aerial Vehicle Videos via Deep Target-Specific Rectification Networks'. Together they form a unique fingerprint.

Cite this