TY - JOUR
T1 - Semantic-Aware Real-Time Correlation Tracking Framework for UAV Videos
AU - Xue, Xizhe
AU - Li, Ying
AU - Yin, Xiaoyue
AU - Shang, Changjing
AU - Peng, Taoxin
AU - Shen, Qiang
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2022/4/1
Y1 - 2022/4/1
N2 - Discriminative correlation filter (DCF) has contributed tremendously to address the problem of object tracking benefitting from its high computational efficiency. However, it has suffered from performance degradation in unmanned aerial vehicle (UAV) tracking. This article presents a novel semantic-aware real-time correlation tracking framework (SARCT) for UAV videos to enhance the performance of DCF trackers without incurring excessive computing cost. Specifically, SARCT first constructs an additional detection module to generate ROI proposals and to filter any response regarding the target irrelevant area. Then, a novel semantic segmentation module based on semantic template generation and semantic coefficient prediction is further introduced to capture semantic information, which can provide precise ROI mask, thereby effectively suppressing background interference in the ROI proposals. By sharing features and specific network layers for object detection and semantic segmentation, SARCT reduces parameter redundancy to attain sufficient speed for real-time applications. Systematic experiments are conducted on three typical aerial datasets in order to evaluate the performance of the proposed SARCT. The results demonstrate that SARCT is able to improve the accuracy of conventional DCF-based trackers significantly, outperforming state-of-the-art deep trackers.
AB - Discriminative correlation filter (DCF) has contributed tremendously to address the problem of object tracking benefitting from its high computational efficiency. However, it has suffered from performance degradation in unmanned aerial vehicle (UAV) tracking. This article presents a novel semantic-aware real-time correlation tracking framework (SARCT) for UAV videos to enhance the performance of DCF trackers without incurring excessive computing cost. Specifically, SARCT first constructs an additional detection module to generate ROI proposals and to filter any response regarding the target irrelevant area. Then, a novel semantic segmentation module based on semantic template generation and semantic coefficient prediction is further introduced to capture semantic information, which can provide precise ROI mask, thereby effectively suppressing background interference in the ROI proposals. By sharing features and specific network layers for object detection and semantic segmentation, SARCT reduces parameter redundancy to attain sufficient speed for real-time applications. Systematic experiments are conducted on three typical aerial datasets in order to evaluate the performance of the proposed SARCT. The results demonstrate that SARCT is able to improve the accuracy of conventional DCF-based trackers significantly, outperforming state-of-the-art deep trackers.
KW - Detection proposals
KW - Discriminative correlation filter (DCF)
KW - Semantic information
KW - Unmanned aerial vehicle (UAV) tracking
UR - http://www.scopus.com/inward/record.url?scp=85128245453&partnerID=8YFLogxK
U2 - 10.1109/TCYB.2020.3005453
DO - 10.1109/TCYB.2020.3005453
M3 - 文章
C2 - 32701457
AN - SCOPUS:85128245453
SN - 2168-2267
VL - 52
SP - 2418
EP - 2429
JO - IEEE Transactions on Cybernetics
JF - IEEE Transactions on Cybernetics
IS - 4
ER -