TY - JOUR
T1 - Efficient correlation information mixer for visual object tracking
AU - Chen, Hang
AU - Zhang, Weiguo
AU - Yan, Danghui
AU - Huang, Lei
AU - Yu, Chao
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/2/15
Y1 - 2024/2/15
N2 - Recently, Siamese tracking framework has been widely used in the visual tracking community. Siamese trackers usually use Cross-Correlation to aggregate templates and search information, so they realize the encoding of target information. However, the previous Cross-Correlation methods either ignore the object channel semantic information or ignore the object's local information. This seriously limits the representation of embedded correlation features and reduces the performance of the trackers. In this paper, to solve these problems, we propose an effective correlation information mixer for visual target tracking. We design an information fusion network to efficiently aggregate template features and search features. In the information fusion network, we use Depthwise Cross-Correlation and Pointwise Cross-Correlation to extract the channel semantic information and local information of the object respectively, and use the correlation information mixer to fully fuse the two correlation maps to achieve the optimal target information encoding. Extensive experimental results show that our tracker achieves competitive performance compared with other state-of-the-art trackers on four benchmarks, including OTB, VOT, UAV123, and LaSOT.
AB - Recently, Siamese tracking framework has been widely used in the visual tracking community. Siamese trackers usually use Cross-Correlation to aggregate templates and search information, so they realize the encoding of target information. However, the previous Cross-Correlation methods either ignore the object channel semantic information or ignore the object's local information. This seriously limits the representation of embedded correlation features and reduces the performance of the trackers. In this paper, to solve these problems, we propose an effective correlation information mixer for visual target tracking. We design an information fusion network to efficiently aggregate template features and search features. In the information fusion network, we use Depthwise Cross-Correlation and Pointwise Cross-Correlation to extract the channel semantic information and local information of the object respectively, and use the correlation information mixer to fully fuse the two correlation maps to achieve the optimal target information encoding. Extensive experimental results show that our tracker achieves competitive performance compared with other state-of-the-art trackers on four benchmarks, including OTB, VOT, UAV123, and LaSOT.
KW - Correlation information mixer
KW - Depthwise cross-correlation
KW - Pointwise cross-correlation
KW - Visual object tracking
UR - http://www.scopus.com/inward/record.url?scp=85183751124&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2024.111368
DO - 10.1016/j.knosys.2024.111368
M3 - 文章
AN - SCOPUS:85183751124
SN - 0950-7051
VL - 285
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 111368
ER -