跳到主要导航 跳到搜索 跳到主要内容

STMT: Spatio-temporal memory transformer for multi-object tracking

  • Songbo Gu
  • , Jianxin Ma
  • , Guancheng Hui
  • , Qiyang Xiao
  • , Wentao Shi
  • Henan University

科研成果: 期刊稿件文章同行评审

13 引用 (Scopus)

摘要

Typically, modern online Multi-Object Tracking (MOT) methods first obtain the detected objects in each frame and then establish associations between them in successive frames. However, it is difficult to obtain high-quality trajectories when camera motion, fast motion, and occlusion challenges occur. To address these problems, this paper proposes a transformer-based MOT system named Spatio-Temporal Memory Transformer (STMT), which focuses on time and history information. The proposed STMT consists of a Spatio-Temporal Enhancement Module (STEM) that uses 3D convolution to model the spatial and temporal interactions of objects and obtains rich features in spatio-temporal information. Moreover, a Dynamic Spatio-Temporal Memory (DSTM) is presented to associate detections with tracklets and contains three units: an Identity Aggregation Module (IAM), a Linear Dynamic Encoder (LD-Encoder) and a memory Decoder (Decoder). The IAM utilizes the geometric changes of objects to reduce the impact of deformation on tracking performance, the LD-Encoder is used to obtain the dependency between objects, and the Decoder generates appearance similarity scores. Furthermore, a Score Fusion Equilibrium Strategy (SFES) is employed to balance the similarity and position distance fusion scores. Extensive experiments demonstrate that the proposed STMT approach is generally superior to the state-of-the-art trackers on the MOT16 and MOT17 benchmarks.

源语言英语
页(从-至)23426-23441
页数16
期刊Applied Intelligence
53
20
DOI
出版状态已出版 - 10月 2023

指纹

探究 'STMT: Spatio-temporal memory transformer for multi-object tracking' 的科研主题。它们共同构成独一无二的指纹。

引用此