Multiple object tracking based on multi-task learning with strip attention

Yaoye Song; Peng Zhang; Wei Huang; Yufei Zha; Tao You; Yanning Zhang

doi:10.1049/ipr2.12327

Multiple object tracking based on multi-task learning with strip attention

Yaoye Song, Peng Zhang, Wei Huang, Yufei Zha, Tao You, Yanning Zhang

School of Computer Science

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

Multiple object tracking (MOT) framework based on bifurcate strategy was usually challenged by data association of different model path, which work for object localisation and appearance embedding independently. By incorporating the re-identification (re-ID) as appearance embedding model, more recent studies on task combination of a single network have made a great progress in tracking performance. Unfortunately, the contributive improvement from re-ID model is hard to balance the accuracy and efficiency for the whole framework. For more effective enhancement of the overall tracking performance, a real-time detection needs to be taken into consideration with other auxiliary means for MOT modelling. Therefore, in this study, a one-shot multiple object tracking is proposed based on multi-task learning to obtain satisfactory performance in both speed and robustness. With updated re-training strategy for the backbone model of detection, a D2LA network is proposed to achieve more characteristic fine-grained feature extraction in branching task of pedestrian recognition. Additionally, a strip attention module is also introduced to further strengthen the feature discriminative capability of the tracking framework in occlusion. Experiments on the 2DMOT15, MOT16, MOT17, and MOT20 benchmark data sets have shown a superior performance in comparison to other state-of-the-art tracking approaches.

Original language	English
Pages (from-to)	3661-3673
Number of pages	13
Journal	IET Image Processing
Volume	15
Issue number	14
DOIs	https://doi.org/10.1049/ipr2.12327
State	Published - Dec 2021

Access to Document

10.1049/ipr2.12327

Cite this

@article{ad67cc0e801448598f86779c8463e0c4,

title = "Multiple object tracking based on multi-task learning with strip attention",

abstract = "Multiple object tracking (MOT) framework based on bifurcate strategy was usually challenged by data association of different model path, which work for object localisation and appearance embedding independently. By incorporating the re-identification (re-ID) as appearance embedding model, more recent studies on task combination of a single network have made a great progress in tracking performance. Unfortunately, the contributive improvement from re-ID model is hard to balance the accuracy and efficiency for the whole framework. For more effective enhancement of the overall tracking performance, a real-time detection needs to be taken into consideration with other auxiliary means for MOT modelling. Therefore, in this study, a one-shot multiple object tracking is proposed based on multi-task learning to obtain satisfactory performance in both speed and robustness. With updated re-training strategy for the backbone model of detection, a D2LA network is proposed to achieve more characteristic fine-grained feature extraction in branching task of pedestrian recognition. Additionally, a strip attention module is also introduced to further strengthen the feature discriminative capability of the tracking framework in occlusion. Experiments on the 2DMOT15, MOT16, MOT17, and MOT20 benchmark data sets have shown a superior performance in comparison to other state-of-the-art tracking approaches.",

author = "Yaoye Song and Peng Zhang and Wei Huang and Yufei Zha and Tao You and Yanning Zhang",

note = "Publisher Copyright: {\textcopyright} 2021 The Authors. IET Image Processing published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology",

year = "2021",

month = dec,

doi = "10.1049/ipr2.12327",

language = "英语",

volume = "15",

pages = "3661--3673",

journal = "IET Image Processing",

issn = "1751-9659",

publisher = "John Wiley & Sons Inc.",

number = "14",

}

TY - JOUR

T1 - Multiple object tracking based on multi-task learning with strip attention

AU - Song, Yaoye

AU - Zhang, Peng

AU - Huang, Wei

AU - Zha, Yufei

AU - You, Tao

AU - Zhang, Yanning

PY - 2021/12

Y1 - 2021/12

N2 - Multiple object tracking (MOT) framework based on bifurcate strategy was usually challenged by data association of different model path, which work for object localisation and appearance embedding independently. By incorporating the re-identification (re-ID) as appearance embedding model, more recent studies on task combination of a single network have made a great progress in tracking performance. Unfortunately, the contributive improvement from re-ID model is hard to balance the accuracy and efficiency for the whole framework. For more effective enhancement of the overall tracking performance, a real-time detection needs to be taken into consideration with other auxiliary means for MOT modelling. Therefore, in this study, a one-shot multiple object tracking is proposed based on multi-task learning to obtain satisfactory performance in both speed and robustness. With updated re-training strategy for the backbone model of detection, a D2LA network is proposed to achieve more characteristic fine-grained feature extraction in branching task of pedestrian recognition. Additionally, a strip attention module is also introduced to further strengthen the feature discriminative capability of the tracking framework in occlusion. Experiments on the 2DMOT15, MOT16, MOT17, and MOT20 benchmark data sets have shown a superior performance in comparison to other state-of-the-art tracking approaches.

AB - Multiple object tracking (MOT) framework based on bifurcate strategy was usually challenged by data association of different model path, which work for object localisation and appearance embedding independently. By incorporating the re-identification (re-ID) as appearance embedding model, more recent studies on task combination of a single network have made a great progress in tracking performance. Unfortunately, the contributive improvement from re-ID model is hard to balance the accuracy and efficiency for the whole framework. For more effective enhancement of the overall tracking performance, a real-time detection needs to be taken into consideration with other auxiliary means for MOT modelling. Therefore, in this study, a one-shot multiple object tracking is proposed based on multi-task learning to obtain satisfactory performance in both speed and robustness. With updated re-training strategy for the backbone model of detection, a D2LA network is proposed to achieve more characteristic fine-grained feature extraction in branching task of pedestrian recognition. Additionally, a strip attention module is also introduced to further strengthen the feature discriminative capability of the tracking framework in occlusion. Experiments on the 2DMOT15, MOT16, MOT17, and MOT20 benchmark data sets have shown a superior performance in comparison to other state-of-the-art tracking approaches.

UR - http://www.scopus.com/inward/record.url?scp=85118712286&partnerID=8YFLogxK

U2 - 10.1049/ipr2.12327

DO - 10.1049/ipr2.12327

M3 - 文章

AN - SCOPUS:85118712286

SN - 1751-9659

VL - 15

SP - 3661

EP - 3673

JO - IET Image Processing

JF - IET Image Processing

IS - 14

ER -

Multiple object tracking based on multi-task learning with strip attention

Abstract

Access to Document

Other files and links

Fingerprint

Cite this