Deformable object tracking with spatiotemporal segmentation in big vision surveillance

Tao Zhuo; Peng Zhang; Yanning Zhang; Wei Huang

doi:10.1109/SPAC.2014.6982647

Deformable object tracking with spatiotemporal segmentation in big vision surveillance

Tao Zhuo, Peng Zhang, Yanning Zhang, Wei Huang

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

2 引用（Scopus）

摘要

The rapid development of worldwide networks has changed many challenge problems from video level to big video level for vision based surveillance. An important technique for big video processing is to extract the salient information from the video datasea effectively. As a fundamental function for data analysis such as behavior understanding for social security, object tracking usually plays an essential role by separating the salient areas from the background scenarios in video. But object tracking in realistic environments is not easy because the appearance configuration of a realistic object may have continual deformation during the movement. In conventional online tracking-by-learning studies, fix-shape appearance modeling is usually utilized for training samples generation due to its applicable simplicity and convenience. Unfortunately, for generic deformable objects, this modeling approach may wrongly discriminate some background areas as the part of object, which is supposed to deteriorate the model update during online learning. Therefore, employing the object segmentation to obtain more precise foreground areas for learning sample generation has been proposed recently to resolve this problem, but a common limitation of these approaches is that the object segmentation was only performed in spatial domain rather than spatiotemporal domain of the video. Therefore, when the background texture is similar to the target object, tracking failure happens because accurate segmentation is hard to be achieved. In this paper, a motion-appearance model for deformable object segmentation is proposed by incorporating pixel based gradients flow in the spatiotemporal domain. With motion information between the consecutive frames, the irregular-shaped object can be accurately segmented by energy function optimization and boundary convergence and the proposed segmentation is then incorporated into a structural SVM tracking framework for online learning sample generation. We have evaluated the proposed tracking on the benchmark video as well as the surveillance video datasets including heavy intrinsic variations and occlusions, as a demonstration, the experiment results has verified a significant improvement in tracking accuracy and robustness in comparison with other state-of-art tracking works.

源语言	英语
主期刊名	Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014
出版商	Institute of Electrical and Electronics Engineers Inc.
页	1-6
页数	6
ISBN（电子版）	9781479953530
DOI	https://doi.org/10.1109/SPAC.2014.6982647
出版状态	已出版 - 11 12月 2014
活动	2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014 - Wuhan, Hubei, 中国期限: 18 10月 2014 → 19 10月 2014

出版系列

姓名	Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014

会议

会议	2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014
国家/地区	中国
市	Wuhan, Hubei
时期	18/10/14 → 19/10/14

访问文件

10.1109/SPAC.2014.6982647

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhuo, T., Zhang, P., Zhang, Y., & Huang, W. (2014). Deformable object tracking with spatiotemporal segmentation in big vision surveillance. 在 Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014 (页码 1-6). 文章 6982647 (Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SPAC.2014.6982647

Zhuo, Tao ; Zhang, Peng ; Zhang, Yanning 等. / Deformable object tracking with spatiotemporal segmentation in big vision surveillance. Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014. Institute of Electrical and Electronics Engineers Inc., 2014. 页码 1-6 (Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014).

@inproceedings{9b8a0a82f9084b6ab1334103a86f10fa,

title = "Deformable object tracking with spatiotemporal segmentation in big vision surveillance",

abstract = "The rapid development of worldwide networks has changed many challenge problems from video level to big video level for vision based surveillance. An important technique for big video processing is to extract the salient information from the video datasea effectively. As a fundamental function for data analysis such as behavior understanding for social security, object tracking usually plays an essential role by separating the salient areas from the background scenarios in video. But object tracking in realistic environments is not easy because the appearance configuration of a realistic object may have continual deformation during the movement. In conventional online tracking-by-learning studies, fix-shape appearance modeling is usually utilized for training samples generation due to its applicable simplicity and convenience. Unfortunately, for generic deformable objects, this modeling approach may wrongly discriminate some background areas as the part of object, which is supposed to deteriorate the model update during online learning. Therefore, employing the object segmentation to obtain more precise foreground areas for learning sample generation has been proposed recently to resolve this problem, but a common limitation of these approaches is that the object segmentation was only performed in spatial domain rather than spatiotemporal domain of the video. Therefore, when the background texture is similar to the target object, tracking failure happens because accurate segmentation is hard to be achieved. In this paper, a motion-appearance model for deformable object segmentation is proposed by incorporating pixel based gradients flow in the spatiotemporal domain. With motion information between the consecutive frames, the irregular-shaped object can be accurately segmented by energy function optimization and boundary convergence and the proposed segmentation is then incorporated into a structural SVM tracking framework for online learning sample generation. We have evaluated the proposed tracking on the benchmark video as well as the surveillance video datasets including heavy intrinsic variations and occlusions, as a demonstration, the experiment results has verified a significant improvement in tracking accuracy and robustness in comparison with other state-of-art tracking works.",

author = "Tao Zhuo and Peng Zhang and Yanning Zhang and Wei Huang",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.; 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014 ; Conference date: 18-10-2014 Through 19-10-2014",

year = "2014",

month = dec,

day = "11",

doi = "10.1109/SPAC.2014.6982647",

language = "英语",

series = "Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1--6",

booktitle = "Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014",

}

Zhuo, T, Zhang, P , Zhang, Y & Huang, W 2014, Deformable object tracking with spatiotemporal segmentation in big vision surveillance. 在 Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014., 6982647, Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014, Institute of Electrical and Electronics Engineers Inc., 页码 1-6, 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014, Wuhan, Hubei, 中国, 18/10/14. https://doi.org/10.1109/SPAC.2014.6982647

Deformable object tracking with spatiotemporal segmentation in big vision surveillance. / Zhuo, Tao; Zhang, Peng ; Zhang, Yanning 等.
Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014. Institute of Electrical and Electronics Engineers Inc., 2014. 页码 1-6 6982647 (Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Deformable object tracking with spatiotemporal segmentation in big vision surveillance

AU - Zhuo, Tao

AU - Zhang, Peng

AU - Zhang, Yanning

AU - Huang, Wei

PY - 2014/12/11

Y1 - 2014/12/11

N2 - The rapid development of worldwide networks has changed many challenge problems from video level to big video level for vision based surveillance. An important technique for big video processing is to extract the salient information from the video datasea effectively. As a fundamental function for data analysis such as behavior understanding for social security, object tracking usually plays an essential role by separating the salient areas from the background scenarios in video. But object tracking in realistic environments is not easy because the appearance configuration of a realistic object may have continual deformation during the movement. In conventional online tracking-by-learning studies, fix-shape appearance modeling is usually utilized for training samples generation due to its applicable simplicity and convenience. Unfortunately, for generic deformable objects, this modeling approach may wrongly discriminate some background areas as the part of object, which is supposed to deteriorate the model update during online learning. Therefore, employing the object segmentation to obtain more precise foreground areas for learning sample generation has been proposed recently to resolve this problem, but a common limitation of these approaches is that the object segmentation was only performed in spatial domain rather than spatiotemporal domain of the video. Therefore, when the background texture is similar to the target object, tracking failure happens because accurate segmentation is hard to be achieved. In this paper, a motion-appearance model for deformable object segmentation is proposed by incorporating pixel based gradients flow in the spatiotemporal domain. With motion information between the consecutive frames, the irregular-shaped object can be accurately segmented by energy function optimization and boundary convergence and the proposed segmentation is then incorporated into a structural SVM tracking framework for online learning sample generation. We have evaluated the proposed tracking on the benchmark video as well as the surveillance video datasets including heavy intrinsic variations and occlusions, as a demonstration, the experiment results has verified a significant improvement in tracking accuracy and robustness in comparison with other state-of-art tracking works.

AB - The rapid development of worldwide networks has changed many challenge problems from video level to big video level for vision based surveillance. An important technique for big video processing is to extract the salient information from the video datasea effectively. As a fundamental function for data analysis such as behavior understanding for social security, object tracking usually plays an essential role by separating the salient areas from the background scenarios in video. But object tracking in realistic environments is not easy because the appearance configuration of a realistic object may have continual deformation during the movement. In conventional online tracking-by-learning studies, fix-shape appearance modeling is usually utilized for training samples generation due to its applicable simplicity and convenience. Unfortunately, for generic deformable objects, this modeling approach may wrongly discriminate some background areas as the part of object, which is supposed to deteriorate the model update during online learning. Therefore, employing the object segmentation to obtain more precise foreground areas for learning sample generation has been proposed recently to resolve this problem, but a common limitation of these approaches is that the object segmentation was only performed in spatial domain rather than spatiotemporal domain of the video. Therefore, when the background texture is similar to the target object, tracking failure happens because accurate segmentation is hard to be achieved. In this paper, a motion-appearance model for deformable object segmentation is proposed by incorporating pixel based gradients flow in the spatiotemporal domain. With motion information between the consecutive frames, the irregular-shaped object can be accurately segmented by energy function optimization and boundary convergence and the proposed segmentation is then incorporated into a structural SVM tracking framework for online learning sample generation. We have evaluated the proposed tracking on the benchmark video as well as the surveillance video datasets including heavy intrinsic variations and occlusions, as a demonstration, the experiment results has verified a significant improvement in tracking accuracy and robustness in comparison with other state-of-art tracking works.

UR - http://www.scopus.com/inward/record.url?scp=84920725926&partnerID=8YFLogxK

U2 - 10.1109/SPAC.2014.6982647

DO - 10.1109/SPAC.2014.6982647

M3 - 会议稿件

AN - SCOPUS:84920725926

T3 - Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014

SP - 1

EP - 6

BT - Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014

Y2 - 18 October 2014 through 19 October 2014

ER -

Zhuo T, Zhang P , Zhang Y, Huang W. Deformable object tracking with spatiotemporal segmentation in big vision surveillance. 在 Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014. Institute of Electrical and Electronics Engineers Inc. 2014. 页码 1-6. 6982647. (Proceedings 2014 IEEE International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2014). doi: 10.1109/SPAC.2014.6982647

Deformable object tracking with spatiotemporal segmentation in big vision surveillance

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此