Colar: Effective and Efficient Online Action Detection by Consulting Exemplars

Le Yang; Junwei Han; Dingwen Zhang

doi:10.1109/CVPR52688.2022.00316

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars

Le Yang, Junwei Han, Dingwen Zhang

自动化学院

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

46 引用（Scopus）

摘要

Online action detection has attracted increasing research interests in recent years. Current works model historical dependencies and anticipate the future to perceive the action evolution within a video segment and improve the detection accuracy. However, the existing paradigm ignores category-level modeling and does not pay sufficient attention to efficiency. Considering a category, its representative frames exhibit various characteristics. Thus, the category-level modeling can provide complimentary guidance to the temporal dependencies modeling. This paper develops an effective exemplar-consultation mechanism that first measures the similarity between a frame and exemplary frames, and then aggregates exemplary features based on the similarity weights. This is also an efficient mechanism, as both similarity measurement and feature aggregation require limited computations. Based on the exemplar-consultation mechanism, the long-term dependencies can be captured by regarding historical frames as exemplars, while the category-level modeling can be achieved by regarding representative frames from a category as exemplars. Due to the complementarity from the categorylevel modeling, our method employs a lightweight architecture but achieves new high performance on three benchmarks. In addition, using a spatio-temporal network to tackle video frames, our method makes a good trade-off between effectiveness and efficiency. Code is available at https://github.com/VividLe/Online-Action-Detection.

源语言	英语
主期刊名	Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
出版商	IEEE Computer Society
页	3150-3159
页数	10
ISBN（电子版）	9781665469463
DOI	https://doi.org/10.1109/CVPR52688.2022.00316
出版状态	已出版 - 2022
活动	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 - New Orleans, 美国期限: 19 6月 2022 → 24 6月 2022

出版系列

姓名	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
卷	2022-June
ISSN（印刷版）	1063-6919

会议

会议	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
国家/地区	美国
市	New Orleans
时期	19/06/22 → 24/06/22

访问文件

10.1109/CVPR52688.2022.00316

其它文件与链接

链接到 Scopus 的出版物

引用此

Yang, L., Han, J., & Zhang, D. (2022). Colar: Effective and Efficient Online Action Detection by Consulting Exemplars. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 (页码 3150-3159). (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2022-June). IEEE Computer Society. https://doi.org/10.1109/CVPR52688.2022.00316

@inproceedings{177f842129ad43efa5635edec274cbf2,

title = "Colar: Effective and Efficient Online Action Detection by Consulting Exemplars",

abstract = "Online action detection has attracted increasing research interests in recent years. Current works model historical dependencies and anticipate the future to perceive the action evolution within a video segment and improve the detection accuracy. However, the existing paradigm ignores category-level modeling and does not pay sufficient attention to efficiency. Considering a category, its representative frames exhibit various characteristics. Thus, the category-level modeling can provide complimentary guidance to the temporal dependencies modeling. This paper develops an effective exemplar-consultation mechanism that first measures the similarity between a frame and exemplary frames, and then aggregates exemplary features based on the similarity weights. This is also an efficient mechanism, as both similarity measurement and feature aggregation require limited computations. Based on the exemplar-consultation mechanism, the long-term dependencies can be captured by regarding historical frames as exemplars, while the category-level modeling can be achieved by regarding representative frames from a category as exemplars. Due to the complementarity from the categorylevel modeling, our method employs a lightweight architecture but achieves new high performance on three benchmarks. In addition, using a spatio-temporal network to tackle video frames, our method makes a good trade-off between effectiveness and efficiency. Code is available at https://github.com/VividLe/Online-Action-Detection.",

keywords = "Behavior analysis, Video analysis and understanding",

author = "Le Yang and Junwei Han and Dingwen Zhang",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 ; Conference date: 19-06-2022 Through 24-06-2022",

year = "2022",

doi = "10.1109/CVPR52688.2022.00316",

language = "英语",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "3150--3159",

booktitle = "Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022",

}

Yang, L, Han, J & Zhang, D 2022, Colar: Effective and Efficient Online Action Detection by Consulting Exemplars. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 卷 2022-June, IEEE Computer Society, 页码 3150-3159, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, 美国, 19/06/22. https://doi.org/10.1109/CVPR52688.2022.00316

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars. / Yang, Le; Han, Junwei ; Zhang, Dingwen.
Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE Computer Society, 2022. 页码 3150-3159 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2022-June).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Colar

T2 - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022

AU - Yang, Le

AU - Han, Junwei

AU - Zhang, Dingwen

PY - 2022

Y1 - 2022

N2 - Online action detection has attracted increasing research interests in recent years. Current works model historical dependencies and anticipate the future to perceive the action evolution within a video segment and improve the detection accuracy. However, the existing paradigm ignores category-level modeling and does not pay sufficient attention to efficiency. Considering a category, its representative frames exhibit various characteristics. Thus, the category-level modeling can provide complimentary guidance to the temporal dependencies modeling. This paper develops an effective exemplar-consultation mechanism that first measures the similarity between a frame and exemplary frames, and then aggregates exemplary features based on the similarity weights. This is also an efficient mechanism, as both similarity measurement and feature aggregation require limited computations. Based on the exemplar-consultation mechanism, the long-term dependencies can be captured by regarding historical frames as exemplars, while the category-level modeling can be achieved by regarding representative frames from a category as exemplars. Due to the complementarity from the categorylevel modeling, our method employs a lightweight architecture but achieves new high performance on three benchmarks. In addition, using a spatio-temporal network to tackle video frames, our method makes a good trade-off between effectiveness and efficiency. Code is available at https://github.com/VividLe/Online-Action-Detection.

AB - Online action detection has attracted increasing research interests in recent years. Current works model historical dependencies and anticipate the future to perceive the action evolution within a video segment and improve the detection accuracy. However, the existing paradigm ignores category-level modeling and does not pay sufficient attention to efficiency. Considering a category, its representative frames exhibit various characteristics. Thus, the category-level modeling can provide complimentary guidance to the temporal dependencies modeling. This paper develops an effective exemplar-consultation mechanism that first measures the similarity between a frame and exemplary frames, and then aggregates exemplary features based on the similarity weights. This is also an efficient mechanism, as both similarity measurement and feature aggregation require limited computations. Based on the exemplar-consultation mechanism, the long-term dependencies can be captured by regarding historical frames as exemplars, while the category-level modeling can be achieved by regarding representative frames from a category as exemplars. Due to the complementarity from the categorylevel modeling, our method employs a lightweight architecture but achieves new high performance on three benchmarks. In addition, using a spatio-temporal network to tackle video frames, our method makes a good trade-off between effectiveness and efficiency. Code is available at https://github.com/VividLe/Online-Action-Detection.

KW - Behavior analysis

KW - Video analysis and understanding

UR - http://www.scopus.com/inward/record.url?scp=85131029501&partnerID=8YFLogxK

U2 - 10.1109/CVPR52688.2022.00316

DO - 10.1109/CVPR52688.2022.00316

M3 - 会议稿件

AN - SCOPUS:85131029501

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 3150

EP - 3159

BT - Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022

PB - IEEE Computer Society

Y2 - 19 June 2022 through 24 June 2022

ER -

Yang L, Han J , Zhang D. Colar: Effective and Efficient Online Action Detection by Consulting Exemplars. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE Computer Society. 2022. 页码 3150-3159. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR52688.2022.00316

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此