SCOAD: Single-Frame Click Supervision for Online Action Detection

Na Ye, Xing Zhang, Dawei Yan, Wei Dong, Qingsen Yan

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Online action detection based on supervised learning requires heavy manual annotation, which is difficult to obtain and may be impractical in real applications. Weakly supervised online action detection (WOAD) can effectively mitigate the problem of substantial labeling costs by using video-level labels. In this paper, we revisit WOAD and propose a weakly supervised online action detection using click-level labels for training, named Single-frame Click Supervision for Online Action Detection (SCOAD). Comparatively, click-level labels can effectively improve prediction accuracy by carrying a small amount of temporal information without massively increase the difficulty and cost of annotation. Specifically, SCOAD includes two joint training modules, i.e., Action Instance Miner (AIM) and Online Action Detector (OAD). To provide more guidance for training network as accuracy as possible, AIM mines pseudo-action instances under the supervision of click labels. Meanwhile, we generate video similarity instances offline by the similarity between video frames and use it to perform finer granularity filtering of error instances generated by AIM. OAD is trained jointly with AIM for online action detection by the pseudo frame-level labels converted from the filtered pseudo-action instances. We conduct extensive experiments on two benchmark datasets to demonstrate that SCOAD can effectively mine and utilize the small amount of temporal information in click-level labels. Code is available at https://github.com/zstarN70/SCOAD.git.

源语言英语
主期刊名Computer Vision – ACCV 2022 - 16th Asian Conference on Computer Vision, Proceedings
编辑Lei Wang, Juergen Gall, Tat-Jun Chin, Imari Sato, Rama Chellappa
出版商Springer Science and Business Media Deutschland GmbH
223-238
页数16
ISBN(印刷版)9783031263156
DOI
出版状态已出版 - 2023
活动16th Asian Conference on Computer Vision, ACCV 2022 - Macao, 中国
期限: 4 12月 20228 12月 2022

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13844 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议16th Asian Conference on Computer Vision, ACCV 2022
国家/地区中国
Macao
时期4/12/228/12/22

指纹

探究 'SCOAD: Single-Frame Click Supervision for Online Action Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此