Harnessing lab knowledge for real-world action recognition

Zhigang Ma; Yi Yang; Feiping Nie; Nicu Sebe; Shuicheng Yan; Alexander G. Hauptmann

doi:10.1007/s11263-014-0717-5

Harnessing lab knowledge for real-world action recognition

Zhigang Ma, Yi Yang, Feiping Nie, Nicu Sebe, Shuicheng Yan, Alexander G. Hauptmann

科研成果: 期刊稿件 › 文章 › 同行评审

36 引用（Scopus）

摘要

Much research on human action recognition has been oriented toward the performance gain on lab-collected datasets. Yet real-world videos are more diverse, with more complicated actions and often only a few of them are precisely labeled. Thus, recognizing actions from these videos is a tough mission. The paucity of labeled real-world videos motivates us to "borrow" strength from other resources. Specifically, considering that many lab datasets are available, we propose to harness lab datasets to facilitate the action recognition in real-world videos given that the lab and real-world datasets are related. As their action categories are usually inconsistent, we design a multi-task learning framework to jointly optimize the classifiers for both sides. The general Schatten $$p$ $ p -norm is exerted on the two classifiers to explore the shared knowledge between them. In this way, our framework is able to mine the shared knowledge between two datasets even if the two have different action categories, which is a major virtue of our method. The shared knowledge is further used to improve the action recognition in the real-world videos. Extensive experiments are performed on real-world datasets with promising results.

源语言	英语
页（从-至）	60-73
页数	14
期刊	International Journal of Computer Vision
卷	109
期	1-2
DOI	https://doi.org/10.1007/s11263-014-0717-5
出版状态	已出版 - 8月 2014
已对外发布	是

访问文件

10.1007/s11263-014-0717-5

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{513e79e9ee27427b8faa61c7f5123226,

title = "Harnessing lab knowledge for real-world action recognition",

abstract = "Much research on human action recognition has been oriented toward the performance gain on lab-collected datasets. Yet real-world videos are more diverse, with more complicated actions and often only a few of them are precisely labeled. Thus, recognizing actions from these videos is a tough mission. The paucity of labeled real-world videos motivates us to {"}borrow{"} strength from other resources. Specifically, considering that many lab datasets are available, we propose to harness lab datasets to facilitate the action recognition in real-world videos given that the lab and real-world datasets are related. As their action categories are usually inconsistent, we design a multi-task learning framework to jointly optimize the classifiers for both sides. The general Schatten $$p$ $ p -norm is exerted on the two classifiers to explore the shared knowledge between them. In this way, our framework is able to mine the shared knowledge between two datasets even if the two have different action categories, which is a major virtue of our method. The shared knowledge is further used to improve the action recognition in the real-world videos. Extensive experiments are performed on real-world datasets with promising results.",

keywords = "Action recognition, General Schatten-p norm, Lab to real-world, Transfer learning",

author = "Zhigang Ma and Yi Yang and Feiping Nie and Nicu Sebe and Shuicheng Yan and Hauptmann, {Alexander G.}",

year = "2014",

month = aug,

doi = "10.1007/s11263-014-0717-5",

language = "英语",

volume = "109",

pages = "60--73",

journal = "International Journal of Computer Vision",

issn = "0920-5691",

publisher = "Springer Netherlands",

number = "1-2",

}

TY - JOUR

T1 - Harnessing lab knowledge for real-world action recognition

AU - Ma, Zhigang

AU - Yang, Yi

AU - Nie, Feiping

AU - Sebe, Nicu

AU - Yan, Shuicheng

AU - Hauptmann, Alexander G.

PY - 2014/8

Y1 - 2014/8

N2 - Much research on human action recognition has been oriented toward the performance gain on lab-collected datasets. Yet real-world videos are more diverse, with more complicated actions and often only a few of them are precisely labeled. Thus, recognizing actions from these videos is a tough mission. The paucity of labeled real-world videos motivates us to "borrow" strength from other resources. Specifically, considering that many lab datasets are available, we propose to harness lab datasets to facilitate the action recognition in real-world videos given that the lab and real-world datasets are related. As their action categories are usually inconsistent, we design a multi-task learning framework to jointly optimize the classifiers for both sides. The general Schatten $$p$ $ p -norm is exerted on the two classifiers to explore the shared knowledge between them. In this way, our framework is able to mine the shared knowledge between two datasets even if the two have different action categories, which is a major virtue of our method. The shared knowledge is further used to improve the action recognition in the real-world videos. Extensive experiments are performed on real-world datasets with promising results.

AB - Much research on human action recognition has been oriented toward the performance gain on lab-collected datasets. Yet real-world videos are more diverse, with more complicated actions and often only a few of them are precisely labeled. Thus, recognizing actions from these videos is a tough mission. The paucity of labeled real-world videos motivates us to "borrow" strength from other resources. Specifically, considering that many lab datasets are available, we propose to harness lab datasets to facilitate the action recognition in real-world videos given that the lab and real-world datasets are related. As their action categories are usually inconsistent, we design a multi-task learning framework to jointly optimize the classifiers for both sides. The general Schatten $$p$ $ p -norm is exerted on the two classifiers to explore the shared knowledge between them. In this way, our framework is able to mine the shared knowledge between two datasets even if the two have different action categories, which is a major virtue of our method. The shared knowledge is further used to improve the action recognition in the real-world videos. Extensive experiments are performed on real-world datasets with promising results.

KW - Action recognition

KW - General Schatten-p norm

KW - Lab to real-world

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=84902258204&partnerID=8YFLogxK

U2 - 10.1007/s11263-014-0717-5

DO - 10.1007/s11263-014-0717-5

M3 - 文章

AN - SCOPUS:84902258204

SN - 0920-5691

VL - 109

SP - 60

EP - 73

JO - International Journal of Computer Vision

JF - International Journal of Computer Vision

IS - 1-2

ER -

Harnessing lab knowledge for real-world action recognition

摘要

访问文件

其它文件与链接

指纹

引用此