TY - JOUR
T1 - Harnessing lab knowledge for real-world action recognition
AU - Ma, Zhigang
AU - Yang, Yi
AU - Nie, Feiping
AU - Sebe, Nicu
AU - Yan, Shuicheng
AU - Hauptmann, Alexander G.
PY - 2014/8
Y1 - 2014/8
N2 - Much research on human action recognition has been oriented toward the performance gain on lab-collected datasets. Yet real-world videos are more diverse, with more complicated actions and often only a few of them are precisely labeled. Thus, recognizing actions from these videos is a tough mission. The paucity of labeled real-world videos motivates us to "borrow" strength from other resources. Specifically, considering that many lab datasets are available, we propose to harness lab datasets to facilitate the action recognition in real-world videos given that the lab and real-world datasets are related. As their action categories are usually inconsistent, we design a multi-task learning framework to jointly optimize the classifiers for both sides. The general Schatten $$p$ $ p -norm is exerted on the two classifiers to explore the shared knowledge between them. In this way, our framework is able to mine the shared knowledge between two datasets even if the two have different action categories, which is a major virtue of our method. The shared knowledge is further used to improve the action recognition in the real-world videos. Extensive experiments are performed on real-world datasets with promising results.
AB - Much research on human action recognition has been oriented toward the performance gain on lab-collected datasets. Yet real-world videos are more diverse, with more complicated actions and often only a few of them are precisely labeled. Thus, recognizing actions from these videos is a tough mission. The paucity of labeled real-world videos motivates us to "borrow" strength from other resources. Specifically, considering that many lab datasets are available, we propose to harness lab datasets to facilitate the action recognition in real-world videos given that the lab and real-world datasets are related. As their action categories are usually inconsistent, we design a multi-task learning framework to jointly optimize the classifiers for both sides. The general Schatten $$p$ $ p -norm is exerted on the two classifiers to explore the shared knowledge between them. In this way, our framework is able to mine the shared knowledge between two datasets even if the two have different action categories, which is a major virtue of our method. The shared knowledge is further used to improve the action recognition in the real-world videos. Extensive experiments are performed on real-world datasets with promising results.
KW - Action recognition
KW - General Schatten-p norm
KW - Lab to real-world
KW - Transfer learning
UR - http://www.scopus.com/inward/record.url?scp=84902258204&partnerID=8YFLogxK
U2 - 10.1007/s11263-014-0717-5
DO - 10.1007/s11263-014-0717-5
M3 - 文章
AN - SCOPUS:84902258204
SN - 0920-5691
VL - 109
SP - 60
EP - 73
JO - International Journal of Computer Vision
JF - International Journal of Computer Vision
IS - 1-2
ER -