Discriminative Multi-View Subspace Feature Learning for Action Recognition

Biyun Sheng; Jun Li; Fu Xiao; Qun Li; Wankou Yang; Junwei Han

doi:10.1109/TCSVT.2019.2918591

Discriminative Multi-View Subspace Feature Learning for Action Recognition

Biyun Sheng, Jun Li, Fu Xiao, Qun Li, Wankou Yang, Junwei Han

School of Automation

Research output: Contribution to journal › Article › peer-review

11 Scopus citations

Abstract

Although deep features have achieved the state-of-The-Art performance in action recognition recently, the hand-crafted shallow features still play a critical role in characterizing human actions for taking advantage of visual contents in an intuitive way such as edge features. Therefore, the shallow features can serve as auxiliary visual cues supplementary to deep representations. In this paper, we propose a discriminative subspace learning model (DSLM) to explore the complementary properties between the hand-crafted shallow feature representations and the deep features. As for the RGB action recognition, this is the first work attempting to mine multi-level feature complementaries by the multi-view subspace learning scheme. To sufficiently capture the complementary information among heterogeneous features, we construct the DSLM by integrating the multi-view reconstruction error and classification error into an unified objective function. To be specific, we first use Fisher Vector to encode improved dense trajectories (iDT+FV) for shallow representations and two-stream convolutional neural network models (T-CNN) for generating deep features. Moreover, the presented DSLM algorithm projects multi-level features onto a shared discriminative subspace with the complementary information and discriminating capacity simultaneously incorporated. Finally, the action types of test samples are identified by the margins from the learned compact representations to the decision boundary. The experimental results on three datasets demonstrate the effectiveness of the proposed method.

Original language	English
Article number	8721146
Pages (from-to)	4591-4600
Number of pages	10
Journal	IEEE Transactions on Circuits and Systems for Video Technology
Volume	30
Issue number	12
DOIs	https://doi.org/10.1109/TCSVT.2019.2918591
State	Published - Dec 2020

Keywords

Action recognition
multi-level feature fusion
multi-view subspace learning

Access to Document

10.1109/TCSVT.2019.2918591

Cite this

@article{f574fd3accfd478c9c34a554c2bb8d83,

title = "Discriminative Multi-View Subspace Feature Learning for Action Recognition",

abstract = "Although deep features have achieved the state-of-The-Art performance in action recognition recently, the hand-crafted shallow features still play a critical role in characterizing human actions for taking advantage of visual contents in an intuitive way such as edge features. Therefore, the shallow features can serve as auxiliary visual cues supplementary to deep representations. In this paper, we propose a discriminative subspace learning model (DSLM) to explore the complementary properties between the hand-crafted shallow feature representations and the deep features. As for the RGB action recognition, this is the first work attempting to mine multi-level feature complementaries by the multi-view subspace learning scheme. To sufficiently capture the complementary information among heterogeneous features, we construct the DSLM by integrating the multi-view reconstruction error and classification error into an unified objective function. To be specific, we first use Fisher Vector to encode improved dense trajectories (iDT+FV) for shallow representations and two-stream convolutional neural network models (T-CNN) for generating deep features. Moreover, the presented DSLM algorithm projects multi-level features onto a shared discriminative subspace with the complementary information and discriminating capacity simultaneously incorporated. Finally, the action types of test samples are identified by the margins from the learned compact representations to the decision boundary. The experimental results on three datasets demonstrate the effectiveness of the proposed method.",

keywords = "Action recognition, multi-level feature fusion, multi-view subspace learning",

author = "Biyun Sheng and Jun Li and Fu Xiao and Qun Li and Wankou Yang and Junwei Han",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2020",

month = dec,

doi = "10.1109/TCSVT.2019.2918591",

language = "英语",

volume = "30",

pages = "4591--4600",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "12",

}

TY - JOUR

T1 - Discriminative Multi-View Subspace Feature Learning for Action Recognition

AU - Sheng, Biyun

AU - Li, Jun

AU - Xiao, Fu

AU - Li, Qun

AU - Yang, Wankou

AU - Han, Junwei

PY - 2020/12

Y1 - 2020/12

N2 - Although deep features have achieved the state-of-The-Art performance in action recognition recently, the hand-crafted shallow features still play a critical role in characterizing human actions for taking advantage of visual contents in an intuitive way such as edge features. Therefore, the shallow features can serve as auxiliary visual cues supplementary to deep representations. In this paper, we propose a discriminative subspace learning model (DSLM) to explore the complementary properties between the hand-crafted shallow feature representations and the deep features. As for the RGB action recognition, this is the first work attempting to mine multi-level feature complementaries by the multi-view subspace learning scheme. To sufficiently capture the complementary information among heterogeneous features, we construct the DSLM by integrating the multi-view reconstruction error and classification error into an unified objective function. To be specific, we first use Fisher Vector to encode improved dense trajectories (iDT+FV) for shallow representations and two-stream convolutional neural network models (T-CNN) for generating deep features. Moreover, the presented DSLM algorithm projects multi-level features onto a shared discriminative subspace with the complementary information and discriminating capacity simultaneously incorporated. Finally, the action types of test samples are identified by the margins from the learned compact representations to the decision boundary. The experimental results on three datasets demonstrate the effectiveness of the proposed method.

AB - Although deep features have achieved the state-of-The-Art performance in action recognition recently, the hand-crafted shallow features still play a critical role in characterizing human actions for taking advantage of visual contents in an intuitive way such as edge features. Therefore, the shallow features can serve as auxiliary visual cues supplementary to deep representations. In this paper, we propose a discriminative subspace learning model (DSLM) to explore the complementary properties between the hand-crafted shallow feature representations and the deep features. As for the RGB action recognition, this is the first work attempting to mine multi-level feature complementaries by the multi-view subspace learning scheme. To sufficiently capture the complementary information among heterogeneous features, we construct the DSLM by integrating the multi-view reconstruction error and classification error into an unified objective function. To be specific, we first use Fisher Vector to encode improved dense trajectories (iDT+FV) for shallow representations and two-stream convolutional neural network models (T-CNN) for generating deep features. Moreover, the presented DSLM algorithm projects multi-level features onto a shared discriminative subspace with the complementary information and discriminating capacity simultaneously incorporated. Finally, the action types of test samples are identified by the margins from the learned compact representations to the decision boundary. The experimental results on three datasets demonstrate the effectiveness of the proposed method.

KW - Action recognition

KW - multi-level feature fusion

KW - multi-view subspace learning

UR - http://www.scopus.com/inward/record.url?scp=85097767874&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2019.2918591

DO - 10.1109/TCSVT.2019.2918591

M3 - 文章

AN - SCOPUS:85097767874

SN - 1051-8215

VL - 30

SP - 4591

EP - 4600

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 12

M1 - 8721146

ER -

Discriminative Multi-View Subspace Feature Learning for Action Recognition

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this