Adversarial Multi-view Networks for Activity Recognition

Lei Bai; Lina Yao; Xianzhi Wang; Salil S. Kanhere; Bin Guo; Zhiwen Yu

doi:10.1145/3397323

Adversarial Multi-view Networks for Activity Recognition

Lei Bai, Lina Yao, Xianzhi Wang, Salil S. Kanhere, Bin Guo, Zhiwen Yu

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

31 引用（Scopus）

摘要

Human activity recognition (HAR) plays an irreplaceable role in various applications and has been a prosperous research topic for years. Recent studies show significant progress in feature extraction (i.e., data representation) using deep learning techniques. However, they face significant challenges in capturing multi-modal spatial-temporal patterns from the sensory data, and they commonly overlook the variants between subjects. We propose a Discriminative Adversarial MUlti-view Network (DAMUN) to address the above issues in sensor-based HAR. We first design a multi-view feature extractor to obtain representations of sensory data streams from temporal, spatial, and spatio-temporal views using convolutional networks. Then, we fuse the multi-view representations into a robust joint representation through a trainable Hadamard fusion module, and finally employ a Siamese adversarial network architecture to decrease the variants between the representations of different subjects. We have conducted extensive experiments under an iterative left-one-subject-out setting on three real-world datasets and demonstrated both the effectiveness and robustness of our approach.

源语言	英语
文章编号	42
期刊	Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
卷	4
期	2
DOI	https://doi.org/10.1145/3397323
出版状态	已出版 - 15 6月 2020

访问文件

10.1145/3397323

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{2f0cd72fef7548e487f7b70055abc78f,

title = "Adversarial Multi-view Networks for Activity Recognition",

abstract = "Human activity recognition (HAR) plays an irreplaceable role in various applications and has been a prosperous research topic for years. Recent studies show significant progress in feature extraction (i.e., data representation) using deep learning techniques. However, they face significant challenges in capturing multi-modal spatial-temporal patterns from the sensory data, and they commonly overlook the variants between subjects. We propose a Discriminative Adversarial MUlti-view Network (DAMUN) to address the above issues in sensor-based HAR. We first design a multi-view feature extractor to obtain representations of sensory data streams from temporal, spatial, and spatio-temporal views using convolutional networks. Then, we fuse the multi-view representations into a robust joint representation through a trainable Hadamard fusion module, and finally employ a Siamese adversarial network architecture to decrease the variants between the representations of different subjects. We have conducted extensive experiments under an iterative left-one-subject-out setting on three real-world datasets and demonstrated both the effectiveness and robustness of our approach.",

keywords = "Activity Recognition, Adversarial Training, Deep Learning, Multi-view Representation",

author = "Lei Bai and Lina Yao and Xianzhi Wang and Kanhere, {Salil S.} and Bin Guo and Zhiwen Yu",

note = "Publisher Copyright: {\textcopyright} 2020 ACM.",

year = "2020",

month = jun,

day = "15",

doi = "10.1145/3397323",

language = "英语",

volume = "4",

journal = "Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies",

issn = "2474-9567",

publisher = "Association for Computing Machinery (ACM)",

number = "2",

}

TY - JOUR

T1 - Adversarial Multi-view Networks for Activity Recognition

AU - Bai, Lei

AU - Yao, Lina

AU - Wang, Xianzhi

AU - Kanhere, Salil S.

AU - Guo, Bin

AU - Yu, Zhiwen

PY - 2020/6/15

Y1 - 2020/6/15

N2 - Human activity recognition (HAR) plays an irreplaceable role in various applications and has been a prosperous research topic for years. Recent studies show significant progress in feature extraction (i.e., data representation) using deep learning techniques. However, they face significant challenges in capturing multi-modal spatial-temporal patterns from the sensory data, and they commonly overlook the variants between subjects. We propose a Discriminative Adversarial MUlti-view Network (DAMUN) to address the above issues in sensor-based HAR. We first design a multi-view feature extractor to obtain representations of sensory data streams from temporal, spatial, and spatio-temporal views using convolutional networks. Then, we fuse the multi-view representations into a robust joint representation through a trainable Hadamard fusion module, and finally employ a Siamese adversarial network architecture to decrease the variants between the representations of different subjects. We have conducted extensive experiments under an iterative left-one-subject-out setting on three real-world datasets and demonstrated both the effectiveness and robustness of our approach.

AB - Human activity recognition (HAR) plays an irreplaceable role in various applications and has been a prosperous research topic for years. Recent studies show significant progress in feature extraction (i.e., data representation) using deep learning techniques. However, they face significant challenges in capturing multi-modal spatial-temporal patterns from the sensory data, and they commonly overlook the variants between subjects. We propose a Discriminative Adversarial MUlti-view Network (DAMUN) to address the above issues in sensor-based HAR. We first design a multi-view feature extractor to obtain representations of sensory data streams from temporal, spatial, and spatio-temporal views using convolutional networks. Then, we fuse the multi-view representations into a robust joint representation through a trainable Hadamard fusion module, and finally employ a Siamese adversarial network architecture to decrease the variants between the representations of different subjects. We have conducted extensive experiments under an iterative left-one-subject-out setting on three real-world datasets and demonstrated both the effectiveness and robustness of our approach.

KW - Activity Recognition

KW - Adversarial Training

KW - Deep Learning

KW - Multi-view Representation

UR - http://www.scopus.com/inward/record.url?scp=85089756468&partnerID=8YFLogxK

U2 - 10.1145/3397323

DO - 10.1145/3397323

M3 - 文章

AN - SCOPUS:85089756468

SN - 2474-9567

VL - 4

JO - Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies

JF - Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies

IS - 2

M1 - 42

ER -

Adversarial Multi-view Networks for Activity Recognition

摘要

访问文件

其它文件与链接

指纹

引用此