TY - GEN
T1 - 3D Feature Extraction Network Based on Self-supervision for Micro-expression Spotting
AU - Wang, Yuhan
AU - Guo, Xupeng
AU - Xia, Zhaoqiang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Micro-expression is an important tool to analyze real human emotions. As the upstream task of micro-expression analysis, video spotting needs to obtain accurate video frame position. At present, it mainly relies on manual calibration by experts, which is not suitable for processing massive videos in real scenes. Due to the short duration and weak intensity of micro-expression, traditional manual feature extraction methods are difficult to capture the weak change of micro-expression, while deep learning based methods are not robust enough. Therefore, this paper proposes a self-supervised facial feature extraction network to constructs a more robust facial feature extractor through self-supervised methods to capture the weak change in micro-expression. Concretely,we split raw long video into clips for model training and introduce a pixel-level-based mask operation to improve the effect of the model reconstruction. Then we reconstruct the optical flow sequence and original face sequence through two 3D feature extraction networks with identical structure and different parameters, and optimize the parameters by self-supervision.The results show that the proposed model captures robust subtle facial change features and improves the accuracy of micro-expression spotting on two datasets CAS(ME)2 and SAMM-LV.
AB - Micro-expression is an important tool to analyze real human emotions. As the upstream task of micro-expression analysis, video spotting needs to obtain accurate video frame position. At present, it mainly relies on manual calibration by experts, which is not suitable for processing massive videos in real scenes. Due to the short duration and weak intensity of micro-expression, traditional manual feature extraction methods are difficult to capture the weak change of micro-expression, while deep learning based methods are not robust enough. Therefore, this paper proposes a self-supervised facial feature extraction network to constructs a more robust facial feature extractor through self-supervised methods to capture the weak change in micro-expression. Concretely,we split raw long video into clips for model training and introduce a pixel-level-based mask operation to improve the effect of the model reconstruction. Then we reconstruct the optical flow sequence and original face sequence through two 3D feature extraction networks with identical structure and different parameters, and optimize the parameters by self-supervision.The results show that the proposed model captures robust subtle facial change features and improves the accuracy of micro-expression spotting on two datasets CAS(ME)2 and SAMM-LV.
KW - Feature extraction network
KW - Micro-expression spotting
KW - Self-supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85199464039&partnerID=8YFLogxK
U2 - 10.1109/ICIPMC62364.2024.10586609
DO - 10.1109/ICIPMC62364.2024.10586609
M3 - 会议稿件
AN - SCOPUS:85199464039
T3 - 2024 3rd International Conference on Image Processing and Media Computing, ICIPMC 2024
SP - 371
EP - 377
BT - 2024 3rd International Conference on Image Processing and Media Computing, ICIPMC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd International Conference on Image Processing and Media Computing, ICIPMC 2024
Y2 - 17 May 2024 through 19 May 2024
ER -