SAST: a suppressing ambiguity self-training framework for facial expression recognition

Zhe Guo; Bingxin Wei; Xuewen Liu; Zhibo Zhang; Shiya Liu; Yangyu Fan

doi:10.1007/s11042-023-17749-w

SAST: a suppressing ambiguity self-training framework for facial expression recognition

Zhe Guo, Bingxin Wei, Xuewen Liu, Zhibo Zhang, Shiya Liu, Yangyu Fan

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

Abstract

Facial expression recognition (FER) suffers from insufficient label information, as human expressions are complex and diverse, with many expressions ambiguous. Using low-quality labels or low-quantity labels will aggravate ambiguity of model predictions and reduce the accuracy of FER. How to improve the robustness of FER to ambiguous data with insufficient information remains challenging. To this end, we propose the Suppressing Ambiguity Self-Training (SAST) framework which is the first attempt to address the problem of insufficient information both label quality and label quantity containing, simultaneously. Specifically, we design an Ambiguous Relative Label Usage (ARLU) strategy that mixes hard labels and soft labels to alleviate the information loss problem caused by hard labels. We also enhance the robustness of the model to ambiguous data by means of Self-Training Resampling (STR). We further use the landmarks and Patch Branch (PB) to enhance the ability of suppressing ambiguity. Experiments on RAF-DB, FERPlus, SFEW, and AffectNet datasets show that our SAST outperforms 6 semi-supervised methods with fewer annotations, and achieves competitive accuracy to State-Of-The-Art (SOTA) FER methods. Our code is available at https://github.com/Liuxww/SAST.

Original language	English
Pages (from-to)	56059-56076
Number of pages	18
Journal	Multimedia Tools and Applications
Volume	83
Issue number	18
DOIs	https://doi.org/10.1007/s11042-023-17749-w
State	Published - May 2024

Keywords

Facial expression recognition
Insufficient information
Self-training
Suppressing ambiguity

Access to Document

10.1007/s11042-023-17749-w

Cite this

@article{193ec0853744458f90cd871e430c91f0,

title = "SAST: a suppressing ambiguity self-training framework for facial expression recognition",

abstract = "Facial expression recognition (FER) suffers from insufficient label information, as human expressions are complex and diverse, with many expressions ambiguous. Using low-quality labels or low-quantity labels will aggravate ambiguity of model predictions and reduce the accuracy of FER. How to improve the robustness of FER to ambiguous data with insufficient information remains challenging. To this end, we propose the Suppressing Ambiguity Self-Training (SAST) framework which is the first attempt to address the problem of insufficient information both label quality and label quantity containing, simultaneously. Specifically, we design an Ambiguous Relative Label Usage (ARLU) strategy that mixes hard labels and soft labels to alleviate the information loss problem caused by hard labels. We also enhance the robustness of the model to ambiguous data by means of Self-Training Resampling (STR). We further use the landmarks and Patch Branch (PB) to enhance the ability of suppressing ambiguity. Experiments on RAF-DB, FERPlus, SFEW, and AffectNet datasets show that our SAST outperforms 6 semi-supervised methods with fewer annotations, and achieves competitive accuracy to State-Of-The-Art (SOTA) FER methods. Our code is available at https://github.com/Liuxww/SAST.",

keywords = "Facial expression recognition, Insufficient information, Self-training, Suppressing ambiguity",

author = "Zhe Guo and Bingxin Wei and Xuewen Liu and Zhibo Zhang and Shiya Liu and Yangyu Fan",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.",

year = "2024",

month = may,

doi = "10.1007/s11042-023-17749-w",

language = "英语",

volume = "83",

pages = "56059--56076",

journal = "Multimedia Tools and Applications",

issn = "1380-7501",

publisher = "Springer",

number = "18",

}

TY - JOUR

T1 - SAST

T2 - a suppressing ambiguity self-training framework for facial expression recognition

AU - Guo, Zhe

AU - Wei, Bingxin

AU - Liu, Xuewen

AU - Zhang, Zhibo

AU - Liu, Shiya

AU - Fan, Yangyu

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.

PY - 2024/5

Y1 - 2024/5

N2 - Facial expression recognition (FER) suffers from insufficient label information, as human expressions are complex and diverse, with many expressions ambiguous. Using low-quality labels or low-quantity labels will aggravate ambiguity of model predictions and reduce the accuracy of FER. How to improve the robustness of FER to ambiguous data with insufficient information remains challenging. To this end, we propose the Suppressing Ambiguity Self-Training (SAST) framework which is the first attempt to address the problem of insufficient information both label quality and label quantity containing, simultaneously. Specifically, we design an Ambiguous Relative Label Usage (ARLU) strategy that mixes hard labels and soft labels to alleviate the information loss problem caused by hard labels. We also enhance the robustness of the model to ambiguous data by means of Self-Training Resampling (STR). We further use the landmarks and Patch Branch (PB) to enhance the ability of suppressing ambiguity. Experiments on RAF-DB, FERPlus, SFEW, and AffectNet datasets show that our SAST outperforms 6 semi-supervised methods with fewer annotations, and achieves competitive accuracy to State-Of-The-Art (SOTA) FER methods. Our code is available at https://github.com/Liuxww/SAST.

AB - Facial expression recognition (FER) suffers from insufficient label information, as human expressions are complex and diverse, with many expressions ambiguous. Using low-quality labels or low-quantity labels will aggravate ambiguity of model predictions and reduce the accuracy of FER. How to improve the robustness of FER to ambiguous data with insufficient information remains challenging. To this end, we propose the Suppressing Ambiguity Self-Training (SAST) framework which is the first attempt to address the problem of insufficient information both label quality and label quantity containing, simultaneously. Specifically, we design an Ambiguous Relative Label Usage (ARLU) strategy that mixes hard labels and soft labels to alleviate the information loss problem caused by hard labels. We also enhance the robustness of the model to ambiguous data by means of Self-Training Resampling (STR). We further use the landmarks and Patch Branch (PB) to enhance the ability of suppressing ambiguity. Experiments on RAF-DB, FERPlus, SFEW, and AffectNet datasets show that our SAST outperforms 6 semi-supervised methods with fewer annotations, and achieves competitive accuracy to State-Of-The-Art (SOTA) FER methods. Our code is available at https://github.com/Liuxww/SAST.

KW - Facial expression recognition

KW - Insufficient information

KW - Self-training

KW - Suppressing ambiguity

UR - http://www.scopus.com/inward/record.url?scp=85178909235&partnerID=8YFLogxK

U2 - 10.1007/s11042-023-17749-w

DO - 10.1007/s11042-023-17749-w

M3 - 文章

AN - SCOPUS:85178909235

SN - 1380-7501

VL - 83

SP - 56059

EP - 56076

JO - Multimedia Tools and Applications

JF - Multimedia Tools and Applications

IS - 18

ER -

SAST: a suppressing ambiguity self-training framework for facial expression recognition

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this