Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling

Haifeng Chen; Chujia Guo; Yan Li; Peng Zhang; Dongmei Jiang

doi:10.1145/3581783.3612864

Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling

Haifeng Chen, Chujia Guo, Yan Li, Peng Zhang, Dongmei Jiang

School of Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

7 Scopus citations

Abstract

This paper presents our solution for the Semi-Supervised Multimodal Emotion Recognition Challenge (MER2023-SEMI), addressing the issue of limited annotated data in emotion recognition. Recently, the self-training-based Semi-Supervised Learning∼(SSL) method has demonstrated its effectiveness in various tasks, including emotion recognition. However, previous studies focused on reducing the confirmation bias of data without adequately considering the issue of data imbalance, which is of great importance in emotion recognition. Additionally, previous methods have primarily focused on unimodal tasks and have not considered the inherent multimodal information in emotion recognition tasks. We propose a simple yet effective semi-supervised multimodal emotion recognition method to address the above issues. We assume that the pseudo-labeled samples with consistent results across unimodal and multimodal classifiers have a more negligible confirmation bias. Based on this assumption, we suggest using a class-balanced strategy to select top-k high-confidence pseudo-labeled samples from each class. The proposed method is validated to be effective on the MER2023-SEMI Grand Challenge, with the weighted F1 score reaching 88.53% on the test set.

Original language	English
Title of host publication	MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia
Publisher	Association for Computing Machinery, Inc
Pages	9556-9560
Number of pages	5
ISBN (Electronic)	9798400701085
DOIs	https://doi.org/10.1145/3581783.3612864
State	Published - 26 Oct 2023
Event	31st ACM International Conference on Multimedia, MM 2023 - Ottawa, Canada Duration: 29 Oct 2023 → 3 Nov 2023

Publication series

Name	MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia

Conference

Conference	31st ACM International Conference on Multimedia, MM 2023
Country/Territory	Canada
City	Ottawa
Period	29/10/23 → 3/11/23

Keywords

class imbalance
multimodal emotion recognition
pseudo-labeling
self-training
semi-supervised learning

Access to Document

10.1145/3581783.3612864

Cite this

Chen, H., Guo, C., Li, Y., Zhang, P., & Jiang, D. (2023). Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling. In MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia (pp. 9556-9560). (MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia). Association for Computing Machinery, Inc. https://doi.org/10.1145/3581783.3612864

@inproceedings{f166768f8e934bac942ce1170435d2c3,

title = "Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling",

abstract = "This paper presents our solution for the Semi-Supervised Multimodal Emotion Recognition Challenge (MER2023-SEMI), addressing the issue of limited annotated data in emotion recognition. Recently, the self-training-based Semi-Supervised Learning∼(SSL) method has demonstrated its effectiveness in various tasks, including emotion recognition. However, previous studies focused on reducing the confirmation bias of data without adequately considering the issue of data imbalance, which is of great importance in emotion recognition. Additionally, previous methods have primarily focused on unimodal tasks and have not considered the inherent multimodal information in emotion recognition tasks. We propose a simple yet effective semi-supervised multimodal emotion recognition method to address the above issues. We assume that the pseudo-labeled samples with consistent results across unimodal and multimodal classifiers have a more negligible confirmation bias. Based on this assumption, we suggest using a class-balanced strategy to select top-k high-confidence pseudo-labeled samples from each class. The proposed method is validated to be effective on the MER2023-SEMI Grand Challenge, with the weighted F1 score reaching 88.53% on the test set.",

keywords = "class imbalance, multimodal emotion recognition, pseudo-labeling, self-training, semi-supervised learning",

author = "Haifeng Chen and Chujia Guo and Yan Li and Peng Zhang and Dongmei Jiang",

note = "Publisher Copyright: {\textcopyright} 2023 ACM.; 31st ACM International Conference on Multimedia, MM 2023 ; Conference date: 29-10-2023 Through 03-11-2023",

year = "2023",

month = oct,

day = "26",

doi = "10.1145/3581783.3612864",

language = "英语",

series = "MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia",

publisher = "Association for Computing Machinery, Inc",

pages = "9556--9560",

booktitle = "MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia",

}

Chen, H, Guo, C, Li, Y, Zhang, P & Jiang, D 2023, Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling. in MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia. MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia, Association for Computing Machinery, Inc, pp. 9556-9560, 31st ACM International Conference on Multimedia, MM 2023, Ottawa, Canada, 29/10/23. https://doi.org/10.1145/3581783.3612864

Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling. / Chen, Haifeng; Guo, Chujia; Li, Yan et al.
MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia. Association for Computing Machinery, Inc, 2023. p. 9556-9560 (MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling

AU - Chen, Haifeng

AU - Guo, Chujia

AU - Li, Yan

AU - Zhang, Peng

AU - Jiang, Dongmei

PY - 2023/10/26

Y1 - 2023/10/26

N2 - This paper presents our solution for the Semi-Supervised Multimodal Emotion Recognition Challenge (MER2023-SEMI), addressing the issue of limited annotated data in emotion recognition. Recently, the self-training-based Semi-Supervised Learning∼(SSL) method has demonstrated its effectiveness in various tasks, including emotion recognition. However, previous studies focused on reducing the confirmation bias of data without adequately considering the issue of data imbalance, which is of great importance in emotion recognition. Additionally, previous methods have primarily focused on unimodal tasks and have not considered the inherent multimodal information in emotion recognition tasks. We propose a simple yet effective semi-supervised multimodal emotion recognition method to address the above issues. We assume that the pseudo-labeled samples with consistent results across unimodal and multimodal classifiers have a more negligible confirmation bias. Based on this assumption, we suggest using a class-balanced strategy to select top-k high-confidence pseudo-labeled samples from each class. The proposed method is validated to be effective on the MER2023-SEMI Grand Challenge, with the weighted F1 score reaching 88.53% on the test set.

AB - This paper presents our solution for the Semi-Supervised Multimodal Emotion Recognition Challenge (MER2023-SEMI), addressing the issue of limited annotated data in emotion recognition. Recently, the self-training-based Semi-Supervised Learning∼(SSL) method has demonstrated its effectiveness in various tasks, including emotion recognition. However, previous studies focused on reducing the confirmation bias of data without adequately considering the issue of data imbalance, which is of great importance in emotion recognition. Additionally, previous methods have primarily focused on unimodal tasks and have not considered the inherent multimodal information in emotion recognition tasks. We propose a simple yet effective semi-supervised multimodal emotion recognition method to address the above issues. We assume that the pseudo-labeled samples with consistent results across unimodal and multimodal classifiers have a more negligible confirmation bias. Based on this assumption, we suggest using a class-balanced strategy to select top-k high-confidence pseudo-labeled samples from each class. The proposed method is validated to be effective on the MER2023-SEMI Grand Challenge, with the weighted F1 score reaching 88.53% on the test set.

KW - class imbalance

KW - multimodal emotion recognition

KW - pseudo-labeling

KW - self-training

KW - semi-supervised learning

UR - http://www.scopus.com/inward/record.url?scp=85179549784&partnerID=8YFLogxK

U2 - 10.1145/3581783.3612864

DO - 10.1145/3581783.3612864

M3 - 会议稿件

AN - SCOPUS:85179549784

T3 - MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia

SP - 9556

EP - 9560

BT - MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia

PB - Association for Computing Machinery, Inc

T2 - 31st ACM International Conference on Multimedia, MM 2023

Y2 - 29 October 2023 through 3 November 2023

ER -

Chen H, Guo C, Li Y, Zhang P, Jiang D. Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling. In MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia. Association for Computing Machinery, Inc. 2023. p. 9556-9560. (MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia). doi: 10.1145/3581783.3612864

Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this