TY - GEN
T1 - High-Quality Label Learning in Generalized Category Discovery
AU - Duan, Yu
AU - He, Junzhi
AU - Nie, Feiping
AU - Gao, Quanxue
AU - Deng, Cheng
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Generalized Category Discovery (GCD) is a recently proposed open-world problem that aims to automatically classify and discover new categories based on partially labeled data. For unlabeled data, previous research commonly considers using pseudo-labels to assist in model learning. These pseudo-labels, together with the true labels of labeled data, form the learning targets for the final classifier, leading to better predictive outcomes. However, low-quality labels can inevitably hinder the learning process of the model. To address this issue, inspired by previous methods, we propose the Calibrated Generalized Category Discovery (CGCD) framework, which incorporates a projection head, a classifier head, and a calibration head. The projection head is used for representation learning. The calibration head learns high-quality labels from the robust predictions of the classifier head, and the classifier head utilizes these high-quality labels for more efficient learning. Both heads mutually enhance each other during training, ultimately leading to a superior solution. In addition, leveraging the characteristics of both the classifier head and calibration head, we designed a classifier representation distribution regularization term to further ensure consistency in their learning processes. Extensive experimental results demonstrate that the proposed CGCD framework achieves state-of-the-art performance across five general and fine-grained visual recognition datasets by leveraging high-quality label learning.
AB - Generalized Category Discovery (GCD) is a recently proposed open-world problem that aims to automatically classify and discover new categories based on partially labeled data. For unlabeled data, previous research commonly considers using pseudo-labels to assist in model learning. These pseudo-labels, together with the true labels of labeled data, form the learning targets for the final classifier, leading to better predictive outcomes. However, low-quality labels can inevitably hinder the learning process of the model. To address this issue, inspired by previous methods, we propose the Calibrated Generalized Category Discovery (CGCD) framework, which incorporates a projection head, a classifier head, and a calibration head. The projection head is used for representation learning. The calibration head learns high-quality labels from the robust predictions of the classifier head, and the classifier head utilizes these high-quality labels for more efficient learning. Both heads mutually enhance each other during training, ultimately leading to a superior solution. In addition, leveraging the characteristics of both the classifier head and calibration head, we designed a classifier representation distribution regularization term to further ensure consistency in their learning processes. Extensive experimental results demonstrate that the proposed CGCD framework achieves state-of-the-art performance across five general and fine-grained visual recognition datasets by leveraging high-quality label learning.
KW - generalized category discovery
KW - high-quality label learning
KW - Open-world semi-supervised learning
UR - https://www.scopus.com/pages/publications/105035066435
U2 - 10.1109/ICDM65498.2025.00030
DO - 10.1109/ICDM65498.2025.00030
M3 - 会议稿件
AN - SCOPUS:105035066435
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 228
EP - 236
BT - Proceedings - 25th IEEE International Conference on Data Mining, ICDM 2025
A2 - Ding, Wei
A2 - Vreeken, Jilles
A2 - Lu, Chang-Tien
A2 - Gunopulos, Dimitrios
A2 - Wu, Xindong
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th IEEE International Conference on Data Mining, ICDM 2025
Y2 - 12 November 2025 through 15 November 2025
ER -