Compound Batch Normalization for Long-Tailed Image Classification

Lechao Cheng; Chaowei Fang; Dingwen Zhang; Guanbin Li; Gang Huang

doi:10.1145/3503161.3547805

Compound Batch Normalization for Long-Tailed Image Classification

Lechao Cheng, Chaowei Fang, Dingwen Zhang, Guanbin Li, Gang Huang

School of Automation

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

7 Scopus citations

Abstract

Significant progress has been made in learning image classification neural networks under long-Tail data distribution using robust training algorithms such as data re-sampling, re-weighting, and margin adjustment. Those methods, however, ignore the impact of data imbalance on feature normalization. The dominance of majority classes (head classes) in estimating statistics and affine parameters causes internal covariate shifts within less-frequent categories to be overlooked. To alleviate this challenge, we propose a compound batch normalization method based on a Gaussian mixture. It can model the feature space more comprehensively and reduce the dominance of head classes. In addition, a moving average-based expectation maximization (EM) algorithm is employed to estimate the statistical parameters of multiple Gaussian distributions. However, the EM algorithm is sensitive to initialization and can easily become stuck in local minima where the multiple Gaussian components continue to focus on majority classes. To tackle this issue, we developed a dual-path learning framework that employs class-Aware split feature normalization to diversify the estimated Gaussian distributions, allowing the Gaussian components to fit with training samples of less-frequent classes more comprehensively. Extensive experiments on commonly used datasets demonstrated that the proposed method outperforms existing methods on long-Tailed image classification.

Original language	English
Title of host publication	MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia
Publisher	Association for Computing Machinery, Inc
Pages	1925-1934
Number of pages	10
ISBN (Electronic)	9781450392037
DOIs	https://doi.org/10.1145/3503161.3547805
State	Published - 10 Oct 2022
Event	30th ACM International Conference on Multimedia, MM 2022 - Lisboa, Portugal Duration: 10 Oct 2022 → 14 Oct 2022

Publication series

Name	MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia

Conference

Conference	30th ACM International Conference on Multimedia, MM 2022
Country/Territory	Portugal
City	Lisboa
Period	10/10/22 → 14/10/22

Keywords

compound batch normalization
image classification
long-Tailed

Access to Document

10.1145/3503161.3547805

Cite this

Cheng, L., Fang, C., Zhang, D., Li, G., & Huang, G. (2022). Compound Batch Normalization for Long-Tailed Image Classification. In MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia (pp. 1925-1934). (MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia). Association for Computing Machinery, Inc. https://doi.org/10.1145/3503161.3547805

@inproceedings{c1bebdbda9fc41fd8b12873fcc9105fb,

title = "Compound Batch Normalization for Long-Tailed Image Classification",

abstract = "Significant progress has been made in learning image classification neural networks under long-Tail data distribution using robust training algorithms such as data re-sampling, re-weighting, and margin adjustment. Those methods, however, ignore the impact of data imbalance on feature normalization. The dominance of majority classes (head classes) in estimating statistics and affine parameters causes internal covariate shifts within less-frequent categories to be overlooked. To alleviate this challenge, we propose a compound batch normalization method based on a Gaussian mixture. It can model the feature space more comprehensively and reduce the dominance of head classes. In addition, a moving average-based expectation maximization (EM) algorithm is employed to estimate the statistical parameters of multiple Gaussian distributions. However, the EM algorithm is sensitive to initialization and can easily become stuck in local minima where the multiple Gaussian components continue to focus on majority classes. To tackle this issue, we developed a dual-path learning framework that employs class-Aware split feature normalization to diversify the estimated Gaussian distributions, allowing the Gaussian components to fit with training samples of less-frequent classes more comprehensively. Extensive experiments on commonly used datasets demonstrated that the proposed method outperforms existing methods on long-Tailed image classification.",

keywords = "compound batch normalization, image classification, long-Tailed",

author = "Lechao Cheng and Chaowei Fang and Dingwen Zhang and Guanbin Li and Gang Huang",

note = "Publisher Copyright: {\textcopyright} 2022 ACM.; 30th ACM International Conference on Multimedia, MM 2022 ; Conference date: 10-10-2022 Through 14-10-2022",

year = "2022",

month = oct,

day = "10",

doi = "10.1145/3503161.3547805",

language = "英语",

series = "MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia",

publisher = "Association for Computing Machinery, Inc",

pages = "1925--1934",

booktitle = "MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia",

}

Cheng, L, Fang, C, Zhang, D, Li, G & Huang, G 2022, Compound Batch Normalization for Long-Tailed Image Classification. in MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia. MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia, Association for Computing Machinery, Inc, pp. 1925-1934, 30th ACM International Conference on Multimedia, MM 2022, Lisboa, Portugal, 10/10/22. https://doi.org/10.1145/3503161.3547805

Compound Batch Normalization for Long-Tailed Image Classification. / Cheng, Lechao; Fang, Chaowei; Zhang, Dingwen et al.
MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia. Association for Computing Machinery, Inc, 2022. p. 1925-1934 (MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Compound Batch Normalization for Long-Tailed Image Classification

AU - Cheng, Lechao

AU - Fang, Chaowei

AU - Zhang, Dingwen

AU - Li, Guanbin

AU - Huang, Gang

PY - 2022/10/10

Y1 - 2022/10/10

N2 - Significant progress has been made in learning image classification neural networks under long-Tail data distribution using robust training algorithms such as data re-sampling, re-weighting, and margin adjustment. Those methods, however, ignore the impact of data imbalance on feature normalization. The dominance of majority classes (head classes) in estimating statistics and affine parameters causes internal covariate shifts within less-frequent categories to be overlooked. To alleviate this challenge, we propose a compound batch normalization method based on a Gaussian mixture. It can model the feature space more comprehensively and reduce the dominance of head classes. In addition, a moving average-based expectation maximization (EM) algorithm is employed to estimate the statistical parameters of multiple Gaussian distributions. However, the EM algorithm is sensitive to initialization and can easily become stuck in local minima where the multiple Gaussian components continue to focus on majority classes. To tackle this issue, we developed a dual-path learning framework that employs class-Aware split feature normalization to diversify the estimated Gaussian distributions, allowing the Gaussian components to fit with training samples of less-frequent classes more comprehensively. Extensive experiments on commonly used datasets demonstrated that the proposed method outperforms existing methods on long-Tailed image classification.

AB - Significant progress has been made in learning image classification neural networks under long-Tail data distribution using robust training algorithms such as data re-sampling, re-weighting, and margin adjustment. Those methods, however, ignore the impact of data imbalance on feature normalization. The dominance of majority classes (head classes) in estimating statistics and affine parameters causes internal covariate shifts within less-frequent categories to be overlooked. To alleviate this challenge, we propose a compound batch normalization method based on a Gaussian mixture. It can model the feature space more comprehensively and reduce the dominance of head classes. In addition, a moving average-based expectation maximization (EM) algorithm is employed to estimate the statistical parameters of multiple Gaussian distributions. However, the EM algorithm is sensitive to initialization and can easily become stuck in local minima where the multiple Gaussian components continue to focus on majority classes. To tackle this issue, we developed a dual-path learning framework that employs class-Aware split feature normalization to diversify the estimated Gaussian distributions, allowing the Gaussian components to fit with training samples of less-frequent classes more comprehensively. Extensive experiments on commonly used datasets demonstrated that the proposed method outperforms existing methods on long-Tailed image classification.

KW - compound batch normalization

KW - image classification

KW - long-Tailed

UR - http://www.scopus.com/inward/record.url?scp=85148278285&partnerID=8YFLogxK

U2 - 10.1145/3503161.3547805

DO - 10.1145/3503161.3547805

M3 - 会议稿件

AN - SCOPUS:85148278285

T3 - MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia

SP - 1925

EP - 1934

BT - MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia

PB - Association for Computing Machinery, Inc

T2 - 30th ACM International Conference on Multimedia, MM 2022

Y2 - 10 October 2022 through 14 October 2022

ER -

Compound Batch Normalization for Long-Tailed Image Classification

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this