Multi-scale and discriminative part detectors based features for multi-label image classification

Gong Cheng; Decheng Gao; Yang Liu; Junwei Han

doi:10.24963/ijcai.2018/90

Multi-scale and discriminative part detectors based features for multi-label image classification

Gong Cheng, Decheng Gao, Yang Liu, Junwei Han

School of Automation

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

12 Scopus citations

Abstract

Convolutional neural networks (CNNs) have shown their promise for image classification task. However, global CNN features still lack geometric invariance for addressing the problem of intra-class variations and so are not optimal for multi-label image classification. This paper proposes a new and effective framework built upon CNNs to learn Multi-scale and Discriminative Part Detectors (MsDPD)-based feature representations for multi-label image classification. Specifically, at each scale level, we (i) first present an entropy-rank based scheme to generate and select a set of discriminative part detectors (DPD), and then (ii) obtain a number of DPD-based convolutional feature maps with each feature map representing the occurrence probability of a particular part detector and learn DPD-based features by using a task-driven pooling scheme. The two steps are formulated into a unified framework by developing a new objective function, which jointly trains part detectors incrementally and integrates the learning of feature representations into the classification task. Finally, the multi-scale features are fused to produce the predictions. Experimental results on PASCAL VOC 2007 and VOC 2012 datasets demonstrate that the proposed method achieves better accuracy when compared with the existing state-of-the-art multi-label classification methods.

Original language	English
Title of host publication	Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018
Editors	Jerome Lang
Publisher	International Joint Conferences on Artificial Intelligence
Pages	649-655
Number of pages	7
ISBN (Electronic)	9780999241127
DOIs	https://doi.org/10.24963/ijcai.2018/90
State	Published - 2018
Event	27th International Joint Conference on Artificial Intelligence, IJCAI 2018 - Stockholm, Sweden Duration: 13 Jul 2018 → 19 Jul 2018

Publication series

Name	IJCAI International Joint Conference on Artificial Intelligence
Volume	2018-July
ISSN (Print)	1045-0823

Conference

Conference	27th International Joint Conference on Artificial Intelligence, IJCAI 2018
Country/Territory	Sweden
City	Stockholm
Period	13/07/18 → 19/07/18

Access to Document

10.24963/ijcai.2018/90

Cite this

Cheng, G., Gao, D., Liu, Y., & Han, J. (2018). Multi-scale and discriminative part detectors based features for multi-label image classification. In J. Lang (Ed.), Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018 (pp. 649-655). (IJCAI International Joint Conference on Artificial Intelligence; Vol. 2018-July). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2018/90

Cheng, Gong ; Gao, Decheng ; Liu, Yang et al. / Multi-scale and discriminative part detectors based features for multi-label image classification. Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018. editor / Jerome Lang. International Joint Conferences on Artificial Intelligence, 2018. pp. 649-655 (IJCAI International Joint Conference on Artificial Intelligence).

@inproceedings{7da6e20529ac4c4485dcf625f56c495b,

title = "Multi-scale and discriminative part detectors based features for multi-label image classification",

abstract = "Convolutional neural networks (CNNs) have shown their promise for image classification task. However, global CNN features still lack geometric invariance for addressing the problem of intra-class variations and so are not optimal for multi-label image classification. This paper proposes a new and effective framework built upon CNNs to learn Multi-scale and Discriminative Part Detectors (MsDPD)-based feature representations for multi-label image classification. Specifically, at each scale level, we (i) first present an entropy-rank based scheme to generate and select a set of discriminative part detectors (DPD), and then (ii) obtain a number of DPD-based convolutional feature maps with each feature map representing the occurrence probability of a particular part detector and learn DPD-based features by using a task-driven pooling scheme. The two steps are formulated into a unified framework by developing a new objective function, which jointly trains part detectors incrementally and integrates the learning of feature representations into the classification task. Finally, the multi-scale features are fused to produce the predictions. Experimental results on PASCAL VOC 2007 and VOC 2012 datasets demonstrate that the proposed method achieves better accuracy when compared with the existing state-of-the-art multi-label classification methods.",

author = "Gong Cheng and Decheng Gao and Yang Liu and Junwei Han",

note = "Publisher Copyright: {\textcopyright} 2018 International Joint Conferences on Artificial Intelligence. All right reserved.; 27th International Joint Conference on Artificial Intelligence, IJCAI 2018 ; Conference date: 13-07-2018 Through 19-07-2018",

year = "2018",

doi = "10.24963/ijcai.2018/90",

language = "英语",

series = "IJCAI International Joint Conference on Artificial Intelligence",

publisher = "International Joint Conferences on Artificial Intelligence",

pages = "649--655",

editor = "Jerome Lang",

booktitle = "Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018",

}

Cheng, G, Gao, D, Liu, Y & Han, J 2018, Multi-scale and discriminative part detectors based features for multi-label image classification. in J Lang (ed.), Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018. IJCAI International Joint Conference on Artificial Intelligence, vol. 2018-July, International Joint Conferences on Artificial Intelligence, pp. 649-655, 27th International Joint Conference on Artificial Intelligence, IJCAI 2018, Stockholm, Sweden, 13/07/18. https://doi.org/10.24963/ijcai.2018/90

Multi-scale and discriminative part detectors based features for multi-label image classification. / Cheng, Gong; Gao, Decheng; Liu, Yang et al.
Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018. ed. / Jerome Lang. International Joint Conferences on Artificial Intelligence, 2018. p. 649-655 (IJCAI International Joint Conference on Artificial Intelligence; Vol. 2018-July).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi-scale and discriminative part detectors based features for multi-label image classification

AU - Cheng, Gong

AU - Gao, Decheng

AU - Liu, Yang

AU - Han, Junwei

PY - 2018

Y1 - 2018

N2 - Convolutional neural networks (CNNs) have shown their promise for image classification task. However, global CNN features still lack geometric invariance for addressing the problem of intra-class variations and so are not optimal for multi-label image classification. This paper proposes a new and effective framework built upon CNNs to learn Multi-scale and Discriminative Part Detectors (MsDPD)-based feature representations for multi-label image classification. Specifically, at each scale level, we (i) first present an entropy-rank based scheme to generate and select a set of discriminative part detectors (DPD), and then (ii) obtain a number of DPD-based convolutional feature maps with each feature map representing the occurrence probability of a particular part detector and learn DPD-based features by using a task-driven pooling scheme. The two steps are formulated into a unified framework by developing a new objective function, which jointly trains part detectors incrementally and integrates the learning of feature representations into the classification task. Finally, the multi-scale features are fused to produce the predictions. Experimental results on PASCAL VOC 2007 and VOC 2012 datasets demonstrate that the proposed method achieves better accuracy when compared with the existing state-of-the-art multi-label classification methods.

AB - Convolutional neural networks (CNNs) have shown their promise for image classification task. However, global CNN features still lack geometric invariance for addressing the problem of intra-class variations and so are not optimal for multi-label image classification. This paper proposes a new and effective framework built upon CNNs to learn Multi-scale and Discriminative Part Detectors (MsDPD)-based feature representations for multi-label image classification. Specifically, at each scale level, we (i) first present an entropy-rank based scheme to generate and select a set of discriminative part detectors (DPD), and then (ii) obtain a number of DPD-based convolutional feature maps with each feature map representing the occurrence probability of a particular part detector and learn DPD-based features by using a task-driven pooling scheme. The two steps are formulated into a unified framework by developing a new objective function, which jointly trains part detectors incrementally and integrates the learning of feature representations into the classification task. Finally, the multi-scale features are fused to produce the predictions. Experimental results on PASCAL VOC 2007 and VOC 2012 datasets demonstrate that the proposed method achieves better accuracy when compared with the existing state-of-the-art multi-label classification methods.

UR - http://www.scopus.com/inward/record.url?scp=85055719845&partnerID=8YFLogxK

U2 - 10.24963/ijcai.2018/90

DO - 10.24963/ijcai.2018/90

M3 - 会议稿件

AN - SCOPUS:85055719845

T3 - IJCAI International Joint Conference on Artificial Intelligence

SP - 649

EP - 655

BT - Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018

A2 - Lang, Jerome

PB - International Joint Conferences on Artificial Intelligence

T2 - 27th International Joint Conference on Artificial Intelligence, IJCAI 2018

Y2 - 13 July 2018 through 19 July 2018

ER -

Cheng G, Gao D, Liu Y, Han J. Multi-scale and discriminative part detectors based features for multi-label image classification. In Lang J, editor, Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018. International Joint Conferences on Artificial Intelligence. 2018. p. 649-655. (IJCAI International Joint Conference on Artificial Intelligence). doi: 10.24963/ijcai.2018/90

Multi-scale and discriminative part detectors based features for multi-label image classification

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this