跳到主要导航 跳到搜索 跳到主要内容

Class attention network for image recognition

  • Northwestern Polytechnical University Xian

科研成果: 期刊稿件文章同行评审

107 引用 (Scopus)

摘要

Visual attention has become a popular and widely used component for image recognition. Although various attention-based methods have been proposed and achieved relatively competitive results, it is observed that the semantic features of each class are likely to entangle with each other, and few studies focus on explicitly extracting category-aware features so far. To address this issue, this paper presents an attention-based image recognition method by using class-specific dictionary learning to disentangle the neural network’s outputs into class-dependent features, thus boosting their discrimination abilities. Specifically, we develop a class attention network (CANet) via integrating a simple yet effective class-specific attention encoding (CAE) module on the top of convolutional layers. Given the feature maps of the convolutional neural networks (CNNs), the CAE module learns a class-specific dictionary, which is leveraged to encode attention maps for each category. Then these attention maps are multiplied by the input features for class-wise adaptive feature refinement. Extensive experiments on the PASCAL VOC 2007, PASCAL VOC 2012, MS COCO, and CUB-200-2011 datasets demonstrate the fabulous performance of our method on multiple visual recognition tasks, including multi-label image classification and fine-grained visual classification. In addition, the visualization results testify that CNNs can explicitly learn class-wise feature representations by introducing class-specific dictionary learning.

源语言英语
文章编号132105
期刊Science China Information Sciences
66
3
DOI
出版状态已出版 - 3月 2023

指纹

探究 'Class attention network for image recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此