Scene classification with recurrent attention of VHR remote sensing images

Qi Wang; Shaoteng Liu; Jocelyn Chanussot; Xuelong Li

doi:10.1109/TGRS.2018.2864987

Scene classification with recurrent attention of VHR remote sensing images

Qi Wang, Shaoteng Liu, Jocelyn Chanussot, Xuelong Li

光电与智能研究院

科研成果: 期刊稿件 › 文章 › 同行评审

559 引用（Scopus）

摘要

Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS), we explore the attention mechanism and propose a novel end-to-end attention recurrent convolutional network (ARCNet) for scene classification. It can learn to focus selectively on some key regions or locations and just process them at high-level features, thereby discarding the noncritical information and promoting the classification performance. The contributions of this paper are threefold. First, we design a novel recurrent attention structure to squeeze high-level semantic and spatial features into several simplex vectors for the reduction of learning parameters. Second, an end-to-end network named ARCNet is proposed to adaptively select a series of attention regions and then to generate powerful predictions by learning to process them sequentially. Third, we construct a new data set named OPTIMAL-31, which contains more categories than popular data sets and gives researchers an extra platform to validate their algorithms. The experimental results demonstrate that our model makes great promotion in comparison with the state-of-the-art approaches.

源语言	英语
文章编号	8454883
页（从-至）	1155-1167
页数	13
期刊	IEEE Transactions on Geoscience and Remote Sensing
卷	57
期	2
DOI	https://doi.org/10.1109/TGRS.2018.2864987
出版状态	已出版 - 2月 2019

访问文件

10.1109/TGRS.2018.2864987

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{189e2db7252d495aab339ff1bfd3429a,

title = "Scene classification with recurrent attention of VHR remote sensing images",

abstract = "Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS), we explore the attention mechanism and propose a novel end-to-end attention recurrent convolutional network (ARCNet) for scene classification. It can learn to focus selectively on some key regions or locations and just process them at high-level features, thereby discarding the noncritical information and promoting the classification performance. The contributions of this paper are threefold. First, we design a novel recurrent attention structure to squeeze high-level semantic and spatial features into several simplex vectors for the reduction of learning parameters. Second, an end-to-end network named ARCNet is proposed to adaptively select a series of attention regions and then to generate powerful predictions by learning to process them sequentially. Third, we construct a new data set named OPTIMAL-31, which contains more categories than popular data sets and gives researchers an extra platform to validate their algorithms. The experimental results demonstrate that our model makes great promotion in comparison with the state-of-the-art approaches.",

keywords = "Attention, convolutional neural network (CNN), deep learning, long short-term memory (LSTM), recurrent neural networks (RNN), remote sensing, scene classification",

author = "Qi Wang and Shaoteng Liu and Jocelyn Chanussot and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.",

year = "2019",

month = feb,

doi = "10.1109/TGRS.2018.2864987",

language = "英语",

volume = "57",

pages = "1155--1167",

journal = "IEEE Transactions on Geoscience and Remote Sensing",

issn = "0196-2892",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "2",

}

TY - JOUR

T1 - Scene classification with recurrent attention of VHR remote sensing images

AU - Wang, Qi

AU - Liu, Shaoteng

AU - Chanussot, Jocelyn

AU - Li, Xuelong

PY - 2019/2

Y1 - 2019/2

N2 - Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS), we explore the attention mechanism and propose a novel end-to-end attention recurrent convolutional network (ARCNet) for scene classification. It can learn to focus selectively on some key regions or locations and just process them at high-level features, thereby discarding the noncritical information and promoting the classification performance. The contributions of this paper are threefold. First, we design a novel recurrent attention structure to squeeze high-level semantic and spatial features into several simplex vectors for the reduction of learning parameters. Second, an end-to-end network named ARCNet is proposed to adaptively select a series of attention regions and then to generate powerful predictions by learning to process them sequentially. Third, we construct a new data set named OPTIMAL-31, which contains more categories than popular data sets and gives researchers an extra platform to validate their algorithms. The experimental results demonstrate that our model makes great promotion in comparison with the state-of-the-art approaches.

AB - Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS), we explore the attention mechanism and propose a novel end-to-end attention recurrent convolutional network (ARCNet) for scene classification. It can learn to focus selectively on some key regions or locations and just process them at high-level features, thereby discarding the noncritical information and promoting the classification performance. The contributions of this paper are threefold. First, we design a novel recurrent attention structure to squeeze high-level semantic and spatial features into several simplex vectors for the reduction of learning parameters. Second, an end-to-end network named ARCNet is proposed to adaptively select a series of attention regions and then to generate powerful predictions by learning to process them sequentially. Third, we construct a new data set named OPTIMAL-31, which contains more categories than popular data sets and gives researchers an extra platform to validate their algorithms. The experimental results demonstrate that our model makes great promotion in comparison with the state-of-the-art approaches.

KW - Attention

KW - convolutional neural network (CNN)

KW - deep learning

KW - long short-term memory (LSTM)

KW - recurrent neural networks (RNN)

KW - remote sensing

KW - scene classification

UR - http://www.scopus.com/inward/record.url?scp=85052862263&partnerID=8YFLogxK

U2 - 10.1109/TGRS.2018.2864987

DO - 10.1109/TGRS.2018.2864987

M3 - 文章

AN - SCOPUS:85052862263

SN - 0196-2892

VL - 57

SP - 1155

EP - 1167

JO - IEEE Transactions on Geoscience and Remote Sensing

JF - IEEE Transactions on Geoscience and Remote Sensing

IS - 2

M1 - 8454883

ER -

Scene classification with recurrent attention of VHR remote sensing images

摘要

访问文件

其它文件与链接

指纹

引用此