Scene classification with recurrent attention of VHR remote sensing images

Qi Wang; Shaoteng Liu; Jocelyn Chanussot; Xuelong Li

doi:10.1109/TGRS.2018.2864987

Scene classification with recurrent attention of VHR remote sensing images

Qi Wang, Shaoteng Liu, Jocelyn Chanussot, Xuelong Li

School of Artificial Intelligence, OPtics and Electronics

Research output: Contribution to journal › Article › peer-review

559 Scopus citations

Abstract

Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS), we explore the attention mechanism and propose a novel end-to-end attention recurrent convolutional network (ARCNet) for scene classification. It can learn to focus selectively on some key regions or locations and just process them at high-level features, thereby discarding the noncritical information and promoting the classification performance. The contributions of this paper are threefold. First, we design a novel recurrent attention structure to squeeze high-level semantic and spatial features into several simplex vectors for the reduction of learning parameters. Second, an end-to-end network named ARCNet is proposed to adaptively select a series of attention regions and then to generate powerful predictions by learning to process them sequentially. Third, we construct a new data set named OPTIMAL-31, which contains more categories than popular data sets and gives researchers an extra platform to validate their algorithms. The experimental results demonstrate that our model makes great promotion in comparison with the state-of-the-art approaches.

Original language	English
Article number	8454883
Pages (from-to)	1155-1167
Number of pages	13
Journal	IEEE Transactions on Geoscience and Remote Sensing
Volume	57
Issue number	2
DOIs	https://doi.org/10.1109/TGRS.2018.2864987
State	Published - Feb 2019

Keywords

Attention
convolutional neural network (CNN)
deep learning
long short-term memory (LSTM)
recurrent neural networks (RNN)
remote sensing
scene classification

Access to Document

10.1109/TGRS.2018.2864987

Cite this

@article{189e2db7252d495aab339ff1bfd3429a,

title = "Scene classification with recurrent attention of VHR remote sensing images",

abstract = "Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS), we explore the attention mechanism and propose a novel end-to-end attention recurrent convolutional network (ARCNet) for scene classification. It can learn to focus selectively on some key regions or locations and just process them at high-level features, thereby discarding the noncritical information and promoting the classification performance. The contributions of this paper are threefold. First, we design a novel recurrent attention structure to squeeze high-level semantic and spatial features into several simplex vectors for the reduction of learning parameters. Second, an end-to-end network named ARCNet is proposed to adaptively select a series of attention regions and then to generate powerful predictions by learning to process them sequentially. Third, we construct a new data set named OPTIMAL-31, which contains more categories than popular data sets and gives researchers an extra platform to validate their algorithms. The experimental results demonstrate that our model makes great promotion in comparison with the state-of-the-art approaches.",

keywords = "Attention, convolutional neural network (CNN), deep learning, long short-term memory (LSTM), recurrent neural networks (RNN), remote sensing, scene classification",

author = "Qi Wang and Shaoteng Liu and Jocelyn Chanussot and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.",

year = "2019",

month = feb,

doi = "10.1109/TGRS.2018.2864987",

language = "英语",

volume = "57",

pages = "1155--1167",

journal = "IEEE Transactions on Geoscience and Remote Sensing",

issn = "0196-2892",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "2",

}

TY - JOUR

T1 - Scene classification with recurrent attention of VHR remote sensing images

AU - Wang, Qi

AU - Liu, Shaoteng

AU - Chanussot, Jocelyn

AU - Li, Xuelong

PY - 2019/2

Y1 - 2019/2

N2 - Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS), we explore the attention mechanism and propose a novel end-to-end attention recurrent convolutional network (ARCNet) for scene classification. It can learn to focus selectively on some key regions or locations and just process them at high-level features, thereby discarding the noncritical information and promoting the classification performance. The contributions of this paper are threefold. First, we design a novel recurrent attention structure to squeeze high-level semantic and spatial features into several simplex vectors for the reduction of learning parameters. Second, an end-to-end network named ARCNet is proposed to adaptively select a series of attention regions and then to generate powerful predictions by learning to process them sequentially. Third, we construct a new data set named OPTIMAL-31, which contains more categories than popular data sets and gives researchers an extra platform to validate their algorithms. The experimental results demonstrate that our model makes great promotion in comparison with the state-of-the-art approaches.

AB - Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS), we explore the attention mechanism and propose a novel end-to-end attention recurrent convolutional network (ARCNet) for scene classification. It can learn to focus selectively on some key regions or locations and just process them at high-level features, thereby discarding the noncritical information and promoting the classification performance. The contributions of this paper are threefold. First, we design a novel recurrent attention structure to squeeze high-level semantic and spatial features into several simplex vectors for the reduction of learning parameters. Second, an end-to-end network named ARCNet is proposed to adaptively select a series of attention regions and then to generate powerful predictions by learning to process them sequentially. Third, we construct a new data set named OPTIMAL-31, which contains more categories than popular data sets and gives researchers an extra platform to validate their algorithms. The experimental results demonstrate that our model makes great promotion in comparison with the state-of-the-art approaches.

KW - Attention

KW - convolutional neural network (CNN)

KW - deep learning

KW - long short-term memory (LSTM)

KW - recurrent neural networks (RNN)

KW - remote sensing

KW - scene classification

UR - http://www.scopus.com/inward/record.url?scp=85052862263&partnerID=8YFLogxK

U2 - 10.1109/TGRS.2018.2864987

DO - 10.1109/TGRS.2018.2864987

M3 - 文章

AN - SCOPUS:85052862263

SN - 0196-2892

VL - 57

SP - 1155

EP - 1167

JO - IEEE Transactions on Geoscience and Remote Sensing

JF - IEEE Transactions on Geoscience and Remote Sensing

IS - 2

M1 - 8454883

ER -

Scene classification with recurrent attention of VHR remote sensing images

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this