Instance-Aware Remote Sensing Image Captioning with Cross-Hierarchy Attention

Chengze Wang, Zhiyu Jiang, Yuan Yuan

科研成果: 书/报告/会议事项章节会议稿件同行评审

15 引用 (Scopus)

摘要

The spatial attention is a straightforward approach to enhance the performance for remote sensing image captioning. However, conventional spatial attention approaches consider only the attention distribution on one fixed coarse grid, resulting in the semantics of tiny objects can be easily ignored or disturbed during the visual feature extraction. Worse still, the fixed semantic level of conventional spatial attention limits the image understanding in different levels and perspectives, which is critical for tackling the huge diversity in remote sensing images. To address these issues, we propose a remote sensing image caption generator with instance-awareness and cross-hierarchy attention. 1) The instances awareness is achieved by introducing a multi-level feature architecture that contains the visual information of multi-level instance-possible regions and their surroundings. 2) Moreover, based on this multi-level feature extraction, a cross-hierarchy attention mechanism is proposed to prompt the decoder to dynamically focus on different semantic hierarchies and instances at each time step. The experimental results on public datasets demonstrate the superiority of proposed approach over existing methods.

源语言英语
主期刊名2020 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2020 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
980-983
页数4
ISBN(电子版)9781728163741
DOI
出版状态已出版 - 26 9月 2020
活动2020 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2020 - Virtual, Waikoloa, 美国
期限: 26 9月 20202 10月 2020

出版系列

姓名International Geoscience and Remote Sensing Symposium (IGARSS)

会议

会议2020 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2020
国家/地区美国
Virtual, Waikoloa
时期26/09/202/10/20

指纹

探究 'Instance-Aware Remote Sensing Image Captioning with Cross-Hierarchy Attention' 的科研主题。它们共同构成独一无二的指纹。

引用此