Multi-scale cropping mechanism for remote sensing image captioning

Xueting Zhang, Qi Wang, Shangdong Chen, Xuelong Li

科研成果: 会议稿件论文同行评审

40 引用 (Scopus)

摘要

With the rapid development of artificial satellite, a large number of high resolution remote sensing images can be easily obtained now. Recently, remote sensing image captioning, which aims to generate accurate and concise descriptive sentences for remote sensing images, has been promoted by template-based model and encoder-decoder model with several related datasets released. Based on an encoder-decoder model, we propose a training mechanism of multi-scale cropping for remote sensing image captioning in this paper, which can extract more fine-grained information from remote sensing images and enhance the generalization performance of the base model. The experimental results on two datasets UCM-captions and Sydney-captions demonstrate that the proposed approach availably improves the performances in describing high resolution remote sensing images.

源语言英语
10039-10042
页数4
DOI
出版状态已出版 - 2019
活动39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Yokohama, 日本
期限: 28 7月 20192 8月 2019

会议

会议39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019
国家/地区日本
Yokohama
时期28/07/192/08/19

指纹

探究 'Multi-scale cropping mechanism for remote sensing image captioning' 的科研主题。它们共同构成独一无二的指纹。

引用此