RIFormer: Learning Rotation-Invariant Features Via Transformer

Chao Song, Shaohui Mei, Mingyang Ma

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Recently, Transformers have been widely used in many computer vision tasks and have shown promising results. However, like convolutional neural networks (CNNs), Transformers cannot handle rotational variations well, thus hindering its further application in the field of remote sensing. In this paper, we design a rotation-invariant Transformer (RIFormer) to alleviate the abovementioned problem. Moreover, we propose a novel rotation-invariant position embedding (RIPE) to encode positional information of features, and this position-dependent features learned by RIPE is robust to rotations. The experimental results show that proposed RIFormer with RIPE can effectively learn rotation-invariant features compared to the state-of-the-art methods with limited parameters. We provide an open-source implementation of our method. It is publicly available at https://github.com/psychAo/RIFormer.

源语言英语
主期刊名IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
5399-5402
页数4
ISBN(电子版)9798350320107
DOI
出版状态已出版 - 2023
活动2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023 - Pasadena, 美国
期限: 16 7月 202321 7月 2023

出版系列

姓名International Geoscience and Remote Sensing Symposium (IGARSS)
2023-July

会议

会议2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023
国家/地区美国
Pasadena
时期16/07/2321/07/23

指纹

探究 'RIFormer: Learning Rotation-Invariant Features Via Transformer' 的科研主题。它们共同构成独一无二的指纹。

引用此