RIFormer: Learning Rotation-Invariant Features Via Transformer

Chao Song; Shaohui Mei; Mingyang Ma

doi:10.1109/IGARSS52108.2023.10282204

RIFormer: Learning Rotation-Invariant Features Via Transformer

Chao Song, Shaohui Mei, Mingyang Ma

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Recently, Transformers have been widely used in many computer vision tasks and have shown promising results. However, like convolutional neural networks (CNNs), Transformers cannot handle rotational variations well, thus hindering its further application in the field of remote sensing. In this paper, we design a rotation-invariant Transformer (RIFormer) to alleviate the abovementioned problem. Moreover, we propose a novel rotation-invariant position embedding (RIPE) to encode positional information of features, and this position-dependent features learned by RIPE is robust to rotations. The experimental results show that proposed RIFormer with RIPE can effectively learn rotation-invariant features compared to the state-of-the-art methods with limited parameters. We provide an open-source implementation of our method. It is publicly available at https://github.com/psychAo/RIFormer.

Original language	English
Title of host publication	IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	5399-5402
Number of pages	4
ISBN (Electronic)	9798350320107
DOIs	https://doi.org/10.1109/IGARSS52108.2023.10282204
State	Published - 2023
Event	2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023 - Pasadena, United States Duration: 16 Jul 2023 → 21 Jul 2023

Publication series

Name	International Geoscience and Remote Sensing Symposium (IGARSS)
Volume	2023-July

Conference

Conference	2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023
Country/Territory	United States
City	Pasadena
Period	16/07/23 → 21/07/23

Keywords

feature learning
position embedding
remote sensing
rotation-invariant
Transformer

Access to Document

10.1109/IGARSS52108.2023.10282204

Cite this

Song, C., Mei, S., & Ma, M. (2023). RIFormer: Learning Rotation-Invariant Features Via Transformer. In IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings (pp. 5399-5402). (International Geoscience and Remote Sensing Symposium (IGARSS); Vol. 2023-July). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IGARSS52108.2023.10282204

@inproceedings{c89c043284df45b3b2aa92a031b78762,

title = "RIFormer: Learning Rotation-Invariant Features Via Transformer",

abstract = "Recently, Transformers have been widely used in many computer vision tasks and have shown promising results. However, like convolutional neural networks (CNNs), Transformers cannot handle rotational variations well, thus hindering its further application in the field of remote sensing. In this paper, we design a rotation-invariant Transformer (RIFormer) to alleviate the abovementioned problem. Moreover, we propose a novel rotation-invariant position embedding (RIPE) to encode positional information of features, and this position-dependent features learned by RIPE is robust to rotations. The experimental results show that proposed RIFormer with RIPE can effectively learn rotation-invariant features compared to the state-of-the-art methods with limited parameters. We provide an open-source implementation of our method. It is publicly available at https://github.com/psychAo/RIFormer.",

keywords = "feature learning, position embedding, remote sensing, rotation-invariant, Transformer",

author = "Chao Song and Shaohui Mei and Mingyang Ma",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023 ; Conference date: 16-07-2023 Through 21-07-2023",

year = "2023",

doi = "10.1109/IGARSS52108.2023.10282204",

language = "英语",

series = "International Geoscience and Remote Sensing Symposium (IGARSS)",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "5399--5402",

booktitle = "IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings",

}

Song, C, Mei, S & Ma, M 2023, RIFormer: Learning Rotation-Invariant Features Via Transformer. in IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings. International Geoscience and Remote Sensing Symposium (IGARSS), vol. 2023-July, Institute of Electrical and Electronics Engineers Inc., pp. 5399-5402, 2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023, Pasadena, United States, 16/07/23. https://doi.org/10.1109/IGARSS52108.2023.10282204

RIFormer: Learning Rotation-Invariant Features Via Transformer. / Song, Chao; Mei, Shaohui; Ma, Mingyang.
IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings. Institute of Electrical and Electronics Engineers Inc., 2023. p. 5399-5402 (International Geoscience and Remote Sensing Symposium (IGARSS); Vol. 2023-July).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - RIFormer

T2 - 2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023

AU - Song, Chao

AU - Mei, Shaohui

AU - Ma, Mingyang

PY - 2023

Y1 - 2023

N2 - Recently, Transformers have been widely used in many computer vision tasks and have shown promising results. However, like convolutional neural networks (CNNs), Transformers cannot handle rotational variations well, thus hindering its further application in the field of remote sensing. In this paper, we design a rotation-invariant Transformer (RIFormer) to alleviate the abovementioned problem. Moreover, we propose a novel rotation-invariant position embedding (RIPE) to encode positional information of features, and this position-dependent features learned by RIPE is robust to rotations. The experimental results show that proposed RIFormer with RIPE can effectively learn rotation-invariant features compared to the state-of-the-art methods with limited parameters. We provide an open-source implementation of our method. It is publicly available at https://github.com/psychAo/RIFormer.

AB - Recently, Transformers have been widely used in many computer vision tasks and have shown promising results. However, like convolutional neural networks (CNNs), Transformers cannot handle rotational variations well, thus hindering its further application in the field of remote sensing. In this paper, we design a rotation-invariant Transformer (RIFormer) to alleviate the abovementioned problem. Moreover, we propose a novel rotation-invariant position embedding (RIPE) to encode positional information of features, and this position-dependent features learned by RIPE is robust to rotations. The experimental results show that proposed RIFormer with RIPE can effectively learn rotation-invariant features compared to the state-of-the-art methods with limited parameters. We provide an open-source implementation of our method. It is publicly available at https://github.com/psychAo/RIFormer.

KW - feature learning

KW - position embedding

KW - remote sensing

KW - rotation-invariant

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=85178364735&partnerID=8YFLogxK

U2 - 10.1109/IGARSS52108.2023.10282204

DO - 10.1109/IGARSS52108.2023.10282204

M3 - 会议稿件

AN - SCOPUS:85178364735

T3 - International Geoscience and Remote Sensing Symposium (IGARSS)

SP - 5399

EP - 5402

BT - IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 16 July 2023 through 21 July 2023

ER -

RIFormer: Learning Rotation-Invariant Features Via Transformer

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this