Multi-modal Spatio-temporal Transformer for Defect Recognition of Substation Equipment

Yiyang Yao, Zexing Du, Xue Wang, Qing Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The utilization of multi-spectral imaging, such as infrared, visible light, and ultraviolet, for recognizing defects in electrical equipment mostly focuses on static measurements and lacks exploration into the dynamic process of defect development. To better exploit dynamic measurements, this paper proposes a novel defect recognition method using tri-spectral videos. Specifically, a multi-modal spatio-temporal Transformer is presented to effectively decompose spatio-temporal features present in various modalities. Besides, a spatio-temporal multi-modal contrastive loss is introduced for self-supervised learning. By aligning extracted features both spatially and temporally across modalities, this loss helps mitigate confusion between modalities and improve the discriminative capacity of learned representations. To evaluate the proposed method, we self-collect a tri-spectral dataset, TROPED, which covers a wide range of dynamic defects in operational substation equipment, and benchmark results on the dataset. Experimental results demonstrate the effectiveness and robustness of the proposed method against other state-of-the-art methods.

Original languageEnglish
Title of host publicationArtificial Intelligence and Robotics - 9th International Symposium, ISAIR 2024, Revised Selected Papers
EditorsHuimin Lu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages211-222
Number of pages12
ISBN (Print)9789819629138
DOIs
StatePublished - 2025
Event9th International Symposium on Artificial Intelligence and Robotics, ISAIR 2024 - Guilin, China
Duration: 27 Sep 202430 Sep 2024

Publication series

NameCommunications in Computer and Information Science
Volume2403 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference9th International Symposium on Artificial Intelligence and Robotics, ISAIR 2024
Country/TerritoryChina
CityGuilin
Period27/09/2430/09/24

Keywords

  • Defect Recognition
  • Dynamic Measurements
  • Electrical Equipment
  • Multi-modal Spatio-temporal Learning
  • Self-supervised Learning

Fingerprint

Dive into the research topics of 'Multi-modal Spatio-temporal Transformer for Defect Recognition of Substation Equipment'. Together they form a unique fingerprint.

Cite this