SpFormer: Spatio-Temporal Modeling for Scanpaths with Transformer

Wenqi Zhong, Linzhi Yu, Chen Xia, Junwei Han, Dingwen Zhang

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

Saccadic scanpath, a data representation of human visual behavior, has received broad interest in multiple domains.Scanpath is a complex eye-tracking data modality that includes the sequences of fixation positions and fixation duration, coupled with image information.However, previous methods usually face the spatial misalignment problem of fixation features and loss of critical temporal data (including temporal correlation and fixation duration).In this study, we propose a Transformer-based scanpath model, SpFormer, to alleviate these problems.First, we propose a fixation-centric paradigm to extract the aligned spatial fixation features and tokenize the scanpaths.Then, according to the visual working memory mechanism, we design a local meta attention to reduce the semantic redundancy of fixations and guide the model to focus on the meta scanpath.Finally, we progressively integrate the duration information and fuse it with the fixation features to solve the problem of ambiguous location with the Transformer block increasing.We conduct extensive experiments on four databases under three tasks.The SpFormer establishes new state-of-the-art results in distinct settings, verifying its flexibility and versatility in practical applications.The code can be obtained from https://github.com/wenqizhong/SpFormer.

源语言英语
主期刊名Technical Tracks 14
编辑Michael Wooldridge, Jennifer Dy, Sriraam Natarajan
出版商Association for the Advancement of Artificial Intelligence
7605-7613
页数9
版本7
ISBN(电子版)1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879
DOI
出版状态已出版 - 25 3月 2024
活动38th AAAI Conference on Artificial Intelligence, AAAI 2024 - Vancouver, 加拿大
期限: 20 2月 202427 2月 2024

出版系列

姓名Proceedings of the AAAI Conference on Artificial Intelligence
编号7
38
ISSN(印刷版)2159-5399
ISSN(电子版)2374-3468

会议

会议38th AAAI Conference on Artificial Intelligence, AAAI 2024
国家/地区加拿大
Vancouver
时期20/02/2427/02/24

指纹

探究 'SpFormer: Spatio-Temporal Modeling for Scanpaths with Transformer' 的科研主题。它们共同构成独一无二的指纹。

引用此