跳到主要导航 跳到搜索 跳到主要内容

Top-k Self-Attention in Transformer for Video Inpainting

  • Guanxiao Li
  • , Ke Zhang
  • , Yu Su
  • , Jing Yu Wang
  • Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Video inpainting restores missing content using global dependencies and relevant non-local frame portions. Recent Transformer-based techniques utilize self-attention mechanisms to establish connections between global patch embeddings. However, due to the scarcity of relevant regions, existing methods end up allocating partial attention weights to a significant number of irrelevant areas. This situation results in a dispersion of dependencies, which negatively impacts modeling accuracy. To address this issue, we introduce a top-k self-attention mechanism specifically designed for Transformer-based video inpainting, which filters out the weights of less relevant regions. This proposed mechanism computes a top-k weight threshold for each missing patch and compels the Transformer to focus on the k most pertinent patch embeddings. As a result, the accuracy of dependency modeling is enhanced, leading to more effective content aggregation for filling in the missing regions. The top-k mechanism is easily integrated into any Transformer-based model, and experiments conducted on the YouTube-VOS and DAVIS datasets show that it significantly improves the model's performance while maintaining high efficiency.

源语言英语
主期刊名2024 5th International Conference on Computer Engineering and Application, ICCEA 2024
出版商Institute of Electrical and Electronics Engineers Inc.
1038-1042
页数5
ISBN(电子版)9798350386776
DOI
出版状态已出版 - 2024
活动5th International Conference on Computer Engineering and Application, ICCEA 2024 - Hybrid, Hangzhou, 中国
期限: 12 4月 202414 4月 2024

出版系列

姓名2024 5th International Conference on Computer Engineering and Application, ICCEA 2024

会议

会议5th International Conference on Computer Engineering and Application, ICCEA 2024
国家/地区中国
Hybrid, Hangzhou
时期12/04/2414/04/24

指纹

探究 'Top-k Self-Attention in Transformer for Video Inpainting' 的科研主题。它们共同构成独一无二的指纹。

引用此