跳到主要导航 跳到搜索 跳到主要内容

DR-IAL: Decoupling-to-recoupling guided interaction-aware learning for egocentric action recognition

  • Northwestern Polytechnical University Xian

科研成果: 期刊稿件文章同行评审

摘要

In the domain of egocentric action recognition, current auxiliary-supervised static inter-action-aware learning methodologies demonstrate considerable shortcomings in addressing inter-individual action variability, temporal dynamics and inherent long-tailed distribution characteristics of egocentric datasets, largely attributable to rigid feature aggregation mechanisms. This rigidity leads to challenges in generalization, primarily due to an insufficient range of visual experiences. To address these limitations, we propose the Decoupling-to-Recoupling Guided Interactive-Aware Learning framework with Motion-Prompted Adaptive Fusion (DR-IAL). This novel framework mimics the dynamic plasticity inherent in human visual systems through a cognitive learning paradigm characterized by “Perception – Decoupling – Recoupling”. It utilizes a dual-pathway motion perception approach to effectively capture both temporal and spatial motion cues, thereby enabling the adaptive fusion of multi-level visual tempos. Furthermore, we integrate learnable Gaussian prior knowledge and differentiable thresholded binarization techniques to bolster feature robustness in critical interaction zones while minimizing background noise. Notably, We present spatiotemporal decoupling-to-recoupling algorithm that effectively separates orthogonal components utilizing attention masks. This algorithm calculates cross-instance similarity matrices to identify challenging “interactive foreground – contextual background” pairs. Additionally, it implements stochastic channel-mixing recoupling in conjunction with spatiotemporal alignment, all while maintaining interpretable attention distributions through the application of semantic-level label constraints. Empirical results demonstrate that our approach achieves state-of-the-art performance on established benchmarks, including EGTEA and EPIC-KITCHENS-100.

源语言英语
文章编号112731
期刊Pattern Recognition
172
DOI
出版状态已出版 - 4月 2026

指纹

探究 'DR-IAL: Decoupling-to-recoupling guided interaction-aware learning for egocentric action recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此