TY - JOUR
T1 - Learning motion-guided salience features for weakly supervised group activity recognition
AU - Du, Zexing
AU - Wang, Qing
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/10/15
Y1 - 2025/10/15
N2 - This paper focuses on exploring motion-guided features for weakly supervised group activity recognition (GAR). Unlike existing GAR methods that simply squeeze extracted tokens or individual features into a single vector by global pooling, limiting their ability to sufficiently represent spatial and temporal salience features in videos, we propose a Motion-Guided Network (MGN) to capture crucial motion contextual information in videos. First, we embed local correlations between the feature maps of adjacent frames to extract motion features in activities. Then, unlike previous works that simply aggregate motion and appearance features by addition or concatenation, MGN uses motion representations to guide the extraction of temporal and spatial features. We have evaluated the proposed method on sports and group activity videos. Extensive experimental results verify the effectiveness of our method. Furthermore, our method has also outperformed some approaches trained with stronger supervision in the comparative evaluation.
AB - This paper focuses on exploring motion-guided features for weakly supervised group activity recognition (GAR). Unlike existing GAR methods that simply squeeze extracted tokens or individual features into a single vector by global pooling, limiting their ability to sufficiently represent spatial and temporal salience features in videos, we propose a Motion-Guided Network (MGN) to capture crucial motion contextual information in videos. First, we embed local correlations between the feature maps of adjacent frames to extract motion features in activities. Then, unlike previous works that simply aggregate motion and appearance features by addition or concatenation, MGN uses motion representations to guide the extraction of temporal and spatial features. We have evaluated the proposed method on sports and group activity videos. Extensive experimental results verify the effectiveness of our method. Furthermore, our method has also outperformed some approaches trained with stronger supervision in the comparative evaluation.
KW - Group activity recognition
KW - Motion-guided representations
KW - Spatial–temporal salience features
KW - Weakly supervised learning
UR - http://www.scopus.com/inward/record.url?scp=105008820197&partnerID=8YFLogxK
U2 - 10.1016/j.engappai.2025.111437
DO - 10.1016/j.engappai.2025.111437
M3 - 文章
AN - SCOPUS:105008820197
SN - 0952-1976
VL - 158
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
M1 - 111437
ER -