Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP

Yating Yu, Congqi Cao, Yueran Zhang, Qinyi Lv, Lingtong Min, Yanning Zhang

科研成果: 期刊稿件会议文章同行评审

指纹

探究 'Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP' 的科研主题。它们共同构成独一无二的指纹。

Computer Science