Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP

Yating Yu, Congqi Cao, Yueran Zhang, Qinyi Lv, Lingtong Min, Yanning Zhang

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP'. Together they form a unique fingerprint.

Computer Science