跳到主要导航 跳到搜索 跳到主要内容

Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement

  • Xiangyang Zhu
  • , Renrui Zhang
  • , Bowei He
  • , Aojun Zhou
  • , Dong Wang
  • , Bin Zhao
  • , Peng Gao
  • City University of Hong Kong
  • Chinese University of Hong Kong
  • Shanghai Artificial Intelligence Laboratory

科研成果: 书/报告/会议事项章节会议稿件同行评审

75 引用 (Scopus)

摘要

The popularity of Contrastive Language-Image Pretraining (CLIP) has propelled its application to diverse downstream vision tasks. To improve its capacity on downstream tasks, few-shot learning has become a widelya-dopted technique. However, existing methods either exhibit limited performance or suffer from excessive learnable parameters. In this paper, we propose APE, an Adaptive Prior rEfinement method for CLIP's pre-trained knowledge, which achieves superior accuracy with high computational efficiency. Via a prior refinement module, we analyze the inter-class disparity in the downstream data and decouple the domain-specific knowledge from the CLIP-extracted cache model. On top of that, we introduce two model variants, a training-free APE and a training-required APE-T. We explore the trilateral affinities between the test image, prior cache model, and textual representations, and only enable a lightweight category-residual module to be trained. For the average accuracy over 11 benchmarks, both APE and APE-T attain state-of-the-art and respectively outperform the second-best by +1.59% and +1.99% under 16 shots with ×30 less learnable parameters. Code is available at https://github.com/yangyangyang127/APE.

源语言英语
主期刊名Proceedings - 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023
出版商Institute of Electrical and Electronics Engineers Inc.
2605-2615
页数11
ISBN(电子版)9798350307184
DOI
出版状态已出版 - 2023
已对外发布
活动2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Paris, 法国
期限: 2 10月 20236 10月 2023

出版系列

姓名Proceedings of the IEEE International Conference on Computer Vision
ISSN(印刷版)1550-5499

会议

会议2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023
国家/地区法国
Paris
时期2/10/236/10/23

指纹

探究 'Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement' 的科研主题。它们共同构成独一无二的指纹。

引用此