摘要
The popularity of pre-trained large models has revolutionized downstream tasks across diverse fields, such as language, vision, and multi-modality. To minimize the adaption cost for downstream tasks, many Parameter-Efficient Fine-Tuning (PEFT) techniques are proposed for language and 2D image pre-trained models. However, the specialized PEFT method for 3D pre-trained models is still under-explored. To this end, we introduce Point-PEFT, a novel framework for adapting point cloud pre-trained models with minimal learnable parameters. Specifically, for a pre-trained 3D model, we freeze most of its parameters, and only tune the newly added PEFT modules on downstream tasks, which consist of a Point-prior Prompt and a Geometry-aware Adapter. The Point-prior Prompt adopts a set of learnable prompt tokens, for which we propose to construct a memory bank with domain-specific knowledge, and utilize a parameter-free attention to enhance the prompt tokens. The Geometry-aware Adapter aims to aggregate point cloud features within spatial neighborhoods to capture fine-grained geometric information through local interactions. Extensive experiments indicate that our Point-PEFT can achieve better performance than the full fine-tuning on various downstream tasks, while using only 5% of the trainable parameters, demonstrating the efficiency and effectiveness of our approach. Code is released at https://github.com/Ivan-Tang-3D/Point-PEFT.
| 源语言 | 英语 |
|---|---|
| 页(从-至) | 5171-5179 |
| 页数 | 9 |
| 期刊 | Proceedings of the AAAI Conference on Artificial Intelligence |
| 卷 | 38 |
| 期 | 6 |
| DOI | |
| 出版状态 | 已出版 - 25 3月 2024 |
| 活动 | 38th AAAI Conference on Artificial Intelligence, AAAI 2024 - Vancouver, 加拿大 期限: 20 2月 2024 → 27 2月 2024 |
指纹
探究 'Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver