跳到主要导航 跳到搜索 跳到主要内容

Training Consistent Mixture-of-Experts-Based Prompt Generator for Continual Learning

  • Yue Lu
  • , Shizhou Zhang
  • , De Cheng
  • , Guoqiang Liang
  • , Yinghui Xing
  • , Nannan Wang
  • , Yanning Zhang
  • Northwestern Polytechnical University Xian
  • Xidian University

科研成果: 书/报告/会议事项章节会议稿件同行评审

10 引用 (Scopus)

摘要

Visual prompt tuning-based continual learning (CL) methods have shown promising performance in exemplar-free scenarios, where their key component can be viewed as a prompt generator. Existing approaches generally rely on freezing old prompts, slow updating and task discrimination for prompt generators to preserve stability and minimize forgetting. In contrast, we introduce a novel approach that trains a consistent prompt generator to ensure stability during CL. Consistency means that for any instance from an old task, its corresponding instance-ware prompt generated by the prompt generator remains consistent even as the generator continually updates in a new task. This ensures that the representation of a specific instance remains stable across tasks and thereby prevents forgetting. We employ a mixture of experts (MoE) as the prompt generator, which contains a router and multiple experts. By deriving conditions sufficient to achieve the consistency for the MoE prompt generator, we demonstrate that: during training in a new task, if the router and experts update in the directions orthogonal to the subspaces spanned by old input features and gating vectors, respectively, the consistency can be theoretically guaranteed. To implement this orthogonality, we project parameter gradients to those orthogonal directions using the orthogonal projection matrices computed via the null space method. Extensive experiments on four class-incremental learning benchmarks validate the effectiveness and superiority of our approach.

源语言英语
主期刊名Special Track on AI Alignment
编辑Toby Walsh, Julie Shah, Zico Kolter
出版商Association for the Advancement of Artificial Intelligence
18915-18923
页数9
版本18
ISBN(电子版)157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 157735897X, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978, 9781577358978
DOI
出版状态已出版 - 11 4月 2025
活动39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025 - Philadelphia, 美国
期限: 25 2月 20254 3月 2025

出版系列

姓名Proceedings of the AAAI Conference on Artificial Intelligence
编号18
39
ISSN(印刷版)2159-5399
ISSN(电子版)2374-3468

会议

会议39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025
国家/地区美国
Philadelphia
时期25/02/254/03/25

指纹

探究 'Training Consistent Mixture-of-Experts-Based Prompt Generator for Continual Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此