Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning

Qingsen Yan; Dong Gong; Yuhang Liu; Anton Van Den Hengel; Javen Qinfeng Shi

doi:10.1109/CVPR52688.2022.00021

Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning

Qingsen Yan, Dong Gong, Yuhang Liu, Anton Van Den Hengel, Javen Qinfeng Shi

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

42 引用（Scopus）

摘要

Continual Learning (CL) methods aim to enable machine learning models to learn new tasks without catastrophic forgetting of those that have been previously mastered. Existing CL approaches often keep a buffer of previously-seen samples, perform knowledge distillation, or use regularization techniques towards this goal. Despite their performance, they still suffer from interference across tasks which leads to catastrophic forgetting. To ameliorate this problem, we propose to only activate and select sparse neurons for learning current and past tasks at any stage. More parameters space and model capacity can thus be reserved for the future tasks. This minimizes the interference between parameters for different tasks. To do so, we propose a Sparse neural Network for Continual Learning (SNCL), which employs variational Bayesian sparsity priors on the activations of the neurons in all layers. Full Experience Replay (FER) provides effective supervision in learning the sparse activations of the neurons in different layers. A loss-aware reservoir-sampling strategy is developed to maintain the memory buffer. The proposed method is agnostic as to the network structures and the task boundaries. Experiments on different datasets show that SNCL achieves state-of-the-art result for mitigating forgetting.

源语言	英语
主期刊名	Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
出版商	IEEE Computer Society
页	109-118
页数	10
ISBN（电子版）	9781665469463
DOI	https://doi.org/10.1109/CVPR52688.2022.00021
出版状态	已出版 - 2022
已对外发布	是
活动	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 - New Orleans, 美国期限: 19 6月 2022 → 24 6月 2022

出版系列

姓名	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
卷	2022-June
ISSN（印刷版）	1063-6919

会议

会议	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
国家/地区	美国
市	New Orleans
时期	19/06/22 → 24/06/22

访问文件

10.1109/CVPR52688.2022.00021

其它文件与链接

链接到 Scopus 的出版物

引用此

Yan, Q., Gong, D., Liu, Y., Van Den Hengel, A., & Shi, J. Q. (2022). Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 (页码 109-118). (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2022-June). IEEE Computer Society. https://doi.org/10.1109/CVPR52688.2022.00021

@inproceedings{54c9aff67ca24e12a632c0e961f3a364,

title = "Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning",

abstract = "Continual Learning (CL) methods aim to enable machine learning models to learn new tasks without catastrophic forgetting of those that have been previously mastered. Existing CL approaches often keep a buffer of previously-seen samples, perform knowledge distillation, or use regularization techniques towards this goal. Despite their performance, they still suffer from interference across tasks which leads to catastrophic forgetting. To ameliorate this problem, we propose to only activate and select sparse neurons for learning current and past tasks at any stage. More parameters space and model capacity can thus be reserved for the future tasks. This minimizes the interference between parameters for different tasks. To do so, we propose a Sparse neural Network for Continual Learning (SNCL), which employs variational Bayesian sparsity priors on the activations of the neurons in all layers. Full Experience Replay (FER) provides effective supervision in learning the sparse activations of the neurons in different layers. A loss-aware reservoir-sampling strategy is developed to maintain the memory buffer. The proposed method is agnostic as to the network structures and the task boundaries. Experiments on different datasets show that SNCL achieves state-of-the-art result for mitigating forgetting.",

keywords = "Deep learning architectures and techniques, Machine learning",

author = "Qingsen Yan and Dong Gong and Yuhang Liu and {Van Den Hengel}, Anton and Shi, {Javen Qinfeng}",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 ; Conference date: 19-06-2022 Through 24-06-2022",

year = "2022",

doi = "10.1109/CVPR52688.2022.00021",

language = "英语",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "109--118",

booktitle = "Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022",

}

Yan, Q, Gong, D, Liu, Y, Van Den Hengel, A & Shi, JQ 2022, Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 卷 2022-June, IEEE Computer Society, 页码 109-118, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, 美国, 19/06/22. https://doi.org/10.1109/CVPR52688.2022.00021

Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning. / Yan, Qingsen; Gong, Dong; Liu, Yuhang 等.
Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE Computer Society, 2022. 页码 109-118 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2022-June).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning

AU - Yan, Qingsen

AU - Gong, Dong

AU - Liu, Yuhang

AU - Van Den Hengel, Anton

AU - Shi, Javen Qinfeng

PY - 2022

Y1 - 2022

N2 - Continual Learning (CL) methods aim to enable machine learning models to learn new tasks without catastrophic forgetting of those that have been previously mastered. Existing CL approaches often keep a buffer of previously-seen samples, perform knowledge distillation, or use regularization techniques towards this goal. Despite their performance, they still suffer from interference across tasks which leads to catastrophic forgetting. To ameliorate this problem, we propose to only activate and select sparse neurons for learning current and past tasks at any stage. More parameters space and model capacity can thus be reserved for the future tasks. This minimizes the interference between parameters for different tasks. To do so, we propose a Sparse neural Network for Continual Learning (SNCL), which employs variational Bayesian sparsity priors on the activations of the neurons in all layers. Full Experience Replay (FER) provides effective supervision in learning the sparse activations of the neurons in different layers. A loss-aware reservoir-sampling strategy is developed to maintain the memory buffer. The proposed method is agnostic as to the network structures and the task boundaries. Experiments on different datasets show that SNCL achieves state-of-the-art result for mitigating forgetting.

AB - Continual Learning (CL) methods aim to enable machine learning models to learn new tasks without catastrophic forgetting of those that have been previously mastered. Existing CL approaches often keep a buffer of previously-seen samples, perform knowledge distillation, or use regularization techniques towards this goal. Despite their performance, they still suffer from interference across tasks which leads to catastrophic forgetting. To ameliorate this problem, we propose to only activate and select sparse neurons for learning current and past tasks at any stage. More parameters space and model capacity can thus be reserved for the future tasks. This minimizes the interference between parameters for different tasks. To do so, we propose a Sparse neural Network for Continual Learning (SNCL), which employs variational Bayesian sparsity priors on the activations of the neurons in all layers. Full Experience Replay (FER) provides effective supervision in learning the sparse activations of the neurons in different layers. A loss-aware reservoir-sampling strategy is developed to maintain the memory buffer. The proposed method is agnostic as to the network structures and the task boundaries. Experiments on different datasets show that SNCL achieves state-of-the-art result for mitigating forgetting.

KW - Deep learning architectures and techniques

KW - Machine learning

UR - http://www.scopus.com/inward/record.url?scp=85141793828&partnerID=8YFLogxK

U2 - 10.1109/CVPR52688.2022.00021

DO - 10.1109/CVPR52688.2022.00021

M3 - 会议稿件

AN - SCOPUS:85141793828

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 109

EP - 118

BT - Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022

PB - IEEE Computer Society

T2 - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022

Y2 - 19 June 2022 through 24 June 2022

ER -

Yan Q, Gong D, Liu Y, Van Den Hengel A, Shi JQ. Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE Computer Society. 2022. 页码 109-118. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR52688.2022.00021

Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此