Prior-knowledge and attention based meta-learning for few-shot learning

Yunxiao Qin; Weiguo Zhang; Chenxu Zhao; Zezheng Wang; Xiangyu Zhu; Jingping Shi; Guojun Qi; Zhen Lei

doi:10.1016/j.knosys.2020.106609

Prior-knowledge and attention based meta-learning for few-shot learning

Yunxiao Qin, Weiguo Zhang, Chenxu Zhao, Zezheng Wang, Xiangyu Zhu, Jingping Shi, Guojun Qi, Zhen Lei

School of Automation

Research output: Contribution to journal › Article › peer-review

25 Scopus citations

Abstract

Recently, meta-learning has been shown to be a promising way to solve few-shot learning. In this paper, inspired by the human cognition process, which utilizes both prior-knowledge and visual attention when learning new knowledge, we present a novel paradigm of meta-learning approach that capitalizes on three developments to introduce attention mechanism and prior-knowledge to meta-learning. In our approach, prior-knowledge is responsible for helping the meta-learner express the input data in a high-level representation space, and the attention mechanism enables the meta-learner to focus on key data features in the representation space. Compared with the existing meta-learning approaches that pay little attention to prior-knowledge and visual attention, our approach alleviates the meta-learner's few-shot cognition burden. Furthermore, we discover a Task-Over-Fitting (TOF) problem,¹ which indicates that the meta-learner has poor generalization across different K-shot learning tasks. To model the TOF problem, we propose a novel Cross-Entropy across Tasks (CET) metric.² Extensive experiments demonstrate that our techniques improve the meta-learner to state-of-the-art performance on several few-shot learning benchmarks while also substantially alleviating the TOF problem.

Original language	English
Article number	106609
Journal	Knowledge-Based Systems
Volume	213
DOIs	https://doi.org/10.1016/j.knosys.2020.106609
State	Published - 15 Feb 2021

Keywords

Attention mechanism
Few-shot learning
Meta-learning
Prior-knowledge
Representation

Access to Document

10.1016/j.knosys.2020.106609

Cite this

@article{fe3b8ac9de4748c08c54030325018934,

title = "Prior-knowledge and attention based meta-learning for few-shot learning",

abstract = "Recently, meta-learning has been shown to be a promising way to solve few-shot learning. In this paper, inspired by the human cognition process, which utilizes both prior-knowledge and visual attention when learning new knowledge, we present a novel paradigm of meta-learning approach that capitalizes on three developments to introduce attention mechanism and prior-knowledge to meta-learning. In our approach, prior-knowledge is responsible for helping the meta-learner express the input data in a high-level representation space, and the attention mechanism enables the meta-learner to focus on key data features in the representation space. Compared with the existing meta-learning approaches that pay little attention to prior-knowledge and visual attention, our approach alleviates the meta-learner's few-shot cognition burden. Furthermore, we discover a Task-Over-Fitting (TOF) problem,1 which indicates that the meta-learner has poor generalization across different K-shot learning tasks. To model the TOF problem, we propose a novel Cross-Entropy across Tasks (CET) metric.2 Extensive experiments demonstrate that our techniques improve the meta-learner to state-of-the-art performance on several few-shot learning benchmarks while also substantially alleviating the TOF problem.",

keywords = "Attention mechanism, Few-shot learning, Meta-learning, Prior-knowledge, Representation",

author = "Yunxiao Qin and Weiguo Zhang and Chenxu Zhao and Zezheng Wang and Xiangyu Zhu and Jingping Shi and Guojun Qi and Zhen Lei",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier B.V.",

year = "2021",

month = feb,

day = "15",

doi = "10.1016/j.knosys.2020.106609",

language = "英语",

volume = "213",

journal = "Knowledge-Based Systems",

issn = "0950-7051",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Prior-knowledge and attention based meta-learning for few-shot learning

AU - Qin, Yunxiao

AU - Zhang, Weiguo

AU - Zhao, Chenxu

AU - Wang, Zezheng

AU - Zhu, Xiangyu

AU - Shi, Jingping

AU - Qi, Guojun

AU - Lei, Zhen

PY - 2021/2/15

Y1 - 2021/2/15

N2 - Recently, meta-learning has been shown to be a promising way to solve few-shot learning. In this paper, inspired by the human cognition process, which utilizes both prior-knowledge and visual attention when learning new knowledge, we present a novel paradigm of meta-learning approach that capitalizes on three developments to introduce attention mechanism and prior-knowledge to meta-learning. In our approach, prior-knowledge is responsible for helping the meta-learner express the input data in a high-level representation space, and the attention mechanism enables the meta-learner to focus on key data features in the representation space. Compared with the existing meta-learning approaches that pay little attention to prior-knowledge and visual attention, our approach alleviates the meta-learner's few-shot cognition burden. Furthermore, we discover a Task-Over-Fitting (TOF) problem,1 which indicates that the meta-learner has poor generalization across different K-shot learning tasks. To model the TOF problem, we propose a novel Cross-Entropy across Tasks (CET) metric.2 Extensive experiments demonstrate that our techniques improve the meta-learner to state-of-the-art performance on several few-shot learning benchmarks while also substantially alleviating the TOF problem.

AB - Recently, meta-learning has been shown to be a promising way to solve few-shot learning. In this paper, inspired by the human cognition process, which utilizes both prior-knowledge and visual attention when learning new knowledge, we present a novel paradigm of meta-learning approach that capitalizes on three developments to introduce attention mechanism and prior-knowledge to meta-learning. In our approach, prior-knowledge is responsible for helping the meta-learner express the input data in a high-level representation space, and the attention mechanism enables the meta-learner to focus on key data features in the representation space. Compared with the existing meta-learning approaches that pay little attention to prior-knowledge and visual attention, our approach alleviates the meta-learner's few-shot cognition burden. Furthermore, we discover a Task-Over-Fitting (TOF) problem,1 which indicates that the meta-learner has poor generalization across different K-shot learning tasks. To model the TOF problem, we propose a novel Cross-Entropy across Tasks (CET) metric.2 Extensive experiments demonstrate that our techniques improve the meta-learner to state-of-the-art performance on several few-shot learning benchmarks while also substantially alleviating the TOF problem.

KW - Attention mechanism

KW - Few-shot learning

KW - Meta-learning

KW - Prior-knowledge

KW - Representation

UR - http://www.scopus.com/inward/record.url?scp=85096532366&partnerID=8YFLogxK

U2 - 10.1016/j.knosys.2020.106609

DO - 10.1016/j.knosys.2020.106609

M3 - 文章

AN - SCOPUS:85096532366

SN - 0950-7051

VL - 213

JO - Knowledge-Based Systems

JF - Knowledge-Based Systems

M1 - 106609

ER -

Prior-knowledge and attention based meta-learning for few-shot learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this