Multi-Scale Metric Learning for Few-Shot Learning

Research output: Contribution to journalArticlepeer-review

227 Scopus citations

Abstract

Few-shot learning in image classification is developed to learn a model that aims to identify unseen classes with only few training samples for each class. Fewer training samples and new tasks of classification make many traditional classification models no longer applicable. In this paper, a novel few-shot learning method named multi-scale metric learning (MSML) is proposed to extract multi-scale features and learn the multi-scale relations between samples for the classification of few-shot learning. In the proposed method, a feature pyramid structure is introduced for multi-scale feature embedding, which aims to combine high-level strong semantic features with low-level but abundant visual features. Then a multi-scale relation generation network (MRGN) is developed for hierarchical metric learning, in which high-level features are corresponding to deeper metric learning while low-level features are corresponding to lighter metric learning. Moreover, a novel loss function named intra-class and inter-class relation loss (IIRL) is proposed to optimize the proposed deep network, which aims to strengthen the correlation between homogeneous groups of samples and weaken the correlation between heterogeneous groups of samples. Experimental results on mini ImageNet and tiered ImageNet demonstrate that the proposed method achieves superior performance in few-shot learning problem.

Original languageEnglish
Article number9097252
Pages (from-to)1091-1102
Number of pages12
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume31
Issue number3
DOIs
StatePublished - Mar 2021

Keywords

  • Few-shot learning
  • metric learning
  • multi-scale feature maps

Fingerprint

Dive into the research topics of 'Multi-Scale Metric Learning for Few-Shot Learning'. Together they form a unique fingerprint.

Cite this