Multi-Scale Metric Learning for Few-Shot Learning

Wen Jiang; Kai Huang; Jie Geng; Xinyang Deng

doi:10.1109/TCSVT.2020.2995754

Multi-Scale Metric Learning for Few-Shot Learning

Wen Jiang, Kai Huang, Jie Geng, Xinyang Deng

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

227 Scopus citations

Abstract

Few-shot learning in image classification is developed to learn a model that aims to identify unseen classes with only few training samples for each class. Fewer training samples and new tasks of classification make many traditional classification models no longer applicable. In this paper, a novel few-shot learning method named multi-scale metric learning (MSML) is proposed to extract multi-scale features and learn the multi-scale relations between samples for the classification of few-shot learning. In the proposed method, a feature pyramid structure is introduced for multi-scale feature embedding, which aims to combine high-level strong semantic features with low-level but abundant visual features. Then a multi-scale relation generation network (MRGN) is developed for hierarchical metric learning, in which high-level features are corresponding to deeper metric learning while low-level features are corresponding to lighter metric learning. Moreover, a novel loss function named intra-class and inter-class relation loss (IIRL) is proposed to optimize the proposed deep network, which aims to strengthen the correlation between homogeneous groups of samples and weaken the correlation between heterogeneous groups of samples. Experimental results on mini ImageNet and tiered ImageNet demonstrate that the proposed method achieves superior performance in few-shot learning problem.

Original language	English
Article number	9097252
Pages (from-to)	1091-1102
Number of pages	12
Journal	IEEE Transactions on Circuits and Systems for Video Technology
Volume	31
Issue number	3
DOIs	https://doi.org/10.1109/TCSVT.2020.2995754
State	Published - Mar 2021

Keywords

Few-shot learning
metric learning
multi-scale feature maps

Access to Document

10.1109/TCSVT.2020.2995754

Cite this

@article{ab684ea66cd44d758a77ec77af617e50,

title = "Multi-Scale Metric Learning for Few-Shot Learning",

abstract = "Few-shot learning in image classification is developed to learn a model that aims to identify unseen classes with only few training samples for each class. Fewer training samples and new tasks of classification make many traditional classification models no longer applicable. In this paper, a novel few-shot learning method named multi-scale metric learning (MSML) is proposed to extract multi-scale features and learn the multi-scale relations between samples for the classification of few-shot learning. In the proposed method, a feature pyramid structure is introduced for multi-scale feature embedding, which aims to combine high-level strong semantic features with low-level but abundant visual features. Then a multi-scale relation generation network (MRGN) is developed for hierarchical metric learning, in which high-level features are corresponding to deeper metric learning while low-level features are corresponding to lighter metric learning. Moreover, a novel loss function named intra-class and inter-class relation loss (IIRL) is proposed to optimize the proposed deep network, which aims to strengthen the correlation between homogeneous groups of samples and weaken the correlation between heterogeneous groups of samples. Experimental results on mini ImageNet and tiered ImageNet demonstrate that the proposed method achieves superior performance in few-shot learning problem.",

keywords = "Few-shot learning, metric learning, multi-scale feature maps",

author = "Wen Jiang and Kai Huang and Jie Geng and Xinyang Deng",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2021",

month = mar,

doi = "10.1109/TCSVT.2020.2995754",

language = "英语",

volume = "31",

pages = "1091--1102",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "3",

}

TY - JOUR

T1 - Multi-Scale Metric Learning for Few-Shot Learning

AU - Jiang, Wen

AU - Huang, Kai

AU - Geng, Jie

AU - Deng, Xinyang

PY - 2021/3

Y1 - 2021/3

N2 - Few-shot learning in image classification is developed to learn a model that aims to identify unseen classes with only few training samples for each class. Fewer training samples and new tasks of classification make many traditional classification models no longer applicable. In this paper, a novel few-shot learning method named multi-scale metric learning (MSML) is proposed to extract multi-scale features and learn the multi-scale relations between samples for the classification of few-shot learning. In the proposed method, a feature pyramid structure is introduced for multi-scale feature embedding, which aims to combine high-level strong semantic features with low-level but abundant visual features. Then a multi-scale relation generation network (MRGN) is developed for hierarchical metric learning, in which high-level features are corresponding to deeper metric learning while low-level features are corresponding to lighter metric learning. Moreover, a novel loss function named intra-class and inter-class relation loss (IIRL) is proposed to optimize the proposed deep network, which aims to strengthen the correlation between homogeneous groups of samples and weaken the correlation between heterogeneous groups of samples. Experimental results on mini ImageNet and tiered ImageNet demonstrate that the proposed method achieves superior performance in few-shot learning problem.

AB - Few-shot learning in image classification is developed to learn a model that aims to identify unseen classes with only few training samples for each class. Fewer training samples and new tasks of classification make many traditional classification models no longer applicable. In this paper, a novel few-shot learning method named multi-scale metric learning (MSML) is proposed to extract multi-scale features and learn the multi-scale relations between samples for the classification of few-shot learning. In the proposed method, a feature pyramid structure is introduced for multi-scale feature embedding, which aims to combine high-level strong semantic features with low-level but abundant visual features. Then a multi-scale relation generation network (MRGN) is developed for hierarchical metric learning, in which high-level features are corresponding to deeper metric learning while low-level features are corresponding to lighter metric learning. Moreover, a novel loss function named intra-class and inter-class relation loss (IIRL) is proposed to optimize the proposed deep network, which aims to strengthen the correlation between homogeneous groups of samples and weaken the correlation between heterogeneous groups of samples. Experimental results on mini ImageNet and tiered ImageNet demonstrate that the proposed method achieves superior performance in few-shot learning problem.

KW - Few-shot learning

KW - metric learning

KW - multi-scale feature maps

UR - http://www.scopus.com/inward/record.url?scp=85102299961&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2020.2995754

DO - 10.1109/TCSVT.2020.2995754

M3 - 文章

AN - SCOPUS:85102299961

SN - 1051-8215

VL - 31

SP - 1091

EP - 1102

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 3

M1 - 9097252

ER -

Multi-Scale Metric Learning for Few-Shot Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this