跳到主要导航 跳到搜索 跳到主要内容

FVSF: Fusion of Multi-Scale Visual and Enhanced Semantic Features for Few-Shot Learning

  • Shuaitong Wang
  • , Kexin Zhang
  • , Zixuan Qin
  • , Baoguo Wei
  • , Xu Li
  • , Wensheng Lin
  • , Lixin Li
  • Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Few-Shot Learning (FSL) aims to recognize novel categories from sparse data but is fundamentally challenged by high intra-class variance and ambiguous feature representations. Existing methods are often limited by unreliable unimodal visual features or fail to bridge the modality gap due to inadequate semantic priors. To address these limitations, we introduce FVSF (Fusion of Visual and Semantic Features), a framework that constructs a highly discriminative embedding space by synergizing three key components. First, a Swin Transformerbased visual fusion module captures a rich hierarchy of visual features, from fine-grained textures to high-order semantics. Second, a Large Language Model (LLM)-driven pipeline generates descriptive, paragraph-level semantic representations for each category, resolving the ambiguity of conventional class labels. Third, a self-supervised contrastive learning strategy refines the embedding space to enhance intra-class compactness. Comprehensive experiments on standard FSL benchmarks, including MiniImageNet, CIFAR-FS, and FC100, demonstrate that FVSF significantly outperforms state-of-the-art methods, establishing a new performance benchmark. Our results confirm that the systematic integration of multimodal information provides a robust solution to the learning challenges posed by data scarcity in FSL.

源语言英语
主期刊名2026 International Conference on Electronics, Information, and Communication, ICEIC 2026
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331580773
DOI
出版状态已出版 - 2026
活动2026 International Conference on Electronics, Information, and Communication, ICEIC 2026 - Macau, 中国
期限: 18 1月 202621 1月 2026

出版系列

姓名2026 International Conference on Electronics, Information, and Communication, ICEIC 2026

会议

会议2026 International Conference on Electronics, Information, and Communication, ICEIC 2026
国家/地区中国
Macau
时期18/01/2621/01/26

指纹

探究 'FVSF: Fusion of Multi-Scale Visual and Enhanced Semantic Features for Few-Shot Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此