跳到主要导航 跳到搜索 跳到主要内容

From Patch to Pixel: A Transformer-Based Hierarchical Framework for Compressive Image Sensing

  • Hongping Gan
  • , Minghe Shen
  • , Yi Hua
  • , Chunyan Ma
  • , Tao Zhang
  • Northwestern Polytechnical University Xian
  • Shanghai Jiao Tong University

科研成果: 期刊稿件文章同行评审

64 引用 (Scopus)

摘要

The convolutional neural network (CNN)-based reconstruction methods have dominated the compressive sensing (CS) in recent years. However, existing CNN-based approaches show potential restrictions in capturing non-local similarity of images, because of the intrinsic characteristic of convolutional layers, $\mathit{i.e.}$, locality and weight sharing. In parallel, the emerging Transformer architecture shows fine capacity in modeling long-distance correlations onto embedded tokens for language and images. Yet vanilla Transformer does not exceed CNN-based networks considerably but shows roughly comparable performance, and the culprit can be the missing of sophisticated inductive bias regarding the local image structures. In this article, to eliminate the restrictions of the aforementioned paradigms, we propose a Transformer-based hierarchical framework, dubbed TCS-Net, for compressive image sensing (or image compressive sensing) with a $\mathit{patch-to-pixel}$ manner. Concretely, the proposed TCS-Net consists of an image acquisition module and a reconstruction module (includes two key decoding phases: a patch-wise decoding phase and a pixel-wise decoding phase). The acquisition module can implement data-driven image sampling by jointly learning with the decoding phases. By adjusting the Transformer architecture to the $\mathit{patch-to-pixel}$ multi-stage pattern, our reconstruction module can gradually decode the CS measurements from the patch-wise outlines to the pixel-wise textures, thereby building a high-precision mapping for image reconstruction. Extensive experiments on several datasets verify that the proposed TCS-Net outperforms existing state-of-the-art image CS methods by considerable margins.

源语言英语
页(从-至)133-146
页数14
期刊IEEE Transactions on Computational Imaging
9
DOI
出版状态已出版 - 2023

指纹

探究 'From Patch to Pixel: A Transformer-Based Hierarchical Framework for Compressive Image Sensing' 的科研主题。它们共同构成独一无二的指纹。

引用此