跳到主要导航 跳到搜索 跳到主要内容

Review of large vision models and visual prompt engineering

  • Jiaqi Wang
  • , Zhengliang Liu
  • , Lin Zhao
  • , Zihao Wu
  • , Chong Ma
  • , Sigang Yu
  • , Haixing Dai
  • , Qiushi Yang
  • , Yiheng Liu
  • , Songyao Zhang
  • , Enze Shi
  • , Yi Pan
  • , Tuo Zhang
  • , Dajiang Zhu
  • , Xiang Li
  • , Xi Jiang
  • , Bao Ge
  • , Yixuan Yuan
  • , Dinggang Shen
  • , Tianming Liu
  • Shu Zhang
  • Northwestern Polytechnical University Xian
  • University of Georgia
  • City University of Hong Kong
  • Shaanxi Normal University
  • University of Electronic Science and Technology of China
  • University of Texas at Arlington
  • Massachusetts General Hospital
  • Department of Electronic Engineering Chinese University of Hong Kong Hong
  • ShanghaiTech University
  • Shanghai Unite Imaging Intelligence Co.Ltd
  • Shanghai Clinical Research and Trial Center

科研成果: 期刊稿件文献综述同行评审

163 引用 (Scopus)

摘要

Visual prompt engineering is a fundamental methodology in the field of visual and image artificial general intelligence. As the development of large vision models progresses, the importance of prompt engineering becomes increasingly evident. Designing suitable prompts for specific visual tasks has emerged as a meaningful research direction. This review aims to summarize the methods employed in the computer vision domain for large vision models and visual prompt engineering, exploring the latest advancements in visual prompt engineering. We present influential large models in the visual domain and a range of prompt engineering methods employed on these models. It is our hope that this review provides a comprehensive and systematic description of prompt engineering methods based on large visual models, offering valuable insights for future researchers in their exploration of this field.

源语言英语
文章编号100047
期刊Meta-Radiology
1
3
DOI
出版状态已出版 - 11月 2023

指纹

探究 'Review of large vision models and visual prompt engineering' 的科研主题。它们共同构成独一无二的指纹。

引用此