Skip to main navigation Skip to search Skip to main content

Prompting Vision-Language Model for Nuclei Instance Segmentation and Classification

  • Northwestern Polytechnical University Xian
  • Hefei Comprehensive National Science Center
  • Air Force Medical University
  • University of Science and Technology of China
  • Beijing Technology and Business University
  • WiSOMNi Co
  • Chongqing University of Posts and Telecommunications

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Nuclei instance segmentation and classification are a fundamental and challenging task in whole slide Imaging (WSI) analysis. Most dense nuclei prediction studies rely heavily on crowd labelled data on high-resolution digital images, leading to a time-consuming and expertise-required paradigm. Recently, Vision-Language Models (VLMs) have been intensively investigated, which learn rich cross-modal correlation from large-scale image-text pairs without tedious annotations. Inspired by this, we build a novel framework, called PromptNu, aiming at infusing abundant nuclei knowledge into the training of the nuclei instance recognition model through vision-language contrastive learning and prompt engineering techniques. Specifically, our approach starts with the creation of multifaceted prompts that integrate comprehensive nuclear knowledge, including visual insights from the GPT-4V model, statistical analyses, and expert insights from the pathology field. Then, we propose a novel prompting methodology that consists of two pivotal vision-language contrastive learning components: the Prompting Nuclei Representation Learning (PNuRL) and the Prompting Nuclei Dense Prediction (PNuDP), which adeptly integrates the expertise embedded in pre-trained VLMs and multifaceted prompts into the feature extraction and prediction process, respectively. Comprehensive experiments on six datasets with extensive WSI scenarios demonstrate the effectiveness of our method for both nuclei instance segmentation and classification tasks.

Original languageEnglish
Pages (from-to)4567-4578
Number of pages12
JournalIEEE Transactions on Medical Imaging
Volume44
Issue number11
DOIs
StatePublished - 2025

Keywords

  • Digital pathology analysis
  • nuclei classification
  • nuclei instance segmentation
  • prompt learning
  • vision-language model

Fingerprint

Dive into the research topics of 'Prompting Vision-Language Model for Nuclei Instance Segmentation and Classification'. Together they form a unique fingerprint.

Cite this