跳到主要导航 跳到搜索 跳到主要内容

Reasoning via Implicit Self-supervised Emergence for Instruction Segmentation

  • Northwestern Polytechnical University Xian
  • Northwest Institute of Nuclear Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

We challenge the assumption that complex instruction-guided segmentation tasks necessitate equally complex and explicit supervision. This paper introduces RISE (Reasoning via Implicit Self-supervised Emergence), a framework that learns intricate compositional reasoning, spanning spatial relations to world knowledge, without a single ground-truth mask. To achieve this, RISE employs reinforcement learning with GRPO guided by a single, strikingly simple reward: the semantic alignment score between the textual instruction and the predicted image region. Our primary discovery is the implicit emergence of a high-quality chain-of-thought process from this minimalist signal. Within a structured format, the model autonomously learns to understand instructions by accessing its latent knowledge, inferring spatial relation-ships—capabilities inherent in its architecture but unlocked by our simple objective. Remarkably, our emergent reasoning yields highly competitive results: RISE achieves 58.7 gIoU on the ReasonSeg benchmark, on par with methods using geometric rewards. Furthermore, we show extreme data efficiency: a variant trained on only 2,000 ImageNet-label pairs establishes a new state-of-the-art for annotation-free referring segmentation with 79.6 cIoU on RefCOCO.

源语言英语
主期刊名Proceedings of the AAAI Conference on Artificial Intelligence
编辑Sven Koenig, Chad Jenkins, Matthew E. Taylor
出版商Association for the Advancement of Artificial Intelligence
13746-13754
页数9
版本16
ISBN(印刷版)9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067, 9781577359067
DOI
出版状态已出版 - 2026
活动40th AAAI Conference on Artificial Intelligence, AAAI 2026 - Singapore, 新加坡
期限: 20 1月 202627 1月 2026

出版系列

姓名Proceedings of the AAAI Conference on Artificial Intelligence
编号16
40
ISSN(印刷版)2159-5399
ISSN(电子版)2374-3468

会议

会议40th AAAI Conference on Artificial Intelligence, AAAI 2026
国家/地区新加坡
Singapore
时期20/01/2627/01/26

指纹

探究 'Reasoning via Implicit Self-supervised Emergence for Instruction Segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此