Large-Scale Unsupervised Semantic Segmentation

Shanghua Gao, Zhong Yu Li, Ming Hsuan Yang, Ming Ming Cheng, Junwei Han, Philip Torr

科研成果: 期刊稿件文章同行评审

30 引用 (Scopus)

摘要

Empowered by large datasets, e.g., ImageNet and MS COCO, unsupervised learning on large-scale data has enabled significant advances for classification tasks. However, whether the large-scale unsupervised semantic segmentation can be achieved remains unknown. There are two major challenges: i) we need a large-scale benchmark for assessing algorithms; ii) we need to develop methods to simultaneously learn category and shape representation in an unsupervised manner. In this work, we propose a new problem of large-scale unsupervised semantic segmentation (LUSS) with a newly created benchmark dataset to help the research progress. Building on the ImageNet dataset, we propose the ImageNet-S dataset with 1.2 million training images and 50k high-quality semantic segmentation annotations for evaluation. Our benchmark has a high data diversity and a clear task objective. We also present a simple yet effective method that works surprisingly well for LUSS. In addition, we benchmark related un/weakly/fully supervised methods accordingly, identifying the challenges and possible directions of LUSS. The benchmark and source code is publicly available at https://github.com/LUSSeg.

源语言英语
页(从-至)7457-7476
页数20
期刊IEEE Transactions on Pattern Analysis and Machine Intelligence
45
6
DOI
出版状态已出版 - 1 6月 2023

指纹

探究 'Large-Scale Unsupervised Semantic Segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此