SPFTN: A self-paced fine-tuning network for segmenting objects in weakly labelled videos

Dingwen Zhang; Le Yang; Deyu Meng; Dong Xu; Junwei Han

doi:10.1109/CVPR.2017.567

SPFTN: A self-paced fine-tuning network for segmenting objects in weakly labelled videos

Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, Junwei Han

自动化学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

39 引用（Scopus）

摘要

Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags. Existing works in this research area might still have some limitations, e.g., lack of effective DNN-based learning frameworks, under-exploring the context information, and requiring to leverage the unstable negative video collection, which prevent them from obtaining more promising performance. To this end, we propose a novel self-paced fine-tuning network (SPFTN)-based framework, which could learn to explore the context information within the video frames and capture adequate object semantics without using the negative videos. To perform weakly supervised learning based on the deep neural network, we make the earliest effort to integrate the self-paced learning regime and the deep neural network into a unified and compatible framework, leading to the self-paced fine-tuning network. Comprehensive experiments on the large-scale YouTube-Objects and DAVIS datasets demonstrate that the proposed approach achieves superior performance as compared with other state-of-the-art methods as well as the baseline networks and models.

源语言	英语
主期刊名	Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
出版商	Institute of Electrical and Electronics Engineers Inc.
页	5340-5348
页数	9
ISBN（电子版）	9781538604571
DOI	https://doi.org/10.1109/CVPR.2017.567
出版状态	已出版 - 6 11月 2017
活动	30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 - Honolulu, 美国期限: 21 7月 2017 → 26 7月 2017

出版系列

姓名	Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
卷	2017-January

会议

会议	30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
国家/地区	美国
市	Honolulu
时期	21/07/17 → 26/07/17

访问文件

10.1109/CVPR.2017.567

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, D., Yang, L., Meng, D., Xu, D., & Han, J. (2017). SPFTN: A self-paced fine-tuning network for segmenting objects in weakly labelled videos. 在 Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 (页码 5340-5348). (Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017; 卷 2017-January). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CVPR.2017.567

Zhang, Dingwen ; Yang, Le ; Meng, Deyu 等. / SPFTN : A self-paced fine-tuning network for segmenting objects in weakly labelled videos. Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017. Institute of Electrical and Electronics Engineers Inc., 2017. 页码 5340-5348 (Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017).

@inproceedings{7eb755561c6b4fe88023b335c25c7d24,

title = "SPFTN: A self-paced fine-tuning network for segmenting objects in weakly labelled videos",

abstract = "Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags. Existing works in this research area might still have some limitations, e.g., lack of effective DNN-based learning frameworks, under-exploring the context information, and requiring to leverage the unstable negative video collection, which prevent them from obtaining more promising performance. To this end, we propose a novel self-paced fine-tuning network (SPFTN)-based framework, which could learn to explore the context information within the video frames and capture adequate object semantics without using the negative videos. To perform weakly supervised learning based on the deep neural network, we make the earliest effort to integrate the self-paced learning regime and the deep neural network into a unified and compatible framework, leading to the self-paced fine-tuning network. Comprehensive experiments on the large-scale YouTube-Objects and DAVIS datasets demonstrate that the proposed approach achieves superior performance as compared with other state-of-the-art methods as well as the baseline networks and models.",

author = "Dingwen Zhang and Le Yang and Deyu Meng and Dong Xu and Junwei Han",

note = "Publisher Copyright: {\textcopyright} 2017 IEEE.; 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 ; Conference date: 21-07-2017 Through 26-07-2017",

year = "2017",

month = nov,

day = "6",

doi = "10.1109/CVPR.2017.567",

language = "英语",

series = "Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "5340--5348",

booktitle = "Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017",

}

Zhang, D, Yang, L, Meng, D, Xu, D & Han, J 2017, SPFTN: A self-paced fine-tuning network for segmenting objects in weakly labelled videos. 在 Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017. Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, 卷 2017-January, Institute of Electrical and Electronics Engineers Inc., 页码 5340-5348, 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, 美国, 21/07/17. https://doi.org/10.1109/CVPR.2017.567

SPFTN: A self-paced fine-tuning network for segmenting objects in weakly labelled videos. / Zhang, Dingwen; Yang, Le; Meng, Deyu 等.
Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017. Institute of Electrical and Electronics Engineers Inc., 2017. 页码 5340-5348 (Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017; 卷 2017-January).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - SPFTN

T2 - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017

AU - Zhang, Dingwen

AU - Yang, Le

AU - Meng, Deyu

AU - Xu, Dong

AU - Han, Junwei

PY - 2017/11/6

Y1 - 2017/11/6

N2 - Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags. Existing works in this research area might still have some limitations, e.g., lack of effective DNN-based learning frameworks, under-exploring the context information, and requiring to leverage the unstable negative video collection, which prevent them from obtaining more promising performance. To this end, we propose a novel self-paced fine-tuning network (SPFTN)-based framework, which could learn to explore the context information within the video frames and capture adequate object semantics without using the negative videos. To perform weakly supervised learning based on the deep neural network, we make the earliest effort to integrate the self-paced learning regime and the deep neural network into a unified and compatible framework, leading to the self-paced fine-tuning network. Comprehensive experiments on the large-scale YouTube-Objects and DAVIS datasets demonstrate that the proposed approach achieves superior performance as compared with other state-of-the-art methods as well as the baseline networks and models.

AB - Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags. Existing works in this research area might still have some limitations, e.g., lack of effective DNN-based learning frameworks, under-exploring the context information, and requiring to leverage the unstable negative video collection, which prevent them from obtaining more promising performance. To this end, we propose a novel self-paced fine-tuning network (SPFTN)-based framework, which could learn to explore the context information within the video frames and capture adequate object semantics without using the negative videos. To perform weakly supervised learning based on the deep neural network, we make the earliest effort to integrate the self-paced learning regime and the deep neural network into a unified and compatible framework, leading to the self-paced fine-tuning network. Comprehensive experiments on the large-scale YouTube-Objects and DAVIS datasets demonstrate that the proposed approach achieves superior performance as compared with other state-of-the-art methods as well as the baseline networks and models.

UR - http://www.scopus.com/inward/record.url?scp=85040645052&partnerID=8YFLogxK

U2 - 10.1109/CVPR.2017.567

DO - 10.1109/CVPR.2017.567

M3 - 会议稿件

AN - SCOPUS:85040645052

T3 - Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017

SP - 5340

EP - 5348

BT - Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 21 July 2017 through 26 July 2017

ER -

Zhang D, Yang L, Meng D, Xu D, Han J. SPFTN: A self-paced fine-tuning network for segmenting objects in weakly labelled videos. 在 Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017. Institute of Electrical and Electronics Engineers Inc. 2017. 页码 5340-5348. (Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017). doi: 10.1109/CVPR.2017.567

SPFTN: A self-paced fine-tuning network for segmenting objects in weakly labelled videos

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此