TY - GEN
T1 - Enhancing boundary for video object segmentation
AU - Zhang, Qi
AU - Lu, Xiaoqiang
AU - Yuan, Yuan
N1 - Publisher Copyright:
© 2018 ACM.
PY - 2018/8/27
Y1 - 2018/8/27
N2 - Video object segmentation aims to separate objects from background in successive video sequence accurately. It is a challenging task as the huge variance in object regions and similarity between object and background. Among previous methods, inner region of an object can be easily separated from background while the region around object boundary is often classified improperly. To address this problem, a novel video object segmentation method is proposed to enhance the object boundary by integrating video supervoxel into Convolutional Neural Network (CNN) model. Supervoxel is exploited in our method for its ability of preserving spatial details. The proposed method can be divided into four steps: 1) convolutional feature of video is extracted with CNN model; 2) supervoxel feature is constructed through averaging the convolutional features within each supervoxel to preserve spatial details of video; 3) the supervoxel feature and original convolutional feature are fused to construct video representation; 4) a softmax classifier is trained based on video representation to classify each pixel in video. The proposed method is evaluated both on DAVIS and Youtube-Objects datasets. Experimental results show that by considering supervoxel with spatial details, the proposed method can achieve impressive performance for video object segmentation through enhancing object boundary.
AB - Video object segmentation aims to separate objects from background in successive video sequence accurately. It is a challenging task as the huge variance in object regions and similarity between object and background. Among previous methods, inner region of an object can be easily separated from background while the region around object boundary is often classified improperly. To address this problem, a novel video object segmentation method is proposed to enhance the object boundary by integrating video supervoxel into Convolutional Neural Network (CNN) model. Supervoxel is exploited in our method for its ability of preserving spatial details. The proposed method can be divided into four steps: 1) convolutional feature of video is extracted with CNN model; 2) supervoxel feature is constructed through averaging the convolutional features within each supervoxel to preserve spatial details of video; 3) the supervoxel feature and original convolutional feature are fused to construct video representation; 4) a softmax classifier is trained based on video representation to classify each pixel in video. The proposed method is evaluated both on DAVIS and Youtube-Objects datasets. Experimental results show that by considering supervoxel with spatial details, the proposed method can achieve impressive performance for video object segmentation through enhancing object boundary.
KW - Convolutional neural network
KW - Supervoxel
KW - Video object segmentation
UR - http://www.scopus.com/inward/record.url?scp=85058662655&partnerID=8YFLogxK
U2 - 10.1145/3271553.3271581
DO - 10.1145/3271553.3271581
M3 - 会议稿件
AN - SCOPUS:85058662655
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the 2nd International Conference on Vision, Image and Signal Processing, ICVISP 2018
PB - Association for Computing Machinery
T2 - 2nd International Conference on Vision, Image and Signal Processing, ICVISP 2018
Y2 - 27 August 2018 through 29 August 2018
ER -