Predicting eye fixations using convolutional neural networks

Nian Liu, Junwei Han, Dingwen Zhang, Shifeng Wen, Tianming Liu

科研成果: 书/报告/会议事项章节会议稿件同行评审

220 引用 (Scopus)

摘要

It is believed that eye movements in free-viewing of natural scenes are directed by both bottom-up visual saliency and top-down visual factors. In this paper, we propose a novel computational framework to simultaneously learn these two types of visual features from raw image data using a multiresolution convolutional neural network (Mr-CNN) for predicting eye fixations. The Mr-CNN is directly trained from image regions centered on fixation and non-fixation locations over multiple resolutions, using raw image pixels as inputs and eye fixation attributes as labels. Diverse top-down visual features can be learned in higher layers. Meanwhile bottom-up visual saliency can also be inferred via combining information over multiple resolutions. Finally, optimal integration of bottom-up and top-down cues can be learned in the last logistic regression layer to predict eye fixations. The proposed approach achieves state-of-the-art results over four publically available benchmark datasets, demonstrating the superiority of our work.

源语言英语
主期刊名IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
出版商IEEE Computer Society
362-370
页数9
ISBN(电子版)9781467369640
DOI
出版状态已出版 - 14 10月 2015
活动IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 - Boston, 美国
期限: 7 6月 201512 6月 2015

出版系列

姓名Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
07-12-June-2015
ISSN(印刷版)1063-6919

会议

会议IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
国家/地区美国
Boston
时期7/06/1512/06/15

指纹

探究 'Predicting eye fixations using convolutional neural networks' 的科研主题。它们共同构成独一无二的指纹。

引用此