Predicting eye fixations using convolutional neural networks

Nian Liu; Junwei Han; Dingwen Zhang; Shifeng Wen; Tianming Liu

doi:10.1109/CVPR.2015.7298633

Predicting eye fixations using convolutional neural networks

Nian Liu, Junwei Han, Dingwen Zhang, Shifeng Wen, Tianming Liu

自动化学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

224 引用（Scopus）

摘要

It is believed that eye movements in free-viewing of natural scenes are directed by both bottom-up visual saliency and top-down visual factors. In this paper, we propose a novel computational framework to simultaneously learn these two types of visual features from raw image data using a multiresolution convolutional neural network (Mr-CNN) for predicting eye fixations. The Mr-CNN is directly trained from image regions centered on fixation and non-fixation locations over multiple resolutions, using raw image pixels as inputs and eye fixation attributes as labels. Diverse top-down visual features can be learned in higher layers. Meanwhile bottom-up visual saliency can also be inferred via combining information over multiple resolutions. Finally, optimal integration of bottom-up and top-down cues can be learned in the last logistic regression layer to predict eye fixations. The proposed approach achieves state-of-the-art results over four publically available benchmark datasets, demonstrating the superiority of our work.

源语言	英语
主期刊名	IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
出版商	IEEE Computer Society
页	362-370
页数	9
ISBN（电子版）	9781467369640
DOI	https://doi.org/10.1109/CVPR.2015.7298633
出版状态	已出版 - 14 10月 2015
活动	IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 - Boston, 美国期限: 7 6月 2015 → 12 6月 2015

出版系列

姓名	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
卷	07-12-June-2015
ISSN（印刷版）	1063-6919

会议

会议	IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
国家/地区	美国
市	Boston
时期	7/06/15 → 12/06/15

访问文件

10.1109/CVPR.2015.7298633

其它文件与链接

链接到 Scopus 的出版物

引用此

Liu, N., Han, J., Zhang, D., Wen, S., & Liu, T. (2015). Predicting eye fixations using convolutional neural networks. 在 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 (页码 362-370). 文章 7298633 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 07-12-June-2015). IEEE Computer Society. https://doi.org/10.1109/CVPR.2015.7298633

@inproceedings{c1716873576242a584f809e2d2f6570b,

title = "Predicting eye fixations using convolutional neural networks",

abstract = "It is believed that eye movements in free-viewing of natural scenes are directed by both bottom-up visual saliency and top-down visual factors. In this paper, we propose a novel computational framework to simultaneously learn these two types of visual features from raw image data using a multiresolution convolutional neural network (Mr-CNN) for predicting eye fixations. The Mr-CNN is directly trained from image regions centered on fixation and non-fixation locations over multiple resolutions, using raw image pixels as inputs and eye fixation attributes as labels. Diverse top-down visual features can be learned in higher layers. Meanwhile bottom-up visual saliency can also be inferred via combining information over multiple resolutions. Finally, optimal integration of bottom-up and top-down cues can be learned in the last logistic regression layer to predict eye fixations. The proposed approach achieves state-of-the-art results over four publically available benchmark datasets, demonstrating the superiority of our work.",

author = "Nian Liu and Junwei Han and Dingwen Zhang and Shifeng Wen and Tianming Liu",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.; IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 ; Conference date: 07-06-2015 Through 12-06-2015",

year = "2015",

month = oct,

day = "14",

doi = "10.1109/CVPR.2015.7298633",

language = "英语",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "362--370",

booktitle = "IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015",

}

Liu, N, Han, J , Zhang, D, Wen, S & Liu, T 2015, Predicting eye fixations using convolutional neural networks. 在 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015., 7298633, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 卷 07-12-June-2015, IEEE Computer Society, 页码 362-370, IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, 美国, 7/06/15. https://doi.org/10.1109/CVPR.2015.7298633

Predicting eye fixations using convolutional neural networks. / Liu, Nian; Han, Junwei ; Zhang, Dingwen 等.
IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015. IEEE Computer Society, 2015. 页码 362-370 7298633 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 07-12-June-2015).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Predicting eye fixations using convolutional neural networks

AU - Liu, Nian

AU - Han, Junwei

AU - Zhang, Dingwen

AU - Wen, Shifeng

AU - Liu, Tianming

PY - 2015/10/14

Y1 - 2015/10/14

N2 - It is believed that eye movements in free-viewing of natural scenes are directed by both bottom-up visual saliency and top-down visual factors. In this paper, we propose a novel computational framework to simultaneously learn these two types of visual features from raw image data using a multiresolution convolutional neural network (Mr-CNN) for predicting eye fixations. The Mr-CNN is directly trained from image regions centered on fixation and non-fixation locations over multiple resolutions, using raw image pixels as inputs and eye fixation attributes as labels. Diverse top-down visual features can be learned in higher layers. Meanwhile bottom-up visual saliency can also be inferred via combining information over multiple resolutions. Finally, optimal integration of bottom-up and top-down cues can be learned in the last logistic regression layer to predict eye fixations. The proposed approach achieves state-of-the-art results over four publically available benchmark datasets, demonstrating the superiority of our work.

AB - It is believed that eye movements in free-viewing of natural scenes are directed by both bottom-up visual saliency and top-down visual factors. In this paper, we propose a novel computational framework to simultaneously learn these two types of visual features from raw image data using a multiresolution convolutional neural network (Mr-CNN) for predicting eye fixations. The Mr-CNN is directly trained from image regions centered on fixation and non-fixation locations over multiple resolutions, using raw image pixels as inputs and eye fixation attributes as labels. Diverse top-down visual features can be learned in higher layers. Meanwhile bottom-up visual saliency can also be inferred via combining information over multiple resolutions. Finally, optimal integration of bottom-up and top-down cues can be learned in the last logistic regression layer to predict eye fixations. The proposed approach achieves state-of-the-art results over four publically available benchmark datasets, demonstrating the superiority of our work.

UR - http://www.scopus.com/inward/record.url?scp=84946554818&partnerID=8YFLogxK

U2 - 10.1109/CVPR.2015.7298633

DO - 10.1109/CVPR.2015.7298633

M3 - 会议稿件

AN - SCOPUS:84946554818

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 362

EP - 370

BT - IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015

PB - IEEE Computer Society

T2 - IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015

Y2 - 7 June 2015 through 12 June 2015

ER -

Predicting eye fixations using convolutional neural networks

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此