Tag-Saliency: Combining bottom-up and top-down information for saliency detection

Guokang Zhu; Qi Wang; Yuan Yuan

doi:10.1016/j.cviu.2013.07.011

Tag-Saliency: Combining bottom-up and top-down information for saliency detection

Guokang Zhu, Qi Wang, Yuan Yuan

Research output: Contribution to journal › Article › peer-review

32 Scopus citations

Abstract

In the real world, people often have a habit tending to pay more attention to some things usually noteworthy, while ignore others. This phenomenon is associated with the top-down attention. Modeling this kind of attention has recently raised many interests in computer vision due to a wide range of practical applications. Majority of the existing models are based on eye-tracking or object detection. However, these methods may not apply to practical situations, because the eye movement data cannot be always recorded or there may be inscrutable objects to be handled in large-scale data sets. This paper proposes a Tag-Saliency model based on hierarchical image over-segmentation and auto-tagging, which can efficiently extract semantic information from large scale visual media data. Experimental results on a very challenging data set show that, the proposed Tag-Saliency model has the ability to locate the truly salient regions in a greater probability than other competitors.

Original language	English
Pages (from-to)	40-49
Number of pages	10
Journal	Computer Vision and Image Understanding
Volume	118
DOIs	https://doi.org/10.1016/j.cviu.2013.07.011
State	Published - Jan 2014
Externally published	Yes

Keywords

Computer vision
Image tagging
Saliency detection
Semantic
Visual attention
Visual media

Access to Document

10.1016/j.cviu.2013.07.011

Cite this

@article{0d1d3629675c4f1eb6d4ee0225f0520e,

title = "Tag-Saliency: Combining bottom-up and top-down information for saliency detection",

abstract = "In the real world, people often have a habit tending to pay more attention to some things usually noteworthy, while ignore others. This phenomenon is associated with the top-down attention. Modeling this kind of attention has recently raised many interests in computer vision due to a wide range of practical applications. Majority of the existing models are based on eye-tracking or object detection. However, these methods may not apply to practical situations, because the eye movement data cannot be always recorded or there may be inscrutable objects to be handled in large-scale data sets. This paper proposes a Tag-Saliency model based on hierarchical image over-segmentation and auto-tagging, which can efficiently extract semantic information from large scale visual media data. Experimental results on a very challenging data set show that, the proposed Tag-Saliency model has the ability to locate the truly salient regions in a greater probability than other competitors.",

keywords = "Computer vision, Image tagging, Saliency detection, Semantic, Visual attention, Visual media",

author = "Guokang Zhu and Qi Wang and Yuan Yuan",

year = "2014",

month = jan,

doi = "10.1016/j.cviu.2013.07.011",

language = "英语",

volume = "118",

pages = "40--49",

journal = "Computer Vision and Image Understanding",

issn = "1077-3142",

publisher = "Academic Press Inc.",

}

TY - JOUR

T1 - Tag-Saliency

T2 - Combining bottom-up and top-down information for saliency detection

AU - Zhu, Guokang

AU - Wang, Qi

AU - Yuan, Yuan

PY - 2014/1

Y1 - 2014/1

N2 - In the real world, people often have a habit tending to pay more attention to some things usually noteworthy, while ignore others. This phenomenon is associated with the top-down attention. Modeling this kind of attention has recently raised many interests in computer vision due to a wide range of practical applications. Majority of the existing models are based on eye-tracking or object detection. However, these methods may not apply to practical situations, because the eye movement data cannot be always recorded or there may be inscrutable objects to be handled in large-scale data sets. This paper proposes a Tag-Saliency model based on hierarchical image over-segmentation and auto-tagging, which can efficiently extract semantic information from large scale visual media data. Experimental results on a very challenging data set show that, the proposed Tag-Saliency model has the ability to locate the truly salient regions in a greater probability than other competitors.

AB - In the real world, people often have a habit tending to pay more attention to some things usually noteworthy, while ignore others. This phenomenon is associated with the top-down attention. Modeling this kind of attention has recently raised many interests in computer vision due to a wide range of practical applications. Majority of the existing models are based on eye-tracking or object detection. However, these methods may not apply to practical situations, because the eye movement data cannot be always recorded or there may be inscrutable objects to be handled in large-scale data sets. This paper proposes a Tag-Saliency model based on hierarchical image over-segmentation and auto-tagging, which can efficiently extract semantic information from large scale visual media data. Experimental results on a very challenging data set show that, the proposed Tag-Saliency model has the ability to locate the truly salient regions in a greater probability than other competitors.

KW - Computer vision

KW - Image tagging

KW - Saliency detection

KW - Semantic

KW - Visual attention

KW - Visual media

UR - http://www.scopus.com/inward/record.url?scp=84890570132&partnerID=8YFLogxK

U2 - 10.1016/j.cviu.2013.07.011

DO - 10.1016/j.cviu.2013.07.011

M3 - 文章

AN - SCOPUS:84890570132

SN - 1077-3142

VL - 118

SP - 40

EP - 49

JO - Computer Vision and Image Understanding

JF - Computer Vision and Image Understanding

ER -

Tag-Saliency: Combining bottom-up and top-down information for saliency detection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this