Evaluation of Saccadic Scanpath Prediction: Subjective Assessment Database and Recurrent Neural Network Based Metric

Chen Xia; Junwei Han; Dingwen Zhang

doi:10.1109/TPAMI.2020.3002168

Evaluation of Saccadic Scanpath Prediction: Subjective Assessment Database and Recurrent Neural Network Based Metric

Chen Xia, Junwei Han, Dingwen Zhang

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

15 引用（Scopus）

摘要

In recent years, predicting the saccadic scanpaths of humans has become a new trend in the field of visual attention modeling. Given various saccadic algorithms, determining how to evaluate their ability to model a dynamic saccade has become an important yet understudied issue. To our best knowledge, existing metrics for evaluating saccadic prediction models are often heuristically designed, which may produce results that are inconsistent with human subjective assessment. To this end, we first construct a subjective database by collecting the assessments on 5,000 pairs of scanpaths from ten subjects. Based on this database, we can compare different metrics according to their consistency with human visual perception. In addition, we also propose a data-driven metric to measure scanpath similarity based on the human subjective comparison. To achieve this goal, we employ a long short-term memory (LSTM) network to learn the inference from the relationship of encoded scanpaths to a binary measurement. Experimental results have demonstrated that the LSTM-based metric outperforms other existing metrics. Moreover, we believe the constructed database can be used as a benchmark to inspire more insights for future metric selection.

源语言	英语
页（从-至）	4378-4395
页数	18
期刊	IEEE Transactions on Pattern Analysis and Machine Intelligence
卷	43
期	12
DOI	https://doi.org/10.1109/TPAMI.2020.3002168
出版状态	已出版 - 1 12月 2021

访问文件

10.1109/TPAMI.2020.3002168

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{acfa33dfae624a9f903ff4ff2bc8c216,

title = "Evaluation of Saccadic Scanpath Prediction: Subjective Assessment Database and Recurrent Neural Network Based Metric",

abstract = "In recent years, predicting the saccadic scanpaths of humans has become a new trend in the field of visual attention modeling. Given various saccadic algorithms, determining how to evaluate their ability to model a dynamic saccade has become an important yet understudied issue. To our best knowledge, existing metrics for evaluating saccadic prediction models are often heuristically designed, which may produce results that are inconsistent with human subjective assessment. To this end, we first construct a subjective database by collecting the assessments on 5,000 pairs of scanpaths from ten subjects. Based on this database, we can compare different metrics according to their consistency with human visual perception. In addition, we also propose a data-driven metric to measure scanpath similarity based on the human subjective comparison. To achieve this goal, we employ a long short-term memory (LSTM) network to learn the inference from the relationship of encoded scanpaths to a binary measurement. Experimental results have demonstrated that the LSTM-based metric outperforms other existing metrics. Moreover, we believe the constructed database can be used as a benchmark to inspire more insights for future metric selection.",

keywords = "evaluation metrics, Long Short-Term Memory (LSTM) network, saccadic models, scanpath comparison, semantic hashing, Visual attention",

author = "Chen Xia and Junwei Han and Dingwen Zhang",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2021",

month = dec,

day = "1",

doi = "10.1109/TPAMI.2020.3002168",

language = "英语",

volume = "43",

pages = "4378--4395",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "12",

}

TY - JOUR

T1 - Evaluation of Saccadic Scanpath Prediction

T2 - Subjective Assessment Database and Recurrent Neural Network Based Metric

AU - Xia, Chen

AU - Han, Junwei

AU - Zhang, Dingwen

PY - 2021/12/1

Y1 - 2021/12/1

N2 - In recent years, predicting the saccadic scanpaths of humans has become a new trend in the field of visual attention modeling. Given various saccadic algorithms, determining how to evaluate their ability to model a dynamic saccade has become an important yet understudied issue. To our best knowledge, existing metrics for evaluating saccadic prediction models are often heuristically designed, which may produce results that are inconsistent with human subjective assessment. To this end, we first construct a subjective database by collecting the assessments on 5,000 pairs of scanpaths from ten subjects. Based on this database, we can compare different metrics according to their consistency with human visual perception. In addition, we also propose a data-driven metric to measure scanpath similarity based on the human subjective comparison. To achieve this goal, we employ a long short-term memory (LSTM) network to learn the inference from the relationship of encoded scanpaths to a binary measurement. Experimental results have demonstrated that the LSTM-based metric outperforms other existing metrics. Moreover, we believe the constructed database can be used as a benchmark to inspire more insights for future metric selection.

AB - In recent years, predicting the saccadic scanpaths of humans has become a new trend in the field of visual attention modeling. Given various saccadic algorithms, determining how to evaluate their ability to model a dynamic saccade has become an important yet understudied issue. To our best knowledge, existing metrics for evaluating saccadic prediction models are often heuristically designed, which may produce results that are inconsistent with human subjective assessment. To this end, we first construct a subjective database by collecting the assessments on 5,000 pairs of scanpaths from ten subjects. Based on this database, we can compare different metrics according to their consistency with human visual perception. In addition, we also propose a data-driven metric to measure scanpath similarity based on the human subjective comparison. To achieve this goal, we employ a long short-term memory (LSTM) network to learn the inference from the relationship of encoded scanpaths to a binary measurement. Experimental results have demonstrated that the LSTM-based metric outperforms other existing metrics. Moreover, we believe the constructed database can be used as a benchmark to inspire more insights for future metric selection.

KW - evaluation metrics

KW - Long Short-Term Memory (LSTM) network

KW - saccadic models

KW - scanpath comparison

KW - semantic hashing

KW - Visual attention

UR - http://www.scopus.com/inward/record.url?scp=85116967030&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2020.3002168

DO - 10.1109/TPAMI.2020.3002168

M3 - 文章

C2 - 32750785

AN - SCOPUS:85116967030

SN - 0162-8828

VL - 43

SP - 4378

EP - 4395

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 12

ER -

Evaluation of Saccadic Scanpath Prediction: Subjective Assessment Database and Recurrent Neural Network Based Metric

摘要

访问文件

其它文件与链接

指纹

引用此