TY - JOUR
T1 - Evaluation of Saccadic Scanpath Prediction
T2 - Subjective Assessment Database and Recurrent Neural Network Based Metric
AU - Xia, Chen
AU - Han, Junwei
AU - Zhang, Dingwen
N1 - Publisher Copyright:
© 1979-2012 IEEE.
PY - 2021/12/1
Y1 - 2021/12/1
N2 - In recent years, predicting the saccadic scanpaths of humans has become a new trend in the field of visual attention modeling. Given various saccadic algorithms, determining how to evaluate their ability to model a dynamic saccade has become an important yet understudied issue. To our best knowledge, existing metrics for evaluating saccadic prediction models are often heuristically designed, which may produce results that are inconsistent with human subjective assessment. To this end, we first construct a subjective database by collecting the assessments on 5,000 pairs of scanpaths from ten subjects. Based on this database, we can compare different metrics according to their consistency with human visual perception. In addition, we also propose a data-driven metric to measure scanpath similarity based on the human subjective comparison. To achieve this goal, we employ a long short-term memory (LSTM) network to learn the inference from the relationship of encoded scanpaths to a binary measurement. Experimental results have demonstrated that the LSTM-based metric outperforms other existing metrics. Moreover, we believe the constructed database can be used as a benchmark to inspire more insights for future metric selection.
AB - In recent years, predicting the saccadic scanpaths of humans has become a new trend in the field of visual attention modeling. Given various saccadic algorithms, determining how to evaluate their ability to model a dynamic saccade has become an important yet understudied issue. To our best knowledge, existing metrics for evaluating saccadic prediction models are often heuristically designed, which may produce results that are inconsistent with human subjective assessment. To this end, we first construct a subjective database by collecting the assessments on 5,000 pairs of scanpaths from ten subjects. Based on this database, we can compare different metrics according to their consistency with human visual perception. In addition, we also propose a data-driven metric to measure scanpath similarity based on the human subjective comparison. To achieve this goal, we employ a long short-term memory (LSTM) network to learn the inference from the relationship of encoded scanpaths to a binary measurement. Experimental results have demonstrated that the LSTM-based metric outperforms other existing metrics. Moreover, we believe the constructed database can be used as a benchmark to inspire more insights for future metric selection.
KW - evaluation metrics
KW - Long Short-Term Memory (LSTM) network
KW - saccadic models
KW - scanpath comparison
KW - semantic hashing
KW - Visual attention
UR - http://www.scopus.com/inward/record.url?scp=85116967030&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2020.3002168
DO - 10.1109/TPAMI.2020.3002168
M3 - 文章
C2 - 32750785
AN - SCOPUS:85116967030
SN - 0162-8828
VL - 43
SP - 4378
EP - 4395
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 12
ER -