TY - GEN
T1 - MAM-RNN
T2 - 26th International Joint Conference on Artificial Intelligence, IJCAI 2017
AU - Li, Xuelong
AU - Zhao, Bin
AU - Lu, Xiaoqiang
PY - 2017
Y1 - 2017
N2 - Visual information is quite important for the task of video captioning. However, in the video, there are a lot of uncorrelated content, which may cause interference to generate a correct caption. Based on this point, we attempt to exploit the visual features which are most correlated to the caption. In this paper, a Multi-level Attention Model based Recurrent Neural Network (MAM-RNN) is proposed, where MAM is utilized to encode the visual feature and RNN works as the decoder to generate the video caption. During generation, the proposed approach is able to adaptively attend to the salient regions in the frame and the frames correlated to the caption. Practically, the experimental results on two benchmark datasets, i.e., MSVD and Charades, have shown the excellent performance of the proposed approach.
AB - Visual information is quite important for the task of video captioning. However, in the video, there are a lot of uncorrelated content, which may cause interference to generate a correct caption. Based on this point, we attempt to exploit the visual features which are most correlated to the caption. In this paper, a Multi-level Attention Model based Recurrent Neural Network (MAM-RNN) is proposed, where MAM is utilized to encode the visual feature and RNN works as the decoder to generate the video caption. During generation, the proposed approach is able to adaptively attend to the salient regions in the frame and the frames correlated to the caption. Practically, the experimental results on two benchmark datasets, i.e., MSVD and Charades, have shown the excellent performance of the proposed approach.
UR - http://www.scopus.com/inward/record.url?scp=85031914301&partnerID=8YFLogxK
U2 - 10.24963/ijcai.2017/307
DO - 10.24963/ijcai.2017/307
M3 - 会议稿件
AN - SCOPUS:85031914301
T3 - IJCAI International Joint Conference on Artificial Intelligence
SP - 2208
EP - 2214
BT - 26th International Joint Conference on Artificial Intelligence, IJCAI 2017
A2 - Sierra, Carles
PB - International Joint Conferences on Artificial Intelligence
Y2 - 19 August 2017 through 25 August 2017
ER -