Spatiotemporal fusion personality prediction based on visual information

Jia Xu, Weijian Tian, Guoyun Lv, Yangyu Fan

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

The previous studies have demonstrated that the use of deep learning algorithms can make personality prediction based on two-dimensional image information, and the emergence of video provides more possibilities for exploring personality prediction. Compared to image-based personality prediction, using video can provide more information than static images. But videos contain hundreds of frames, not all of which are useful, and processing these images requires a lot of computation. This paper proposes to apply video analysis algorithms to the task of personality prediction and propose the use of LSTM to fuse image feature information. The best prediction effect is confirmed by experiments when the fusion frame number is 16 frames. This paper is based on 3D-ConvNet to build an end-to-end video analysis network and solve the network over fitting problem by pre-training and data augmentation. Experiments show that the accuracy of character prediction can be improved by using 3D-ConvNet to fuse the spatio-temporal information of videos.

源语言英语
页(从-至)44227-44244
页数18
期刊Multimedia Tools and Applications
82
28
DOI
出版状态已出版 - 11月 2023

指纹

探究 'Spatiotemporal fusion personality prediction based on visual information' 的科研主题。它们共同构成独一无二的指纹。

引用此