TY - JOUR
T1 - 视频萃取
AU - Li, Xuelong
AU - Zhao, Bin
N1 - Publisher Copyright:
© 2021, Science China Press. All right reserved.
PY - 2021/5
Y1 - 2021/5
N2 - Video has become one of the most important data forms. Video distillation explores more compact data forms and information modalities by analyzing the spatial-temporal and semantic features of video data, which is an important task in computer vision and a key technique in artificial intelligence. With the rapid development of video capturing devices and the increasing human requirements, video analysis tasks are facing numbers of opportunities and challenges. In recent years, large amounts of video distillation approaches are proposed. This paper creatively unifies the theoretical basis of video distillation by analyzing the relationship among data, information and knowledge from the perspective of information theory, and argues that the principle of video distillation is to improve the information capacity of video data. Then, we overview existing approaches in the aspects of video data representation, key content summarization, moving object synopsis and text description generation, etc., and relate the development of video summarization, synopsis and captioning, which are typical tasks in video distillation. More importantly, this paper discusses the advantages and drawbacks of existing approaches, and then points out several key scientific problems that have not yet been addressed, and simultaneously analyzes the potential future development in video distillation.
AB - Video has become one of the most important data forms. Video distillation explores more compact data forms and information modalities by analyzing the spatial-temporal and semantic features of video data, which is an important task in computer vision and a key technique in artificial intelligence. With the rapid development of video capturing devices and the increasing human requirements, video analysis tasks are facing numbers of opportunities and challenges. In recent years, large amounts of video distillation approaches are proposed. This paper creatively unifies the theoretical basis of video distillation by analyzing the relationship among data, information and knowledge from the perspective of information theory, and argues that the principle of video distillation is to improve the information capacity of video data. Then, we overview existing approaches in the aspects of video data representation, key content summarization, moving object synopsis and text description generation, etc., and relate the development of video summarization, synopsis and captioning, which are typical tasks in video distillation. More importantly, this paper discusses the advantages and drawbacks of existing approaches, and then points out several key scientific problems that have not yet been addressed, and simultaneously analyzes the potential future development in video distillation.
KW - Artificial intelligence
KW - Computer vision
KW - Video captioning
KW - Video distillation
KW - Video summarization
KW - Video synopsis
KW - Visual representation
UR - http://www.scopus.com/inward/record.url?scp=85106358067&partnerID=8YFLogxK
U2 - 10.1360/SSI-2020-0165
DO - 10.1360/SSI-2020-0165
M3 - 文章
AN - SCOPUS:85106358067
SN - 1674-7267
VL - 51
SP - 695
EP - 734
JO - Scientia Sinica Informationis
JF - Scientia Sinica Informationis
IS - 5
ER -