Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices

Jingyi Hao; Sicong Liu; Bin Guo; Yasan Ding; Zhiwen Yu

doi:10.1109/ICPADS56603.2022.00060

Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices

Jingyi Hao, Sicong Liu, Bin Guo, Yasan Ding, Zhiwen Yu

School of Computer Science

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

The huge amount of video data produced by ubiqui tous cameras imposes significant challenges for users to efficiently obtain useful video information. Multi-view video summarization (MVS) aggregates multi-view videos into information-rich video summaries by considering content correlations within each view and between multiple views. Existing MVS methods fail to concentrate on performance across scenarios and usually achieve satisfactory performance on specific training datasets. However, when faced with unseen video scenarios, the quality of the summaries generated by existing methods may degrade. Moreover, they usually only use cameras for data acquisition, which require a large amount of network bandwidth to transfer the data to the server for processing. To bridge this gap, we propose a context-adaptive online reinforcement learning multi-view video summarization framework (COORS) that meets the low response latency performance requirements of context adaptation while ensuring camera hardware compatibility. Specifically, COORS enables retraining in new contexts by extracting contextindependent rewards, while improving model convergence speed based on representation learning and replica playback. Extensive experiments show that COORS has better performance compared to the state-of-the-art baselines.

Original language	English
Title of host publication	Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022
Publisher	IEEE Computer Society
Pages	411-418
Number of pages	8
ISBN (Electronic)	9781665473156
DOIs	https://doi.org/10.1109/ICPADS56603.2022.00060
State	Published - 2023
Event	28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022 - Nanjing, China Duration: 10 Jan 2023 → 12 Jan 2023

Publication series

Name	Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS
Volume	2023-January
ISSN (Print)	1521-9097

Conference

Conference	28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022
Country/Territory	China
City	Nanjing
Period	10/01/23 → 12/01/23

Keywords

context-adaptive
multi-view video summarization
reinforcement learning

Access to Document

10.1109/ICPADS56603.2022.00060

Cite this

Hao, J., Liu, S., Guo, B., Ding, Y., & Yu, Z. (2023). Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices. In Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022 (pp. 411-418). (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS; Vol. 2023-January). IEEE Computer Society. https://doi.org/10.1109/ICPADS56603.2022.00060

Hao, Jingyi ; Liu, Sicong ; Guo, Bin et al. / Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices. Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022. IEEE Computer Society, 2023. pp. 411-418 (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS).

@inproceedings{8c5250b3ce4b4f7181feb8b49d2dcaf9,

title = "Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices",

abstract = "The huge amount of video data produced by ubiqui tous cameras imposes significant challenges for users to efficiently obtain useful video information. Multi-view video summarization (MVS) aggregates multi-view videos into information-rich video summaries by considering content correlations within each view and between multiple views. Existing MVS methods fail to concentrate on performance across scenarios and usually achieve satisfactory performance on specific training datasets. However, when faced with unseen video scenarios, the quality of the summaries generated by existing methods may degrade. Moreover, they usually only use cameras for data acquisition, which require a large amount of network bandwidth to transfer the data to the server for processing. To bridge this gap, we propose a context-adaptive online reinforcement learning multi-view video summarization framework (COORS) that meets the low response latency performance requirements of context adaptation while ensuring camera hardware compatibility. Specifically, COORS enables retraining in new contexts by extracting contextindependent rewards, while improving model convergence speed based on representation learning and replica playback. Extensive experiments show that COORS has better performance compared to the state-of-the-art baselines.",

keywords = "context-adaptive, multi-view video summarization, reinforcement learning",

author = "Jingyi Hao and Sicong Liu and Bin Guo and Yasan Ding and Zhiwen Yu",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022 ; Conference date: 10-01-2023 Through 12-01-2023",

year = "2023",

doi = "10.1109/ICPADS56603.2022.00060",

language = "英语",

series = "Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS",

publisher = "IEEE Computer Society",

pages = "411--418",

booktitle = "Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022",

}

Hao, J, Liu, S, Guo, B, Ding, Y & Yu, Z 2023, Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices. in Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022. Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS, vol. 2023-January, IEEE Computer Society, pp. 411-418, 28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022, Nanjing, China, 10/01/23. https://doi.org/10.1109/ICPADS56603.2022.00060

Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices. / Hao, Jingyi; Liu, Sicong; Guo, Bin et al.
Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022. IEEE Computer Society, 2023. p. 411-418 (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS; Vol. 2023-January).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices

AU - Hao, Jingyi

AU - Liu, Sicong

AU - Guo, Bin

AU - Ding, Yasan

AU - Yu, Zhiwen

PY - 2023

Y1 - 2023

N2 - The huge amount of video data produced by ubiqui tous cameras imposes significant challenges for users to efficiently obtain useful video information. Multi-view video summarization (MVS) aggregates multi-view videos into information-rich video summaries by considering content correlations within each view and between multiple views. Existing MVS methods fail to concentrate on performance across scenarios and usually achieve satisfactory performance on specific training datasets. However, when faced with unseen video scenarios, the quality of the summaries generated by existing methods may degrade. Moreover, they usually only use cameras for data acquisition, which require a large amount of network bandwidth to transfer the data to the server for processing. To bridge this gap, we propose a context-adaptive online reinforcement learning multi-view video summarization framework (COORS) that meets the low response latency performance requirements of context adaptation while ensuring camera hardware compatibility. Specifically, COORS enables retraining in new contexts by extracting contextindependent rewards, while improving model convergence speed based on representation learning and replica playback. Extensive experiments show that COORS has better performance compared to the state-of-the-art baselines.

AB - The huge amount of video data produced by ubiqui tous cameras imposes significant challenges for users to efficiently obtain useful video information. Multi-view video summarization (MVS) aggregates multi-view videos into information-rich video summaries by considering content correlations within each view and between multiple views. Existing MVS methods fail to concentrate on performance across scenarios and usually achieve satisfactory performance on specific training datasets. However, when faced with unseen video scenarios, the quality of the summaries generated by existing methods may degrade. Moreover, they usually only use cameras for data acquisition, which require a large amount of network bandwidth to transfer the data to the server for processing. To bridge this gap, we propose a context-adaptive online reinforcement learning multi-view video summarization framework (COORS) that meets the low response latency performance requirements of context adaptation while ensuring camera hardware compatibility. Specifically, COORS enables retraining in new contexts by extracting contextindependent rewards, while improving model convergence speed based on representation learning and replica playback. Extensive experiments show that COORS has better performance compared to the state-of-the-art baselines.

KW - context-adaptive

KW - multi-view video summarization

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85152948388&partnerID=8YFLogxK

U2 - 10.1109/ICPADS56603.2022.00060

DO - 10.1109/ICPADS56603.2022.00060

M3 - 会议稿件

AN - SCOPUS:85152948388

T3 - Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS

SP - 411

EP - 418

BT - Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022

PB - IEEE Computer Society

T2 - 28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022

Y2 - 10 January 2023 through 12 January 2023

ER -

Hao J, Liu S, Guo B, Ding Y, Yu Z. Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices. In Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022. IEEE Computer Society. 2023. p. 411-418. (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS). doi: 10.1109/ICPADS56603.2022.00060

Context-Adaptive Online Reinforcement Learning for Multi-view Video Summarization on Mobile Devices

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this