LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING

Mingyi Yang; Xionghui Mao; Yujie Yin; Zhiwei Zhu; Defa Wang; Shuai Wan; Fuzheng Yang

doi:10.1109/ICIP51287.2024.10647741

LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING

Mingyi Yang, Xionghui Mao, Yujie Yin, Zhiwei Zhu, Defa Wang, Shuai Wan, Fuzheng Yang

School of Electronics and Information

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

In this paper, we propose a learning-based video compression which can perform continuously variable bitrate coding. The proposed method generates feature transformation parameters through a conditional network according to the input spatial quality map. These parameters are then used to adaptively transform the intermediate features of the encoder, decoder, and spatiotemporal entropy model in the codec, thus enabling variable bitrate coding. Additionally, to improve the compression efficiency of the codec, we propose incorporating the quality map of the preceding frame into the hyperprior encoder and leveraging the temporal prior encoder. A multi-stage training strategy is employed to jointly train the codec with a multi-frame rate-distortion loss function. The experimental results demonstrate that the proposed method can achieve continuously variable bitrate adaptation while maintaining rate-distortion performance comparable to the fixed bitrate model. Furthermore, the proposed method also supports ROI-based compression.

Original language	English
Title of host publication	2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings
Publisher	IEEE Computer Society
Pages	3723-3729
Number of pages	7
ISBN (Electronic)	9798350349399
DOIs	https://doi.org/10.1109/ICIP51287.2024.10647741
State	Published - 2024
Event	31st IEEE International Conference on Image Processing, ICIP 2024 - Abu Dhabi, United Arab Emirates Duration: 27 Oct 2024 → 30 Oct 2024

Publication series

Name	Proceedings - International Conference on Image Processing, ICIP
ISSN (Print)	1522-4880

Conference

Conference	31st IEEE International Conference on Image Processing, ICIP 2024
Country/Territory	United Arab Emirates
City	Abu Dhabi
Period	27/10/24 → 30/10/24

Keywords

Deep learning
ROI-based compression
Variable-rate compression
Video compression

Access to Document

10.1109/ICIP51287.2024.10647741

Cite this

Yang, M., Mao, X., Yin, Y., Zhu, Z., Wang, D., Wan, S., & Yang, F. (2024). LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING. In 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings (pp. 3723-3729). (Proceedings - International Conference on Image Processing, ICIP). IEEE Computer Society. https://doi.org/10.1109/ICIP51287.2024.10647741

@inproceedings{e86fba3fa6f54593a974c4d156bc7c1a,

title = "LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING",

abstract = "In this paper, we propose a learning-based video compression which can perform continuously variable bitrate coding. The proposed method generates feature transformation parameters through a conditional network according to the input spatial quality map. These parameters are then used to adaptively transform the intermediate features of the encoder, decoder, and spatiotemporal entropy model in the codec, thus enabling variable bitrate coding. Additionally, to improve the compression efficiency of the codec, we propose incorporating the quality map of the preceding frame into the hyperprior encoder and leveraging the temporal prior encoder. A multi-stage training strategy is employed to jointly train the codec with a multi-frame rate-distortion loss function. The experimental results demonstrate that the proposed method can achieve continuously variable bitrate adaptation while maintaining rate-distortion performance comparable to the fixed bitrate model. Furthermore, the proposed method also supports ROI-based compression.",

keywords = "Deep learning, ROI-based compression, Variable-rate compression, Video compression",

author = "Mingyi Yang and Xionghui Mao and Yujie Yin and Zhiwei Zhu and Defa Wang and Shuai Wan and Fuzheng Yang",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 31st IEEE International Conference on Image Processing, ICIP 2024 ; Conference date: 27-10-2024 Through 30-10-2024",

year = "2024",

doi = "10.1109/ICIP51287.2024.10647741",

language = "英语",

series = "Proceedings - International Conference on Image Processing, ICIP",

publisher = "IEEE Computer Society",

pages = "3723--3729",

booktitle = "2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings",

}

Yang, M, Mao, X, Yin, Y, Zhu, Z, Wang, D, Wan, S & Yang, F 2024, LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING. in 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings. Proceedings - International Conference on Image Processing, ICIP, IEEE Computer Society, pp. 3723-3729, 31st IEEE International Conference on Image Processing, ICIP 2024, Abu Dhabi, United Arab Emirates, 27/10/24. https://doi.org/10.1109/ICIP51287.2024.10647741

LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING. / Yang, Mingyi; Mao, Xionghui; Yin, Yujie et al.
2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings. IEEE Computer Society, 2024. p. 3723-3729 (Proceedings - International Conference on Image Processing, ICIP).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING

AU - Yang, Mingyi

AU - Mao, Xionghui

AU - Yin, Yujie

AU - Zhu, Zhiwei

AU - Wang, Defa

AU - Wan, Shuai

AU - Yang, Fuzheng

PY - 2024

Y1 - 2024

N2 - In this paper, we propose a learning-based video compression which can perform continuously variable bitrate coding. The proposed method generates feature transformation parameters through a conditional network according to the input spatial quality map. These parameters are then used to adaptively transform the intermediate features of the encoder, decoder, and spatiotemporal entropy model in the codec, thus enabling variable bitrate coding. Additionally, to improve the compression efficiency of the codec, we propose incorporating the quality map of the preceding frame into the hyperprior encoder and leveraging the temporal prior encoder. A multi-stage training strategy is employed to jointly train the codec with a multi-frame rate-distortion loss function. The experimental results demonstrate that the proposed method can achieve continuously variable bitrate adaptation while maintaining rate-distortion performance comparable to the fixed bitrate model. Furthermore, the proposed method also supports ROI-based compression.

AB - In this paper, we propose a learning-based video compression which can perform continuously variable bitrate coding. The proposed method generates feature transformation parameters through a conditional network according to the input spatial quality map. These parameters are then used to adaptively transform the intermediate features of the encoder, decoder, and spatiotemporal entropy model in the codec, thus enabling variable bitrate coding. Additionally, to improve the compression efficiency of the codec, we propose incorporating the quality map of the preceding frame into the hyperprior encoder and leveraging the temporal prior encoder. A multi-stage training strategy is employed to jointly train the codec with a multi-frame rate-distortion loss function. The experimental results demonstrate that the proposed method can achieve continuously variable bitrate adaptation while maintaining rate-distortion performance comparable to the fixed bitrate model. Furthermore, the proposed method also supports ROI-based compression.

KW - Deep learning

KW - ROI-based compression

KW - Variable-rate compression

KW - Video compression

UR - http://www.scopus.com/inward/record.url?scp=85216868942&partnerID=8YFLogxK

U2 - 10.1109/ICIP51287.2024.10647741

DO - 10.1109/ICIP51287.2024.10647741

M3 - 会议稿件

AN - SCOPUS:85216868942

T3 - Proceedings - International Conference on Image Processing, ICIP

SP - 3723

EP - 3729

BT - 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings

PB - IEEE Computer Society

T2 - 31st IEEE International Conference on Image Processing, ICIP 2024

Y2 - 27 October 2024 through 30 October 2024

ER -

LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this