DVC-P: Deep Video Compression with Perceptual Optimizations

Saiping Zhang; Marta Mrak; Luis Herranz; Marc Gorriz Blanch; Shuai Wan; Fuzheng Yang

doi:10.1109/VCIP53242.2021.9675350

DVC-P: Deep Video Compression with Perceptual Optimizations

Saiping Zhang, Marta Mrak, Luis Herranz, Marc Gorriz Blanch, Shuai Wan, Fuzheng Yang

电子信息学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

7 引用（Scopus）

摘要

Recent years have witnessed the significant development of learning-based video compression methods, which aim at optimizing objective or perceptual quality and bit rates. In this paper, we introduce deep video compression with perceptual op-timizations (DVC-P), which aims at increasing perceptual quality of decoded videos. Our proposed DVC-P is based on Deep Video Compression (DVC) network, but improves it with perceptual optimizations. Specifically, a discriminator network and a mixed loss are employed to help our network trade off among distortion, perception and rate. Furthermore, nearest-neighbor interpolation is used to eliminate checkerboard artifacts which can appear in sequences encoded with DVC frameworks. Thanks to these two improvements, the perceptual quality of decoded sequences is improved. Experimental results demonstrate that, compared with the baseline DVC, our proposed method can generate videos with higher perceptual quality achieving 12.27% reduction in a perceptual BD- rate equivalent, on average.

源语言	英语
主期刊名	2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings
出版商	Institute of Electrical and Electronics Engineers Inc.
ISBN（电子版）	9781728185514
DOI	https://doi.org/10.1109/VCIP53242.2021.9675350
出版状态	已出版 - 2021
活动	2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Munich, 德国期限: 5 12月 2021 → 8 12月 2021

出版系列

姓名	2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings

会议

会议	2021 International Conference on Visual Communications and Image Processing, VCIP 2021
国家/地区	德国
市	Munich
时期	5/12/21 → 8/12/21

访问文件

10.1109/VCIP53242.2021.9675350

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, S., Mrak, M., Herranz, L., Blanch, M. G., Wan, S., & Yang, F. (2021). DVC-P: Deep Video Compression with Perceptual Optimizations. 在 2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings (2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/VCIP53242.2021.9675350

Zhang, Saiping ; Mrak, Marta ; Herranz, Luis 等. / DVC-P : Deep Video Compression with Perceptual Optimizations. 2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2021. (2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings).

@inproceedings{452c396fd4d942528169f7543c40896e,

title = "DVC-P: Deep Video Compression with Perceptual Optimizations",

abstract = "Recent years have witnessed the significant development of learning-based video compression methods, which aim at optimizing objective or perceptual quality and bit rates. In this paper, we introduce deep video compression with perceptual op-timizations (DVC-P), which aims at increasing perceptual quality of decoded videos. Our proposed DVC-P is based on Deep Video Compression (DVC) network, but improves it with perceptual optimizations. Specifically, a discriminator network and a mixed loss are employed to help our network trade off among distortion, perception and rate. Furthermore, nearest-neighbor interpolation is used to eliminate checkerboard artifacts which can appear in sequences encoded with DVC frameworks. Thanks to these two improvements, the perceptual quality of decoded sequences is improved. Experimental results demonstrate that, compared with the baseline DVC, our proposed method can generate videos with higher perceptual quality achieving 12.27% reduction in a perceptual BD- rate equivalent, on average.",

keywords = "Generative adversarial network, Spatial interpolation, Video compression",

author = "Saiping Zhang and Marta Mrak and Luis Herranz and Blanch, {Marc Gorriz} and Shuai Wan and Fuzheng Yang",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 2021 International Conference on Visual Communications and Image Processing, VCIP 2021 ; Conference date: 05-12-2021 Through 08-12-2021",

year = "2021",

doi = "10.1109/VCIP53242.2021.9675350",

language = "英语",

series = "2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings",

}

Zhang, S, Mrak, M, Herranz, L, Blanch, MG, Wan, S & Yang, F 2021, DVC-P: Deep Video Compression with Perceptual Optimizations. 在 2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings. 2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings, Institute of Electrical and Electronics Engineers Inc., 2021 International Conference on Visual Communications and Image Processing, VCIP 2021, Munich, 德国, 5/12/21. https://doi.org/10.1109/VCIP53242.2021.9675350

DVC-P: Deep Video Compression with Perceptual Optimizations. / Zhang, Saiping; Mrak, Marta; Herranz, Luis 等.
2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2021. (2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - DVC-P

T2 - 2021 International Conference on Visual Communications and Image Processing, VCIP 2021

AU - Zhang, Saiping

AU - Mrak, Marta

AU - Herranz, Luis

AU - Blanch, Marc Gorriz

AU - Wan, Shuai

AU - Yang, Fuzheng

PY - 2021

Y1 - 2021

N2 - Recent years have witnessed the significant development of learning-based video compression methods, which aim at optimizing objective or perceptual quality and bit rates. In this paper, we introduce deep video compression with perceptual op-timizations (DVC-P), which aims at increasing perceptual quality of decoded videos. Our proposed DVC-P is based on Deep Video Compression (DVC) network, but improves it with perceptual optimizations. Specifically, a discriminator network and a mixed loss are employed to help our network trade off among distortion, perception and rate. Furthermore, nearest-neighbor interpolation is used to eliminate checkerboard artifacts which can appear in sequences encoded with DVC frameworks. Thanks to these two improvements, the perceptual quality of decoded sequences is improved. Experimental results demonstrate that, compared with the baseline DVC, our proposed method can generate videos with higher perceptual quality achieving 12.27% reduction in a perceptual BD- rate equivalent, on average.

AB - Recent years have witnessed the significant development of learning-based video compression methods, which aim at optimizing objective or perceptual quality and bit rates. In this paper, we introduce deep video compression with perceptual op-timizations (DVC-P), which aims at increasing perceptual quality of decoded videos. Our proposed DVC-P is based on Deep Video Compression (DVC) network, but improves it with perceptual optimizations. Specifically, a discriminator network and a mixed loss are employed to help our network trade off among distortion, perception and rate. Furthermore, nearest-neighbor interpolation is used to eliminate checkerboard artifacts which can appear in sequences encoded with DVC frameworks. Thanks to these two improvements, the perceptual quality of decoded sequences is improved. Experimental results demonstrate that, compared with the baseline DVC, our proposed method can generate videos with higher perceptual quality achieving 12.27% reduction in a perceptual BD- rate equivalent, on average.

KW - Generative adversarial network

KW - Spatial interpolation

KW - Video compression

UR - http://www.scopus.com/inward/record.url?scp=85125252475&partnerID=8YFLogxK

U2 - 10.1109/VCIP53242.2021.9675350

DO - 10.1109/VCIP53242.2021.9675350

M3 - 会议稿件

AN - SCOPUS:85125252475

T3 - 2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings

BT - 2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 5 December 2021 through 8 December 2021

ER -

Zhang S, Mrak M, Herranz L, Blanch MG, Wan S, Yang F. DVC-P: Deep Video Compression with Perceptual Optimizations. 在 2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2021. (2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings). doi: 10.1109/VCIP53242.2021.9675350

DVC-P: Deep Video Compression with Perceptual Optimizations

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此