DVC-P: Deep Video Compression with Perceptual Optimizations

Saiping Zhang, Marta Mrak, Luis Herranz, Marc Gorriz Blanch, Shuai Wan, Fuzheng Yang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Recent years have witnessed the significant development of learning-based video compression methods, which aim at optimizing objective or perceptual quality and bit rates. In this paper, we introduce deep video compression with perceptual op-timizations (DVC-P), which aims at increasing perceptual quality of decoded videos. Our proposed DVC-P is based on Deep Video Compression (DVC) network, but improves it with perceptual optimizations. Specifically, a discriminator network and a mixed loss are employed to help our network trade off among distortion, perception and rate. Furthermore, nearest-neighbor interpolation is used to eliminate checkerboard artifacts which can appear in sequences encoded with DVC frameworks. Thanks to these two improvements, the perceptual quality of decoded sequences is improved. Experimental results demonstrate that, compared with the baseline DVC, our proposed method can generate videos with higher perceptual quality achieving 12.27% reduction in a perceptual BD- rate equivalent, on average.

Original languageEnglish
Title of host publication2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728185514
DOIs
StatePublished - 2021
Event2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Munich, Germany
Duration: 5 Dec 20218 Dec 2021

Publication series

Name2021 International Conference on Visual Communications and Image Processing, VCIP 2021 - Proceedings

Conference

Conference2021 International Conference on Visual Communications and Image Processing, VCIP 2021
Country/TerritoryGermany
CityMunich
Period5/12/218/12/21

Keywords

  • Generative adversarial network
  • Spatial interpolation
  • Video compression

Fingerprint

Dive into the research topics of 'DVC-P: Deep Video Compression with Perceptual Optimizations'. Together they form a unique fingerprint.

Cite this