Chroma Intra Prediction With Lightweight Attention-Based Neural Networks

Chengyi Zou, Shuai Wan, Tiannan Ji, Marc Gorriz Blanch, Marta Mrak, Luis Herranz

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Neural networks can be successfully used for cross-component prediction in video coding. In particular, attention-based architectures are suitable for chroma intra prediction using luma information because of their capability to model relations between difierent channels. However, the complexity of such methods is still very high and should be further reduced, especially for decoding. In this paper, a cost-effective attention-based neural network is designed for chroma intra prediction. Moreover, with the goal of further improving coding performance, a novel approach is introduced to utilize more boundary information effectively. In addition to improving prediction, a simplification methodology is also proposed to reduce inference complexity by simplifying convolutions. The proposed schemes are integrated into H.266/Versatile Video Coding (VVC) pipeline, and only one additional binary block-level syntax flag is introduced to indicate whether a given block makes use of the proposed method. Experimental results demonstrate that the proposed scheme achieves up to -0.46%/-2.29%/-2.17% BD-rate reduction on Y/Cb/Cr components, respectively, compared with H.266/VVC anchor. Reductions in the encoding and decoding complexity of up to 22% and 61%, respectively, are achieved by the proposed scheme with respect to the previous attention-based chroma intra prediction method while maintaining coding performance.

Original languageEnglish
Pages (from-to)549-560
Number of pages12
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume34
Issue number1
DOIs
StatePublished - 1 Jan 2024

Keywords

  • attention-based neural networks
  • Chroma intra prediction
  • complexity reduction

Fingerprint

Dive into the research topics of 'Chroma Intra Prediction With Lightweight Attention-Based Neural Networks'. Together they form a unique fingerprint.

Cite this