Chroma Intra Prediction With Lightweight Attention-Based Neural Networks

Chengyi Zou; Shuai Wan; Tiannan Ji; Marc Gorriz Blanch; Marta Mrak; Luis Herranz

doi:10.1109/TCSVT.2023.3282980

Chroma Intra Prediction With Lightweight Attention-Based Neural Networks

Chengyi Zou, Shuai Wan, Tiannan Ji, Marc Gorriz Blanch, Marta Mrak, Luis Herranz

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Neural networks can be successfully used for cross-component prediction in video coding. In particular, attention-based architectures are suitable for chroma intra prediction using luma information because of their capability to model relations between difierent channels. However, the complexity of such methods is still very high and should be further reduced, especially for decoding. In this paper, a cost-effective attention-based neural network is designed for chroma intra prediction. Moreover, with the goal of further improving coding performance, a novel approach is introduced to utilize more boundary information effectively. In addition to improving prediction, a simplification methodology is also proposed to reduce inference complexity by simplifying convolutions. The proposed schemes are integrated into H.266/Versatile Video Coding (VVC) pipeline, and only one additional binary block-level syntax flag is introduced to indicate whether a given block makes use of the proposed method. Experimental results demonstrate that the proposed scheme achieves up to -0.46%/-2.29%/-2.17% BD-rate reduction on Y/Cb/Cr components, respectively, compared with H.266/VVC anchor. Reductions in the encoding and decoding complexity of up to 22% and 61%, respectively, are achieved by the proposed scheme with respect to the previous attention-based chroma intra prediction method while maintaining coding performance.

Original language	English
Pages (from-to)	549-560
Number of pages	12
Journal	IEEE Transactions on Circuits and Systems for Video Technology
Volume	34
Issue number	1
DOIs	https://doi.org/10.1109/TCSVT.2023.3282980
State	Published - 1 Jan 2024

Keywords

attention-based neural networks
Chroma intra prediction
complexity reduction

Access to Document

10.1109/TCSVT.2023.3282980

Cite this

@article{cc172d1c0bbc4698ba869b1adcc21b9f,

title = "Chroma Intra Prediction With Lightweight Attention-Based Neural Networks",

abstract = "Neural networks can be successfully used for cross-component prediction in video coding. In particular, attention-based architectures are suitable for chroma intra prediction using luma information because of their capability to model relations between difierent channels. However, the complexity of such methods is still very high and should be further reduced, especially for decoding. In this paper, a cost-effective attention-based neural network is designed for chroma intra prediction. Moreover, with the goal of further improving coding performance, a novel approach is introduced to utilize more boundary information effectively. In addition to improving prediction, a simplification methodology is also proposed to reduce inference complexity by simplifying convolutions. The proposed schemes are integrated into H.266/Versatile Video Coding (VVC) pipeline, and only one additional binary block-level syntax flag is introduced to indicate whether a given block makes use of the proposed method. Experimental results demonstrate that the proposed scheme achieves up to -0.46%/-2.29%/-2.17% BD-rate reduction on Y/Cb/Cr components, respectively, compared with H.266/VVC anchor. Reductions in the encoding and decoding complexity of up to 22% and 61%, respectively, are achieved by the proposed scheme with respect to the previous attention-based chroma intra prediction method while maintaining coding performance.",

keywords = "attention-based neural networks, Chroma intra prediction, complexity reduction",

author = "Chengyi Zou and Shuai Wan and Tiannan Ji and Blanch, {Marc Gorriz} and Marta Mrak and Luis Herranz",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2024",

month = jan,

day = "1",

doi = "10.1109/TCSVT.2023.3282980",

language = "英语",

volume = "34",

pages = "549--560",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "1",

}

TY - JOUR

T1 - Chroma Intra Prediction With Lightweight Attention-Based Neural Networks

AU - Zou, Chengyi

AU - Wan, Shuai

AU - Ji, Tiannan

AU - Blanch, Marc Gorriz

AU - Mrak, Marta

AU - Herranz, Luis

PY - 2024/1/1

Y1 - 2024/1/1

N2 - Neural networks can be successfully used for cross-component prediction in video coding. In particular, attention-based architectures are suitable for chroma intra prediction using luma information because of their capability to model relations between difierent channels. However, the complexity of such methods is still very high and should be further reduced, especially for decoding. In this paper, a cost-effective attention-based neural network is designed for chroma intra prediction. Moreover, with the goal of further improving coding performance, a novel approach is introduced to utilize more boundary information effectively. In addition to improving prediction, a simplification methodology is also proposed to reduce inference complexity by simplifying convolutions. The proposed schemes are integrated into H.266/Versatile Video Coding (VVC) pipeline, and only one additional binary block-level syntax flag is introduced to indicate whether a given block makes use of the proposed method. Experimental results demonstrate that the proposed scheme achieves up to -0.46%/-2.29%/-2.17% BD-rate reduction on Y/Cb/Cr components, respectively, compared with H.266/VVC anchor. Reductions in the encoding and decoding complexity of up to 22% and 61%, respectively, are achieved by the proposed scheme with respect to the previous attention-based chroma intra prediction method while maintaining coding performance.

AB - Neural networks can be successfully used for cross-component prediction in video coding. In particular, attention-based architectures are suitable for chroma intra prediction using luma information because of their capability to model relations between difierent channels. However, the complexity of such methods is still very high and should be further reduced, especially for decoding. In this paper, a cost-effective attention-based neural network is designed for chroma intra prediction. Moreover, with the goal of further improving coding performance, a novel approach is introduced to utilize more boundary information effectively. In addition to improving prediction, a simplification methodology is also proposed to reduce inference complexity by simplifying convolutions. The proposed schemes are integrated into H.266/Versatile Video Coding (VVC) pipeline, and only one additional binary block-level syntax flag is introduced to indicate whether a given block makes use of the proposed method. Experimental results demonstrate that the proposed scheme achieves up to -0.46%/-2.29%/-2.17% BD-rate reduction on Y/Cb/Cr components, respectively, compared with H.266/VVC anchor. Reductions in the encoding and decoding complexity of up to 22% and 61%, respectively, are achieved by the proposed scheme with respect to the previous attention-based chroma intra prediction method while maintaining coding performance.

KW - attention-based neural networks

KW - Chroma intra prediction

KW - complexity reduction

UR - http://www.scopus.com/inward/record.url?scp=85161519110&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2023.3282980

DO - 10.1109/TCSVT.2023.3282980

M3 - 文章

AN - SCOPUS:85161519110

SN - 1051-8215

VL - 34

SP - 549

EP - 560

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 1

ER -

Chroma Intra Prediction With Lightweight Attention-Based Neural Networks

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this