Adaptive Chroma Prediction Based on Luma Difference for H.266/VVC

Junyan Huo; Danni Wang; Hui Yuan; Shuai Wan; Fuzheng Yang

doi:10.1109/TIP.2023.3330607

Adaptive Chroma Prediction Based on Luma Difference for H.266/VVC

Junyan Huo, Danni Wang, Hui Yuan, Shuai Wan, Fuzheng Yang

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

Cross-component chroma prediction plays an important role in improving coding efficiency for H.266/VVC. We use the differences between reference samples and the predicted sample to design an attention model for chroma prediction, namely luma difference-based chroma prediction (LDCP). Specifically, the luma differences (LDs) between reference samples and the predicted sample are employed as the input of the attention model, which is designed as a softmax function to map LDs to chroma weights nonlinearly. Finally, a weighted chroma prediction is conducted based on the weights and chroma reference samples. To provide adaptive weights, the model parameter of the softmax function can be determined based on the template (T-LDCP) or offline learning (L-LDCP), respectively. Experimental results show that the T-LDCP achieves BD-rate reductions of 0.34%, 2.02%, and 2.34% for the Y, Cb, and Cr components, and the L-LDCP brings 0.32%, 2.06%, and 2.21% BD-rate savings for Y, Cb, and Cr components, respectively. The L-LDCP introduces slight encoding and decoding time increments, i.e., 2% and 1%, when integrated into the latest VVC test model version 18.0. Besides, the LDCP can be implemented by a pixel-level parallelization which is hardware-friendly.

Original language	English
Pages (from-to)	6318-6331
Number of pages	14
Journal	IEEE Transactions on Image Processing
Volume	32
DOIs	https://doi.org/10.1109/TIP.2023.3330607
State	Published - 2023

Keywords

cross-component prediction
softmax function
versatile video coding
video coding
Weighted chroma prediction

Access to Document

10.1109/TIP.2023.3330607

Cite this

@article{5ee136e5a56642d18c65ec562bca4060,

title = "Adaptive Chroma Prediction Based on Luma Difference for H.266/VVC",

abstract = "Cross-component chroma prediction plays an important role in improving coding efficiency for H.266/VVC. We use the differences between reference samples and the predicted sample to design an attention model for chroma prediction, namely luma difference-based chroma prediction (LDCP). Specifically, the luma differences (LDs) between reference samples and the predicted sample are employed as the input of the attention model, which is designed as a softmax function to map LDs to chroma weights nonlinearly. Finally, a weighted chroma prediction is conducted based on the weights and chroma reference samples. To provide adaptive weights, the model parameter of the softmax function can be determined based on the template (T-LDCP) or offline learning (L-LDCP), respectively. Experimental results show that the T-LDCP achieves BD-rate reductions of 0.34%, 2.02%, and 2.34% for the Y, Cb, and Cr components, and the L-LDCP brings 0.32%, 2.06%, and 2.21% BD-rate savings for Y, Cb, and Cr components, respectively. The L-LDCP introduces slight encoding and decoding time increments, i.e., 2% and 1%, when integrated into the latest VVC test model version 18.0. Besides, the LDCP can be implemented by a pixel-level parallelization which is hardware-friendly.",

keywords = "cross-component prediction, softmax function, versatile video coding, video coding, Weighted chroma prediction",

author = "Junyan Huo and Danni Wang and Hui Yuan and Shuai Wan and Fuzheng Yang",

note = "Publisher Copyright: {\textcopyright} 1992-2012 IEEE.",

year = "2023",

doi = "10.1109/TIP.2023.3330607",

language = "英语",

volume = "32",

pages = "6318--6331",

journal = "IEEE Transactions on Image Processing",

issn = "1057-7149",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Adaptive Chroma Prediction Based on Luma Difference for H.266/VVC

AU - Huo, Junyan

AU - Wang, Danni

AU - Yuan, Hui

AU - Wan, Shuai

AU - Yang, Fuzheng

PY - 2023

Y1 - 2023

N2 - Cross-component chroma prediction plays an important role in improving coding efficiency for H.266/VVC. We use the differences between reference samples and the predicted sample to design an attention model for chroma prediction, namely luma difference-based chroma prediction (LDCP). Specifically, the luma differences (LDs) between reference samples and the predicted sample are employed as the input of the attention model, which is designed as a softmax function to map LDs to chroma weights nonlinearly. Finally, a weighted chroma prediction is conducted based on the weights and chroma reference samples. To provide adaptive weights, the model parameter of the softmax function can be determined based on the template (T-LDCP) or offline learning (L-LDCP), respectively. Experimental results show that the T-LDCP achieves BD-rate reductions of 0.34%, 2.02%, and 2.34% for the Y, Cb, and Cr components, and the L-LDCP brings 0.32%, 2.06%, and 2.21% BD-rate savings for Y, Cb, and Cr components, respectively. The L-LDCP introduces slight encoding and decoding time increments, i.e., 2% and 1%, when integrated into the latest VVC test model version 18.0. Besides, the LDCP can be implemented by a pixel-level parallelization which is hardware-friendly.

AB - Cross-component chroma prediction plays an important role in improving coding efficiency for H.266/VVC. We use the differences between reference samples and the predicted sample to design an attention model for chroma prediction, namely luma difference-based chroma prediction (LDCP). Specifically, the luma differences (LDs) between reference samples and the predicted sample are employed as the input of the attention model, which is designed as a softmax function to map LDs to chroma weights nonlinearly. Finally, a weighted chroma prediction is conducted based on the weights and chroma reference samples. To provide adaptive weights, the model parameter of the softmax function can be determined based on the template (T-LDCP) or offline learning (L-LDCP), respectively. Experimental results show that the T-LDCP achieves BD-rate reductions of 0.34%, 2.02%, and 2.34% for the Y, Cb, and Cr components, and the L-LDCP brings 0.32%, 2.06%, and 2.21% BD-rate savings for Y, Cb, and Cr components, respectively. The L-LDCP introduces slight encoding and decoding time increments, i.e., 2% and 1%, when integrated into the latest VVC test model version 18.0. Besides, the LDCP can be implemented by a pixel-level parallelization which is hardware-friendly.

KW - cross-component prediction

KW - softmax function

KW - versatile video coding

KW - video coding

KW - Weighted chroma prediction

UR - http://www.scopus.com/inward/record.url?scp=85177068004&partnerID=8YFLogxK

U2 - 10.1109/TIP.2023.3330607

DO - 10.1109/TIP.2023.3330607

M3 - 文章

C2 - 37956019

AN - SCOPUS:85177068004

SN - 1057-7149

VL - 32

SP - 6318

EP - 6331

JO - IEEE Transactions on Image Processing

JF - IEEE Transactions on Image Processing

ER -

Adaptive Chroma Prediction Based on Luma Difference for H.266/VVC

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this