Adaptive Enhanced Global Intra Prediction for Efficient Video Coding in Beyond VVC

Junyan Huo; Yanzhuo Ma; Zhenyao Zhang; Hongli Zhang; Hui Yuan; Shuai Wan; Fuzheng Yang

doi:10.1109/TCSVT.2025.3535951

Adaptive Enhanced Global Intra Prediction for Efficient Video Coding in Beyond VVC

Junyan Huo, Yanzhuo Ma, Zhenyao Zhang, Hongli Zhang, Hui Yuan, Shuai Wan, Fuzheng Yang

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

Abstract

Global intra prediction (GIP), including intra-block copy and template matching prediction (TMP), exploits the global correlation of the same image to improve the coding efficiency. In Beyond VVC, TMP uses template matching to determine the reference blocks for efficient prediction. There usually exists an error between the coding block and reference blocks, caused by the content mismatch or the coding distortion of the reference blocks. We propose an enhancement over the reference blocks, namely enhanced GIP (EGIP). Specifically, we design an enhanced filter according to the templates of the coding block and the reference blocks, with the reconstructed template of the coding block as the label for supervised learning. To support different enhancements, we design two types of inputs, i.e., EGIP based on neighboring samples (N-EGIP) and EGIP based on multiple hypothesis references (M-EGIP). Experimental results show that, based on enhanced compression model (ECM) version 8.0, N-EGIP achieves BD-rate reductions of 0.37%, 0.42%, and 0.40%, and M-EGIP brings 0.34%, 0.37%, and 0.34% BD-rate savings for Y, Cb, and Cr components, respectively. A higher coding gain, 0.46%, 0.54%, and 0.52% BD-rate savings, can be achieved by integrating N-EGIP and M-EGIP together. Owing to the coding gain and small complexity increase, the proposed EGIP has been adopted in the exploration of Beyond VVC and integrated into its reference software.

Original language	English
Pages (from-to)	6145-6157
Number of pages	13
Journal	IEEE Transactions on Circuits and Systems for Video Technology
Volume	35
Issue number	6
DOIs	https://doi.org/10.1109/TCSVT.2025.3535951
State	Published - 2025

Keywords

Beyond VVC
Video coding
enhanced filter
intra template matching
reference templates

Access to Document

10.1109/TCSVT.2025.3535951

Cite this

@article{a2c6c123c9e74cfdad87e095cd290160,

title = "Adaptive Enhanced Global Intra Prediction for Efficient Video Coding in Beyond VVC",

abstract = "Global intra prediction (GIP), including intra-block copy and template matching prediction (TMP), exploits the global correlation of the same image to improve the coding efficiency. In Beyond VVC, TMP uses template matching to determine the reference blocks for efficient prediction. There usually exists an error between the coding block and reference blocks, caused by the content mismatch or the coding distortion of the reference blocks. We propose an enhancement over the reference blocks, namely enhanced GIP (EGIP). Specifically, we design an enhanced filter according to the templates of the coding block and the reference blocks, with the reconstructed template of the coding block as the label for supervised learning. To support different enhancements, we design two types of inputs, i.e., EGIP based on neighboring samples (N-EGIP) and EGIP based on multiple hypothesis references (M-EGIP). Experimental results show that, based on enhanced compression model (ECM) version 8.0, N-EGIP achieves BD-rate reductions of 0.37%, 0.42%, and 0.40%, and M-EGIP brings 0.34%, 0.37%, and 0.34% BD-rate savings for Y, Cb, and Cr components, respectively. A higher coding gain, 0.46%, 0.54%, and 0.52% BD-rate savings, can be achieved by integrating N-EGIP and M-EGIP together. Owing to the coding gain and small complexity increase, the proposed EGIP has been adopted in the exploration of Beyond VVC and integrated into its reference software.",

keywords = "Beyond VVC, Video coding, enhanced filter, intra template matching, reference templates",

author = "Junyan Huo and Yanzhuo Ma and Zhenyao Zhang and Hongli Zhang and Hui Yuan and Shuai Wan and Fuzheng Yang",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2025",

doi = "10.1109/TCSVT.2025.3535951",

language = "英语",

volume = "35",

pages = "6145--6157",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "6",

}

TY - JOUR

T1 - Adaptive Enhanced Global Intra Prediction for Efficient Video Coding in Beyond VVC

AU - Huo, Junyan

AU - Ma, Yanzhuo

AU - Zhang, Zhenyao

AU - Zhang, Hongli

AU - Yuan, Hui

AU - Wan, Shuai

AU - Yang, Fuzheng

PY - 2025

Y1 - 2025

N2 - Global intra prediction (GIP), including intra-block copy and template matching prediction (TMP), exploits the global correlation of the same image to improve the coding efficiency. In Beyond VVC, TMP uses template matching to determine the reference blocks for efficient prediction. There usually exists an error between the coding block and reference blocks, caused by the content mismatch or the coding distortion of the reference blocks. We propose an enhancement over the reference blocks, namely enhanced GIP (EGIP). Specifically, we design an enhanced filter according to the templates of the coding block and the reference blocks, with the reconstructed template of the coding block as the label for supervised learning. To support different enhancements, we design two types of inputs, i.e., EGIP based on neighboring samples (N-EGIP) and EGIP based on multiple hypothesis references (M-EGIP). Experimental results show that, based on enhanced compression model (ECM) version 8.0, N-EGIP achieves BD-rate reductions of 0.37%, 0.42%, and 0.40%, and M-EGIP brings 0.34%, 0.37%, and 0.34% BD-rate savings for Y, Cb, and Cr components, respectively. A higher coding gain, 0.46%, 0.54%, and 0.52% BD-rate savings, can be achieved by integrating N-EGIP and M-EGIP together. Owing to the coding gain and small complexity increase, the proposed EGIP has been adopted in the exploration of Beyond VVC and integrated into its reference software.

AB - Global intra prediction (GIP), including intra-block copy and template matching prediction (TMP), exploits the global correlation of the same image to improve the coding efficiency. In Beyond VVC, TMP uses template matching to determine the reference blocks for efficient prediction. There usually exists an error between the coding block and reference blocks, caused by the content mismatch or the coding distortion of the reference blocks. We propose an enhancement over the reference blocks, namely enhanced GIP (EGIP). Specifically, we design an enhanced filter according to the templates of the coding block and the reference blocks, with the reconstructed template of the coding block as the label for supervised learning. To support different enhancements, we design two types of inputs, i.e., EGIP based on neighboring samples (N-EGIP) and EGIP based on multiple hypothesis references (M-EGIP). Experimental results show that, based on enhanced compression model (ECM) version 8.0, N-EGIP achieves BD-rate reductions of 0.37%, 0.42%, and 0.40%, and M-EGIP brings 0.34%, 0.37%, and 0.34% BD-rate savings for Y, Cb, and Cr components, respectively. A higher coding gain, 0.46%, 0.54%, and 0.52% BD-rate savings, can be achieved by integrating N-EGIP and M-EGIP together. Owing to the coding gain and small complexity increase, the proposed EGIP has been adopted in the exploration of Beyond VVC and integrated into its reference software.

KW - Beyond VVC

KW - Video coding

KW - enhanced filter

KW - intra template matching

KW - reference templates

UR - http://www.scopus.com/inward/record.url?scp=85216815686&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2025.3535951

DO - 10.1109/TCSVT.2025.3535951

M3 - 文章

AN - SCOPUS:85216815686

SN - 1051-8215

VL - 35

SP - 6145

EP - 6157

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 6

ER -

Adaptive Enhanced Global Intra Prediction for Efficient Video Coding in Beyond VVC

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this