From Distortion Manifold to Perceptual Quality: a Data Efficient Blind ImageÂ Quality Assessment Approach

Shaolin Su; Qingsen Yan; Yu Zhu; Jinqiu Sun; Yanning Zhang

doi:10.1016/j.patcog.2022.109047

From Distortion Manifold to Perceptual Quality: a Data Efficient Blind ImageÂ Quality Assessment Approach

Shaolin Su, Qingsen Yan, Yu Zhu, Jinqiu Sun, Yanning Zhang

School of Computer Science

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

16 Scopus citations

Abstract

Though current no-reference image quality assessment (NR-IQA) approaches have achieved impressive performance gain thanks to deep learning techniques, it is claimed that the risk of over-fitting exists. To improve model generalization ability, most of the current researches incorporate mass data to train or tune the data-driven models. However, the process of image data collection and quality label annotation is quite time-consuming and labour-intensive. Therefore, in this paper, we explore an alternative solution to promote model generalizability but with relatively small fractions of training data. Compared with previous approaches which make effort to approximate the whole complex image distribution, we propose to explicitly learn an image distortion manifold first, which lies in a much lower dimension space and also representative in capturing general degradation patterns. We then project the images to their perceived quality from the learned manifold to obtain quality predictions. Since the manifold embeds general distortion features despite of varying image contents, it can be learned with relatively small amount of samples. In order to learn the manifold and quality projection, we introduce a two-branched network to learn both low level distortions and high level semantics. We also propose a simple but efficient training framework, composing of a masked labelling strategy and a gradual weighting curriculum to fulfill the task. Thanks to the learned distortion manifold, the proposed model achieves superior generalizability compared with previous models. Extensive experiments demonstrate its effectiveness in terms of training with limited data, testing on large scale images, and with unseen types of distorted images.

Original language	English
Article number	109047
Journal	Pattern Recognition
Volume	133
DOIs	https://doi.org/10.1016/j.patcog.2022.109047
State	Published - Jan 2023

Keywords

Distortion manifold
Generalizability
Image quality assessment
No-Reference

Access to Document

10.1016/j.patcog.2022.109047

Cite this

@article{5160c4488f0843fca6402c467884b0be,

title = "From Distortion Manifold to Perceptual Quality: a Data Efficient Blind Image{\^A} Quality Assessment Approach",

abstract = "Though current no-reference image quality assessment (NR-IQA) approaches have achieved impressive performance gain thanks to deep learning techniques, it is claimed that the risk of over-fitting exists. To improve model generalization ability, most of the current researches incorporate mass data to train or tune the data-driven models. However, the process of image data collection and quality label annotation is quite time-consuming and labour-intensive. Therefore, in this paper, we explore an alternative solution to promote model generalizability but with relatively small fractions of training data. Compared with previous approaches which make effort to approximate the whole complex image distribution, we propose to explicitly learn an image distortion manifold first, which lies in a much lower dimension space and also representative in capturing general degradation patterns. We then project the images to their perceived quality from the learned manifold to obtain quality predictions. Since the manifold embeds general distortion features despite of varying image contents, it can be learned with relatively small amount of samples. In order to learn the manifold and quality projection, we introduce a two-branched network to learn both low level distortions and high level semantics. We also propose a simple but efficient training framework, composing of a masked labelling strategy and a gradual weighting curriculum to fulfill the task. Thanks to the learned distortion manifold, the proposed model achieves superior generalizability compared with previous models. Extensive experiments demonstrate its effectiveness in terms of training with limited data, testing on large scale images, and with unseen types of distorted images.",

keywords = "Distortion manifold, Generalizability, Image quality assessment, No-Reference",

author = "Shaolin Su and Qingsen Yan and Yu Zhu and Jinqiu Sun and Yanning Zhang",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier Ltd",

year = "2023",

month = jan,

doi = "10.1016/j.patcog.2022.109047",

language = "英语",

volume = "133",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd",

}

TY - JOUR

T1 - From Distortion Manifold to Perceptual Quality

T2 - a Data Efficient Blind ImageÂ Quality Assessment Approach

AU - Su, Shaolin

AU - Yan, Qingsen

AU - Zhu, Yu

AU - Sun, Jinqiu

AU - Zhang, Yanning

PY - 2023/1

Y1 - 2023/1

N2 - Though current no-reference image quality assessment (NR-IQA) approaches have achieved impressive performance gain thanks to deep learning techniques, it is claimed that the risk of over-fitting exists. To improve model generalization ability, most of the current researches incorporate mass data to train or tune the data-driven models. However, the process of image data collection and quality label annotation is quite time-consuming and labour-intensive. Therefore, in this paper, we explore an alternative solution to promote model generalizability but with relatively small fractions of training data. Compared with previous approaches which make effort to approximate the whole complex image distribution, we propose to explicitly learn an image distortion manifold first, which lies in a much lower dimension space and also representative in capturing general degradation patterns. We then project the images to their perceived quality from the learned manifold to obtain quality predictions. Since the manifold embeds general distortion features despite of varying image contents, it can be learned with relatively small amount of samples. In order to learn the manifold and quality projection, we introduce a two-branched network to learn both low level distortions and high level semantics. We also propose a simple but efficient training framework, composing of a masked labelling strategy and a gradual weighting curriculum to fulfill the task. Thanks to the learned distortion manifold, the proposed model achieves superior generalizability compared with previous models. Extensive experiments demonstrate its effectiveness in terms of training with limited data, testing on large scale images, and with unseen types of distorted images.

AB - Though current no-reference image quality assessment (NR-IQA) approaches have achieved impressive performance gain thanks to deep learning techniques, it is claimed that the risk of over-fitting exists. To improve model generalization ability, most of the current researches incorporate mass data to train or tune the data-driven models. However, the process of image data collection and quality label annotation is quite time-consuming and labour-intensive. Therefore, in this paper, we explore an alternative solution to promote model generalizability but with relatively small fractions of training data. Compared with previous approaches which make effort to approximate the whole complex image distribution, we propose to explicitly learn an image distortion manifold first, which lies in a much lower dimension space and also representative in capturing general degradation patterns. We then project the images to their perceived quality from the learned manifold to obtain quality predictions. Since the manifold embeds general distortion features despite of varying image contents, it can be learned with relatively small amount of samples. In order to learn the manifold and quality projection, we introduce a two-branched network to learn both low level distortions and high level semantics. We also propose a simple but efficient training framework, composing of a masked labelling strategy and a gradual weighting curriculum to fulfill the task. Thanks to the learned distortion manifold, the proposed model achieves superior generalizability compared with previous models. Extensive experiments demonstrate its effectiveness in terms of training with limited data, testing on large scale images, and with unseen types of distorted images.

KW - Distortion manifold

KW - Generalizability

KW - Image quality assessment

KW - No-Reference

UR - http://www.scopus.com/inward/record.url?scp=85138442822&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2022.109047

DO - 10.1016/j.patcog.2022.109047

M3 - 文章

AN - SCOPUS:85138442822

SN - 0031-3203

VL - 133

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 109047

ER -

From Distortion Manifold to Perceptual Quality: a Data Efficient Blind ImageÂ Quality Assessment Approach

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this