Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation

Bao Liu; Liang Wang; Jingting Wang; Jinyu Zhang

doi:10.1007/s12652-021-03667-y

Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation

Bao Liu, Liang Wang, Jingting Wang, Jinyu Zhang

科研成果: 期刊稿件 › 文章 › 同行评审

7 引用（Scopus）

摘要

Image generation is a hot topic in the field of machine learning and computer vision. As a representative of its algorithm, the Generative Adversarial Network (GAN) has the problem of mode collapse in practice. The proposed Dual Discriminator Weighted Mixture Generative Adversarial Network (D2WMGAN) approach can cope with this problem. On the one hand, the D2WMGAN uses the mixed distribution of multiple generators to approximate the real distribution, in order to prevent the extreme situation that multiple generators learn the same distribution and generate the same class of samples, with a classifier to play games with generators to make different generators learn different distributions. On the other hand, the objective function of D2WMGAN weights the Kullback–Leibler (KL) divergence and the reverse KL divergence, and uses their complementary characteristics to improve the quality and diversity of samples from the generators. Then, the theoretical conditional optimality of the D2WMGAN is proved theoretically, which shows that multiple generators can learn the real data distribution in the case of the optimal discriminator and classifier. Finally, extensive experiments are conducted on a large amount of synthetic data and real-world large-scale datasets (such as, CIFAR-10 and MNIST), and the commonly used GAN evaluation indicators (Wasserstein distance, JS divergence, Inception score, and Frechet Inception Distance) are introduced for comparative analysis. Experimental results show that the proposed D2WMGAN approach can better learn multiple mode data, generate rich realistic samples, and effectively solve the problem of mode collapse.

源语言	英语
页（从-至）	10013-10025
页数	13
期刊	Journal of Ambient Intelligence and Humanized Computing
卷	14
期	8
DOI	https://doi.org/10.1007/s12652-021-03667-y
出版状态	已出版 - 8月 2023
已对外发布	是

访问文件

10.1007/s12652-021-03667-y

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{5950233b23d7444f9a064055829332e9,

title = "Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation",

abstract = "Image generation is a hot topic in the field of machine learning and computer vision. As a representative of its algorithm, the Generative Adversarial Network (GAN) has the problem of mode collapse in practice. The proposed Dual Discriminator Weighted Mixture Generative Adversarial Network (D2WMGAN) approach can cope with this problem. On the one hand, the D2WMGAN uses the mixed distribution of multiple generators to approximate the real distribution, in order to prevent the extreme situation that multiple generators learn the same distribution and generate the same class of samples, with a classifier to play games with generators to make different generators learn different distributions. On the other hand, the objective function of D2WMGAN weights the Kullback–Leibler (KL) divergence and the reverse KL divergence, and uses their complementary characteristics to improve the quality and diversity of samples from the generators. Then, the theoretical conditional optimality of the D2WMGAN is proved theoretically, which shows that multiple generators can learn the real data distribution in the case of the optimal discriminator and classifier. Finally, extensive experiments are conducted on a large amount of synthetic data and real-world large-scale datasets (such as, CIFAR-10 and MNIST), and the commonly used GAN evaluation indicators (Wasserstein distance, JS divergence, Inception score, and Frechet Inception Distance) are introduced for comparative analysis. Experimental results show that the proposed D2WMGAN approach can better learn multiple mode data, generate rich realistic samples, and effectively solve the problem of mode collapse.",

keywords = "Dual Discriminator Generative Adversarial Network, Generative Adversarial Network, Mixture Generative Adversarial Network, Mode collapse",

author = "Bao Liu and Liang Wang and Jingting Wang and Jinyu Zhang",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.",

year = "2023",

month = aug,

doi = "10.1007/s12652-021-03667-y",

language = "英语",

volume = "14",

pages = "10013--10025",

journal = "Journal of Ambient Intelligence and Humanized Computing",

issn = "1868-5137",

publisher = "Springer Verlag",

number = "8",

}

TY - JOUR

T1 - Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation

AU - Liu, Bao

AU - Wang, Liang

AU - Wang, Jingting

AU - Zhang, Jinyu

PY - 2023/8

Y1 - 2023/8

N2 - Image generation is a hot topic in the field of machine learning and computer vision. As a representative of its algorithm, the Generative Adversarial Network (GAN) has the problem of mode collapse in practice. The proposed Dual Discriminator Weighted Mixture Generative Adversarial Network (D2WMGAN) approach can cope with this problem. On the one hand, the D2WMGAN uses the mixed distribution of multiple generators to approximate the real distribution, in order to prevent the extreme situation that multiple generators learn the same distribution and generate the same class of samples, with a classifier to play games with generators to make different generators learn different distributions. On the other hand, the objective function of D2WMGAN weights the Kullback–Leibler (KL) divergence and the reverse KL divergence, and uses their complementary characteristics to improve the quality and diversity of samples from the generators. Then, the theoretical conditional optimality of the D2WMGAN is proved theoretically, which shows that multiple generators can learn the real data distribution in the case of the optimal discriminator and classifier. Finally, extensive experiments are conducted on a large amount of synthetic data and real-world large-scale datasets (such as, CIFAR-10 and MNIST), and the commonly used GAN evaluation indicators (Wasserstein distance, JS divergence, Inception score, and Frechet Inception Distance) are introduced for comparative analysis. Experimental results show that the proposed D2WMGAN approach can better learn multiple mode data, generate rich realistic samples, and effectively solve the problem of mode collapse.

AB - Image generation is a hot topic in the field of machine learning and computer vision. As a representative of its algorithm, the Generative Adversarial Network (GAN) has the problem of mode collapse in practice. The proposed Dual Discriminator Weighted Mixture Generative Adversarial Network (D2WMGAN) approach can cope with this problem. On the one hand, the D2WMGAN uses the mixed distribution of multiple generators to approximate the real distribution, in order to prevent the extreme situation that multiple generators learn the same distribution and generate the same class of samples, with a classifier to play games with generators to make different generators learn different distributions. On the other hand, the objective function of D2WMGAN weights the Kullback–Leibler (KL) divergence and the reverse KL divergence, and uses their complementary characteristics to improve the quality and diversity of samples from the generators. Then, the theoretical conditional optimality of the D2WMGAN is proved theoretically, which shows that multiple generators can learn the real data distribution in the case of the optimal discriminator and classifier. Finally, extensive experiments are conducted on a large amount of synthetic data and real-world large-scale datasets (such as, CIFAR-10 and MNIST), and the commonly used GAN evaluation indicators (Wasserstein distance, JS divergence, Inception score, and Frechet Inception Distance) are introduced for comparative analysis. Experimental results show that the proposed D2WMGAN approach can better learn multiple mode data, generate rich realistic samples, and effectively solve the problem of mode collapse.

KW - Dual Discriminator Generative Adversarial Network

KW - Generative Adversarial Network

KW - Mixture Generative Adversarial Network

KW - Mode collapse

UR - http://www.scopus.com/inward/record.url?scp=85124083060&partnerID=8YFLogxK

U2 - 10.1007/s12652-021-03667-y

DO - 10.1007/s12652-021-03667-y

M3 - 文章

AN - SCOPUS:85124083060

SN - 1868-5137

VL - 14

SP - 10013

EP - 10025

JO - Journal of Ambient Intelligence and Humanized Computing

JF - Journal of Ambient Intelligence and Humanized Computing

IS - 8

ER -

Dual Discriminator Weighted Mixture Generative Adversarial Network for image generation

摘要

访问文件

其它文件与链接

指纹

引用此