SS-GANs: Text-to-image via stage by stage generative adversarial networks

Ming Tian, Yuting Xue, Chunna Tian, Lei Wang, Donghu Deng, Wei Wei

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Realistic text-to-image synthesis has achieved great improvements in recent years. However, most work ignores the relationship between low and high resolution and prefers to adopt identical module in different stages. It is obviously inappropriate because the differences in various generation stages are huge. Therefore, we propose a novel structure of network named SS-GANs, in which specific modules are added in different stages to satisfy the unique requirements. In addition, we also explore an effective training way named coordinated train and a simple negative sample selection mechanism. Lastly, we train our model on Oxford-102 dataset, which outperforms the state-of-the-art models.

Original languageEnglish
Title of host publicationPattern Recognition and Computer Vision 2nd Chinese Conference, PRCV 2019, Proceedings, Part II
EditorsZhouchen Lin, Liang Wang, Tieniu Tan, Jian Yang, Guangming Shi, Nanning Zheng, Xilin Chen, Yanning Zhang
PublisherSpringer
Pages475-486
Number of pages12
ISBN (Print)9783030317225
DOIs
StatePublished - 2019
Event2nd Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2019 - Xi'an, China
Duration: 8 Nov 201911 Nov 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11858 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2nd Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2019
Country/TerritoryChina
CityXi'an
Period8/11/1911/11/19

Keywords

  • Coordinated train
  • Different stages
  • Negative samples
  • Text-to-image

Fingerprint

Dive into the research topics of 'SS-GANs: Text-to-image via stage by stage generative adversarial networks'. Together they form a unique fingerprint.

Cite this