Faculty Publications

Semantics-enhanced adversarial nets for text-to-image synthesis

Hongchen Tan, Dalian University of Technology
Xiuping Liu, Dalian University of Technology
Xin Li, Louisiana State University
Yi Zhang, Dalian University of Technology
Baocai Yin, Dalian University of Technology

Document Type

Conference Proceeding

Publication Date

10-1-2019

Abstract

This paper presents a new model, Semantics-enhanced Generative Adversarial Network (SEGAN), for fine-grained text-to-image generation. We introduce two modules, a Semantic Consistency Module (SCM) and an Attention Competition Module (ACM), to our SEGAN. The SCM incorporates image-level semantic consistency into the training of the Generative Adversarial Network (GAN), and can diversify the generated images and improve their structural coherence. A Siamese network and two types of semantic similarities are designed to map the synthesized image and the groundtruth image to nearby points in the latent semantic feature space. The ACM constructs adaptive attention weights to differentiate keywords from unimportant words, and improves the stability and accuracy of SEGAN. Extensive experiments demonstrate that our SEGAN significantly outperforms existing state-of-the-art methods in generating photo-realistic images. All source codes and models will be released for comparative study.

Publication Source (Journal or Book title)

Proceedings of the IEEE International Conference on Computer Vision

First Page

10500

Last Page

10509

Recommended Citation

Tan, H., Liu, X., Li, X., Zhang, Y., & Yin, B. (2019). Semantics-enhanced adversarial nets for text-to-image synthesis. Proceedings of the IEEE International Conference on Computer Vision, 2019-October, 10500-10509. https://doi.org/10.1109/ICCV.2019.01060

This document is currently not available here.

COinS

Faculty Publications

Semantics-enhanced adversarial nets for text-to-image synthesis

Document Type

Publication Date

Abstract

Publication Source (Journal or Book title)

First Page

Last Page

Recommended Citation

Search

Browse

Author Corner

SPONSORED BY

Faculty Publications

Semantics-enhanced adversarial nets for text-to-image synthesis

Authors

Document Type

Publication Date

Abstract

Publication Source (Journal or Book title)

First Page

Last Page

Recommended Citation

Share

Search

Browse

Author Corner

SPONSORED BY