Exploring the Benefits of Mixture of Generators and Discriminators in GAN Training | Summaries Painting

Lessons Learned from the Training of GANs on

Artificial Datasets

Shichang Tang1, 2, 3

1School of Information Science & Technology, ShanghaiTech University

2Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences

3University of Chinese Academy of Sciences

Abstract —Generative Adversarial Networks (GANs)

have made great progress in synthesizing realistic im-

ages in recent years. However, they are often trained on

image datasets with either too few samples or too many

classes belonging to different data distributions. Conse-

quently, GANs are prone to underfitting or overfitting,

making the analysis of them difficult and constrained.

Therefore, in order to conduct a thorough study on

GANs while obviating unnecessary interferences in-

troduced by the datasets, we train them on artificial

datasets where there are infinitely many samples and

the real data distributions are simple, high-dimensional

and have structured manifolds. Moreover, the genera-

tors are designed such that optimal sets of parameters

exist. Empirically, we find that under various distance

measures, the generator fails to learn such parameters

with the GAN training procedure. We also find that

training mixtures of GANs leads to more performance

gain compared to increasing the network depth or

width when the model complexity is high enough. Our

experimental results demonstrate that a mixture of

generators can discover different modes or different

classes automatically in an unsupervised setting, which

we attribute to the distribution of the generation and

discrimination tasks across multiple generators and

discriminators. As an example of the generalizability

of our conclusions to realistic datasets, we train a

mixture of GANs on the CIFAR-10 dataset and our

method significantly outperforms the state-of-the-art

in terms of popular metrics, i.e., Inception Score (IS)

and Fréchet Inception Distance (FID).

I. Introduction

The past few years have witnessed the arising popularity

of generative models. As can be seen, image processing

(e.g., image super-resolution and editing) and machine

learning (e.g., reinforcement learning and semi-supervised

learning) tasks are infused strong energy by generative

models [1]. Typically, a generative model learns a distri-

bution Pgto approximate the true distribution Pr, given

a set of observed samples.

Generative Adversarial Network [2], with no doubt, is

the most prevailing generative model. It is composed of a

generator Gthat maps random noise to synthesized data

points, and a discriminator Dwhich aims to tell whether

its input comes from the real data distribution Pror

generative distribution Pg. During training, Dand Gare

updated simultaneously or alternatingly. In a vanilla GAN,

Dgives an estimate of the Jensen–Shannon divergence

between Prand Pgwhile Gtries to minimize it [2].

Unfortunately, the objective of Gcan get saturated

when Pgand Prdo not have an non-negligible overlapping

manifold, causing vanishing gradients to the generator

[3]. Let Zand Xbe the domain and codomain of G

respectively. G(Z)is contained in a countable union of

manifolds of dimension at most dim Z. Then, according to

[3], if the dimension of Zis less than that of X,G(Z)will

be a set of measure 0in X,Prand Pgcan be distinguished

with accuracy 1by Dand thus no gradient is provided

to G. Besides, GANs suffer from mode col lapse. Mode

collapse refers to the phenomenon that the samples of the

generator lacks the diversity exhibited in Pr. [4] prove that

the generator can fool the discriminator by generating a

limited number of images from the training set. In other

cases of mode collapse, the generated samples are even

meaningless as Gneeds only to fool Din the current

iteration. When mode collapse happens, the model fails

to generate diverse and realistic data.

To cope with these challenges, variants of GAN were

proposed (e.g., [5]–[10]). Limited by the fact that these

methods are applied to high-dimensional realistic datasets

with inadequate samples from each class, the behavior

of GANs remains not completely understood. Another

problem with realistic datasets is that the performance

of GANs can degrade simply due to data scarcity or

insufficient model complexity [4], [11].

Considering that we aim to study the behavior of GANs,

conventional image datasets might not be good choices.

Hence we train GANs on artificially constructed datasets

(e.g., mixtures of Gaussians in high dimensional space),

applying neural networks with sufficiently high capacity.

In this way, we can avoid the influence of the aforemen-

tioned factors and focus on the inherent problems of GAN

training.

The contributions of this work can be summarized as

follows:

•We propose a set of metrics for evaluating GANs

trained on the artificial datasets.

•We designed controlled experiments where we can

adjust the network width/depth, the mixture of net-

works, and the training set size, and then relate them

to the performance of GANs.

arXiv:2007.06418v2 [cs.LG] 14 Jul 2020

Exploring the Benefits of Mixture of Generators and Discriminators in GAN Training, Summaries of Painting

Related documents

Partial preview of the text

Download Exploring the Benefits of Mixture of Generators and Discriminators in GAN Training and more Summaries Painting in PDF only on Docsity!

Lessons Learned from the Training of GANs on

Artificial Datasets

Shichang Tang1, 2, 3

arXiv:2007.06418v2 [cs.LG] 14 Jul 2020

A. WGAN-GP

x

y

x’

y’

z’