Demystifying MMD GANs

Mikolaj Binkowski,Danica J. Sutherland,M. Arbel,A. Gretton

Published 2018 in International Conference on Learning Representations

ABSTRACT

We investigate the training and performance of generative adversarial networks using the Maximum Mean Discrepancy (MMD) as critic, termed MMD GANs. As our main theoretical contribution, we clarify the situation with bias in GAN loss functions raised by recent work: we show that gradient estimators used in the optimization process for both MMD GANs and Wasserstein GANs are unbiased, but learning a discriminator based on samples leads to biased gradients for the generator parameters. We also discuss the issue of kernel choice for the MMD critic, and characterize the kernel corresponding to the energy distance used for the Cramer GAN critic. Being an integral probability metric, the MMD benefits from training strategies recently developed for Wasserstein GANs. In experiments, the MMD GAN is able to employ a smaller critic network than the Wasserstein GAN, resulting in a simpler and faster-training algorithm with matching performance. We also propose an improved measure of GAN convergence, the Kernel Inception Distance, and show how to use it to dynamically adapt learning rates during GAN training.

PUBLICATION RECORD

Publication year
2018
Venue
International Conference on Learning Representations
Publication date
2018-01-04
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1801.01401
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

GENERATIVE ADVERSARIAL NETS
2018influential reference
Towards the Automatic Anime Characters Creation with Generative Adversarial Networks
2017cited by this paper
GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium
2017cited by this paper
Comparison of Maximum Likelihood and GAN-based training of Real NVPs
2017cited by this paper
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
2017cited by this paper
Wasserstein Generative Adversarial Networks
2017influential reference
Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step
2017cited by this paper
MMD GAN: Towards Deeper Understanding of Moment Matching Network
2017influential reference
Learning Generative Models with Sinkhorn Divergences
2017cited by this paper
Sinkhorn-AutoDiff: Tractable Wasserstein Learning of Generative Models
2017cited by this paper
Fisher GAN
2017cited by this paper
Distance Covariance in Metric Spaces
2017cited by this paper
BEGAN: Boundary Equilibrium Generative Adversarial Networks
2017influential reference
The Cramer Distance as a Solution to Biased Wasserstein Gradients
2017influential reference
Improved Training of Wasserstein GANs
2017influential reference
McGan: Mean and Covariance Feature Matching GAN
2017cited by this paper
Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis
2017cited by this paper
Generalization and Equilibrium in Generative Adversarial Nets (GANs)
2017influential reference
Approximation and Convergence Properties of Generative Adversarial Learning
2017influential reference
Distributional Adversarial Networks
2017influential reference
Towards Principled Methods for Training Generative Adversarial Networks
2017influential reference
Do GANs actually learn the distribution? An empirical study
2017cited by this paper
Revisiting Classifier Two-Sample Tests
2016cited by this paper
DISCO Nets : DISsimilarity COefficients Networks
2016influential reference
Improved Techniques for Training GANs
2016cited by this paper
f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization
2016cited by this paper
Generative Models and Model Criticism via Optimized Maximum Mean Discrepancy
2016cited by this paper
Stacked Generative Adversarial Networks
2016influential reference
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
2015cited by this paper
Generative Moment Matching Networks
2015cited by this paper
Training generative neural networks via Maximum Mean Discrepancy optimization
2015cited by this paper
The Zero Set of a Real Analytic Function
2015cited by this paper
A note on the evaluation of generative models
2015cited by this paper
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)
2015cited by this paper
Rethinking the Inception Architecture for Computer Vision
2015cited by this paper
A Test of Relative Similarity For Model Selection in Generative Models
2015cited by this paper
LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop
2015cited by this paper
Deep Learning Face Attributes in the Wild
2014cited by this paper
Generative adversarial networks
2014cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
B-tests: Low Variance Kernel Two-Sample Tests
2013cited by this paper
Intriguing properties of neural networks
2013cited by this paper
A Kernel Two-Sample Test
2012influential reference
Equivalence of distance-based and RKHS-based statistics in hypothesis testing
2012influential reference
On the empirical estimation of integral probability metrics
2012cited by this paper
Better Mixing via Deep Representations
2012cited by this paper
Scikit-learn: Machine Learning in Python
2011influential reference
Universality, Characteristic Kernels and RKHS Embedding of Measures
2010cited by this paper
On integral probability metrics, φ-divergences and binary classification
2009cited by this paper
Learning Multiple Layers of Features from Tiny Images
2009cited by this paper
Hilbert Space Embeddings and Metrics on Probability Measures
2009cited by this paper
Kernel Choice and Classifiability for RKHS Embeddings of Probability Distributions
2009cited by this paper
Support vector machines
2008cited by this paper
Probability Theory: A Comprehensive Course
2008cited by this paper
Strictly Proper Scoring Rules, Prediction, and Estimation
2007influential reference
TESTING FOR EQUAL DISTRIBUTIONS IN HIGH DIMENSION
2004cited by this paper
Gradient-based learning applied to document recognition
1998cited by this paper
Integral Probability Metrics and Their Generating Classes of Functions
1997cited by this paper
Unbiased Estimation in Convex Families
1969cited by this paper
The Set of Nondifferentiability of a Continuous Function
1966cited by this paper
Moments of a Truncated Bivariate Normal Distribution
1961cited by this paper
Sur l'ensemble des points de non-dérivabilité d'une fonction continue
1946cited by this paper

CITED BY

MedVAR: Towards Scalable and Efficient Medical Image Generation via Next-scale Autoregressive Prediction
2026cites this paper
VTONQA: A Multi-Dimensional Quality Assessment Dataset for Virtual Try-on
2026cites this paper
HP-GAN: Harnessing pretrained networks for GAN improvement with FakeTwins and discriminator consistency.
2026cites this paper
Off-The-Shelf Image-to-Image Models Are All You Need To Defeat Image Protection Schemes
2026cites this paper
Towards Efficient Low-rate Image Compression with Frequency-aware Diffusion Prior Refinement
2026cites this paper
DreamLoop: Controllable Cinemagraph Generation from a Single Photograph
2026cites this paper
Training-Free Distribution Adaptation for Diffusion Models via Maximum Mean Discrepancy Guidance
2026cites this paper
Understanding Frechet Speech Distance for Synthetic Speech Quality Evaluation
2026influential citation
HDR Reconstruction Boosting with Training-Free and Exposure-Consistent Diffusion
2026cites this paper
SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation
2026cites this paper
Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization
2026cites this paper
Self-supervised restoration of singing voice degraded by pitch shifting using shallow diffusion
2026cites this paper
A Simulation-to-Real Transformation for Small-Sample Fault Diagnosis in Aeroengine Dual-Rotor Systems
2026cites this paper
Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment
2026influential citation
AlignVTOFF: Texture-Spatial Feature Alignment for High-Fidelity Virtual Try-Off
2026cites this paper
Improving StyleGAN-ADA with efficient channel attention and multidimensional enhancement heuristic for solder joint defect detection
2026cites this paper
Dynamic Differential Linear Attention: Enhancing Linear Diffusion Transformer for High-Quality Image Generation
2026cites this paper
Enhancing computer vision-based bridge traffic identification at nighttime through CycleGAN-enabled data augmentation
2026cites this paper
Dual-Manifold Gradient Rectification: Reconciling Topology and Texture in Industrial Anomaly Synthesis
2026cites this paper
InfScene-SR: Spatially Continuous Inference for Arbitrary-Size Image Super-Resolution
2026cites this paper
SesaHand: Enhancing 3D Hand Reconstruction via Controllable Generation with Semantic and Structural Alignment
2026cites this paper
Structure-Consistent Contrastive Learning for Unpaired Image Translation With Gradient-Domain Constraints
2026cites this paper
Flow-Based Conformal Predictive Distributions
2026cites this paper
The Illusion of Forgetting: Attack Unlearned Diffusion via Initial Latent Variable Optimization
2026influential citation
360Anything: Geometry-Free Lifting of Images and Videos to 360°
2026cites this paper
DR-DDPM: Synthetic image generation for diabetic retinopathy using denoising diffusion probabilistic models for enhanced detection
2026cites this paper
Free-VTON: Cost-free acceleration and quality enhancement for diffusion-based virtual try-on
2026cites this paper
Reusing source diffusion model for domain perception: Towards few-shot image generation via fine-tuning
2026cites this paper
MEF-GD: Multimodal Enhancement and Fusion Network for Garment Designer
2026cites this paper
A Hybrid Optical Neural Network With Diverse Parameterized Layers for Multifunctional Intelligent Processing
2026cites this paper
WGAN-GP augmented hyperspectral framework for pest infestation grading in stored Astragalus membranaceus
2026cites this paper
GenCAMO: Scene-Graph Contextual Decoupling for Environment-aware and Mask-free Camouflage Image-Dense Annotation Generation
2026cites this paper
HVTC-GAN: A High-Level Vision Task Cooperative GAN for SAR-to-Optical Translation via Semantic Segmentation
2026cites this paper
A Ceramic Rare Defect Amplification Method Based on TC-CycleGAN
2026cites this paper
Information Modeling of Asymmetric Aesthetics Using DCGAN: A Data-Driven Approach to the Generation of Marbling Art
2026cites this paper
When Are Two Scores Better Than One? Investigating Ensembles of Diffusion Models
2026influential citation
EVolSplat4D: Efficient Volume-based Gaussian Splatting for 4D Urban Scene Synthesis
2026cites this paper
FreeFix: Boosting 3D Gaussian Splatting via Fine-Tuning-Free Diffusion Models
2026cites this paper
SLIM-Diff: Shared Latent Image-Mask Diffusion with Lp loss for Data-Scarce Epilepsy FLAIR MRI
2026cites this paper
BearGen: LLM-guided signal generation framework for bearing fault diagnosis
2026cites this paper
VAR-3D: View-aware Auto-Regressive Model for Text-to-3D Generation via a 3D Tokenizer
2026cites this paper
Real-world face super-resolution based on generative adversarial and face alignment networks
2026influential citation
Human-operational 3D indoor layout generation with LLM-driven anthropometric simulation
2026cites this paper
Initialization-Aware Score-Based Diffusion Sampling
2026cites this paper
Fusion of deep and manual features for improved generation of large-size, low-resolution functional medical images
2026cites this paper
SkeleGuide: Explicit Skeleton Reasoning for Context-Aware Human-in-Place Image Synthesis
2026cites this paper
Exposing Diversity Bias in Deep Generative Models: Statistical Origins and Correction of Diversity Error
2026cites this paper
Geometric Image Editing via Effects-Sensitive In-Context Inpainting with Diffusion Transformers
2026cites this paper
Steering Large Reasoning Models towards Concise Reasoning via Flow Matching
2026cites this paper
PromptSplit: Revealing Prompt-Level Disagreement in Generative Models
2026cites this paper
Sketch2Avatar: Geometry-Guided 3D Full-Body Human Generation in 360° From Hand-Drawn Sketches
2026cites this paper
Advancing GAN Evaluation: The Advanced Mahalanobis Distance Learning Metric for Realistic Car Damage Image Assessment
2026influential citation
GO-MLVTON: Garment Occlusion-Aware Multi-Layer Virtual Try-On with Diffusion Models
2026cites this paper
Generation of Chest CT pulmonary Nodule Images by Latent Diffusion Models using the LIDC-IDRI Dataset
2026cites this paper
Style Quantization for Data-Efficient GAN Training
2025cites this paper
HierRelTriple: Guiding Indoor Layout Generation with Hierarchical Relationship Triplet Losses
2025influential citation
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
2025influential citation
A Diffusion-Based Framework for Occluded Object Movement
2025cites this paper
GenRAN: GenFusion-guided Reversible Anonymization Network for face privacy preserving
2025cites this paper
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
2025cites this paper
ViTon-GUN: Person-to-Person Virtual Try-on via Garment Unwrapping
2025cites this paper
Dual discriminator GAN-based synthetic crop disease image generation for precise crop disease identification
2025cites this paper
k-NN as a Simple and Effective Estimator of Transferability
2025influential citation
CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation
2025cites this paper
MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing
2025cites this paper
Guided Diffusion for the Extension of Machine Vision to Human Visual Perception
2025cites this paper
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models
2025cites this paper
Unpaired Translation of Chest X-ray Images for Lung Opacity Diagnosis via Adaptive Activation Masks and Cross-Domain Alignment
2025cites this paper
InvFusion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems
2025cites this paper
Interactive High-Quality Skin Lesion Generation using Diffusion Models for VR-based Dermatological Education
2025cites this paper
SynCity: Training-Free Generation of 3D Worlds
2025cites this paper
3D Synthesis for Architectural Design
2025cites this paper
Scale-wise Distillation of Diffusion Models
2025cites this paper
SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis
2025cites this paper
Single Image Iterative Subject-driven Generation and Editing
2025cites this paper
Zero-Shot Styled Text Image Generation, but Make It Autoregressive
2025cites this paper
Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images
2025cites this paper
MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
2025cites this paper
DefectFill: Realistic Defect Generation with Inpainting Diffusion Model for Visual Inspection
2025cites this paper
AS-Net: Adaptive Style-aware Network for Handwritten Text Generation
2025cites this paper
Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation
2025cites this paper
TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
2025cites this paper
Free-Lunch Color-Texture Disentanglement for Stylized Image Generation
2025cites this paper
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
2025cites this paper
MAD: Makeup All-in-One with Cross-Domain Diffusion Model
2025cites this paper
Efficient Training-Free High-Resolution Synthesis with Energy Rectification in Diffusion Models
2025cites this paper
Φ-GAN: Physics-Inspired GAN for Generating SAR Images Under Limited Data
2025cites this paper
Quality Measures for Dynamic Graph Generative Models
2025cites this paper
Train on classical, deploy on quantum: scaling generative quantum machine learning to a thousand qubits
2025cites this paper
Extremely low-bitrate Image Compression Semantically Disentangled by LMMs from a Human Perception Perspective
2025cites this paper
AI-Augmented Thyroid Scintigraphy for Robust Classification
2025cites this paper
Two-step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real
2025cites this paper
Personalized Generation In Large Model Era: A Survey
2025cites this paper
DLF: Extreme Image Compression with Dual-generative Latent Fusion
2025cites this paper
A Deep Bayesian Nonparametric Framework for Robust Mutual Information Estimation
2025cites this paper
PerCoV2: Improved Ultra-Low Bit-Rate Perceptual Image Compression with Implicit Hierarchical Masked Image Modeling
2025cites this paper
MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment
2025cites this paper
Simplified one-sided Image-to-Image Translation with Reconstruction-Constrained Generative Adversarial Networks
2025cites this paper
CHOrD: Generation of Collision-Free, House-Scale, and Organized Digital Twins for 3D Indoor Scenes with Controllable Floor Plans and Optimal Layouts
2025cites this paper
Parametric MMD Estimation with Missing Values: Robustness to Missingness and Data Model Misspecification
2025cites this paper