LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop

F. Yu,Yinda Zhang,Shuran Song,Ari Seff,Jianxiong Xiao

Published 2015 in arXiv.org

ABSTRACT

While there has been remarkable progress in the performance of visual recognition algorithms, the state-of-the-art models tend to be exceptionally data-hungry. Large labeled training datasets, expensive and tedious to produce, are required to optimize millions of parameters in deep network models. Lagging behind the growth in model capacity, the available datasets are quickly becoming outdated in terms of size and density. To circumvent this bottleneck, we propose to amplify human effort through a partially automated labeling scheme, leveraging deep learning with humans in the loop. Starting from a large set of candidate images for each category, we iteratively sample a subset, ask people to label them, classify the others with a trained model, split the set into positives, negatives, and unlabeled based on the classification confidence, and then iterate with the unlabeled set. To assess the effectiveness of this cascading procedure and enable further progress in visual recognition research, we construct a new image dataset, LSUN. It contains around one million labeled images for each of 10 scene categories and 20 object categories. We experiment with training popular convolutional networks and find that they achieve substantial performance gains when trained on this dataset.

PUBLICATION RECORD

Publication year
2015
Venue
arXiv.org
Publication date
2015-06-10
Fields of study
Computer Science, Engineering
Identifiers
arXiv 1506.03365
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
2015cited by this paper
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
2015cited by this paper
Best of both worlds: Human-machine collaboration for object annotation
2015cited by this paper
Deep Image: Scaling up Image Recognition
2015cited by this paper
Learning Deep Features for Scene Recognition using Places Database
2014influential reference
Deep neural networks are easily fooled: High confidence predictions for unrecognizable images
2014cited by this paper
ImageNet Large Scale Visual Recognition Challenge
2014cited by this paper
Explaining and Harnessing Adversarial Examples
2014cited by this paper
Very Deep Convolutional Networks for Large-Scale Image Recognition
2014cited by this paper
Going deeper with convolutions
2014cited by this paper
Scalable multi-label annotation
2014cited by this paper
Intriguing properties of neural networks
2013cited by this paper
ImageNet classification with deep convolutional neural networks
2012influential reference
Multiclass recognition and part localization with humans in the loop
2011cited by this paper
Unbiased look at dataset bias
2011cited by this paper
Literature
2010cited by this paper
SUN database: Large-scale scene recognition from abbey to zoo
2010cited by this paper
Visual Recognition with Humans in the Loop
2010cited by this paper
Active Learning Literature Survey
2009cited by this paper
ImageNet: A large-scale hierarchical image database
2009cited by this paper
Towards Scalable Dataset Construction: An Active Learning Approach
2008cited by this paper
Multi-Level Active Prediction of Useful Image Annotations for Recognition
2008cited by this paper
Et al
2008cited by this paper
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories
2004cited by this paper
Support vector machine active learning with applications to text classification
2002cited by this paper
Support Vector Machine Active Learning with Applications to Text Classification
2000cited by this paper
Gradient-based learning applied to document recognition
1998cited by this paper

CITED BY

S^2F-Net:A Robust Spatial-Spectral Fusion Framework for Cross-Model AIGC Detection
2026cites this paper
Halt the Hallucination: Decoupling Signal and Semantic OOD Detection Based on Cascaded Early Rejection
2026cites this paper
Exposing Diversity Bias in Deep Generative Models: Statistical Origins and Correction of Diversity Error
2026cites this paper
Implicit Neural Representation Facilitates Unified Universal Vision Encoding
2026cites this paper
Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization
2026cites this paper
C-WOE: Clustering for Out-of-Distribution Detection Learning With Wild Outlier Exposure
2026cites this paper
Optimal Transport-Induced Samples against Out-of-Distribution Overconfidence
2026cites this paper
Synthetic Image Detection with CLIP: Understanding and Assessing Predictive Cues
2026cites this paper
Modeling Score Approximation Errors in Diffusion Models via Forward SPDEs
2026cites this paper
CASL: Concept-Aligned Sparse Latents for Interpreting Diffusion Models
2026cites this paper
Brain-Machine Enhanced Intelligence for Semi-Supervised Facial Emotion Recognition
2026cites this paper
DAVIS: OOD Detection via Dominant Activations and Variance for Increased Separation
2026cites this paper
De-Decay: Defusing Computer Vision Model Degradation through Scalable and Actionable Human-Data Alignment
2026cites this paper
Off-The-Shelf Image-to-Image Models Are All You Need To Defeat Image Protection Schemes
2026cites this paper
Is this real? Susceptibility to deepfakes in machines and humans
2026cites this paper
RealStats: A Rigorous Real-Only Statistical Framework for Fake Image Detection
2026cites this paper
Gradient-Aligned Calibration for Post-Training Quantization of Diffusion Models
2026influential citation
GAROD: Delve into Gradient-Based Attribution Reliability for Out-of-Distribution Detection
2026cites this paper
Provably Safe Generative Sampling with Constricting Barrier Functions
2026cites this paper
Blind denoising diffusion models and the blessings of dimensionality
2026cites this paper
DTAMS: High-Capacity Generative Steganography via Dynamic Multi-Timestep Selection and Adaptive Deviation Mapping in Latent Diffusion
2026cites this paper
HP-GAN: Harnessing pretrained networks for GAN improvement with FakeTwins and discriminator consistency.
2026cites this paper
MoEE: Mixture of Edge Experts for Collaborative Inference of Heterogeneous Models Based on Out-of-Distribution Detection
2026cites this paper
Annealing Genetic Slicing Adversarial Networks Based Feedback for Imbalanced Visual Classification
2026cites this paper
LooC: Effective Low-Dimensional Codebook for Compositional Vector Quantization
2026cites this paper
ICONIC-444: A 3.1-Million-Image Dataset for OOD Detection Research
2026cites this paper
Breaking Semantic Hegemony: Decoupling Principal and Residual Subspaces for Generalized OOD Detection
2026cites this paper
Learning with Adaptive Prototype Manifolds for Out-of-Distribution Detection
2026cites this paper
Relational Feature Caching for Accelerating Diffusion Transformers
2026cites this paper
A Difference-in-Difference Approach to Detecting AI-Generated Images
2026cites this paper
Deep Neural Networks Internal Representation via Neuron Community Exploration
2026cites this paper
GRRE: Leveraging G-Channel Removed Reconstruction Error for Robust Detection of AI-Generated Images
2026cites this paper
Feature-Aware Test Generation for Deep Learning Models
2026influential citation
RemEdit: Efficient Diffusion Editing with Riemannian Geometry
2026cites this paper
Catalyst: Out-of-Distribution Detection via Elastic Scaling
2026influential citation
Your AI-Generated Image Detector Can Secretly Achieve SOTA Accuracy, If Calibrated
2026cites this paper
Towards Explainable Fake Image Detection with Multi-Modal Large Language Models
2025cites this paper
Security and Privacy Challenges of AIGC in Metaverse: A Comprehensive Survey
2025cites this paper
Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images
2025cites this paper
UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
2025cites this paper
InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection
2025cites this paper
Let the Void Be Void: Robust Open-Set Semi-Supervised Learning via Selective Non-Alignment
2025cites this paper
OMS: One More Step Noise Searching to Enhance Membership Inference Attacks for Diffusion Models
2025cites this paper
The deficit of new information in diffusion models: a focus on diverse samples
2025influential citation
CKGAN: Training Generative Adversarial Networks Using Characteristic Kernel Integral Probability Metrics
2025cites this paper
Manifold Constraint Reduces Exposure Bias in Accelerated Diffusion Sampling
2025cites this paper
MD-ProjTex: Texturing 3D Shapes with Multi-Diffusion Projection
2025cites this paper
Enhanced Diffusion Sampling via Extrapolation with Multiple ODE Solutions
2025cites this paper
Frequency-Quantized Variational Autoencoder Based on 2D-FFT for Enhanced Image Reconstruction and Generation
2025cites this paper
Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism
2025cites this paper
Efficient and Adaptive Diffusion Model Inference Through Lookup Table on Mobile Devices
2025cites this paper
EOOD: Entropy-based Out-of-distribution Detection
2025cites this paper
Exploiting Inter-Sample Information for Long-Tailed Out-of-Distribution Detection
2025cites this paper
Simple open-set recognition combining metric learning and anomaly detection
2025cites this paper
Provable Discriminative Hyperspherical Embedding for Out-of-Distribution Detection
2025cites this paper
Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training
2025cites this paper
Adaptive few-shot image augmentation for fine-grained industrial defects based on region-level modeling
2025cites this paper
Keep and Extent: Unified Knowledge Embedding for Few-Shot Image Generation
2025cites this paper
Ownership Infringement Detection for Generative Adversarial Networks Against Model Stealing
2025cites this paper
Linear Multistep Solver Distillation for Fast Sampling of Diffusion Models
2025influential citation
MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection
2025cites this paper
Sparse-to-Sparse Training of Diffusion Models
2025influential citation
Optimal Stepsize for Diffusion Sampling
2025cites this paper
Consistent Subject Generation via Contrastive Instantiated Concepts
2025cites this paper
ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery
2025cites this paper
Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models
2025cites this paper
ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting
2025cites this paper
CMSL: Cross-modal Style Learning for Few-shot Image Generation
2025cites this paper
Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking
2025cites this paper
Inverting the Generation Process of Denoising Diffusion Implicit Models: Empirical Evaluation and a Novel Method
2025cites this paper
BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors
2025cites this paper
SSDM: Generated image interaction method based on spatial sparsity for diffusion models
2025cites this paper
Extended multi-scale feature fusion and balanced generative adversarial network for image inpainting under limited data
2025cites this paper
Predicting the Strength of Composites with Computer Vision Using Small Experimental Datasets
2025cites this paper
Adaptive Out-of-Distribution Detection with Coarse-to-Fine Grained Representation
2025cites this paper
KeyBoxGAN: enhancing 2D object detection through annotated and editable image synthesis
2025cites this paper
CacheQuant: Comprehensively Accelerated Diffusion Models
2025cites this paper
Reducing the Content Bias for AI-generated Image Detection
2025cites this paper
USegMix: Unsupervised Segment Mix for Efficient Data Augmentation in Pathology Images
2025cites this paper
Hierarchical Semantic Compression for Consistent Image Semantic Restoration
2025cites this paper
Noisy Test-Time Adaptation in Vision-Language Models
2025cites this paper
Personalized Image Generation with Deep Generative Models: A Decade Survey
2025cites this paper
Detecting computer-generated images by using only real images
2025cites this paper
DNN Layers Features Reduction for Out-of-Distribution Detection
2025cites this paper
Parameter Expanded Stochastic Gradient Markov Chain Monte Carlo
2025cites this paper
CADRef: Robust Out-of-Distribution Detection via Class-Aware Decoupled Relative Feature Leveraging
2025cites this paper
Optimizing for the Shortest Path in Denoising Diffusion Model
2025cites this paper
FLaNS: Feature-Label Negative Sampling for Out-of-Distribution Detection
2025cites this paper
F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting
2025cites this paper
Ev-Layout: A Large-scale Event-based Multi-modal Dataset for Indoor Layout Estimation and Tracking
2025cites this paper
Improving GAN Performance Using Confidence-Aware Discrimination
2025cites this paper
Out-of-Distribution Detectors: Not Yet Primed for Practical Deployment
2025cites this paper
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
2025cites this paper
uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images
2025influential citation
What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images
2025cites this paper
FLODA: Harnessing Vision-Language Models for Deepfake Assessment
2025cites this paper
Visual Jenga: Discovering Object Dependencies via Counterfactual Inpainting
2025cites this paper
Improving out-of-distribution detection by enforcing confidence margin
2025cites this paper
Exploring the Collaborative Advantage of Low-level Information on Generalizable AI-Generated Image Detection
2025cites this paper
Methods and trends in detecting AI-generated images: A comprehensive review
2025influential citation