Places: A 10 Million Image Database for Scene Recognition

Bolei Zhou,Àgata Lapedriza,A. Khosla,A. Oliva,A. Torralba

Published 2018 in IEEE Transactions on Pattern Analysis and Machine Intelligence

ABSTRACT

The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. Here we describe the Places Database, a repository of 10 million scene photographs, labeled with scene semantic categories, comprising a large and diverse list of the types of environments encountered in the world. Using the state-of-the-art Convolutional Neural Networks (CNNs), we provide scene classification CNNs (Places-CNNs) as baselines, that significantly outperform the previous approaches. Visualization of the CNNs trained on Places shows that object detectors emerge as an intermediate representation of scene classification. With its high-coverage and high-diversity of exemplars, the Places Database along with the Places-CNNs offer a novel resource to guide future progress on scene recognition problems.

PUBLICATION RECORD

Publication year
2018
Venue
IEEE Transactions on Pattern Analysis and Machine Intelligence
Publication date
2018-06-01
Fields of study
Medicine, Computer Science, Engineering, Environmental Science
Identifiers
DOI 10.1109/TPAMI.2017.2723009 PMID 28692961
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Scene Parsing through ADE20K Dataset
2017cited by this paper
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
2016cited by this paper
The Cityscapes Dataset for Semantic Urban Scene Understanding
2016cited by this paper
Mastering the game of Go with deep neural networks and tree search
2016cited by this paper
Synthesizing the preferred inputs for neurons in neural networks via deep generator networks
2016cited by this paper
Deep Residual Learning for Image Recognition
2015cited by this paper
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
2015cited by this paper
CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
2014cited by this paper
Going deeper with convolutions
2014influential reference
Learning Deep Features for Scene Recognition using Places Database
2014influential reference
Very Deep Convolutional Networks for Large-Scale Image Recognition
2014cited by this paper
Microsoft COCO: Common Objects in Context
2014cited by this paper
ImageNet Large Scale Visual Recognition Challenge
2014cited by this paper
Object Detectors Emerge in Deep Scene CNNs
2014cited by this paper
Watson: Beyond Jeopardy!
2013cited by this paper
DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition
2013cited by this paper
ImageNet classification with deep convolutional neural networks
2012influential reference
SUN attribute database: Discovering, annotating, and recognizing scene attributes
2012cited by this paper
Human action recognition by learning bases of action attributes and parts
2011cited by this paper
Unbiased look at dataset bias
2011influential reference
SUN database: Large-scale scene recognition from abbey to zoo
2010influential reference
The Pascal Visual Object Classes (VOC) Challenge
2010cited by this paper
Recognizing indoor scenes
2009cited by this paper
ImageNet: A large-scale hierarchical image database
2009influential reference
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
2008cited by this paper
LIBLINEAR: A Library for Large Linear Classification
2008cited by this paper
Caltech-256 Object Category Dataset
2007influential reference
What, where and who? Classifying events by scene and object recognition
2007cited by this paper
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
2006influential reference
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories
2004influential reference
Deep Blue
2002cited by this paper
The Measurement of Diversity
2001cited by this paper
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
2001cited by this paper
Book Reviews: Foundations of Statistical Natural Language Processing
1999cited by this paper
Gradient-based learning applied to document recognition
1998cited by this paper
Indices of diversity and evenness
1998cited by this paper
Long Short-Term Memory
1997cited by this paper
WordNet: A Lexical Database for English
1995influential reference
Pictures and names: making the connection.
1984cited by this paper

CITED BY

Vendi Novelty Scores for Out-of-Distribution Detection
2026cites this paper
Diffbias: Harnessing diffusion models' prediction bias for adversarial patch defense
2026cites this paper
DCAC: Dynamic Class-Aware Cache Creates Stronger Out-of-Distribution Detectors
2026cites this paper
VersaViT: Enhancing MLLM Vision Backbones via Task-Guided Optimization
2026cites this paper
Decoding the continuum of architectural intention and public perception in industrial heritage regeneration: A multimodal social media data analysis
2026cites this paper
Self-supervised learning yields representational signatures of category-selective cortex
2026cites this paper
Dilated Superpixel Aggregation for Visual Place Recognition
2026cites this paper
IPEC: Test-Time Incremental Prototype Enhancement Classifier for Few-Shot Learning
2026cites this paper
DVLA-RL: Dual-Level Vision-Language Alignment with Reinforcement Learning Gating for Few-Shot Learning
2026cites this paper
Exploring SAIG Methods for an Objective Evaluation of XAI
2026cites this paper
Architectural Insights for Post-Tornado Damage Recognition
2026cites this paper
GSNR: Graph Smooth Null-Space Representation for Inverse Problems
2026cites this paper
From Misclassifications to Outliers: Joint Reliability Assessment in Classification
2026cites this paper
MSGC-NetVLAD: a lightweight visual place recognition neural network for autonomous robots
2026cites this paper
Bottom-up building exposure modeling with multimodal earth vision
2026cites this paper
AFFS: Adaptive Fast Frequency Selection Algorithm for Deep Learning Feature Extraction
2026cites this paper
Dual-backbone fusion network for damage segmentation in cultural heritage buildings
2026cites this paper
Vision Also You Need: Navigating Out-of-Distribution Detection with Multimodal Large Language Model
2026cites this paper
Distributionally Robust Classification for Multi-source Unsupervised Domain Adaptation
2026cites this paper
UV-M3TL: A Unified and Versatile Multimodal Multi-Task Learning Framework for Assistive Driving Perception
2026cites this paper
Seeing the whole in the parts with self-supervised representation learning
2026cites this paper
Learning Credal Ensembles via Distributionally Robust Optimization
2026cites this paper
Enhancing out-of-distribution detection with bilateral distribution score.
2026cites this paper
SWING: Unlocking Implicit Graph Representations for Graph Random Features
2026influential citation
Image-Based Analysis of Tourist Destination Perceptions: A Deep Learning and Spatial–Temporal Study in Slovenia
2026cites this paper
StreetTree: A Large-Scale Global Benchmark for Fine-Grained Tree Species Classification
2026cites this paper
Cdmm: learning a conditional diffusion model from multi-single images
2026influential citation
Real Eyes Realize Faster: Gaze Stability and Pupil Novelty for Efficient Egocentric Learning
2026cites this paper
Context-Dependent Affordance Computation in Vision-Language Models
2026cites this paper
MATdiff: Mask-aware transformer with diffusion model for large-mask image inpainting
2026cites this paper
DiffInpaint: line drawing guided murals restoration with diffusion model
2026cites this paper
The Human Brain as a Dynamic Mixture of Expert Models in Video Understanding
2026cites this paper
MDT-FI: Mask-Guided Dual-Branch Transformer With Texture and Structure Feature Interaction for Image Inpainting
2026cites this paper
G-LFFN: A Global-Local Feature Fusion Network Leveraging Transformer-Encoder and Contrastive Learning for Multimodal Sentiment Analysis
2026cites this paper
Image deconvolution using adapted cauchy method
2026cites this paper
TCSMAF: twin cascade spatial multi-scale attention filtering inpainting of traditional Chinese painting
2026cites this paper
Relational Integrated Cross-modal Scene Fusion with CLIP for Fine-grained Indoor Scene Recognition in Intelligent Manufacturing Systems
2026cites this paper
Validity-aware context modeling for gradient-guided image inpainting
2026cites this paper
Aligned Stable Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency
2026influential citation
Household robot utilizing location information for human activity and habit understanding
2026cites this paper
Dual-domain perception and cross-domain collaboration guidance network for image inpainting
2026cites this paper
Understanding vision transformer robustness through the lens of out-of-distribution detection
2026cites this paper
Breaking Semantic Hegemony: Decoupling Principal and Residual Subspaces for Generalized OOD Detection
2026cites this paper
Plasmonic artificial inspector for herbal medicines via surface-enhanced Raman spectroscopy and deep learning
2026cites this paper
Analyzing Diffusion and Autoregressive Vision Language Models in Multimodal Embedding Space
2026cites this paper
Dynamic transformer architecture for continual learning of multimodal tasks
2026cites this paper
G2LFormer: Global-to-Local Token Mixing Transformer for Blind Image Inpainting and Beyond
2026cites this paper
Unpacking visual perception of urban public space using social media images and street view imagery: Insights from Singapore riverfront
2026cites this paper
C-WOE: Clustering for Out-of-Distribution Detection Learning With Wild Outlier Exposure
2026cites this paper
GAROD: Delve into Gradient-Based Attribution Reliability for Out-of-Distribution Detection
2026cites this paper
Modeling e-scooter route choices: Infrastructural preferences and intervention scenario analysis
2026cites this paper
Deep transfer learning based image colorization using VGG19 and CLAHE.
2026cites this paper
GAFD-CC: Global-aware feature decoupling with confidence calibration for out-of-distribution detection.
2026cites this paper
Softmax is not Enough (for Adaptive Conformal Classification)
2026cites this paper
A Dataset is Worth 1 MB
2026cites this paper
GarNet: Geometry-Aware Rectification Network for Generic Image Distortions
2026cites this paper
Diagnosing Generalization Failures from Representational Geometry Markers
2026cites this paper
Mind the Way You Select Negative Texts: Pursuing the Distance Consistency in OOD Detection with VLMs
2026cites this paper
VAR-Turbo: Unlocking the Potential of Visual Autoregressive Models Through Dual Redundancy
2026cites this paper
Incentive Aware AI Regulations: A Credal Characterisation
2026cites this paper
SRasP: Self-Reorientation Adversarial Style Perturbation for Cross-Domain Few-Shot Learning
2026cites this paper
MSDFF-RCNet: A Combined Multi-Structure Data Fusion Framework and Recurrent Attention for Remote Sensing Scene Classification
2026cites this paper
High-fidelity mural inpainting via progressive reconstruction and damage-aware adaptation
2026cites this paper
Prompts Libra: Enhanced Image Outpainting Diffusion Model With Balanced Bimodal Guidance
2026cites this paper
CNN-Based 360$^{\circ }$ Scene Recognition for Automatic Generation of Omnidirectional Scent Effects
2026influential citation
Innovative multimodal fusion framework for human-scale humid-heat parameters prediction
2026cites this paper
City identity recognition: how representation bias influences model predictability and replicability?
2026cites this paper
ASK-HOI: Affordance-Scene Knowledge Prompting for Human-Object Interaction Detection
2026cites this paper
Edge-guided interactive fusion of texture and geometric features for Dunhuang mural image inpainting
2026cites this paper
Enhancing Out-of-Distribution Detection in Transfer Learning Through Intuitionistic Fuzzy Set-Based Prediction
2026cites this paper
The complexity of urban form’s impacts on residents’ neighborhood perception: Machine learning evidence from Singapore
2026cites this paper
GenCAMO: Scene-Graph Contextual Decoupling for Environment-aware and Mask-free Camouflage Image-Dense Annotation Generation
2026cites this paper
Image Inpainting Methods: A Review of Deep Learning Approaches
2026cites this paper
Rapid ensemble encoding of average scene features
2026cites this paper
Cross-modal Proxy Evolving for OOD Detection with Vision-Language Models
2026cites this paper
EfficientFSL: Enhancing Few-Shot Classification via Query-Only Tuning in Vision Transformers
2026cites this paper
Reliable and Fast Humans Removed Visual Scene Representation
2026cites this paper
ICONIC-444: A 3.1-Million-Image Dataset for OOD Detection Research
2026influential citation
Insight: Interpretable Semantic Hierarchies in Vision-Language Encoders
2026cites this paper
Enhancing Few-Shot Out-of-Distribution Detection via the Refinement of Foreground and Background
2026cites this paper
SeNeDiF-OOD: Semantic Nested Dichotomy Fusion for Out-of-Distribution Detection Methodology in Open-World Classification. A Case Study on Monument Style Classification
2026cites this paper
Let's Roll a BiFTA: Bi-refinement for Fine-grained Text-visual Alignment in Vision-Language Models
2026cites this paper
DAVIS: OOD Detection via Dominant Activations and Variance for Increased Separation
2026influential citation
Diff-GDAformer: A diffusion-guided dynamic attention transformer for image inpainting
2026cites this paper
Catalyst: Out-of-Distribution Detection via Elastic Scaling
2026influential citation
Learning Sparse Visual Representations via Spatial-Semantic Factorization
2026cites this paper
Temporal Slowness in Central Vision Drives Semantic Object Learning
2026cites this paper
Magic-MM-Embedding: Towards Visual-Token-Efficient Universal Multimodal Embedding with MLLMs
2026cites this paper
VMF-GOS: Geometry-guided virtual Outlier Synthesis for Long-Tailed OOD Detection
2026cites this paper
Learning with Adaptive Prototype Manifolds for Out-of-Distribution Detection
2026cites this paper
Intrinsic dimensionality as a model-free measure of class imbalance
2026cites this paper
MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model
2026cites this paper
Halt the Hallucination: Decoupling Signal and Semantic OOD Detection Based on Cascaded Early Rejection
2026cites this paper
AICE: Three domain conversion network applied to All-in-One Image Inpainting and Color Enhancement Task
2026cites this paper
Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows
2026cites this paper
Detecting OOD Samples via Optimal Transport Scoring Function
2025cites this paper
Advancing Out-of-Distribution Detection via Local Neuroplasticity
2025cites this paper
QPM: Discrete Optimization for Globally Interpretable Image Classification
2025cites this paper
Logit Disagreement: OoD Detection with Bayesian Neural Networks
2025cites this paper
Visual emotion analysis using skill-based multi-teacher knowledge distillation
2025cites this paper