Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning

Published 2015 in International Conference on Machine Learning

ABSTRACT

Deep learning tools have gained tremendous attention in applied machine learning. However such tools for regression and classification do not capture model uncertainty. In comparison, Bayesian models offer a mathematically grounded framework to reason about model uncertainty, but usually come with a prohibitive computational cost. In this paper we develop a new theoretical framework casting dropout training in deep neural networks (NNs) as approximate Bayesian inference in deep Gaussian processes. A direct result of this theory gives us tools to model uncertainty with dropout NNs -- extracting information from existing models that has been thrown away so far. This mitigates the problem of representing uncertainty in deep learning without sacrificing either computational complexity or test accuracy. We perform an extensive study of the properties of dropout's uncertainty. Various network architectures and non-linearities are assessed on tasks of regression and classification, using MNIST as an example. We show a considerable improvement in predictive log-likelihood and RMSE compared to existing state-of-the-art methods, and finish by using dropout's uncertainty in deep reinforcement learning.

PUBLICATION RECORD

Publication year
2015
Venue
International Conference on Machine Learning
Publication date
2015-06-06
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1506.02142
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Probabilistic Backpropagation for Scalable Learning of Bayesian Neural Networks
2015cited by this paper
Probabilistic machine learning and artificial intelligence
2015cited by this paper
Human-level control through deep reinforcement learning
2015cited by this paper
Improving the Gaussian Process Sparse Spectrum Approximation by Representing Uncertainty in Frequency Inputs
2015cited by this paper
Weight Uncertainty in Neural Networks
2015cited by this paper
Compressing Neural Networks with the Hashing Trick
2015cited by this paper
Weight Uncertainty in Neural Network
2015cited by this paper
Neural networks applied to discriminate botanical origin of honeys.
2015cited by this paper
Dropout: a simple way to prevent neural networks from overfitting
2014cited by this paper
Stochastic Backpropagation and Approximate Inference in Deep Generative Models
2014cited by this paper
A Bayesian encourages dropout
2014cited by this paper
Doubly Stochastic Variational Bayes for non-Conjugate Inference
2014cited by this paper
Scientific method: Statistical errors
2014cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
On the use of artificial neural networks in simulation-based manufacturing control
2014cited by this paper
Searching for exotic particles in high-energy physics with deep learning
2014cited by this paper
Caffe: Convolutional Architecture for Fast Feature Embedding
2014cited by this paper
Experimental biology: Sometimes Bayesian statistics are better
2013cited by this paper
Intriguing properties of neural networks
2013cited by this paper
Regularization of Neural Networks using DropConnect
2013cited by this paper
Fast dropout training
2013cited by this paper
Points of significance: Importance of being uncertain
2013cited by this paper
Auto-Encoding Variational Bayes
2013cited by this paper
Stochastic variational inference
2012cited by this paper
Practical Bayesian Optimization of Machine Learning Algorithms
2012cited by this paper
Variational Bayesian Inference with Stochastic Search
2012cited by this paper
Deep Gaussian Processes
2012cited by this paper
Practical Variational Inference for Neural Networks
2011influential reference
Algorithms for Reinforcement Learning
2010cited by this paper
Neural Network based Intrusion Detection System for critical infrastructures
2009cited by this paper
Et al
2008influential reference
The mnist database of handwritten digits
2005influential reference
Editorial
2005cited by this paper
Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)
2005cited by this paper
Marginalized kernels for biological sequences
2002cited by this paper
GradientBased Learning Applied to Document Recognition
2001cited by this paper
Ensemble learning in Bayesian neural networks
1998cited by this paper
Gradient-based learning applied to document recognition
1998cited by this paper
Computing with Infinite Networks
1996cited by this paper
Bayesian learning for neural networks
1995cited by this paper
Keeping the neural networks simple by minimizing the description length of the weights
1993cited by this paper
A Practical Bayesian Framework for Backpropagation Networks
1992cited by this paper
Statistical errors.
1977cited by this paper
ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES
1933cited by this paper

CITED BY

Leveraging Large-Scale Public Data for Artificial Intelligence-Driven Chest X-Ray Analysis and Diagnosis
2026cites this paper
Reconstruction Guided Few-shot Network For Remote Sensing Image Classification
2026cites this paper
FUSTT: Forecasting Using Spatio-Temporal Transformers for Predicting Marine Stressors of Blue-Carbon Ecosystems
2026cites this paper
Meta-Learning Guided Pruning for Few-Shot Plant Pathology on Edge Devices
2026cites this paper
Artificial intelligence (AI)-based multi-organ contour quality assurance with uncertainty estimation for online adaptive radiotherapy (oART)
2026cites this paper
Active Learning Strategies for Efficient Machine-Learned Interatomic Potentials Across Diverse Material Systems
2026cites this paper
EsurvFusion: An Evidential Multimodal Survival Fusion Model Based on Epistemic Random Fuzzy Sets
2026cites this paper
Estimate Travel Time on Large-Scale Road Networks: A Deep Representation Learning Approach
2026cites this paper
Post-pandemic tourism forecasting with ensemble RNN
2026cites this paper
Uncertainty-Calibrated Explainable AI for Fetal Ultrasound Plane Classification
2026influential citation
Correcting Autonomous Driving Object Detection Misclassifications with Automated Commonsense Reasoning
2026cites this paper
Perceiving Unpredictability for New Energy Power and Electricity Consumption Forecasting
2026cites this paper
A Rapid Prediction Method for Underwater Vehicle Radiated Noise Based on Feature Selection and Parallel Residual Neural Network
2026cites this paper
Outcome-Grounded Advantage Reshaping for Fine-Grained Credit Assignment in Mathematical Reasoning
2026cites this paper
StatCHAR: Statistical Timing Characterization Framework via Heterogeneous Graph Attention Network and Active Learning With Parasitic RC Reduction
2026cites this paper
Single-to-multi-fidelity history-dependent learning with uncertainty quantification and disentanglement: Application to data-driven constitutive modeling
2026cites this paper
Unveiling cross-modal consistency: Taming inter- and intra-modal noise for robust multi-modal knowledge graph completion
2026cites this paper
A real-time traffic-load-driven framework for asphalt pavement maintenance timing
2026cites this paper
Uncertainty Estimation-Based Performance Monitoring for Deep Learning-Based CSI Prediction
2026cites this paper
Recent advances in uncertainty analysis for prognostics and remaining useful life prediction: A review
2026cites this paper
Reframing Audio Data Annotation as Domain Adaptation Process: A Multi-Indicator Active Learning Framework
2026cites this paper
Enhanced Data-Driven Product Development via Gradient Based Optimization and Conformalized Monte Carlo Dropout Uncertainty Estimation
2026influential citation
Material exploration through active learning -- METAL
2026cites this paper
When to Act: Calibrated Confidence for Reliable Human Intention Prediction in Assistive Robotics
2026cites this paper
Multi-objective genetic programming-based algorithmic trading, using directional changes and a modified sharpe ratio score for identifying optimal trading strategies
2026influential citation
3D Deep Learning Joint Inversion of Active Seismic Full Waveform and Passive Seismic Traveltime Data for Reservoir Imaging and Uncertainty Quantification
2026cites this paper
Uncertainty-aware genomic deep learning with knowledge distillation
2026cites this paper
Rapid Urban Flood Simulation and Prediction Using Integrated Hydrodynamic Modeling and Deep Learning Approaches
2026cites this paper
Empowering federated learning for robust compound-protein interaction prediction across heterogeneous cross-pharma domains
2026cites this paper
L-RAG: Balancing Context and Retrieval with Entropy-Based Lazy Loading
2026cites this paper
Using Adversarial Training to Improve Uncertainty Quantification
2026cites this paper
A Tutorial on Data-Driven Quality-of-Experience Modeling With Explainable Artificial Intelligence
2026cites this paper
Learning domain-invariant representation for generalizable iris segmentation
2026cites this paper
Spiking neural networks with uncertainty model of stochastic sampling for circuit yield enhancement
2026cites this paper
The TES Framework: Joint Statistical Modeling and Machine Learning for Network KPI Forecasting
2026cites this paper
Actively Learning Unified Embeddings for Joint Open Knowledge Base Canonicalization and Linking
2026cites this paper
Two-stage deep learning model for nuclear power plant parameter trend prediction with epistemic uncertainty quantification
2026cites this paper
Bridging techniques and applications in sentiment analysis: Approaches, challenges, and emerging trends
2026cites this paper
Boundary-precise semi-supervised medical image segmentation via prototypical mutual learning and cyclic task consistency
2026cites this paper
Bayesian deep learning for vehicle battery health prognostics: Incorporating behavioral perception and informative priors
2026cites this paper
Leveraging uncertainty for transmission period control in IoT applications
2026cites this paper
Machine learning for domain transfer between simulated and experimental 2D X-ray diffraction patterns using generative adversarial networks
2026cites this paper
SwinLSTM-EmoRec: A Robust Dual-Modal Emotion Recognition Framework Combining mmWave Radar and Camera for IoT-Enabled Multimedia Applications
2026cites this paper
CrackNet++: An enhanced UNet architecture employing tri-axial fusion attention and multi-scale gated fusion for precise crack segmentation and quantification
2026cites this paper
A Multi-Level Probabilistic Deep Learning Network Augmented With Normalizing Flow for Ambiguous Medical Image Segmentation
2026cites this paper
Trustworthy Data-Driven Wildfire Risk Prediction and Understanding in Western Canada
2026cites this paper
CutisAI: Deep Learning Framework for Automated Dermatology and Cancer Screening
2026cites this paper
Multi-Constraint Physics-Informed Generative Adversarial Networks (PIGAN) Enable Small-Data Learning for Urban Wind Field Prediction
2026cites this paper
Hybrid feedforward neural network for pressure vessel internal corrosion prediction: integrating chemical models with inspection data for structural integrity assessment
2026cites this paper
Bayesian Deep Image Priors for denoising XRF maps
2026cites this paper
Stochastic Deep Learning: A Probabilistic Framework for Modeling Uncertainty in Structured Temporal Data
2026cites this paper
Evidence Conflict Sampling for Open-set Active Learning
2026cites this paper
Early detection of air leakage in IoT-connected compressors using enhanced data sampling with deep learning
2026cites this paper
Deep Learning-Based Diffraction Identification and Uncertainty-Aware Adaptive Weighting for GNSS Positioning in Occluded Environments
2026cites this paper
Hybrid GNN–LSTM Architecture for Probabilistic IoT Botnet Detection with Calibrated Risk Assessment
2026cites this paper
RoadMark-cGAN: Generative Conditional Learning to Directly Map Road Marking Lines from Aerial Orthophotos via Image-to-Image Translation
2026cites this paper
Bayesian deep learning for probabilistic aquifer vulnerability and uncertainty prediction
2026influential citation
Selective classification with machine learning uncertainty estimates improves ACS prediction: a retrospective study in the prehospital setting
2026cites this paper
An Uncertainty-Aware Bayesian Deep Learning Method for Automatic Identification and Capacitance Estimation of Compensation Capacitors
2026cites this paper
A MultiRater MultiOrgan Abdominal CT Dataset for Calibration Analysis and Uncertainty Modeling in Segmentation
2026cites this paper
Towards reliable recognition for plant diseases and weeds by learning soft probability population
2026cites this paper
Progressive uncertainty-guided network for binary segmentation in high-resolution remote sensing imagery
2026cites this paper
Likelihood-based Fine-tuning of Protein Language Models for Few-shot Fitness Prediction and Design
2026cites this paper
Trustworthy Driver State Perception via Contextual Interaction-Driven Evidential Vision-Language Fusion in Vehicular Cyber-Physical Systems
2026cites this paper
Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning
2026cites this paper
Towards Reliable Tracking: An Uncertainty-Aware Siamese Tracker
2026cites this paper
Cooperative multi-task learning and reliability assessment for glioma segmentation and IDH genotyping
2026cites this paper
ILD: Image-Level Labels Driven Active Learning Object Detection
2026cites this paper
Active Learning for Transformer-Based Fault Diagnosis in 5G and Beyond Mobile Networks
2026cites this paper
Missing microseismic data imputation in tunnel monitoring using a transformer model with an integrated Gaussian mixture model
2026cites this paper
A comparative Bayesian PINN–MCMC analysis of Barrow–Tsallis holographic dark energy with neutrinos: Toward resolving the Hubble tension
2026cites this paper
Evaluating the Adversarial Robustness of Vision-Language Models via Internal Feature Perturbations
2026cites this paper
Remote Sensing Identification and Mapping of Spartina Alterniflora in Chinese Mainland Coastal at 2 m Spatial Resolution
2026cites this paper
Monte Carlo-based uncertainty estimation in end-to-end deep learning model for children and adolescents' dental age assessment
2026cites this paper
Failure prediction and uncertainty verification of LiDAR thermoelectric coolers using Archimedes spiral loss-based domain generalization
2026cites this paper
Uncertainty-Aware Maritime Point Cloud Detector (U-MPCD) for Autonomous Surface Vehicles
2026cites this paper
BELT-Fusion: Bayesian Evidential Late Fusion for Trustworthy V2X Perception
2026cites this paper
Facade parsing via joint structural priors and phased deep learning
2026cites this paper
User-Driven Land Cover Change Prediction Map Tool for Land Conservation Planning
2026cites this paper
Embedding uncertainty modeling for cold-start item recommendation
2026cites this paper
Uncertainty-Aware Bayesian Time Series framework for probabilistic imputation
2026cites this paper
Deep sequence learning architectures for predicting cyclic soil stress-strain behavior: comparative evaluation of LSTM, GRU, TCN, and hybrid models
2026cites this paper
A lightweight uncertainty modeling approach for wearable sensor signals based on sample overlap estimation
2026cites this paper
Uncertainty-fused statistical detection with MiniRocket and dual-scoring TCN for small-sample electricity theft detection and load data recovery
2026cites this paper
Enhanced safety and emergency decision-making in NPPs: Multi-step parameter prediction for complex accident scenarios
2026cites this paper
Degradation prediction of the low-Pt loading proton exchange membrane fuel cell based on spatio-temporal Transformer network
2026cites this paper
Statistical comparison and uncertainty analysis of graph neural networks and machine learning models for molecular property prediction in drug discovery
2026cites this paper
Towards High spatial resolution and fine-grained fidelity depth reconstruction of single-photon LiDAR with context-aware spatiotemporal modeling
2026cites this paper
UncertaintyGeo: A Dirichlet Network Architecture for Evaluating IP Geolocation Uncertainty
2026cites this paper
Standard candle based distance estimation with learning algorithms
2026cites this paper
Multi-task time series forecasting with adaptive graph neural networks based on feature uncertainty
2026cites this paper
Online learning supported surrogate-based flowsheet model maintenance
2026cites this paper
Stochastic Actor-Critic: Mitigating Overestimation via Temporal Aleatoric Uncertainty
2026cites this paper
Reliable leukemia detection via transfer-enhanced Bayesian CNNs
2026cites this paper
Uncertainty-aware geological prediction in TBM tunneling: A probabilistic bayesian framework with exploratory multi-source label construction
2026cites this paper
Beyond overconfidence: Embedding curiosity and humility for ethical medical AI
2026cites this paper
DCG ReID: Disentangling Collaboration and Guidance Fusion Representations for Multi-modal Vehicle Re-Identification
2026cites this paper
Teddy: neural inference of epidemiological parameters from viral sequences
2026cites this paper
CroBIM-U: Uncertainty-Driven Referring Remote Sensing Image Segmentation
2026cites this paper
Backpropagation-Free Test-Time Adaptation for Lightweight EEG-Based Brain-Computer Interfaces
2026cites this paper