On Markov chain Monte Carlo methods for tall data

Published 2015 in Journal of machine learning research

ABSTRACT

Markov chain Monte Carlo methods are often deemed too computationally intensive to be of any practical use for big data applications, and in particular for inference on datasets containing a large number $n$ of individual data points, also known as tall datasets. In scenarios where data are assumed independent, various approaches to scale up the Metropolis-Hastings algorithm in a Bayesian inference context have been recently proposed in machine learning and computational statistics. These approaches can be grouped into two categories: divide-and-conquer approaches and, subsampling-based algorithms. The aims of this article are as follows. First, we present a comprehensive review of the existing literature, commenting on the underlying assumptions and theoretical guarantees of each method. Second, by leveraging our understanding of these limitations, we propose an original subsampling-based approach which samples from a distribution provably close to the posterior distribution of interest, yet can require less than $O(n)$ data point likelihood evaluations at each iteration for certain statistical models in favourable scenarios. Finally, we have only been able so far to propose subsampling-based methods which display good performance in scenarios where the Bernstein-von Mises approximation of the target posterior distribution is excellent. It remains an open challenge to develop such methods in scenarios where the Bernstein-von Mises approximation is poor.

PUBLICATION RECORD

Publication year
2015
Venue
Journal of machine learning research
Publication date
2015-05-11
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1505.02827
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Weak convergence and empirical processes
2019cited by this paper
Asymptotic statistics
2018influential reference
Stochastic Bouncy Particle Sampler
2016cited by this paper
The Zig-Zag process and super-efficient sampling for Bayesian analysis of big data
2016influential reference
On Event-Chain Monte Carlo Methods
2016cited by this paper
Unbiased Bayes for Big Data: Paths of Partial Posteriors
2015cited by this paper
Unbiased Estimation with Square Root Convergence for SDE Models
2015influential reference
Accelerating Metropolis-Hastings algorithms by Delayed Acceptance
2015influential reference
The Bouncy Particle Sampler: A Nonreversible Rejection-Free Markov Chain Monte Carlo Method
2015cited by this paper
Perturbation theory for Markov chains via Wasserstein distance
2015cited by this paper
The Fundamental Incompatibility of Hamiltonian Monte Carlo and Data Subsampling
2015cited by this paper
WASP: Scalable Bayes via barycenters of subset posteriors
2015influential reference
Distributed Bayesian Posterior Sampling via Moment Sharing
2014cited by this paper
CELEBRATING 50 YEARS OF THE APPLIED PROBABILITY TRUST
2014cited by this paper
Firefly Monte Carlo: Exact MCMC with Subsets of Data
2014influential reference
Ergodicity of Approximate MCMC Chains with Applications to Large Data Sets
2014cited by this paper
Stochastic Gradient Hamiltonian Monte Carlo
2014cited by this paper
Expectation propagation as a way of life ∗
2014cited by this paper
Genetics: Unravelling complexity
2014cited by this paper
Consistency and Fluctuations For Stochastic Gradient Langevin Dynamics
2014cited by this paper
Noisy Monte Carlo: convergence of Markov chains with approximate transition kernels
2014cited by this paper
Scalable and Robust Bayesian Inference via the Median Posterior
2014influential reference
Speeding Up MCMC by Efficient Data Subsampling
2014influential reference
Concentration inequalities for sampling without replacement
2013cited by this paper
Parallelizing MCMC via Weierstrass Sampler
2013cited by this paper
Austerity in MCMC Land: Cutting the Metropolis-Hastings Budget
2013cited by this paper
Fast Computation of Wasserstein Barycenters
2013cited by this paper
Efficient implementation of Markov chain Monte Carlo when using an unbiased likelihood estimator
2012influential reference
Approximate Bayesian computation methods
2012influential reference
Monte Carlo MCMC: Efficient Inference by Approximate Sampling
2012cited by this paper
Coupled MCMC with a randomized acceptance probability
2012cited by this paper
Convergence properties of pseudo-marginal Markov chain Monte Carlo algorithms
2012cited by this paper
Bayesian Learning via Stochastic Gradient Langevin Dynamics
2011influential reference
Rejection-free Monte Carlo sampling for general potentials.
2011cited by this paper
Particle Markov chain Monte Carlo methods
2010cited by this paper
The pseudo-marginal approach for efficient Monte Carlo computations
2009influential reference
A Logistic Approximation to The Cumulative Normal Distribution
2009influential reference
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
2009cited by this paper
Bayes in the sky: Bayesian inference and model selection in cosmology
2008cited by this paper
Simulated annealing in the presence of noise
2008cited by this paper
A weakly informative default prior distribution for logistic and other regression models
2008cited by this paper
Prediction, learning, and games
2006cited by this paper
Stochastic optimization using simulated annealing with hypothesis test
2006cited by this paper
Sensitivity and convergence of uniformly ergodic Markov chains
2005cited by this paper
Sampling for Bayesian Computation with Large Datasets
2005influential reference
Stochastic potential switching algorithm for Monte Carlo simulations of complex systems.
2005influential reference
Monte Carlo Statistical Methods
2004influential reference
An MCMC approach to classical estimation
2003influential reference
Estimation of population growth or decline in genetically monitored populations.
2003influential reference
Nonlinear Time Series
2003influential reference
A Parallel Mixture of SVMs for Very Large Scale Problems
2001influential reference
Optimal scaling for various Metropolis-Hastings algorithms
2001cited by this paper
Simulated annealing for discrete optimization with estimation
1999influential reference
A noisy Monte Carlo algorithm
1999cited by this paper
The penalty method for random walks with uncertain energies
1998cited by this paper
Weak Convergence and Empirical Processes: With Applications to Statistics
1996cited by this paper
Hybrid Monte Carlo
1988cited by this paper
Integrating a modified simulated annealing algorithm with the simulation of a manufacturing system to optimize buffer sizes in automatic assembly systems
1988influential reference
Bosonic lattice gauge theory with noise
1985influential reference

CITED BY

Bayesian Interpolating Neural Network (B-INN): a scalable and reliable Bayesian model for large-scale physical systems
2026cites this paper
Fast Gibbs Sampling on Bayesian Hidden Markov Model with Missing Observations
2026cites this paper
Three-dimensional bedrock implicit modeling and uncertainty quantification from sparse geological map data
2026cites this paper
Energy distance-based subsampling Markov chain Monte Carlo
2025cites this paper
Robust estimation of a Markov chain transition matrix from multiple sample paths
2025cites this paper
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
2025cites this paper
Towards trustworthy civil aviation hazards identification: An uncertainty-aware deep learning framework
2025cites this paper
Bayesian Data Sketching for Varying Coefficient Regression Models
2025cites this paper
Nonparametric Variational Infinite Libby-Novick Beta Mixture Model for Medical Data Clustering
2025cites this paper
A Review of Uncertainty Representation and Quantification in Neural Networks
2025cites this paper
FPGA-Based Acceleration of MCMC Algorithm through Self-Shrinking for Big Data
2025cites this paper
Bayesian Neural Network Surrogates for Bayesian Optimization of Carbon Capture and Storage Operations
2025cites this paper
A practical guide to estimation and uncertainty quantification of aerodynamic flows
2025cites this paper
Provably convergent stochastic fixed-point algorithm for free-support Wasserstein barycenter of continuous non-parametric measures
2025cites this paper
Scalable inference of transcriptional variability with BASiCS.
2025cites this paper
Addressing ecological challenges from a quantum computing perspective
2025cites this paper
Approximate Bayesian Computation with Statistical Distances for Model Selection
2025cites this paper
Client-only Distributed Markov Chain Monte Carlo Sampling over a Network
2025cites this paper
Asymptotic Analysis of the Bias–Variance Trade-Off in Subsampling Metropolis–Hastings
2025cites this paper
Efficient MCMC Sampling with Expensive-to-Compute and Irregular Likelihoods
2025influential citation
Primed Priors for Simulation-Based Validation of Bayesian Models
2024cites this paper
Metropolis--Hastings with Scalable Subsampling
2024cites this paper
Perturbations of Markov Chains
2024influential citation
Ensemble Kalman inversion approximate Bayesian computation
2024cites this paper
Sparse Bayesian Neural Networks: Bridging Model and Parameter Uncertainty through Scalable Variational Inference
2024cites this paper
General bounds on the quality of Bayesian coresets
2024cites this paper
Large sample scaling analysis of the Zig-Zag algorithm for Bayesian inference
2024cites this paper
Diffusion posterior sampling for simulation-based inference in tall data settings
2024cites this paper
Analysing symbolic data by pseudo-marginal methods
2024cites this paper
PT-HMC: Optimization-based Pre-Training with Hamiltonian Monte-Carlo Sampling for Driver Intention Recognition
2024cites this paper
Uncertainty-Aware Hand Gesture Recognition for Safety-Critical and Emergency Human-Robot Interaction
2024cites this paper
Running Markov Chain Monte Carlo on Modern Hardware and Software
2024cites this paper
Diffusion Generative Modelling for Divide-and-Conquer MCMC
2024cites this paper
Gaussian mixture models for training Bayesian convolutional neural networks
2024cites this paper
The OX Optimizer: A Novel Optimization Algorithm and Its Application in Enhancing Support Vector Machine Performance for Attack Detection
2024cites this paper
A Deep Dive into the Trophic Ecology of Engraulis ringens: Assessing Diet Through Stomach Content and Stable Isotope Analysis
2024cites this paper
Distribution-Aware Mean Estimation under User-level Local Differential Privacy
2024cites this paper
Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics
2024cites this paper
Data-Driven Discovery of Nonlinear Dynamical Systems from Noisy and Sparse Observations
2024cites this paper
Weighting non-IID batches for out-of-distribution detection
2024cites this paper
Real-Time Prediction of Multiple Output States in Diesel Engines using a Deep Neural Operator Framework
2023cites this paper
Variational Inference for Bayesian Neural Networks under Model and Parameter Uncertainty
2023cites this paper
Machine Learning and the Future of Bayesian Computation
2023cites this paper
Investigating the effect of fused deposition modelling on the tribology of PETG thermoplastic
2023cites this paper
Bayesian Quantification with Black-Box Estimators
2023cites this paper
Bayesian Pseudo-Coresets via Contrastive Divergence
2023cites this paper
Lévy Langevin Monte Carlo
2023cites this paper
Introducing Variational Inference in Undergraduate Statistics and Data Science Curriculum
2023cites this paper
Minibatch Markov Chain Monte Carlo Algorithms for Fitting Gaussian Processes
2023cites this paper
Parameter estimation from an Ornstein-Uhlenbeck process with measurement noise
2023influential citation
Coreset Markov chain Monte Carlo
2023cites this paper
The surrogate Gibbs-posterior of a corrected stochastic MALA: Towards uncertainty quantification for neural networks
2023influential citation
Information Bound and Its Applications in Bayesian Neural Networks
2023cites this paper
A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors
2023cites this paper
Enhancing Sample Quality through Minimum Energy Importance Weights
2023cites this paper
Neural Likelihood Approximation for Integer Valued Time Series Data
2023influential citation
Pigeons.jl: Distributed Sampling From Intractable Distributions
2023cites this paper
Predicting fine-scale taxonomic variation in landscape vegetation using large satellite imagery data sets
2023cites this paper
Markov chain Monte Carlo approach to the analysis of response patterns in data collection process
2023cites this paper
Minibatch training of neural network ensembles via trajectory sampling
2023cites this paper
Modeling Tennis Matches Using Monte Carlo Simulations Incorporating Dynamic Parameters
2023cites this paper
Real-time prediction of gas flow dynamics in diesel engines using a deep neural operator framework
2023cites this paper
Introducing Variational Inference in Statistics and Data Science Curriculum
2023influential citation
Perturbation analysis of Markov chain Monte Carlo for graphical models
2023influential citation
Bayesian inference and neural estimation of acoustic wave propagation
2023cites this paper
A Bayesian deep learning approach for rheological properties prediction of asphalt binders considering uncertainty of output
2023cites this paper
Inference and uncertainty quantification of stochastic gene expression via synthetic models
2022cites this paper
Pigeonhole Stochastic Gradient Langevin Dynamics for Large Crossed Mixed Effects Models
2022cites this paper
Discovering Inductive Bias with Gibbs Priors: A Diagnostic Tool for Approximate Bayesian Inference
2022influential citation
Data Subsampling for Bayesian Neural Networks
2022cites this paper
Non-reversible Parallel Tempering for Deep Posterior Approximation
2022cites this paper
Optimality in Noisy Importance Sampling
2022cites this paper
Slope failures and safety index assessment of waste rock dumps in Nigeria’s major mines
2022cites this paper
Learning Groundwater Contaminant Diffusion‐Sorption Processes With a Finite Volume Neural Network
2022cites this paper
Deep Variational Free Energy Approach to Dense Hydrogen.
2022cites this paper
PAC-Bayes Bounds for Bandit Problems: A Survey and Experimental Comparison
2022cites this paper
Unbiased Time-Average Estimators for Markov Chains
2022cites this paper
The convergent Indian buffet process
2022cites this paper
How to Combine Variational Bayesian Networks in Federated Learning
2022cites this paper
Semi-Complete Data Augmentation for Efficient State Space Model Fitting
2022cites this paper
An Optimal Transport Approach for Selecting a Representative Subsample with Application in Efficient Kernel Density Estimation
2022cites this paper
Improving Bayesian Neural Networks by Adversarial Sampling
2022cites this paper
Bayesian forecasting in economics and finance: A modern review
2022cites this paper
Fast Bayesian Coresets via Subsampling and Quasi-Newton Refinement
2022cites this paper
Centered plug-in estimation of Wasserstein distances
2022cites this paper
Approximating solutions of the Chemical Master equation using neural networks
2022cites this paper
Bayesian inference via sparse Hamiltonian flows
2022cites this paper
Ecological Models: When Worlds Collide
2022cites this paper
Computing Bayes: From Then ‘Til Now
2022cites this paper
A deep variational free energy approach to dense hydrogen
2022cites this paper
Bayesian Forecasting in the 21st Century: A Modern Review
2022cites this paper
Variational Inference with Locally Enhanced Bounds for Hierarchical Models
2022cites this paper
Approximate Methods for Bayesian Computation
2022cites this paper
Importance Sampling Methods for Bayesian Inference with Partitioned Data
2022influential citation
On the Convergence of Hamiltonian Monte Carlo with Stochastic Gradients
2021cites this paper
An Approach to Incorporate Subsampling Into a Generic Bayesian Hierarchical Model
2021cites this paper
Variational Bayes in State Space Models: Inferential and Predictive Accuracy
2021cites this paper
Bayesian Inference in Common Microeconometric Models With Massive Datasets by Double Marginalized Subsampling
2021cites this paper
Federated Functional Variational Inference
2021cites this paper
How To Train Your Program
2021cites this paper