The Zig-Zag process and super-efficient sampling for Bayesian analysis of big data

Published 2016 in Annals of Statistics

ABSTRACT

Standard MCMC methods can scale poorly to big data settings due to the need to evaluate the likelihood at each iteration. There have been a number of approximate MCMC algorithms that use sub-sampling ideas to reduce this computational burden, but with the drawback that these algorithms no longer target the true posterior distribution. We introduce a new family of Monte Carlo methods based upon a multi-dimensional version of the Zig-Zag process of (Bierkens, Roberts, 2017), a continuous time piecewise deterministic Markov process. While traditional MCMC methods are reversible by construction (a property which is known to inhibit rapid convergence) the Zig-Zag process offers a flexible non-reversible alternative which we observe to often have favourable convergence properties. We show how the Zig-Zag process can be simulated without discretisation error, and give conditions for the process to be ergodic. Most importantly, we introduce a sub-sampling version of the Zig-Zag process that is an example of an {\em exact approximate scheme}, i.e. the resulting approximate process still has the posterior as its stationary distribution. Furthermore, if we use a control-variate idea to reduce the variance of our unbiased estimator, then the Zig-Zag process can be super-efficient: after an initial pre-processing step, essentially independent samples from the posterior distribution are obtained at a computational cost which does not depend on the size of the data.

PUBLICATION RECORD

Publication year
2016
Venue
Annals of Statistics
Publication date
2016-07-11
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1214/18-AOS1715 arXiv 1607.03188
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Ergodicity of the zigzag process
2017cited by this paper
Exponential ergodicity of the bouncy particle sampler
2017cited by this paper
Simple, scalable and accurate posterior interval estimation
2016cited by this paper
Quantifying the accuracy of approximate diffusions and Markov chains
2016cited by this paper
The Scalable Langevin Exact Algorithm : Bayesian Inference for Big Data
2016cited by this paper
Stochastic Bouncy Particle Sampler
2016cited by this paper
Variance Reduction in Stochastic Gradient Langevin Dynamics
2016cited by this paper
Piecewise Deterministic Markov Processes for Continuous-Time Monte Carlo
2016cited by this paper
Exploration of the (Non-)Asymptotic Bias and Variance of Stochastic Gradient Langevin Dynamics
2016cited by this paper
WASP: Scalable Bayes via barycenters of subset posteriors
2015cited by this paper
Long time behavior of telegraph processes under convex potentials
2015cited by this paper
Variance Reduction Using Nonreversible Langevin Samplers
2015cited by this paper
A Complete Recipe for Stochastic Gradient MCMC
2015cited by this paper
On Markov chain Monte Carlo methods for tall data
2015cited by this paper
The Bouncy Particle Sampler: A Nonreversible Rejection-Free Markov Chain Monte Carlo Method
2015influential reference
A piecewise deterministic scaling limit of Lifted Metropolis-Hastings in the Curie-Weiss model
2015influential reference
(Non-) asymptotic properties of Stochastic Gradient Langevin Dynamics
2015cited by this paper
Robust and Scalable Bayes via a Median of Subset Posterior Measures
2014cited by this paper
Non-reversible Metropolis-Hastings
2014influential reference
Probability in High Dimension
2014influential reference
Firefly Monte Carlo: Exact MCMC with Subsets of Data
2014cited by this paper
Big Data
2014cited by this paper
Speeding Up MCMC by Efficient Data Subsampling
2014cited by this paper
Irreversible Langevin samplers and variance reduction: a large deviations approach
2014cited by this paper
Consistency and Fluctuations For Stochastic Gradient Langevin Dynamics
2014cited by this paper
Hypocoercive relaxation to equilibrium for some kinetic models via a third order differential inequality
2013cited by this paper
Asymptotically Exact, Embarrassingly Parallel MCMC
2013influential reference
Parallelizing MCMC via Weierstrass Sampler
2013cited by this paper
On nonnegative unbiased estimators
2013cited by this paper
Accelerating reversible Markov chains
2013cited by this paper
Rejection-free Monte Carlo sampling for general potentials.
2011cited by this paper
Bayesian Learning via Stochastic Gradient Langevin Dynamics
2011cited by this paper
Handbook of Markov Chain Monte Carlo
2011cited by this paper
Improving the Asymptotic Performance of Markov Chain Monte-Carlo by Inserting Vortices
2010cited by this paper
Quantitative Estimates for the Long-Time Behavior of an Ergodic Variant of the Telegraph Process
2010cited by this paper
The pseudo-marginal approach for efficient Monte Carlo computations
2009cited by this paper
Irreversible Monte Carlo Algorithms for Efficient Sampling
2008cited by this paper
A modified next reaction method for simulating chemical systems with time dependent propensities and delays.
2007cited by this paper
Efficient Exact Stochastic Simulation of Chemical Systems with Many Species and Many Channels
2000cited by this paper
ANALYSIS OF A NONREVERSIBLE MARKOV CHAIN SAMPLER
2000cited by this paper
Exponential convergence of Langevin distributions and their discrete approximations
1996cited by this paper
Suppressing Random Walks in Markov Chain Monte Carlo Using Ordered Overrelaxation
1995cited by this paper
Accelerating Gaussian Diffusions
1993cited by this paper
Hybrid Monte Carlo
1988influential reference
Mathematical Statistics and Data Analysis
1988cited by this paper
Simulation of Nonhomogeneous Poisson Processes by Thinning
1979influential reference
A stochastic model related to the telegrapher's equation
1974cited by this paper
Asymptotic Expansions Associated with Posterior Distributions
1970cited by this paper
Monte Carlo Sampling Methods Using Markov Chains and Their Applications
1970influential reference

CITED BY

Diffusive Scaling Limits of Forward Event-Chain Monte Carlo: Provably Efficient Exploration with Partial Refreshment
2026cites this paper
Piecewise Deterministic Markov Processes for Bayesian Inference of PDE Coefficients
2026cites this paper
Event-Chain Monte Carlo: The global-balance breakthrough
2026cites this paper
On micromodes in Bayesian posterior distributions and their implications for MCMC
2026influential citation
Convergence of non-reversible Markov processes via lifting and flow Poincar{\'e} inequality
2025cites this paper
Sampling with time-changed Markov processes
2025influential citation
Boosting Statistic Learning with Synthetic Data from Pretrained Large Models
2025cites this paper
Piecewise Deterministic Sampling for Constrained Distributions
2025influential citation
Transient regime of piecewise deterministic Monte Carlo algorithms
2025cites this paper
Foundations of locally-balanced Markov processes
2025cites this paper
A coupling-based approach to f-divergences diagnostics for Markov chain Monte Carlo
2025cites this paper
Smoothing Out Sticking Points: Sampling from Discrete-Continuous Mixtures with Dynamical Monte Carlo by Mapping Discrete Mass into a Latent Universe
2025cites this paper
Speedups in nonequilibrium thermal relaxation: Mpemba and related effects
2025cites this paper
Quantitative Hypocoercivity and Lifting of Classical and Quantum Dynamics
2025cites this paper
Numerical Generalized Randomized Hamiltonian Monte Carlo for piecewise smooth target densities
2025cites this paper
Towards practical PDMP sampling: Metropolis adjustments, locally adaptive step-sizes, and NUTS-based time lengths
2025cites this paper
Non-reversible lifts of reversible diffusion processes and relaxation times
2024cites this paper
Graph-accelerated Markov Chain Monte Carlo using Approximate Samples
2024cites this paper
Theoretical guarantees for lifted samplers
2024cites this paper
Hypocoercivity meets lifts
2024influential citation
Numerical Approximations and Convergence Analysis of Piecewise Diffusion Markov Processes, with Application to Glioma Cell Migration
2024cites this paper
Averaging polyhazard models using Piecewise deterministic Monte Carlo with applications to data with long-term survivors
2024cites this paper
Velocity Jumps for Molecular Dynamics.
2024cites this paper
The velocity jump Langevin process and its splitting scheme: long time convergence and numerical accuracy
2024cites this paper
Large sample scaling analysis of the Zig-Zag algorithm for Bayesian inference
2024influential citation
Integration of active learning and MCMC sampling for efficient Bayesian calibration of mechanical properties
2024cites this paper
Ensemble-Based Annealed Importance Sampling
2024cites this paper
Fused $L_{1/2}$ prior for large scale linear inverse problem with Gibbs bouncy particle sampler
2024cites this paper
Liouville Flow Importance Sampler
2024cites this paper
Stochastic Gradient Piecewise Deterministic Monte Carlo Samplers
2024influential citation
Piecewise deterministic generative models
2024influential citation
Metropolis--Hastings with Scalable Subsampling
2024cites this paper
The random timestep Euler method and its continuous dynamics
2024cites this paper
Adaptive Stereographic MCMC
2024cites this paper
Non-Log-Concave and Nonsmooth Sampling via Langevin Monte Carlo Algorithms
2023cites this paper
Accelerating Bayesian inference of dependency between mixed-type biological traits
2023cites this paper
Extending JumpProcess.jl for fast point process simulation with time-varying intensities
2023cites this paper
Super-efficient exact Hamiltonian Monte Carlo for the von Mises distribution
2023cites this paper
An Introduction to the Calibration of Computer Models
2023cites this paper
Contraction Rate Estimates of Stochastic Gradient Kinetic Langevin Integrators
2023cites this paper
Debiasing piecewise deterministic Markov process samplers using couplings
2023influential citation
Non-reversible guided Metropolis kernel
2023cites this paper
Bayes Hilbert Spaces for Posterior Approximation
2023cites this paper
Piecewise Deterministic Markov Processes for Bayesian Neural Networks
2023influential citation
Methods and applications of PDMP samplers with boundary conditions
2023cites this paper
Speeding up Langevin Dynamics by Mixing
2023cites this paper
Bayesian Pseudo-Coresets via Contrastive Divergence
2023cites this paper
Faster high-accuracy log-concave sampling via algorithmic warm starts
2023cites this paper
Polynomial Convergence Rates of Piecewise Deterministic Markov Processes
2023cites this paper
Scaling of piecewise deterministic Monte Carlo for anisotropic targets
2023cites this paper
Concepts in Monte Carlo sampling
2023cites this paper
Feature Selection Techniques for Big Data Analytics
2022cites this paper
Sampling using adaptive regenerative processes
2022influential citation
PDMP Characterisation of Event-Chain Monte Carlo Algorithms for Particle Systems
2022cites this paper
Federated Bayesian Computation via Piecewise Deterministic Markov Processes
2022influential citation
Sampling constrained continuous probability distributions: A review
2022cites this paper
Adaptive Importance Sampling Based on Fault Tree Analysis for Piecewise Deterministic Markov Process
2022cites this paper
Sampling Algorithms in Statistical Physics: A Guide for Statistics and Machine Learning
2022cites this paper
Automatic Zig-Zag sampling in practice
2022cites this paper
Randomized time Riemannian Manifold Hamiltonian Monte Carlo
2022cites this paper
microscopic derivation of coupled SPDE’s with a
2022cites this paper
Infinite dimensional Piecewise Deterministic Markov Processes
2022influential citation
Accelerating Bayesian inference of dependency between complex biological traits
2022cites this paper
Stereographic Markov chain Monte Carlo
2022cites this paper
Bregman Proximal Langevin Monte Carlo via Bregman-Moreau Envelopes
2022cites this paper
Statistical Methods with Applications in Data Mining: A Review of the Most Recent Works
2022cites this paper
Fast Bayesian Coresets via Subsampling and Quasi-Newton Refinement
2022cites this paper
Inferring epidemics from multiple dependent data via pseudo-marginal methods
2022cites this paper
Hamiltonian zigzag accelerates large-scale inference for conditional dependencies between complex biological traits
2022cites this paper
Geometric Methods for Sampling, Optimisation, Inference and Adaptive Agents
2022cites this paper
Nonlinear MCMC for Bayesian Machine Learning
2022cites this paper
Birth–death dynamics for sampling: global convergence, approximations and their asymptotics
2022cites this paper
Eyring–Kramers type formulas for some piecewise deterministic Markov processes
2022cites this paper
Hamiltonian zigzag speeds up large-scale learning of direct eﬀects among mixed-type biological traits
2022cites this paper
Pigeonhole Stochastic Gradient Langevin Dynamics for Large Crossed Mixed Effects Models
2022cites this paper
Conservative random walk
2022cites this paper
Continuously Tempered PDMP samplers
2022cites this paper
Gradient flows and randomised thresholding: sparse inversion and classification
2022cites this paper
Computing Bayes: From Then ‘Til Now
2022cites this paper
A benchmark for the Bayesian inversion of coefficients in partial differential equations
2021cites this paper
Sticky PDMP samplers for sparse and local inference problems
2021influential citation
Strong invariance principles for ergodic Markov processes
2021cites this paper
Gradient-Based Markov Chain Monte Carlo for Bayesian Inference With Non-differentiable Priors
2021influential citation
Speed Up Zig-Zag
2021influential citation
Spatiotemporal blocking of the bouncy particle sampler for efficient inference in state-space models
2021cites this paper
Zigzag Path Connects Two Monte Carlo Samplers: Hamiltonian Counterpart to a Piecewise Deterministic Markov Process
2021influential citation
Optimal friction matrix for underdamped Langevin sampling
2021cites this paper
Perturbation theory for killed Markov processes and quasi-stationary distributions
2021cites this paper
PDMP Monte Carlo methods for piecewise smooth densities
2021cites this paper
The Application of Zig-Zag Sampler in Sequential Markov Chain Monte Carlo
2021cites this paper
Some Results on Generalized Accelerated Motions Driven by the Telegraph Process
2021cites this paper
Concave-Convex PDMP-based Sampling
2021cites this paper
Approximations of Piecewise Deterministic Markov Processes and their convergence properties
2021cites this paper
Stochastic gradient Langevin dynamics with adaptive drifts
2021cites this paper
Bayesian Computational Methods of the Logistic Regression Model
2021cites this paper
Hard-disk dipoles and non-reversible Markov chains.
2021cites this paper
Bayesian mechanics for stationary processes
2021cites this paper
Convergence Analysis of Schr{ö}dinger-F{ö}llmer Sampler without Convexity
2021cites this paper
Accelerating numerical simulation of continuous-time Boolean satisfiability solver using discrete gradient
2021cites this paper
Bayesian Likelihood-Free
2021cites this paper