Random projections for Bayesian regression

Leo N. Geppert,K. Ickstadt,Alexander Munteanu,Jens Quedenfeld,C. Sohler

Published 2015 in Statistics and computing

ABSTRACT

This article deals with random projections applied as a data reduction technique for Bayesian regression analysis. We show sufficient conditions under which the entire d-dimensional distribution is approximately preserved under random projections by reducing the number of data points from n to k∈O(poly(d/ε))\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k\in O({\text {poly}}(d/\varepsilon ))$$\end{document} in the case n≫d\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$n\gg d$$\end{document}. Under mild assumptions, we prove that evaluating a Gaussian likelihood function based on the projected data instead of the original data yields a (1+O(ε))\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(1+O(\varepsilon ))$$\end{document}-approximation in terms of the ℓ2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell _2$$\end{document} Wasserstein distance. Our main result shows that the posterior distribution of Bayesian linear regression is approximated up to a small error depending on only an ε\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon $$\end{document}-fraction of its defining parameters. This holds when using arbitrary Gaussian priors or the degenerate case of uniform distributions over Rd\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbb {R}^d$$\end{document} for β\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta $$\end{document}. Our empirical evaluations involve different simulated settings of Bayesian linear regression. Our experiments underline that the proposed method is able to recover the regression model up to small error while considerably reducing the total running time.

PUBLICATION RECORD

Publication year
2015
Venue
Statistics and computing
Publication date
2015-04-23
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1007/s11222-015-9608-z arXiv 1504.06122
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

The elements of statistical learning: data mining, inference, and prediction, 2nd Edition
2020cited by this paper
Dimensionality Reduction
2018influential reference
Pattern Recognition And Machine Learning
2016cited by this paper
Sketching for M-Estimators: A Unified Approach to Robust Regression
2015cited by this paper
Implementing Randomized Matrix Algorithms in Parallel and Distributed Environments
2015cited by this paper
Statistical and Algorithmic Perspectives on Randomized Sketching for Ordinary Least-Squares
2015cited by this paper
Approximation and Streaming Algorithms for Projective Clustering via Random Projections
2014cited by this paper
Faster SVD-truncated regularized least-squares
2014cited by this paper
Towards scaling up Markov chain Monte Carlo: an adaptive subsampling approach
2014cited by this paper
R: A language and environment for statistical computing.
2014influential reference
Event labeling combining ensemble detectors and background knowledge
2014influential reference
Dimensionality Reduction for k-Means Clustering and Low Rank Approximation
2014influential reference
Bayesian Inference with Big Data : A Snapshot from a Workshop
2014influential reference
Faster SVD-Truncated Least-Squares Regression
2014cited by this paper
Principal Component Analysis and Higher Correlations for Distributed Data
2013cited by this paper
Subspace Embeddings and $\ell_p$-Regression Using Exponential Random Variables
2013cited by this paper
Bayesian Compressed Regression
2013cited by this paper
Lower Bounds for Oblivious Subspace Embeddings
2013cited by this paper
Direct QR factorizations for tall-and-skinny matrices in MapReduce architectures
2013cited by this paper
Turning big data into tiny data: Constant-size coresets for k-means, PCA and projective clustering
2013cited by this paper
A statistical perspective on algorithmic leveraging
2013cited by this paper
Compressed Sensing
2012cited by this paper
Near-Optimal Coresets for Least-Squares Regression
2012cited by this paper
OSNAP: Faster Numerical Linear Algebra Algorithms via Sparser Subspace Embeddings
2012influential reference
The Fast Cauchy Transform: with Applications to Basis Construction, Regression, and Subspace Approximation in L1
2012cited by this paper
Random Projections for Linear Support Vector Machines
2012cited by this paper
The Fast Cauchy Transform and Faster Robust Linear Regression
2012cited by this paper
Improved Matrix Algorithms via the Subsampled Randomized Hadamard Transform
2012cited by this paper
Bayesian computing with INLA: New features
2012influential reference
Low-Rank Approximation and Regression in Input Sparsity Time
2012cited by this paper
Sparsity lower bounds for dimensionality reducing maps
2012influential reference
Tall and skinny QR factorizations in MapReduce architectures
2011cited by this paper
The Johnson-Lindenstrauss Transform: An Empirical Study
2011cited by this paper
Efficient Gaussian process regression for large datasets.
2011cited by this paper
The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo
2011cited by this paper
Bayesian data analysis.
2010cited by this paper
Random Projections for $k$-means Clustering
2010cited by this paper
Approximate Bayesian Computation (ABC) in practice.
2010influential reference
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman
2009cited by this paper
Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions
2009cited by this paper
Numerical linear algebra in the streaming model
2009influential reference
Spectral Algorithms
2009cited by this paper
Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions
2009cited by this paper
Twice-ramanujan sparsifiers
2008influential reference
A Simple Proof of the Restricted Isometry Property for Random Matrices
2008cited by this paper
Communication-optimal Parallel and Sequential QR and LU Factorizations
2008cited by this paper
Optimal Transport: Old and New
2008cited by this paper
Bayesian compressive sensing and projection optimization
2007cited by this paper
Sampling algorithms and coresets for ℓp regression
2007cited by this paper
Faster least squares approximation
2007cited by this paper
Approximate Bayesian Inference for Latent Gaussian Models
2007influential reference
Pseudo-random number generation for sketch-based estimations
2007influential reference
A one-pass sequential Monte Carlo method for Bayesian analysis of massive datasets
2006influential reference
Conservative prior distributions for variance parameters in hierarchical models
2006cited by this paper
Sampling algorithms for l2 regression and applications
2006cited by this paper
Pattern Recognition and Machine Learning
2006influential reference
Improved Approximation Algorithms for Large Matrices via Random Projections
2006influential reference
Data streams: algorithms and applications
2005influential reference
Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information
2004influential reference
Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper)
2004cited by this paper
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
2004cited by this paper
Principal Component Analysis
2003cited by this paper
Economic inequality and burden-sharing in the provision of local environmental quality
2002influential reference
Likelihood-Based Data Squashing: A Modeling Approach to Instance Construction
2002cited by this paper
Approximate Bayesian computation in population genetics.
2002cited by this paper
Abstract The
2002cited by this paper
Squashing flat files flatter
1999cited by this paper
A Reliable Randomized Algorithm for the Closest-Pair Problem
1997cited by this paper
A class of Wasserstein metrics for probability distributions.
1984cited by this paper
Matrix computations
1983cited by this paper
Solving least squares problems
1976influential reference
Numerical methods for solving linear least squares problems
1965cited by this paper

CITED BY

A Trimodal 2D Metasurface Biosensor with Bayesian Regression for Ultra-Sensitive Cancer Biomarker Detection
2025cites this paper
Compressed Bayesian Tensor Regression
2025cites this paper
System-on-Chip Based Accelerator Design for Real-Time Cardiac MRI: Balancing Speed and Accuracy
2025cites this paper
Scalable Bayesian p-generalized probit and logistic regression
2024cites this paper
A Bayesian Hierarchical Model for Orthogonal Tucker Decomposition with Oblivious Tensor Compression
2024cites this paper
Detecting Interactions in High‐Dimensional Data Using Cross Leverage Scores
2024cites this paper
Bayesian quantile regression for streaming data
2024cites this paper
Sparse data-driven random projection in regression for high-dimensional data
2023cites this paper
Impact of Statistics on Data Science
2023cites this paper
Sketching in High-Dimensional Regression With Big Data Using Gaussian Scale Mixture Priors
2022cites this paper
Evaluating dimensionality reduction for genomic prediction
2022cites this paper
On randomized sketching algorithms and the Tracy–Widom law
2022cites this paper
Cross-Leverage Scores for Selecting Subsets of Explanatory Variables
2021influential citation
Reduction for Efficient Probit Regression
2021cites this paper
Yarn Strength CV Prediction Using Principal Component Analysis and Automatic Relevance Determination on Bayesian Platform
2021cites this paper
Oblivious Sketching for Logistic Regression
2021cites this paper
Streaming statistical models via Merge & Reduce
2020cites this paper
Sketching for Two-Stage Least Squares Estimation
2020cites this paper
Improving Random Projections With Extra Vectors to Approximate Inner Products
2020cites this paper
A Framework for Bayesian Optimization in Embedded Subspaces
2019cites this paper
LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations
2019influential citation
Sparse Variational Inference: Bayesian Coresets from Scratch
2019cites this paper
An Econometric View of Algorithmic Subsampling
2019cites this paper
An Econometric Perspective on Algorithmic Subsampling
2019cites this paper
Remaining Useful Life Prediction for Lithium-Ion Battery: A Deep Learning Approach
2018cites this paper
Sketching for Latent Dirichlet-Categorical Models
2018cites this paper
Subspace Embedding and Linear Regression with Orlicz Norm
2018cites this paper
On Coresets for Logistic Regression
2018cites this paper
Data Science: the impact of statistics
2018cites this paper
Fast Bayesian Inference in GLMs with Low Rank Data Approximations
2018cites this paper
A Conceptual Framework for Lithium-ion Battery RUL Prediction Using Deep Learning
2018cites this paper
Core Dependency Networks
2018influential citation
Coresets for Dependency Networks
2017influential citation
Coreset based Dependency Networks
2017cites this paper
Automated Scalable Bayesian Inference via Hilbert Coresets
2017cites this paper
Statistical properties of sketching algorithms
2017cites this paper
Big Data Science
2017cites this paper
Coresets-Methods and History: A Theoreticians Design Pattern for Approximation and Streaming Algorithms
2017cites this paper
Optimal projection of observations in a Bayesian setting
2017cites this paper
Distributed Source Detection With Dimension Reduction in Multiple-Antenna Wireless Networks
2017cites this paper
Acceleration of MCMC-based algorithms using reconfigurable logic
2017cites this paper
A random version of principal component analysis in data clustering
2016cites this paper
Statistik, Data Science und Big Data
2016cites this paper
Comparison among dimensionality reduction techniques based on Random Projection for cancer classification
2016cites this paper
Warm starting Bayesian optimization
2016cites this paper
Statistik, Data Science und Big Data
2016cites this paper
A note on replacing uniform subsampling by random projections in MCMC for linear regression of tall datasets
2015influential citation
Hierarchische Bayes-Regression bei Einbettung großer Datensätze
2015influential citation
Technical Report for Collaborative Research Center Sfb 876 Providing Information by Resource- Constrained Data Analysis Subproject A1 Data Mining for Ubiquitous System Software Machine Learning on Fpgas
year unknowncites this paper