Robustly Learning a Gaussian: Getting Optimal Error, Efficiently

Ilias Diakonikolas,Gautam Kamath,D. Kane,Jerry Li,Ankur Moitra,Alistair Stewart

Published 2017 in ACM-SIAM Symposium on Discrete Algorithms

ABSTRACT

We study the fundamental problem of learning the parameters of a high-dimensional Gaussian in the presence of noise -- where an $\varepsilon$-fraction of our samples were chosen by an adversary. We give robust estimators that achieve estimation error $O(\varepsilon)$ in the total variation distance, which is optimal up to a universal constant that is independent of the dimension. In the case where just the mean is unknown, our robustness guarantee is optimal up to a factor of $\sqrt{2}$ and the running time is polynomial in $d$ and $1/\epsilon$. When both the mean and covariance are unknown, the running time is polynomial in $d$ and quasipolynomial in $1/\varepsilon$. Moreover all of our algorithms require only a polynomial number of samples. Our work shows that the same sorts of error guarantees that were established over fifty years ago in the one-dimensional setting can also be achieved by efficient algorithms in high-dimensional settings.

PUBLICATION RECORD

Publication year
2017
Venue
ACM-SIAM Symposium on Discrete Algorithms
Publication date
2017-04-12
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1137/1.9781611975031.171 arXiv 1704.03866
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Robust Statistics
2018cited by this paper
Robust Sparse Estimation Tasks in High Dimensions
2017cited by this paper
Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers
2017cited by this paper
Being Robust (in High Dimensions) Can Be Practical
2017cited by this paper
Computationally Efficient Robust Estimation of Sparse Functionals
2017cited by this paper
Computationally Efficient Robust Sparse Estimation in High Dimensions
2017cited by this paper
Robust Learning of Fixed-Structure Bayesian Networks
2016cited by this paper
Agnostic Estimation of Mean and Covariance
2016cited by this paper
Robust Estimators in High Dimensions without the Computational Intractability
2016influential reference
Learning from untrusted data
2016cited by this paper
Geometric Algorithms and Combinatorial Optimization
2016cited by this paper
Statistical Query Lower Bounds for Robust Estimation of High-Dimensional Gaussians and Gaussian Mixtures
2016cited by this paper
Faster and Sample Near-Optimal Algorithms for Proper Learning Mixtures of Gaussians
2013cited by this paper
Analysis of Boolean Functions
2012cited by this paper
Algorithms and Hardness for Robust Subspace Recovery
2012cited by this paper
A Structure Theorem for Poorly Anticoncentrated Gaussian Chaoses and Applications to the Study of Polynomial Threshold Functions
2012cited by this paper
Learning Poisson Binomial Distributions
2011cited by this paper
Introduction to the non-asymptotic analysis of random matrices
2010cited by this paper
Bayesian data analysis.
2010cited by this paper
Latent Dirichlet Allocation
2009cited by this paper
Et al
2008cited by this paper
Genes mirror geography within Europe
2008cited by this paper
A central limit theorem for convex sets
2006cited by this paper
Robust Estimators are Hard to Compute
2006cited by this paper
Robust Regression and Outlier Detection
2005cited by this paper
ADAPTIVE ESTIMATION OF A QUADRATIC FUNCTIONAL BY MODEL SELECTION
2001cited by this paper
Robust statistics: a brief introduction and overview
2001cited by this paper
Distributional and L-q norm inequalities for polynomials over convex bodies in R-n
2001cited by this paper
Combinatorial methods in density estimation
2001cited by this paper
Emergence of simple-cell receptive field properties by learning a sparse code for natural images
1996cited by this paper
Robust Statistics—The Approach Based on Influence Functions
1986cited by this paper
Multivariate estimation with high breakdown point
1985cited by this paper
On robust estimation of the location parameter
1980influential reference
Mathematics and the Picturing of Data
1975cited by this paper
Robust Estimation of a Location Parameter
1964cited by this paper
On the spherical surface of smallest radius enclosing a bounded subset of $n$-dimensional euclidean space
1941cited by this paper
Ueber die kleinste Kugel, die eine räumliche Figur einschliesst.
year unknowncited by this paper
Edinburgh Research Explorer Learning from satisfying assignments
year unknowncited by this paper

CITED BY

Efficient Multivariate Robust Mean Estimation Under Mean-Shift Contamination
2025cites this paper
Linear Regression under Missing or Corrupted Coordinates
2025cites this paper
On the Learnability of Distribution Classes with Adaptive Adversaries
2025cites this paper
Cross-modality material embedding loss for transferring knowledge between heterogeneous material descriptors
2025cites this paper
Learning High-dimensional Gaussians from Censored Data
2025cites this paper
Improved Robust Estimation for Erdős-Rényi Graphs: The Sparse Regime and Optimal Breakdown Point
2025cites this paper
Byzantine Game Theory: Sun Tzus Boxes
2025cites this paper
Robust Sparse Estimation for Gaussians with Optimal Error under Huber Contamination
2024influential citation
A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion *
2024cites this paper
Distribution Learnability and Robustness
2024cites this paper
The Broader Landscape of Robustness in Algorithmic Statistics
2024cites this paper
Efficient Statistics With Unknown Truncation, Polynomial Time Algorithms, Beyond Gaussians
2024cites this paper
Robust Mixture Learning when Outliers Overwhelm Small Groups
2024cites this paper
Robust Distribution Learning with Local and Global Adversarial Corruptions
2024cites this paper
Multigroup Robustness
2024cites this paper
Beyond Catoni: Sharper Rates for Heavy-Tailed and Robust Mean Estimation
2023cites this paper
Near-Optimal Algorithms for Gaussians with Huber Contamination: Mean Estimation and Linear Regression
2023cites this paper
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption
2023cites this paper
The Full Landscape of Robust Mean Testing: Sharp Separations between Oblivious and Adaptive Contamination
2023cites this paper
Bayesian Strategy-Proof Facility Location via Robust Estimation
2023influential citation
Learning Mixtures of Gaussians with Censored Data
2023cites this paper
Spectral clustering in the Gaussian mixture block model
2023cites this paper
Privately Estimating a Gaussian: Efficient, Robust, and Optimal
2023cites this paper
Robust and Sparse Estimation of Linear Regression Coefficients with Heavy-tailed Noises and Covariates
2022cites this paper
Efficient List-Decodable Regression using Batches
2022cites this paper
Nearly minimax robust estimator of the mean vector by iterative spectral dimension reduction
2022cites this paper
Privately Estimating a Gaussian: Efficient, Robust and Optimal
2022cites this paper
Adversarial Robust and Sparse Estimation of Linear Regression Coeﬃcient
2022cites this paper
Robust estimation algorithms don't need to know the corruption level
2022cites this paper
Optimal Sub-Gaussian Mean Estimation in Very High Dimensions
2022cites this paper
Outlier Robust and Sparse Estimation of Linear Regression Coefficients
2022cites this paper
Estimation Contracts for Outlier-Robust Geometric Perception
2022cites this paper
FITNESS: (Fine Tune on New and Similar Samples) to detect anomalies in streams with drift and outliers
2022cites this paper
Exponential Weights Algorithms for Selective Learning
2021cites this paper
Buying Data Over Time: Approximately Optimal Strategies for Dynamic Data-Driven Decisions
2021cites this paper
Adversarial robust weighted Huber regression
2021influential citation
Covariance-Aware Private Mean Estimation Without Private Covariance Estimation
2021cites this paper
Efficiently learning halfspaces with Tsybakov noise
2021cites this paper
Learning GMMs with Nearly Optimal Robustness Guarantees
2021cites this paper
Robustness meets algorithms
2021cites this paper
Stochastic Dueling Bandits with Adversarial Corruption
2021cites this paper
Robust and Differentially Private Mean Estimation
2021cites this paper
SoS Degree Reduction with Applications to Clustering and Robust Moment Estimation
2021cites this paper
Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error
2021cites this paper
Private Robust Estimation by Stabilizing Convex Relaxations
2021cites this paper
Kalman filtering with adversarial corruptions
2021cites this paper
Robust Estimation for Random Graphs
2021cites this paper
Efficient Algorithms for Learning from Coarse Labels
2021influential citation
Robust Online Convex Optimization in the Presence of Outliers
2021cites this paper
List-Decodable Mean Estimation via Iterative Multi-Filtering
2020cites this paper
Computationally and Statistically Efficient Truncated Regression
2020cites this paper
Prophet Inequalities with Linear Correlations and Augmentations
2020cites this paper
All-in-one robust estimator of the Gaussian mean
2020cites this paper
List-Decodable Subspace Recovery via Sum-of-Squares
2020cites this paper
Submitted to the Annals of Statistics ROBUST MULTIVARIATE MEAN ESTIMATION : THE OPTIMALITY OF TRIMMED MEAN By
2020cites this paper
Robust estimation with Lasso when outputs are adversarially contaminated
2020cites this paper
Learning Entangled Single-Sample Distributions via Iterative Trimming
2020cites this paper
Interactive Proofs for Verifying Machine Learning
2020cites this paper
Outlier-Robust Clustering of Non-Spherical Mixtures
2020cites this paper
Robustly Learning any Clusterable Mixture of Gaussians
2020cites this paper
Reducibility and Statistical-Computational Gaps from Secret Leakage
2020cites this paper
Robust Distributed Learning
2020cites this paper
Estimating Principal Components under Adversarial Perturbations
2020influential citation
Learning Halfspaces with Tsybakov Noise
2020cites this paper
Robust Sub-Gaussian Principal Component Analysis and Width-Independent Schatten Packing
2020cites this paper
Efficient Statistics for Sparse Graphical Models from Truncated Samples
2020cites this paper
Robust Gaussian Covariance Estimation in Nearly-Matrix Multiplication Time
2020cites this paper
Online Robust Regression via SGD on the l1 loss
2020cites this paper
Robust linear regression: optimal rates in polynomial time
2020cites this paper
Efficient Parameter Estimation of Truncated Boolean Product Distributions
2020cites this paper
Learning Entangled Single-Sample Gaussians in the Subset-of-Signals Model
2020cites this paper
Optimal Robust Linear Regression in Nearly Linear Time
2020cites this paper
Robust and Heavy-Tailed Mean Estimation Made Simple, via Regret Minimization
2020cites this paper
Rank Aggregation from Pairwise Comparisons in the Presence of Adversarial Corruptions
2020cites this paper
Robust Mean Estimation on Highly Incomplete Data with Arbitrary Outliers
2020cites this paper
A Polynomial Time Algorithm for Learning Halfspaces with Tsybakov Noise
2020cites this paper
Online and Distribution-Free Robustness: Regression and Contextual Bandits with Huber Contamination
2020cites this paper
Adversarial Robust Low Rank Matrix Estimation: Compressed Sensing and Matrix Completion
2020cites this paper
Private and Secure Distributed Learning
2020cites this paper
Optimal Mean Estimation without a Variance
2020cites this paper
Hardness of Learning Halfspaces with Massart Noise
2020cites this paper
Outlier-Robust Clustering of Gaussians and Other Non-Spherical Mixtures
2020cites this paper
Robustness
2020cites this paper
Near-Optimal Statistical Query Hardness of Learning Halfspaces with Massart Noise
2020cites this paper
Outlier-robust estimation of a sparse linear model using 𝓁1-penalized Huber's M-estimator
2019cites this paper
Robust Subspace Recovery with Adversarial Outliers
2019influential citation
List-decodeable Linear Regression
2019cites this paper
Beyond the Worst-Case Analysis of Algorithms
2019influential citation
Recent Advances in Algorithmic High-Dimensional Robust Statistics
2019cites this paper
Robust Dynamic Assortment Optimization in the Presence of Outlier Customers
2019cites this paper
Outlier-Robust High-Dimensional Sparse Estimation via Iterative Filtering
2019cites this paper
Generalized Resilience and Robust Statistics
2019influential citation
Nearly Tight Bounds for Robust Proper Learning of Halfspaces with a Margin
2019cites this paper
Average-Case Lower Bounds for Learning Sparse Mixtures, Robust Estimation and Semirandom Adversaries
2019cites this paper
Efficient Truncated Statistics with Unknown Truncation
2019cites this paper
Robust multivariate mean estimation: The optimality of trimmed mean
2019cites this paper
Genuinely distributed Byzantine machine learning
2019cites this paper
The Limitations of Adversarial Training and the Blind-Spot Attack
2019cites this paper
Globally-convergent Iteratively Reweighted Least Squares for Robust Regression Problems
2019cites this paper
Robust Algorithms for the Secretary Problem
2019cites this paper