On the inconsistency of ℓ1-penalised sparse precision matrix estimation

Otte Heinävaara,Janne Leppä-aho,J. Corander,Antti Honkela

Published 2016 in BMC Bioinformatics

ABSTRACT

Various ℓ1-penalised estimation methods such as graphical lasso and CLIME are widely used for sparse precision matrix estimation and learning of undirected network structure from data. Many of these methods have been shown to be consistent under various quantitative assumptions about the underlying true covariance matrix. Intuitively, these conditions are related to situations where the penalty term will dominate the optimisation. We explore the consistency of ℓ1-based methods for a class of bipartite graphs motivated by the structure of models commonly used for gene regulatory networks. We show that all ℓ1-based methods fail dramatically for models with nearly linear dependencies between the variables. We also study the consistency on models derived from real gene expression data and note that the assumptions needed for consistency never hold even for modest sized gene networks and ℓ1-based methods also become unreliable in practice for larger networks. Our results demonstrate that ℓ1-penalised undirected network structure learning methods are unable to reliably learn many sparse bipartite graph structures, which arise often in gene expression data. Users of such methods should be aware of the consistency criteria of the methods and check if they are likely to be met in their application of interest.

PUBLICATION RECORD

Publication year
2016
Venue
BMC Bioinformatics
Publication date
2016-03-08
Fields of study
Mathematics, Computer Science, Medicine
Identifiers
DOI 10.1186/s12859-016-1309-x arXiv 1603.02532 PMID 28105909 PMCID 5249033
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Graphical Models
2020cited by this paper
Learning Gaussian graphical models with fractional marginal pseudo-likelihood
2016cited by this paper
Graphical Models In Applied Multivariate Statistics
2016cited by this paper
An experimentally supported model of the Bacillus subtilis global transcriptional regulatory network
2015cited by this paper
QUIC: quadratic approximation for sparse inverse covariance estimation
2014cited by this paper
Marginal Pseudo-Likelihood Learning of Markov Network structures
2014cited by this paper
Comprehensive molecular portraits of human breast tumours
2013cited by this paper
Fast and adaptive sparse precision matrix estimation in high dimensions
2012cited by this paper
Comprehensive molecular portraits of human breast tumors
2012cited by this paper
Calculating Determinants of Block Matrices
2011cited by this paper
A Constrained ℓ1 Minimization Approach to Sparse Precision Matrix Estimation
2011influential reference
Scikit-learn: Machine Learning in Python
2011cited by this paper
Gene Regulatory Networks from Multifactorial Perturbations Using Graphical Lasso: Application to the DREAM4 Challenge
2010cited by this paper
The Coordinated P53 and Estrogen Receptor Cis-Regulation at an FLT1 Promoter SNP Is Specific to Genotoxic Stress and Estrogenic Compound
2010cited by this paper
Partial Correlation Estimation by Joint Sparse Regression Models
2008cited by this paper
High-dimensional covariance estimation by minimizing ℓ1-penalized log-determinant divergence
2008cited by this paper
A note on the Lasso for Gaussian graphical model selection
2008cited by this paper
Sparse inverse covariance estimation with the graphical lasso.
2008cited by this paper
An Arabidopsis gene network based on the graphical Gaussian model.
2007cited by this paper
Model selection and estimation in the Gaussian graphical model
2007cited by this paper
On Model Selection Consistency of Lasso
2006cited by this paper
Probabilistic inference of transcription factor concentrations and gene-specific regulatory activities
2006cited by this paper
High-dimensional graphs and variable selection with the Lasso
2006cited by this paper
Bayesian sparse hidden components analysis for transcription regulation networks
2005cited by this paper
Quaderni di Dipartimento Objective Bayes Factors for Gaussian Directed Acyclic Graphical Models
2004cited by this paper
Network component analysis: Reconstruction of regulatory signals in biological systems
2003cited by this paper
Inverses of 2 × 2 block matrices
2002cited by this paper
Fundamental patterns underlying gene expression profiles: simplicity from complexity.
2000cited by this paper
Parameter Priors for Directed Acyclic Graphical Models and the Characteriration of Several Probability Distributions
1999cited by this paper
Regression Shrinkage and Selection via the Lasso
1996cited by this paper
Model Selection Through Sparse Max Likelihood Estimation Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data
year unknowncited by this paper

CITED BY

Rigidity theory in statistical inference
2026cites this paper
Leveraging Low-Rank Factorizations of Conditional Correlation Matrices in Graph Learning
2025cites this paper
Asymptotic post-selection inference for regularized graphical models
2025cites this paper
Signal Estimation and Uncertainties Extraction in Terahertz Time-Domain Spectroscopy
2024cites this paper
Clusterpath Gaussian Graphical Modeling
2024cites this paper
Simulation-Based Performance Evaluation of Missing Data Handling in Network Analysis
2024cites this paper
Maximum likelihood thresholds of Gaussian graphical models and graphical lasso
2023cites this paper
Learning Graphical Factor Models with Riemannian Optimization
2022cites this paper
Graph Learning Techniques Using Structured Data for IoT Air Pollution Monitoring Platforms
2021cites this paper
Structured Graph Learning Via Laplacian Spectral Constraints
2019cites this paper
A Unified Framework for Structured Graph Learning via Spectral Constraints
2019cites this paper
Back to the basics: Rethinking partial correlation network methodology.
2019influential citation
A Combined PLS and Negative Binomial Regression Model for Inferring Association Networks from Next-Generation Sequencing Count Data
2018cites this paper
Inferring the global financial network from high-dimensional time-series of stock returns
2018influential citation
Lower bounds for two-sample structural change detection in ising and Gaussian models
2017cites this paper
Selected proceedings of Machine Learning in Systems Biology: MLSB 2016
2016cites this paper