Sparse PCA with False Discovery Rate Controlled Variable Selection

Jasin Machkour,A. Breloy,Michael Muma,D. Palomar,Frédéric Pascal

Published 2024 in IEEE International Conference on Acoustics, Speech, and Signal Processing

ABSTRACT

Sparse principal component analysis (PCA) aims at mapping large dimensional data to a linear subspace of lower dimension. By imposing loading vectors to be sparse, it performs the double duty of dimension reduction and variable selection. Sparse PCA algorithms are usually expressed as a trade-off between explained variance and sparsity of the loading vectors (i.e., number of selected variables). As a high explained variance is not necessarily synonymous with relevant information, these methods are prone to select irrelevant variables. To overcome this issue, we propose an alternative formulation of sparse PCA driven by the false discovery rate (FDR). We then leverage the Terminating-Random Experiments (T-Rex) selector to automatically determine an FDR-controlled support of the loading vectors. A major advantage of the resulting T-Rex PCA is that no sparsity parameter tuning is required. Numerical experiments and a stock market data example demonstrate a significant performance improvement.

PUBLICATION RECORD

Publication year
2024
Venue
IEEE International Conference on Acoustics, Speech, and Signal Processing
Publication date
2024-01-16
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1109/ICASSP48485.2024.10448237 arXiv 2401.08375
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

False Discovery Rate Control for Fast Screening of Large-Scale Genomics Biobanks
2023cited by this paper
Robust and Globally Sparse Pca via Majorization-Minimization and Variable Splitting
2023cited by this paper
False Discovery Rate Control for Grouped Variable Selection in High-Dimensional Linear Models Using the T-Knock Filter
2022cited by this paper
The terminating-random experiments selector: Fast high-dimensional variable selection with false discovery rate control
2021influential reference
Majorization-Minimization on the Stiefel Manifold With Application to Robust Sparse PCA
2021cited by this paper
Variable Selection
2019cited by this paper
A Selective Overview of Sparse Principal Component Analysis
2018cited by this paper
Principal component analysis: a review and recent developments
2016cited by this paper
Panning for gold: ‘model‐X’ knockoffs for high dimensional controlled variable selection
2016cited by this paper
Orthogonal Sparse PCA and Covariance Estimation via Procrustes Reformulation
2016cited by this paper
Sparse Principal Component Analysis via Rotation and Truncation
2014cited by this paper
Controlling the false discovery rate via knockoffs
2014cited by this paper
双対平坦空間におけるLeast Angle Regressionと情報量規準
2009cited by this paper
TESTING SIGNIFICANCE OF FEATURES BY LASSOED PRINCIPAL COMPONENTS.
2008cited by this paper
Statistical arbitrage in the US equities market
2008cited by this paper
Sparse Variable PCA Using Geodesic Steepest Descent
2008cited by this paper
Sparse Principal Component Analysis
2006influential reference
Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach
2004cited by this paper
Principal Component Analysis
2003cited by this paper
THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY
2001cited by this paper
Regression Shrinkage and Selection via the Lasso
1996cited by this paper
Controlling the false discovery rate: a practical and powerful approach to multiple testing
1995cited by this paper

CITED BY

Knockoff-Guided Compressive Sensing: A Statistical Machine Learning Framework for Support-Assured Signal Recovery
2025cites this paper
FDR-Controlled Portfolio Optimization for Sparse Financial Index Tracking
2024cites this paper
High-dimensional false discovery rate control for dependent variables
2024cites this paper
FDR-Controlled Sparse Index Tracking with Autoregressive Stock Dependency Models
2024cites this paper
The terminating-random experiments selector: Fast high-dimensional variable selection with false discovery rate control
2021cites this paper