Provable Filter Pruning for Efficient Neural Networks

Lucas Liebenwein,Cenk Baykal,Harry Lang,Dan Feldman,D. Rus

Published 2019 in International Conference on Learning Representations

ABSTRACT

We present a provable, sampling-based approach for generating compact Convolutional Neural Networks (CNNs) by identifying and removing redundant filters from an over-parameterized network. Our algorithm uses a small batch of input data points to assign a saliency score to each filter and constructs an importance sampling distribution where filters that highly affect the output are sampled with correspondingly high probability. In contrast to existing filter pruning approaches, our method is simultaneously data-informed, exhibits provable guarantees on the size and performance of the pruned network, and is widely applicable to varying network architectures and data sets. Our analytical bounds bridge the notions of compressibility and importance of network structures, which gives rise to a fully-automated procedure for identifying and preserving filters in layers that are essential to the network's performance. Our experimental evaluations on popular architectures and data sets show that our algorithm consistently generates sparser and more efficient models than those constructed by existing filter pruning approaches.

PUBLICATION RECORD

Publication year
2019
Venue
International Conference on Learning Representations
Publication date
2019-11-18
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1911.07412
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Dynamic Model Pruning with Feedback
2020cited by this paper
Soft Taylor Pruning for Accelerating Deep Convolutional Neural Networks
2020influential reference
MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning
2019cited by this paper
Learning Filter Basis for Convolutional Neural Network Compression
2019cited by this paper
Revisiting hard thresholding for DNN pruning
2019cited by this paper
The State of Sparsity in Deep Neural Networks
2019cited by this paper
SiPPing Neural Networks: Sensitivity-informed Provable Pruning of Neural Networks
2019influential reference
Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration
2018cited by this paper
Stronger generalization bounds for deep nets via a compression approach
2018cited by this paper
Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers
2018cited by this paper
Learning to Prune Filters in Convolutional Neural Networks
2018cited by this paper
Ridge Regression and Provable Deterministic Ridge Leverage Score Sampling
2018cited by this paper
Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds
2018influential reference
The Lottery Ticket Hypothesis: Training Pruned Neural Networks
2018cited by this paper
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
2018cited by this paper
Rethinking the Value of Network Pruning
2018cited by this paper
SNIP: Single-shot Network Pruning based on Connection Sensitivity
2018influential reference
Learning Steering Bounds for Parallel Autonomous Systems
2018influential reference
AutoPruner: An End-to-End Trainable Filter Pruning Method for Efficient Deep Model Inference
2018cited by this paper
Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank
2017cited by this paper
On Compressing Deep Models by Low Rank and Sparse Decomposition
2017influential reference
More is Less: A More Complicated Network with Less Inference Complexity
2017cited by this paper
NISP: Pruning Networks Using Neuron Importance Score Propagation
2017influential reference
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
2017influential reference
Compression-aware Training of Deep Networks
2017cited by this paper
Soft Weight-Sharing for Neural Network Compression
2017cited by this paper
Practical Coreset Constructions for Machine Learning
2017cited by this paper
Automatic differentiation in PyTorch
2017cited by this paper
Channel Pruning for Accelerating Very Deep Neural Networks
2017cited by this paper
New Frameworks for Offline and Streaming Coreset Constructions
2016influential reference
Dynamic Network Surgery for Efficient DNNs
2016cited by this paper
Densely Connected Convolutional Networks
2016influential reference
Pruning Filters for Efficient ConvNets
2016influential reference
Wide Residual Networks
2016cited by this paper
Training CNNs with Low-Rank Filters for Efficient Image Classification
2015cited by this paper
Deep Residual Learning for Image Recognition
2015influential reference
Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding
2015influential reference
An Introduction to Matrix Concentration Inequalities
2015cited by this paper
Binary embeddings with structured hashed projections
2015cited by this paper
Compressing Neural Networks with the Hashing Trick
2015cited by this paper
Speeding up Convolutional Neural Networks with Low Rank Expansions
2014cited by this paper
Provable deterministic leverage score sampling
2014cited by this paper
Very Deep Convolutional Networks for Large-Scale Image Recognition
2014cited by this paper
A Note on Randomized Element-wise Matrix Sparsification
2014cited by this paper
Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation
2014cited by this paper
Probability in High Dimension
2014cited by this paper
ImageNet Large Scale Visual Recognition Challenge
2014influential reference
Matrix Entry-wise Sampling : Simple is Best [ Extended Abstract ]
2013cited by this paper
A unified framework for approximating and clustering data
2011cited by this paper
Feature hashing for large scale multitask learning
2009cited by this paper
Hash Kernels for Structured Data
2009cited by this paper
High dimensional probability : proceedings of the Fourth International Conference
2006cited by this paper
High Dimensional Probability III
2003cited by this paper
Gradient-based learning applied to document recognition
1998influential reference
Optimal Brain Damage
1989cited by this paper

CITED BY

CNN Compression via Channel-Wise Variance-Based Filter Pruning
2026cites this paper
AdaSP: Adaptive Soft Filter Pruning With Layer-Wise Ratios for Model Compression in Low-Altitude UAVs
2026cites this paper
DataS^3: Dataset Subset Selection for Specialization
2025cites this paper
Hessian-driven N:M sparsity and quantization co-optimization for edge device deployment
2025cites this paper
Theoretical Compression Bounds for Wide Multilayer Perceptrons
2025cites this paper
IESSP: Information Extraction-Based Sparse Stripe Pruning Method for Deep Neural Networks
2025cites this paper
Pruning AMR: Efficient Visualization of Implicit Neural Representations via Weight Matrix Analysis
2025cites this paper
ModHiFi: Identifying High Fidelity predictive components for Model Modification
2025cites this paper
Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples
2025cites this paper
The Quest for Universal Master Key Filters in DS-CNNs
2025cites this paper
Resource-Aware Neural Network Pruning Using Graph-based Reinforcement Learning
2025cites this paper
Weight-Sharing NAS with Architecture-Agnostic Intermediate Representation
2025influential citation
One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression
2025influential citation
Resource-Aware Neural Network Pruning Using Constrained Reinforcement Learning and Self-Competition
2025cites this paper
Catalyst: a Novel Regularizer for Structured Pruning with Auxiliary Extension of Parameter Space
2025cites this paper
Flexible Group Count Enables Hassle-Free Structured Pruning
2025cites this paper
A Rescaling-Invariant Lipschitz Bound Based on Path-Metrics for Modern ReLU Network Parameterizations
2024influential citation
DDEP: Evolutionary pruning using distilled dataset
2024cites this paper
REPrune: Channel Pruning via Kernel Representative Selection
2024cites this paper
Reduced storage direct tensor ring decomposition for convolutional neural networks compression
2024cites this paper
Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment
2024cites this paper
Filter pruning via schatten p-norm
2024cites this paper
Enhanced Network Compression Through Tensor Decompositions and Pruning
2024cites this paper
DisCEdit: Model Editing by Identifying Discriminative Components
2024cites this paper
Mutual Information Preserving Neural Network Pruning
2024influential citation
A Greedy Hierarchical Approach to Whole-Network Filter-Pruning in CNNs
2024cites this paper
Markov-PQ: Joint Pruning-Quantization via Learnable Markov Chain
2024cites this paper
Shapley Pruning for Neural Network Compression
2024cites this paper
Co-Exploring Structured Sparsification and Low-Rank Tensor Decomposition for Compact DNNs
2024cites this paper
Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition
2024cites this paper
Archtree: on-the-fly tree-structured exploration for latency-aware pruning of deep neural networks
2023cites this paper
Computing Approximate $\ell_p$ Sensitivities
2023cites this paper
Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning
2023cites this paper
SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading Acceleration
2023cites this paper
Filter Pruning For CNN With Enhanced Linear Representation Redundancy
2023cites this paper
SCoTTi: Save Computation at Training Time with an adaptive framework
2023cites this paper
A Differentiable Framework for End-to-End Learning of Hybrid Structured Compression
2023cites this paper
Neural Network Light Weighting Approach Using Multi-Metric Evaluation of Convolution Kernels
2023cites this paper
A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations
2023cites this paper
Efficient Layer Compression Without Pruning
2023cites this paper
Towards Efficient Convolutional Neural Network for Embedded Hardware via Multi-Dimensional Pruning
2023cites this paper
D-Score: A Synapse-Inspired Approach for Filter Pruning
2023cites this paper
TVSPrune - Pruning Non-discriminative filters via Total Variation separability of intermediate representations without fine tuning
2023cites this paper
Block-wise Pruning for Convolutional Neural Networks
2023cites this paper
Structural Alignment for Network Pruning through Partial Regularization
2023cites this paper
Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models
2023cites this paper
CR-SFP: Learning Consistent Representation for Soft Filter Pruning
2023cites this paper
Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning
2023cites this paper
Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions
2023cites this paper
Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning
2023cites this paper
ZipLM: Inference-Aware Structured Pruning of Language Models
2023cites this paper
Iterative clustering pruning for convolutional neural networks
2023cites this paper
EVC: Towards Real-Time Neural Image Compression with Mask Decay
2023cites this paper
Towards Optimal Compression: Joint Pruning and Quantization
2023cites this paper
Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
2023cites this paper
Structured Pruning for Deep Convolutional Neural Networks: A Survey
2023cites this paper
Integrating Fairness and Model Pruning Through Bi-level Optimization
2023cites this paper
Gradient-Free Structured Pruning with Unlabeled Data
2023cites this paper
Provable Data Subset Selection For Efficient Neural Network Training
2023cites this paper
Dynamic Structure Pruning for Compressing CNNs
2023cites this paper
Performance-Aware Approximation of Global Channel Pruning for Multitask CNNs
2023cites this paper
CP3: Channel Pruning Plug-in for Point-Based Networks
2023cites this paper
A Tensor-based Convolutional Neural Network for Small Dataset Classification
2023cites this paper
Progressive Channel-Shrinking Network
2023cites this paper
Boosting Convolutional Neural Networks With Middle Spectrum Grouped Convolution
2023cites this paper
Neural Network Reduction with Guided Regularizers
2023cites this paper
EMNAPE: Efficient Multi-Dimensional Neural Architecture Pruning for EdgeAI
2023cites this paper
RGP: Neural Network Pruning Through Regular Graph With Edges Swapping
2023cites this paper
Differentiable Transportation Pruning
2023cites this paper
Empirical evaluation of filter pruning methods for acceleration of convolutional neural network
2023influential citation
Performance optimizations on U-Net speech enhancement models
2022cites this paper
GhostNets on Heterogeneous Devices via Cheap Operations
2022influential citation
Adaptive Activation-based Structured Pruning
2022influential citation
Algorithm 1 Adaptive Iterative Structured Pruning Algorithm
2022influential citation
Coresets for Data Discretization and Sine Wave Fitting
2022influential citation
Obstacle Aware Sampling for Path Planning
2022cites this paper
The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks
2022cites this paper
Data-Efficient Structured Pruning via Submodular Optimization
2022influential citation
CHEX: CHannel EXploration for CNN Model Compression
2022cites this paper
Compressing convolutional neural networks with hierarchical Tucker-2 decomposition
2022cites this paper
End-to-End Sensitivity-Based Filter Pruning
2022influential citation
Differentiable Network Pruning via Polarization of Probabilistic Channelwise Soft Masks
2022cites this paper
Robust Learning of Parsimonious Deep Neural Networks
2022influential citation
Revisiting Random Channel Pruning for Neural Network Compression
2022influential citation
Adaptive Neural Network Structure Optimization Algorithm Based on Dynamic Nodes
2022cites this paper
Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm
2022cites this paper
SInGE: Sparsity via Integrated Gradients Estimation of Neuron Relevance
2022cites this paper
Dynamic Selection of Perception Models for Robotic Control
2022cites this paper
Trainability Preserving Neural Structured Pruning
2022cites this paper
Global balanced iterative pruning for efficient convolutional neural networks
2022cites this paper
Group and Exclusive Sparse Regularization-based Continual Learning of CNNs
2022cites this paper
Interpretations Steered Network Pruning via Amortized Inferred Saliency Maps
2022cites this paper
CAP: instance complexity-aware network pruning
2022cites this paper
An Enhanced Scheme for Reducing the Complexity of Pointwise Convolutions in CNNs for Image Classification Based on Interleaved Grouped Filters without Divisibility Constraints
2022cites this paper
Pruning Neural Networks via Coresets and Convex Geometry: Towards No Assumptions
2022influential citation
Deep Learning on Home Drone: Searching for the Optimal Architecture
2022cites this paper
CWP: Instance complexity weighted channel-wise soft masks for network pruning
2022cites this paper
SeKron: A Decomposition Method Supporting Many Factorization Structures
2022cites this paper
Accelerating CNN via Dynamic Pattern-based Pruning Network
2022cites this paper
Pruning by Active Attention Manipulation
2022cites this paper