Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks

Çaglar Gülçehre,Kyunghyun Cho,Razvan Pascanu,Yoshua Bengio

Published 2013 in ECML/PKDD

ABSTRACT

In this paper we propose and investigate a novel nonlinear unit, called L p unit, for deep neural networks. The proposed L p unit receives signals from several projections of a subset of units in the layer below and computes a normalized L p norm. We notice two interesting interpretations of the L p unit. First, the proposed unit can be understood as a generalization of a number of conventional pooling operators such as average, root-mean-square and max pooling widely used in, for instance, convolutional neural networks (CNN), HMAX models and neocognitrons. Furthermore, the L p unit is, to a certain degree, similar to the recently proposed maxout unit [13] which achieved the state-of-the-art object recognition results on a number of benchmark datasets. Secondly, we provide a geometrical interpretation of the activation function based on which we argue that the L p unit is more efficient at representing complex, nonlinear separating boundaries. Each L p unit defines a superelliptic boundary, with its exact shape defined by the order p. We claim that this makes it possible to model arbitrarily shaped, curved boundaries more efficiently by combining a few L p units of different orders. This insight justifies the need for learning different orders for each unit in the model. We empirically evaluate the proposed L p units on a number of datasets and show that multilayer perceptrons (MLP) consisting of the L p units achieve the state-of-the-art results on a number of benchmark datasets. Furthermore, we evaluate the proposed L p unit on the recently proposed deep recurrent neural networks (RNN).

PUBLICATION RECORD

Publication year
2013
Venue
ECML/PKDD
Publication date
2013-11-07
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1007/978-3-662-44848-9_34 arXiv 1311.1780
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Pylearn2: a machine learning research library
2013cited by this paper
How to Construct Deep Recurrent Neural Networks
2013cited by this paper
On Fast Dropout and its Applicability to Recurrent Networks
2013cited by this paper
Knowledge Matters: Importance of Prior Information for Optimization
2013influential reference
Revisiting Natural Gradient for Deep Networks
2013cited by this paper
Learned-norm pooling for deep neural networks
2013cited by this paper
Maxout Networks
2013influential reference
Modeling Natural Images Using Gated MRFs
2013cited by this paper
Deep Learning of Representations
2013cited by this paper
Improving neural networks by preventing co-adaptation of feature detectors
2012cited by this paper
On the difficulty of training recurrent neural networks
2012cited by this paper
High-dimensional sequence transduction
2012cited by this paper
Multi-column deep neural network for traffic sign classification
2012cited by this paper
Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription
2012influential reference
Disentangling Factors of Variation for Facial Expression Recognition
2012cited by this paper
Deep Neural Networks for Acoustic Modeling in Speech Recognition
2012influential reference
ADADELTA: An Adaptive Learning Rate Method
2012cited by this paper
Random Search for Hyper-Parameter Optimization
2012cited by this paper
Theano: new features and speed improvements
2012cited by this paper
ImageNet classification with deep convolutional neural networks
2012cited by this paper
Deep Sparse Rectifier Neural Networks
2011cited by this paper
The Manifold Tangent Classifier
2011cited by this paper
Suitability of V1 Energy Models for Object Classification
2011cited by this paper
Theano: A CPU and GPU Math Compiler in Python
2010cited by this paper
Neural Networks and Learning Machines
2010cited by this paper
Rectified Linear Units Improve Restricted Boltzmann Machines
2010cited by this paper
A Theoretical Analysis of Feature Pooling in Visual Recognition
2010cited by this paper
Linear spatial pyramid matching using sparse coding for image classification
2009cited by this paper
What is the best multi-stage architecture for object recognition?
2009cited by this paper
Application of distributed SVM architectures in classifying forest data cover types
2008cited by this paper
Complex cell pooling and the statistics of natural images
2007cited by this paper
Unsupervised Learning of Invariant Feature Hierarchies with Applications to Object Recognition
2007cited by this paper
Influence of cultivation temperature on the ligninolytic activity of selected fungal strains
2006cited by this paper
Hierarchical models of object recognition in cortex
1999cited by this paper
Gradient-based learning applied to document recognition
1998influential reference
Multilayer feedforward networks are universal approximators
1989cited by this paper
Learning representations by back-propagation errors, nature
1986cited by this paper
Learning representations of back-propagation errors
1986cited by this paper
Learning representations by back-propagating errors
1986cited by this paper
Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position
1980cited by this paper
Receptive fields and functional architecture of monkey striate cortex
1968cited by this paper
PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS
1963cited by this paper
Principles of Neurodynamics. Perceptrons and the Theory of Brain Mechanisms.
1962cited by this paper

CITED BY

Toward real-time emotion recognition in fog computing-based systems: leveraging interpretable PCA_CNN, YOLO with self-attention mechanism
2026cites this paper
Adaptive Spatial Goodness Encoding: Advancing and Scaling Forward-Forward Learning Without Backpropagation
2025cites this paper
Joint Intensity and Spatio-Temporal Representation Learning for Extreme Precipitation Nowcasting
2025cites this paper
Improvement of Rainfall Estimation Accuracy Using a Convolutional Neural Network with Convolutional Block Attention Model on Surveillance Camera
2025cites this paper
Self-supervision enhances instance-based multiple instance learning methods in digital pathology: a benchmark study
2025cites this paper
SUPERB-EP: Evaluating Encoder Pooling Techniques in Self-Supervised Learning Models for Speech Classification
2025cites this paper
Enhancing Deep Learning Model Using Whale Optimization Algorithm on Brain Tumor MRI
2025cites this paper
scRSSL: Residual semi‐supervised learning with deep generative models to automatically identify cell types
2025cites this paper
MultiScale-enhanced detection network (MS-EDN) with dual encoder structure for infrared small target detection
2025cites this paper
LGCOAMix: Local and Global Context-and-Object-Part-Aware Superpixel-Based Data Augmentation for Deep Visual Recognition
2025cites this paper
Rethinking the temporal downsampling paradigm for continuous sign language recognition
2025cites this paper
Is Complete Labeling Necessary? Understanding Active Learning in Longitudinal Medical Imaging
2025cites this paper
Attention-modulated frequency-aware pooling via spatial guidance
2025cites this paper
CO2 emission prediction from coal used in power plants: a machine learning-based approach
2024cites this paper
Efficient LWPooling: Rethinking the Wavelet Pooling for Scene Parsing
2024cites this paper
AI-Driven DDoS Mitigation at the Edge: Leveraging Machine Learning for Real-Time Threat Detection and Response
2024cites this paper
PMSFF: Improved Protein Binding Residues Prediction through Multi-Scale Sequence-Based Feature Fusion Strategy
2024cites this paper
A Short-Long Term Sequence Learning Network for Precipitation Nowcasting
2024cites this paper
Hybrid deep learning models for multi-ahead river water level forecasting
2024cites this paper
VIX constant maturity futures trading strategy: A walk-forward machine learning study
2024cites this paper
Universal Approximation Abilities of a Modular Differentiable Neural Network
2024influential citation
Exploring Multiple Instance Learning (MIL): A brief survey
2024cites this paper
Malicious Internet Entity Detection Using Local Graph Inference
2024cites this paper
An Experimental Study of nmODE in Recognizing Endoscopic Submucosal Dissection Workflow
2023cites this paper
GCN- and GRU-Based Intelligent Model for Temperature Prediction of Local Heating Surfaces
2023cites this paper
A General Framework for Robust G-Invariance in G-Equivariant Networks
2023cites this paper
Enhancing Brain Tumor Diagnosis: Transitioning From Convolutional Neural Network to Involutional Neural Network
2023cites this paper
Function-Space Optimality of Neural Architectures with Multivariate Nonlinearities
2023influential citation
Global Features are All You Need for Image Retrieval and Reranking
2023cites this paper
A novel dual-pooling attention module for UAV vehicle re-identification
2023cites this paper
Generalised f-Mean Aggregation for Graph Neural Networks
2023cites this paper
Sequential and Graphical Cross-Domain Recommendations with a Multi-View Hierarchical Transfer Gate
2023influential citation
Content-Adaptive Downsampling in Convolutional Neural Networks
2023cites this paper
Teasing out missing reactions in genome-scale metabolic networks through hypergraph learning
2023cites this paper
FlexPooling with Simple Auxiliary Classifiers in Deep Networks
2023cites this paper
Predicting cortical oscillations with bidirectional LSTM network: a simulation study
2023cites this paper
A novel design of learnable pooling algorithm
2023cites this paper
Deep Learning of Liver Contrast-Enhanced Ultrasound to Predict Microvascular Invasion and Prognosis in Hepatocellular Carcinoma
2022cites this paper
Using deep learning approaches for coloring silicone maxillofacial prostheses: A comparison of two approaches
2022cites this paper
Object detection based on cortex hierarchical activation in border sensitive mechanism and classification-GIou joint representation
2022cites this paper
Regularized Optimal Transport Layers for Generalized Global Pooling Operations
2022cites this paper
YZR-net : Self-supervised Hidden representations Invariant to Transformations for profanity detection
2022cites this paper
Towards Unsupervised Subject-Independent Speech-Based Relapse Detection in Patients with Psychosis using Variational Autoencoders
2022cites this paper
E-Prevention: Advanced Support System for Monitoring and Relapse Prevention in Patients with Psychotic Disorders Analyzing Long-Term Multimodal Data from Wearables and Video Captures
2022cites this paper
Revisiting Global Pooling through the Lens of Optimal Transport
2022cites this paper
Self-Attentive Pooling for Efficient Deep Learning
2022cites this paper
Frequency-Dividing Downsampling Module of the Lifting Scheme for Image Classification
2022cites this paper
Hyper-flexible Convolutional Neural Networks based on Generalized Lehmer and Power Means
2022cites this paper
Temporal Lift Pooling for Continuous Sign Language Recognition
2022cites this paper
A Survey on Hyperlink Prediction
2022cites this paper
Teasing out Missing Reactions in Genome-scale Metabolic Networks through Deep Learning
2022cites this paper
Investigating the Value of Subtitles for Improved Movie Recommendations
2022cites this paper
Consensus Function from an $L_p^q-$norm Regularization Term for its Use as Adaptive Activation Functions in Neural Networks
2022cites this paper
Joint weakly and fully supervised learning for surface defect segmentation from images
2022cites this paper
A review of convolutional neural network architectures and their optimizations
2022cites this paper
Hierarchical Spherical CNNs With Lifting-Based Adaptive Wavelets for Pooling and Unpooling
2022cites this paper
QGNN: Value Function Factorisation with Graph Neural Networks
2022cites this paper
An application of Pixel Interval Down-sampling (PID) for dense tiny microorganism counting on environmental microorganism images
2022cites this paper
Generalizing Aggregation Functions in GNNs: High-Capacity GNNs via Nonlinear Neighborhood Aggregators
2022cites this paper
Learning strides in convolutional neural networks
2022cites this paper
Pooling in convolutional neural networks for medical image analysis: a survey and an empirical study
2022influential citation
FA-LSTM: A Novel Toxic Gas Concentration Prediction Model in Pollutant Environment
2022cites this paper
Zero-shot reductive paraphrasing for digitally semi-literate
2021cites this paper
Comparison of Methods Generalizing Max- and Average-Pooling
2021cites this paper
Review of Image Classification Algorithms Based on Convolutional Neural Networks
2021cites this paper
Two-Stream Deep Fusion Network Based on VAE and CNN for Synthetic Aperture Radar Target Recognition
2021cites this paper
AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling
2021cites this paper
Aircraft Type Recognition in Remote Sensing Images: Bilinear Discriminative Extreme Learning Machine Framework
2021cites this paper
Time Series Forecasting in a CVD Reactor for Polysilicon Production
2021cites this paper
Refining activation downsampling with SoftPool
2021cites this paper
Chebyshev Pooling: An Alternative Layer for the Pooling of CNNs-Based Classifier
2021cites this paper
Processing Method of Missing Data in Dam Safety Monitoring
2021cites this paper
Pet Companion System Based on Image Recognition
2021cites this paper
Learning to Pool in Graph Neural Networks for Extrapolation
2021influential citation
Deep insight: Convolutional neural network and its applications for COVID-19 prognosis
2021cites this paper
Z-pooling**This research was supported by the program Cátedras Francesa do Estado de São Paulo, an initiative of the French consulate and the state of São Paulo (Brazil). The authors thank D. Fourer and I. Brahim for their contributions.
2021cites this paper
Ordinal Pooling
2021cites this paper
Mapping the Internet: Modelling Entity Interactions in Complex Heterogeneous Networks
2021cites this paper
Adaptive wavelet pooling for convolutional neural networks
2021cites this paper
A New Pooling Approach Based on Zeckendorf’s Theorem for Texture Transfer Information
2021cites this paper
On the Memory Mechanism of Tensor-Power Recurrent Models
2021cites this paper
Modeling novel ionenes for electrochemical devices with first principles and machine learning methods
2020cites this paper
Objects and scenes classification with selective use of central and peripheral image content
2020cites this paper
Multi Layer Neural Networks as Replacement for Pooling Operations
2020cites this paper
Attentive Pooling with Learnable Norms for Text Representation
2020cites this paper
Defect Depth Estimation in Infrared Thermography with Deep Learning
2020cites this paper
Artificial Intelligence and Security: 6th International Conference, ICAIS 2020, Hohhot, China, July 17–20, 2020, Proceedings, Part I
2020cites this paper
Compressive Sampling Based Multi-Spectrum Deep Learning for Sub-Nyquist Pacemaker ECG Analysis
2020cites this paper
Method of Multi-feature Fusion Based on Attention Mechanism in Malicious Software Detection
2020cites this paper
CNN-based fusion and classification of SAR and Optical data
2020cites this paper
A Method of Defect Depth Estimation for Simulated Infrared Thermography Data with Deep Learning
2020cites this paper
Violence recognition using convolutional neural network: A survey
2020cites this paper
BERT4NILM: A Bidirectional Transformer Model for Non-Intrusive Load Monitoring
2020cites this paper
Individualized Gait Generation for Rehabilitation Robots Based on Recurrent Neural Networks
2020influential citation
Algerian Dialect Translation Applied on COVID-19 Social Media Comments
2020cites this paper
High-order Learning Model via Fractional Tensor Network Decomposition
2020cites this paper
Protein Secondary Structure Prediction using Recurrent Neural Networks
2020cites this paper
Deep Convolutional Networks in System Identification
2019cites this paper
α-Integration Pooling for Convolutional Neural Networks
2019influential citation
Weakly Supervised Segmentation of Cracks on Solar Cells Using Normalized Lp Norm
2019cites this paper