Generalized Orderless Pooling Performs Implicit Salient Matching

Marcel Simon,Yang Gao,Trevor Darrell,Joachim Denzler,E. Rodner

Published 2017 in IEEE International Conference on Computer Vision

ABSTRACT

Most recent CNN architectures use average pooling as a final feature encoding step. In the field of fine-grained recognition, however, recent global representations like bilinear pooling offer improved performance. In this paper, we generalize average and bilinear pooling to “α-pooling”, allowing for learning the pooling strategy during training. In addition, we present a novel way to visualize decisions made by these approaches. We identify parts of training images having the highest influence on the prediction of a given test image. This allows for justifying decisions to users and also for analyzing the influence of semantic parts. For example, we can show that the higher capacity VGG16 model focuses much more on the bird's head than, e.g., the lower-capacity VGG-M model when recognizing fine-grained bird categories. Both contributions allow us to analyze the difference when moving between average and bilinear pooling. In addition, experiments show that our generalized approach can outperform both across a variety of standard datasets.

PUBLICATION RECORD

Publication year
2017
Venue
IEEE International Conference on Computer Vision
Publication date
2017-05-01
Fields of study
Computer Science
Identifiers
DOI 10.1109/ICCV.2017.531 arXiv 1705.00487
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Chimpanzee Faces in the Wild: Log-Euclidean CNNs for Predicting Identities and Attributes of Primates
2016cited by this paper
Weakly Supervised Fine-Grained Categorization With Part-Based Image Representation
2016cited by this paper
Interferences in Match Kernels
2016cited by this paper
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
2016cited by this paper
Picking Deep Filter Responses for Fine-Grained Image Recognition
2016influential reference
A Probabilistic Collaborative Representation Based Approach for Pattern Classification
2016influential reference
Visual Concept Recognition and Localization via Iterative Introspection
2016influential reference
Deep Residual Learning for Image Recognition
2015cited by this paper
Bilinear CNN Models for Fine-Grained Visual Recognition
2015cited by this paper
Compact Bilinear Pooling
2015influential reference
Visualizing and Understanding Deep Texture Representations
2015cited by this paper
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
2015cited by this paper
Learning Deep Features for Discriminative Localization
2015cited by this paper
Fine-grained recognition without part annotations
2015influential reference
On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
2015cited by this paper
Understanding Neural Networks Through Deep Visualization
2015cited by this paper
Neural Activation Constellations: Unsupervised Part Model Discovery with Convolutional Networks
2015cited by this paper
Return of the Devil in the Details: Delving Deep into Convolutional Nets
2014influential reference
Part Detector Discovery in Deep Convolutional Neural Networks
2014cited by this paper
Bird Species Categorization Using Pose Normalized Deep Convolutional Nets
2014cited by this paper
Multi-scale Orderless Pooling of Deep Convolutional Activation Features
2014cited by this paper
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
2014cited by this paper
Revisiting the Fisher vector for fine-grained classification
2014cited by this paper
Very Deep Convolutional Networks for Large-Scale Image Recognition
2014cited by this paper
ImageNet Large Scale Visual Recognition Challenge
2014cited by this paper
Visualizing and Understanding Convolutional Networks
2013cited by this paper
Style Finder: Fine-Grained Clothing Style Detection and Retrieval
2013cited by this paper
Fine-Grained Visual Classification of Aircraft
2013cited by this paper
Stochastic Pooling for Regularization of Deep Convolutional Neural Networks
2013cited by this paper
Symbiotic Segmentation and Part Localization for Fine-Grained Categorization
2013influential reference
Semantic Segmentation with Second-Order Pooling
2012cited by this paper
The Caltech-UCSD Birds-200-2011 Dataset
2011influential reference
Human action recognition by learning bases of action attributes and parts
2011cited by this paper
Recognizing indoor scenes
2009cited by this paper
Efficient Match Kernel between Sets of Features for Visual Recognition
2009cited by this paper
Separating Style and Content with Bilinear Models
2000cited by this paper

CITED BY

Elusive Images: Beyond Coarse Analysis for Fine-Grained Recognition
2024cites this paper
Pooling in convolutional neural networks for medical image analysis: a survey and an empirical study
2022cites this paper
Spatial Consistency and Feature Diversity Regularization in Transfer Learning for Fine-Grained Visual Categorization
2022cites this paper
SR-GNN: Spatial Relation-Aware Graph Neural Network for Fine-Grained Image Categorization
2022cites this paper
Beyond Global Average Pooling: Alternative Feature Aggregations for Weakly Supervised Localization
2022cites this paper
Zernike Pooling: Generalizing Average Pooling Using Zernike Moments
2021cites this paper
Exploiting Web Images for Moth Species Classification
2021cites this paper
Fine-Grained Adversarial Semi-Supervised Learning
2021cites this paper
Enhancing Fine-Grained Classification for Low Resolution Images
2021cites this paper
Lightweight Filtering of Noisy Web Data: Augmenting Fine-grained Datasets with Selected Internet Images
2021cites this paper
Re-rank Coarse Classification with Local Region Enhanced Features for Fine-Grained Image Recognition
2021cites this paper
Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition
2020influential citation
End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained Recognition
2020cites this paper
Facing the Hard Problems in FGVC
2020cites this paper
The Whole Is More Than Its Parts? From Explicit to Implicit Pose Normalization
2020cites this paper
Multi Layer Neural Networks as Replacement for Pooling Operations
2020cites this paper
Multi-Objective Matrix Normalization for Fine-Grained Visual Recognition
2020influential citation
α-Integration Pooling for Convolutional Neural Networks
2019cites this paper
Spatiotemporal Fusion Networks for Video Action Recognition
2019cites this paper
Bilinear CNN Model for Fine-Grained Classification Based on Subcategory-Similarity Measurement
2019cites this paper
Cross-Category Cross-Semantic Regularization for Fine-Grained Image Recognition
2019cites this paper
CIS-Net: A Novel CNN Model for Spatial Image Steganalysis via Cover Image Suppression
2019cites this paper
Adaptive Bilinear Pooling for Fine-grained Representation Learning
2019cites this paper
On Global Feature Pooling for Fine-grained Visual Categorization
2019influential citation
Question Type Guided Attention in Visual Question Answering
2018cites this paper
DFT-based Transformation Invariant Pooling Layer for Visual Classification
2018cites this paper
Impostor Networks for Fast Fine-Grained Recognition
2018influential citation
Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition
2018cites this paper
Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning
2018cites this paper
Deep Attentional Structured Representation Learning for Visual Recognition
2018cites this paper
Alpha-Integration Pooling for Convolutional Neural Networks
2018cites this paper
Local Temporal Bilinear Pooling for Fine-Grained Action Parsing
2018cites this paper
Fine-Grained Image Classification With Gaussian Mixture Layer
2018cites this paper
Maximum-Entropy Fine-Grained Classification
2018influential citation
Learning discriminative visual elements using part-based convolutional neural network
2018cites this paper
Surface defect saliency of magnetic tile
2018cites this paper
Improving Fine-Grained Visual Classification using Pairwise Confusion
2017influential citation
Towards Automated Visual Monitoring of Individual Gorillas in the Wild
2017cites this paper
Training with Confusion for Fine-Grained Visual Classification
2017cites this paper
Compact Tensor Pooling for Visual Question Answering
2017cites this paper
Pairwise Confusion for Fine-Grained Visual Classification
2017cites this paper
Deep bilinear features for Her2 scoring in digital pathology
2017cites this paper