Evaluation of output embeddings for fine-grained image classification

Zeynep Akata,Scott E. Reed,D. Walter,Honglak Lee,B. Schiele

Published 2014 in Computer Vision and Pattern Recognition

ABSTRACT

Image classification has advanced significantly in recent years with the availability of large-scale image sets. However, fine-grained classification remains a major challenge due to the annotation cost of large numbers of fine-grained categories. This project shows that compelling classification performance can be achieved on such categories even without labeled training data. Given image and class embeddings, we learn a compatibility function such that matching embeddings are assigned a higher score than mismatching ones; zero-shot classification of an image proceeds by finding the label yielding the highest joint compatibility score. We use state-of-the-art image features and focus on different supervised attributes and unsupervised output embeddings either derived from hierarchies or learned from unlabeled text corpora. We establish a substantially improved state-of-the-art on the Animals with Attributes and Caltech-UCSD Birds datasets. Most encouragingly, we demonstrate that purely unsupervised output embeddings (learned from Wikipedia and improved with finegrained text) achieve compelling results, even outperforming the previous supervised state-of-the-art. By combining different output embeddings, we further improve results.

PUBLICATION RECORD

Publication year
2014
Venue
Computer Vision and Pattern Recognition
Publication date
2014-09-29
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1109/CVPR.2015.7298911 arXiv 1409.8403
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
2015cited by this paper
Label-Embedding for Image Classification
2015cited by this paper
Caffe: Convolutional Architecture for Fast Feature Embedding
2014cited by this paper
word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method
2014cited by this paper
GloVe: Global Vectors for Word Representation
2014cited by this paper
Transductive Multi-view Embedding for Zero-Shot Recognition and Annotation
2014cited by this paper
Discriminative Unsupervised Feature Learning with Convolutional Neural Networks
2014cited by this paper
Going deeper with convolutions
2014cited by this paper
COSTA: Co-Occurrence Statistics for Zero-Shot Classification
2014cited by this paper
DeViSE: A Deep Visual-Semantic Embedding Model
2013influential reference
DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition
2013cited by this paper
A Unified Probabilistic Approach Modeling Relationships between Attributes and Objects
2013cited by this paper
Designing Category-Level Attributes for Discriminative Visual Recognition
2013cited by this paper
BabyTalk: Understanding and Generating Simple Image Descriptions
2013cited by this paper
Label-Embedding for Attribute-Based Classification
2013influential reference
Zero-Shot Learning Through Cross-Modal Transfer
2013cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013influential reference
What's in a Name? First Names as Facial Attributes
2013cited by this paper
Fine-Grained Crowdsourcing for Fine-Grained Recognition
2013cited by this paper
Zero-Shot Learning by Convex Combination of Semantic Embeddings
2013cited by this paper
Discovering localized attributes for fine-grained recognition
2012cited by this paper
I and J
2012influential reference
Multi-attribute spaces: Calibration for attribute fusion and similarity search
2012cited by this paper
ImageNet classification with deep convolutional neural networks
2012influential reference
Online incremental attribute-based zero-shot learning
2012cited by this paper
Evaluating knowledge transfer and zero-shot learning in a large-scale setting
2011cited by this paper
Im2Text: Describing Images Using 1 Million Captioned Photographs
2011cited by this paper
Relative attributes
2011cited by this paper
Image ranking and retrieval based on multi-attribute queries
2011cited by this paper
Recognizing human actions by attributes
2011cited by this paper
Structured Learning and Prediction in Computer Vision
2011cited by this paper
Human action recognition by learning bases of action attributes and parts
2011cited by this paper
Multiclass recognition and part localization with humans in the loop
2011cited by this paper
Combining attributes and Fisher vectors for efficient image retrieval
2011cited by this paper
What helps where – and why? Semantic relatedness for knowledge transfer
2010influential reference
Combining Language Sources and Robust Semantic Relatedness for Attribute-Based Knowledge Transfer
2010cited by this paper
Vlfeat: an open and portable library of computer vision algorithms
2010cited by this paper
Label Embedding Trees for Large Multi-Class Tasks
2010cited by this paper
What Helps Where \textendash And Why? Semantic Relatedness for Knowledge Transfer
2010cited by this paper
Caltech-UCSD Birds 200
2010cited by this paper
Large scale image annotation: learning to rank with joint word-image embeddings
2010influential reference
Attribute-centric recognition for cross-category generalization
2010cited by this paper
Attribute-Based Transfer Learning for Object Categorization with Zero/One Training Example
2010cited by this paper
Improving the Fisher Kernel for Large-Scale Image Classification
2010cited by this paper
Zero-shot Learning with Semantic Output Codes
2009cited by this paper
Multi-Label Prediction via Compressed Sensing
2009cited by this paper
Describing objects by their attributes
2009cited by this paper
ImageNet: A large-scale hierarchical image database
2009cited by this paper
Fisher Kernels on Visual Vocabularies for Image Categorization
2007cited by this paper
Learning Visual Attributes
2007cited by this paper
A Typology Of Ontology-Based Semantic Measures
2005cited by this paper
Large Margin Methods for Structured and Interdependent Output Variables
2005influential reference
A Neural Probabilistic Language Model
2003cited by this paper
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
2001cited by this paper
GradientBased Learning Applied to Document Recognition
2001influential reference
An Information-Theoretic Definition of Similarity
1998cited by this paper
Gradient-based learning applied to document recognition
1998cited by this paper
Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy
1997cited by this paper
Using Information Content to Evaluate Semantic Similarity in a Taxonomy
1995cited by this paper
WordNet: A Lexical Database for English
1995cited by this paper
Solving Multiclass Learning Problems via Error-Correcting Output Codes
1994cited by this paper
Filling in a sparse training space for word sense identification
1994cited by this paper
Distributional Structure
1954cited by this paper
Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks
year unknowncited by this paper
Attribute-Based Classification for Zero-Shot Visual Object Categorization
year unknowninfluential reference

CITED BY

Scattering-Attention Semi-Supervised-Guided Reinforced Region Proposal Network for Zero-Shot Detection on SAR Images
2026cites this paper
Knowledge-data-model-driven multimodal few-shot learning for hyperspectral fine classification: Generalization across sensor, category and scene
2026cites this paper
A zero-shot prototype expansion model for alleviating the hubness problem and compound fault diagnosis
2026cites this paper
ZeroDiff++: Substantial Unseen Visual-semantic Correlation in Zero-shot Learning
2026cites this paper
Instance-aware visual-semantic interaction for zero-shot learning
2026cites this paper
A contrastive cluster zero-shot model for cross-type fault diagnosis of bearings
2026cites this paper
Quantum generative adversarial network with automated noise suppression mechanism based on WGAN-GP
2025cites this paper
Zero-shot learning augmented slow feature analysis for semantic-aware industrial process fault detection
2025cites this paper
Multi-View Text Enhancement for Parameter-Free Zero-Shot 3D Model Classification
2025cites this paper
Self-attention and cross-modal attention for audio-visual zero-shot learning
2025cites this paper
Decentralized Model Selection for Test-Time Adaptation in Heterogeneous Connected Systems
2025cites this paper
Rethinking Generalized Zero-Shot Learning: A Synthesized Per-Instance Attribute Perspective
2025cites this paper
A review on NLP zero-shot and few-shot learning: methods and applications
2025cites this paper
Attribute Prompt Alignment Network for Zero-Shot Learning
2025cites this paper
Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning
2025cites this paper
Thinking Beyond Labels: Vocabulary-Free Fine-Grained Recognition using Reasoning-Augmented LMMs
2025cites this paper
PLZero: placeholder based approach to generalized zero-shot learning for multi-label recognition in chest radiographs
2025cites this paper
Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation
2025cites this paper
Deep Hierarchical Learning for 3D Semantic Segmentation
2025cites this paper
Multihop Reconstruction for Generalized Zero-Shot Node Classification
2025cites this paper
Zero-shot learning based on the fusion of global and local representations
2025cites this paper
Domain-Aware Knowledge Debiasing for Generalizable Video Understanding in CLIP
2025cites this paper
Consistency constraint guided network for zero-shot 3D classification
2025cites this paper
DFCNet: Dual-Factor Compensatory Clustering Network for Modality-Imbalanced Generalized Zero-Shot Learning
2025cites this paper
Zero-shot gear fault diagnosis based on attribute consistency embedding
2025cites this paper
Zero-Shot Fault Diagnosis in Industrial Processes Using Graph-Regularized Coupled Dictionary Learning
2025cites this paper
DARKIN: A zero-shot benchmark for phosphosite–dark kinase association using protein language models
2025cites this paper
LoCATe-GAT: Modeling Multi-Scale Local Context and Action Relationships for Zero-Shot Action Recognition
2025cites this paper
EfficientZSR: A Lightweight Generalized Zero-Shot Recognition Framework
2025cites this paper
Prototype-Guided Curriculum Learning for Zero-Shot Learning
2025cites this paper
DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery
2025cites this paper
Learning semantic consistency for audio-visual zero-shot learning
2025cites this paper
PC-GZSL: Prior Correction for Generalized Zero Shot Learning
2025cites this paper
Boosting Zero-Shot Learning using A Combination of EfficientNet and Deep Visual-Semantic Embeddings
2025cites this paper
Denoised and Dynamic Alignment Enhancement for Zero-Shot Learning
2025cites this paper
Multi-view deep generative dual fusion network for zero-shot learning
2025cites this paper
From Seen to Unseen: Harnessing Temporal Dependencies and Graph Structures for Zero-Sample Fault Diagnosis in Industrial Systems
2025cites this paper
Object-Aware Image Augmentation for Audio-Visual Zero-Shot Learning
2025influential citation
CFF: Coarse-to-Fine-to-Fusion Semantic Prototype Generation for Zero-Shot Classification
2025cites this paper
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification
2025cites this paper
Pathology Data Mining and Biomarker Discovery with Novel Algorithms
2025cites this paper
MetaZero: A Novel Meta-Learning Method Suitable for Generalized Zero-Shot Learning
2025cites this paper
RURA-Net: A general disease diagnosis method based on Zero-Shot Learning
2025cites this paper
Learning to Identify Seen, Unseen and Unknown in the Open World: A Practical Setting for Zero-Shot Learning
2025cites this paper
Knowledge Graph-Guided Deep Network for Hyperspectral Remote Sensing Image Classification
2025cites this paper
Global–Local Attention-Aware Zero-Shot Learning for Industrial Fault Diagnosis
2025cites this paper
Cross-domain zero-shot fault diagnosis method for high-voltage circuit breakers driven by multidomain spatial projection and dual embedded structure
2025cites this paper
Large Models are Good Annotators for Zero-Shot Learning
2025cites this paper
Estimation of Near-Instance-Level Attribute Bottleneck for Zero-Shot Learning
2024cites this paper
Synthesizing Classifiers from Prior Knowledge
2024influential citation
From Coarse to Fine: Hierarchical Zero-Shot Fault Diagnosis With Multigrained Attributes
2024cites this paper
Transductive Learning With Prior Knowledge for Generalized Zero-Shot Action Recognition
2024cites this paper
ActionHub: A Large-scale Action Video Description Dataset for Zero-shot Action Recognition
2024influential citation
Sentinel-Guided Zero-Shot Learning: A Collaborative Paradigm Without Real Data Exposure
2024cites this paper
Semantic Hierarchy-Aware Segmentation
2024cites this paper
Semantic Knowledge Base-Enabled Zero-Shot Multi-Level Feature Transmission Optimization
2024cites this paper
Instructing Prompt-to-Prompt Generation for Zero-Shot Learning
2024cites this paper
A novel mechanical fault diagnosis for high-voltage circuit breakers with zero-shot learning
2024cites this paper
Zero-shot image classification via Visual–Semantic Feature Decoupling
2024cites this paper
Causal Intervention-based Counterfactual Generative Network for Generalized Zero-shot Learning
2024cites this paper
Artificial intelligence in medicine
2024cites this paper
Visual Narratives: Large-scale Hierarchical Classification of Art-historical Images
2024influential citation
Synthetic Image Generation using StackGAN
2024cites this paper
Your Semantic-Independent Watermark is Fragile: A Semantic Perturbation Attack against EaaS Watermark
2024cites this paper
Parameter-free Zero-shot 3D Model Classification Based on Multi-view Representation
2024cites this paper
A zero-shot attribute-embedded model with a feature difference mapping sigmoid function for compound fault diagnosis of rotating machinery.
2024cites this paper
Masked Autoencoder via End-to-End Zero-Shot Learning for Fault Diagnosis of Unseen Classes
2024cites this paper
Deconfounding Causal Inference for Zero-Shot Action Recognition
2024cites this paper
Dual insurance for generalized zero-shot learning
2024cites this paper
Enhanced fish species classification using dynamic multilayer perceptron and transformer encoders with extra distribution data
2024cites this paper
Attention-driven frequency-based Zero-Shot Learning with phase augmentation
2024cites this paper
A Zero-Shot Learning Approach for Blockage Detection and Identification Based on the Stacking Ensemble Model
2024cites this paper
ParsNets: A Parsimonious Composition of Orthogonal and Low-Rank Linear Networks for Zero-Shot Learning
2024cites this paper
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning
2024influential citation
Application of CLIP for efficient zero-shot learning
2024cites this paper
Make Gating Fairer: Fault Attribute-Driven Bias Calibration for Generalized Zero-Shot Industrial Fault Diagnosis
2024cites this paper
A Semantic-Consistent Conditional Generative Adversarial Model for Zero-Shot Learning In Bearing Fault Diagnosis
2024cites this paper
On the Element-Wise Representation and Reasoning in Zero-Shot Image Recognition: A Systematic Survey
2024cites this paper
Contrastive visual feature filtering for generalized zero-shot learning
2024cites this paper
PSVMA+: Exploring Multi-Granularity Semantic-Visual Adaption for Generalized Zero-Shot Learning
2024cites this paper
Adaptive Masking Enhances Visual Grounding
2024cites this paper
A Survey of Neural Trees: Co-Evolving Neural Networks and Decision Trees
2024cites this paper
Feature Selection With Partial Autoencoding for Zero-Sample Fault Diagnosis
2024cites this paper
Embedded Zero-Shot Image Classification Based on Bidirectional Feature Mapping
2024cites this paper
CDF-Net: A zero-shot 3D classification network with CLIP Decoder for Feature Fusion
2024influential citation
Improving Fine-Grained Image Classification With Multimodal Information
2024cites this paper
Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning
2024cites this paper
ZeroDiff: Solidified Visual-semantic Correlation in Zero-Shot Learning
2024cites this paper
Towards Discriminative Feature Generation for Generalized Zero-Shot Learning
2024cites this paper
A Zero-Sample Fault Diagnosis Method Based on Transfer Learning
2024cites this paper
Hierarchical contrastive representation for zero shot learning
2024cites this paper
Cross-domain zero-shot learning for enhanced fault diagnosis in high-voltage circuit breakers
2024cites this paper
Knowledge Graph Enhancement for Fine-Grained Zero-Shot Learning on ImageNet21K
2024cites this paper
Zero-shot learning via categorization-relevant disentanglement and discriminative samples synthesis
2024cites this paper
Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
2024cites this paper
I2DFormer+: Learning Image to Document Summary Attention for Zero-Shot Image Classification
2024cites this paper
ZS-VAT: Learning Unbiased Attribute Knowledge for Zero-Shot Recognition Through Visual Attribute Transformer
2024cites this paper
Validating predictions of burial mounds with field data: the promise and reality of machine learning
2024cites this paper
Theoretical and Empirical Advantages of Dense-Vector to One-Hot Encoding of Intent Classes in Open-World Scenarios
2024cites this paper
‘Eyes of a Hawk and Ears of a Fox’: Part Prototype Network for Generalized Zero-Shot Learning
2024influential citation