SIFT Meets CNN: A Decade Survey of Instance Retrieval

Published 2016 in IEEE Transactions on Pattern Analysis and Machine Intelligence

ABSTRACT

In the early days, content-based image retrieval (CBIR) was studied with global features. Since 2003, image retrieval based on local descriptors (de facto SIFT) has been extensively studied for over a decade due to the advantage of SIFT in dealing with image transformations. Recently, image representations based on the convolutional neural network (CNN) have attracted increasing interest in the community and demonstrated impressive performance. Given this time of rapid evolution, this article provides a comprehensive survey of instance retrieval over the last decade. Two broad categories, SIFT-based and CNN-based methods, are presented. For the former, according to the codebook size, we organize the literature into using large/medium-sized/small codebooks. For the latter, we discuss three lines of methods, i.e., using pre-trained or fine-tuned CNN models, and hybrid methods. The first two perform a single-pass of an image to the network, while the last category employs a patch-based feature extraction scheme. This survey presents milestones in modern instance retrieval, reviews a broad selection of previous works in different categories, and provides insights on the connection between SIFT and CNN-based methods. After analyzing and comparing retrieval performance of different categories on several datasets, we discuss promising directions towards generic and specialized instance retrieval.

PUBLICATION RECORD

Publication year
2016
Venue
IEEE Transactions on Pattern Analysis and Machine Intelligence
Publication date
2016-08-05
Fields of study
Medicine, Computer Science
Identifiers
DOI 10.1109/TPAMI.2017.2709749 arXiv 1608.01807 PMID 29610107
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

FINDING NEW IDEA FOR BSIFT: TOWARD DATA-INDEPENDENT CODEBOOK FOR LARGE SCALE IMAGE SEARCH
2020cited by this paper
24/7 Place Recognition by View Synthesis
2018cited by this paper
Retrieving Objects by Partitioning
2017cited by this paper
Improving Large-Scale Image Retrieval Through Robust Aggregation of Local Descriptors
2017cited by this paper
Higher-Order Occurrence Pooling for Bags-of-Words: Visual Concept Detection
2017cited by this paper
TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking
2016cited by this paper
Scalable Feature Matching by Dual Cascaded Scalar Quantization for Image Retrieval
2016cited by this paper
CNN Image Retrieval Learns from BoW: Unsupervised Fine-Tuning with Hard Examples
2016cited by this paper
LIFT: Learned Invariant Feature Transform
2016cited by this paper
Efficient Large-Scale Similarity Search Using Matrix Factorization
2016cited by this paper
Good Practice in CNN Feature Transfer
2016influential reference
A Survey on Learning to Hash
2016cited by this paper
Accurate Image Search with Multi-Scale Contextual Evidences
2016cited by this paper
Polysemous Codes
2016cited by this paper
InterActive: Inter-Layer Activeness Propagation
2016cited by this paper
Exploiting Hierarchical Activations of Neural Network for Image Retrieval
2016cited by this paper
Faster R-CNN Features for Instance Search
2016influential reference
Bags of Local Convolutional Features for Scalable Instance Search
2016cited by this paper
A Vote-and-Verify Strategy for Fast Spatial Verification in Image Retrieval
2016cited by this paper
Deep Image Retrieval: Learning Global Representations for Image Search
2016influential reference
Large-scale vehicle re-identification in urban surveillance videos
2016cited by this paper
End-to-End Learning of Deep Visual Representations for Image Retrieval
2016cited by this paper
Group Invariant Deep Representations for Image Instance Retrieval
2016cited by this paper
Fine-residual VLAD for image retrieval
2016cited by this paper
Efficient Diffusion on Region Manifolds: Recovering Small Objects with Compact CNN Representations
2016cited by this paper
Approximate Fisher Kernels of Non-iid Image Models for Image Categorization
2015cited by this paper
Deep semantic ranking based hashing for multi-label image retrieval
2015cited by this paper
Fisher Encoded Convolutional Bag-of-Windows for Efficient Image Retrieval and Social Image Tagging
2015cited by this paper
Fisher vectors meet Neural Networks: A hybrid classification architecture
2015cited by this paper
Neural Codes for Image Retrieval
2015influential reference
Early burst detection for memory-efficient image retrieval
2015cited by this paper
A practical guide to CNNs and Fisher Vectors for image instance retrieval
2015cited by this paper
Particular object retrieval with integral max-pooling of CNN activations
2015influential reference
Multiple Measurements and Joint Dimensionality Reduction for Large Scale Image Search with Short Vectors
2015cited by this paper
Query-Adaptive Logo Search using Shape-Aware Descriptors
2015cited by this paper
Pairwise geometric matching for large-scale object retrieval
2015cited by this paper
Leveraging coupled multi-index for scalable retrieval of mammographic masses
2015cited by this paper
DeepIndex for Accurate and Efficient Image Retrieval
2015cited by this paper
Object level deep feature pooling for compact image representation
2015influential reference
Learning to compare image patches via convolutional neural networks
2015cited by this paper
A survey of recent advances in visual feature detection
2015cited by this paper
Learning visual similarity for product design with convolutional neural networks
2015cited by this paper
Deep Residual Learning for Image Recognition
2015influential reference
Aggregating Local Deep Features for Image Retrieval
2015influential reference
Aggregating Local Deep Features for Image Retrieval
2015cited by this paper
Local Convolutional Features with Unsupervised Training for Image Retrieval
2015cited by this paper
Cross-Dimensional Weighting for Aggregated Deep Convolutional Features
2015influential reference
Hybrid multi-layer deep CNN/aggregator feature for image classification
2015cited by this paper
Image Classification and Retrieval are ONE
2015influential reference
Adaptive Dither Voting for Robust Spatial Verification
2015influential reference
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
2015cited by this paper
UvA-DARE ( Digital Academic Repository ) Attributes and Categories for Generic Instance Search from One Example
2015cited by this paper
Multi-scale pyramid pooling for deep convolutional representation
2015cited by this paper
Cross Indexing With Grouplets
2015cited by this paper
Visual Place Recognition with Repetitive Structures
2015cited by this paper
Scalable Person Re-identification: A Benchmark
2015cited by this paper
Deep Learning
2015cited by this paper
Exploiting local features from deep networks for image retrieval
2015cited by this paper
NetVLAD: CNN Architecture for Weakly Supervised Place Recognition
2015cited by this paper
Coloring image search with coupled multi-index
2015cited by this paper
Deep learning of binary hash codes for fast image retrieval
2015cited by this paper
Triangulation Embedding and Democratic Aggregation for Image Search
2014influential reference
Descriptor Matching with Convolutional Neural Networks: a Comparison to SIFT
2014cited by this paper
Multimedia search reranking: A literature survey
2014cited by this paper
Revisiting the Fisher vector for fine-grained classification
2014cited by this paper
Hypercolumns for object segmentation and fine-grained localization
2014cited by this paper
Coupled Binary Embedding for Large-Scale Image Retrieval
2014influential reference
Cross-Indexing of Binary SIFT Codes for Large-Scale Image Search
2014influential reference
Packing and Padding: Coupled Multi-index for Accurate Image Retrieval
2014influential reference
Learning Deep Features for Scene Recognition using Places Database
2014influential reference
Factors of Transferability for a Generic ConvNet Representation
2014influential reference
Going deeper with convolutions
2014influential reference
CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
2014cited by this paper
Towards Codebook-Free: Scalable Cascaded Hashing for Mobile Image Search
2014cited by this paper
Fully convolutional networks for semantic segmentation
2014cited by this paper
A Comprehensive Study Over VLAD and Product Quantization in Large-Scale Image Retrieval
2014cited by this paper
A Baseline for Visual Instance Retrieval with Deep Convolutional Networks
2014cited by this paper
Scalable Nearest Neighbor Algorithms for High Dimensional Data
2014cited by this paper
Generalized Max Pooling
2014influential reference
Edge Boxes: Locating Object Proposals from Edges
2014cited by this paper
Orientation Covariant Aggregation of Local Descriptors with Embeddings
2014cited by this paper
Visual Instance Retrieval with Deep Convolutional Networks
2014influential reference
DENSE sampling of features for image retrieval
2014cited by this paper
Learning Local Feature Descriptors Using Convex Optimisation
2014cited by this paper
Return of the Devil in the Details: Delving Deep into Convolutional Nets
2014cited by this paper
Visual query expansion with or without geometry: Refining local descriptors by feature aggregation
2014cited by this paper
BM25 With Exponential IDF for Instance Search
2014cited by this paper
Learning Fine-Grained Image Similarity with Deep Ranking
2014cited by this paper
Multi-scale Orderless Pooling of Deep Convolutional Activation Features
2014influential reference
Bayes Merging of Multiple Vocabularies for Scalable Image Retrieval
2014cited by this paper
Feature Coding in Image Classification: A Comprehensive Study
2014cited by this paper
A Group Testing Framework for Similarity Search in High-dimensional Spaces
2014cited by this paper
A Comparison of Dense Region Detectors for Image Search and Fine-Grained Classification
2014cited by this paper
Very Deep Convolutional Networks for Large-Scale Image Recognition
2014influential reference
Hough Pyramid Matching: Speeded-Up Geometry Re-ranking for Large Scale Image Retrieval
2014influential reference
To Aggregate or Not to aggregate: Selective Match Kernels for Image Search
2013cited by this paper
Revisiting the VLAD image representation
2013cited by this paper
Bundle min-hashing for logo recognition
2013influential reference
Query Adaptive Similarity for Large Scale Object Retrieval
2013cited by this paper
Joint Inverted Indexing
2013cited by this paper

CITED BY

DIGM-SWE: Robust Image Steganography Without Embedding for IoT Security
2026cites this paper
CUSIR: Constraint based Unsupervised and Scalable Image retrieval
2026cites this paper
GAFF: Global Attention Feature Flow Network for Optical and SAR Image Registration Under Geometric Transformations
2026cites this paper
A comprehensive survey of content based image retrieval schemes: advancements, challenges, and future directions
2026cites this paper
Multi-model fusion and re-ranking for spatial image retrieval
2026cites this paper
Evaluating fashion compatibility based on mutual reference momentum contrast with weak-positive samples
2026cites this paper
NCNet: Learning to Find Non-Consistent Correspondence Using Learnable Frequency Response Function
2025cites this paper
SignMotionFuse: A Dual-Path Slowfast Network for Continuous Sign Language Recognition
2025cites this paper
LLM4HAR: Generalizable On-device Human Activity Recognition with Pretrained LLMs
2025cites this paper
A Crane Wire Rope Lifting Ratio Detection Method Based on SegFormer
2025cites this paper
3D2 SIFT: Enhancing 3D SIFT deformability by a two-step calculation strategy
2025cites this paper
Dual-consistency framework for cross-domain zero-shot image retrieval via prompt reconstruction and semantic mining
2025cites this paper
Fast and Scalable Mixed Precision Euclidean Distance Calculations Using GPU Tensor Cores
2025cites this paper
HCSSIR: Hash Code Selection-Based Secure Image Retrieval in a Blockchain Environment
2025cites this paper
Analysis of Temperature Rise Image Recognition and Positioning Methods for Power Equipment Under the Internet of Things Environment
2025cites this paper
CIMF-Net: A Change Indicator-Enhanced Multiscale Fusion Network for Remote Sensing Change Detection
2025cites this paper
Robot Localization Using a Learned Keypoint Detector and Descriptor with a Floor Camera and a Feature Rich Industrial Floor
2025cites this paper
SplatCo: Structure-View Collaborative Gaussian Splatting for Detail-Preserving Rendering of Large-Scale Unbounded Scenes
2025cites this paper
License Plate Recognition Under the Dual Challenges of Sand and Light: Dataset Construction and Model Optimization
2025cites this paper
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
2025cites this paper
A segmentation-free method for image retrieval and pattern spotting in historical documents using convolutional features
2025cites this paper
Real-Time Lightweight Vehicle Object Detection via Layer-Adaptive Model Pruning
2025cites this paper
Recurrence Meets Transformers for Universal Multimodal Retrieval
2025cites this paper
RRPN-AM: a ship hull number detection model based on rotating region proposal networks and attention mechanisms
2025cites this paper
Exploiting Hu invariant moments and deep features for image retrieval
2025cites this paper
Fusion of Global and Local Features with Multi-Inverted Indices for Image Retrieval
2025cites this paper
Facial expression recognition based on YOLOv8 deep learning in complex scenes
2025cites this paper
Dare to Plagiarize? Plagiarized Painting Recognition and Retrieval
2025cites this paper
Few-shot multi-scale railway obstacle detection via lightweight linear transformer and precise feature reweighting
2025cites this paper
Close proximity aerial image for precision viticulture. A review
2025cites this paper
Emphasizes-Response Subdues-Disturbing for PEMFC Fault Diagnosis
2025cites this paper
A comprehensive review of object detection with traditional and deep learning methods
2025cites this paper
Self-Supervised Incremental Learning of Object Representations from Arbitrary Image Sets
2025cites this paper
Semantic Consistency And Integrity Network For Cloth-changing Person Re-identification
2025cites this paper
Exploiting Deep Contrast Feature for Image Retrieval
2025cites this paper
AirRoom: Objects Matter in Room Reidentification
2025cites this paper
ILIAS: Instance-Level Image retrieval At Scale
2025cites this paper
Breaking the Frame: Image Retrieval by Visual Overlap Prediction
2024cites this paper
Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification
2024cites this paper
Object-attribute-relation model-based semantic coding for image transmission
2024cites this paper
A Self-Distillation Contrastive Learning Architecture for Global and Local Underwater Terrain Feature Extraction and Matching
2024cites this paper
CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching
2024cites this paper
Length and salient losses co-supported content-based commodity retrieval neural network
2024cites this paper
基于光学显微视觉的精密定位测量综述（特邀）
2024cites this paper
DGMA2-Net: A Difference-Guided Multiscale Aggregation Attention Network for Remote Sensing Change Detection
2024cites this paper
Linking unknown characters via oracle bone inscriptions retrieval
2024influential citation
CLIP-DS: Incorporating DataStd and Similarity Networks into CLIP for Scene Recognition Task
2024cites this paper
Improving Long Term Accuracy of Visual Localization in Urban Environment
2024cites this paper
Local feature matching from detector-based to detector-free: a survey
2024cites this paper
A Self-Aware Digital Memory Framework Powered by Artificial Intelligence
2024cites this paper
Identifying Representative Images for Events Description Using Machine Learning
2024cites this paper
Locating Target Regions for Image Retrieval in an Unsupervised Manner
2024cites this paper
The impact of introducing textual semantics on item instance retrieval with highly similar appearance: An empirical study
2024cites this paper
Retrieving images with missing regions by fusion of content and semantic features
2024cites this paper
Matchable image retrieval for large-scale UAV images: an evaluation of SfM-based reconstruction
2024cites this paper
Manifold information through neighbor embedding projection for image retrieval
2024cites this paper
Filtering with relational similarity
2024cites this paper
Querying For Actions Over Videos
2024cites this paper
MLMQ-IR: Multi-label multi-query image retrieval based on the variance of Hamming distance
2024cites this paper
Scene Image Retrieval Based on Salient Local Feature Aggregation and Geographic Information
2024cites this paper
AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
2024cites this paper
Performance evaluation of attention-deep hashing based medical image retrieval in brain MRI datasets
2024cites this paper
Raw Spectral Filter Array Imaging for Scene Recognition
2024cites this paper
Fully Unsupervised Domain-Agnostic Image Retrieval
2024cites this paper
A Low-illumination Image Enhancement Algorithm Based on Retinex to Improve SIFT Matching in Similar Scenes
2024cites this paper
Semantic image representation for image recognition and retrieval using multilayer variational auto-encoder, InceptionNet and low-level image features
2024cites this paper
Exploiting deep cross-semantic features for image retrieval
2024cites this paper
Structured Analysis and Comparison of Alphabets in Historical Handwritten Ciphers
2024cites this paper
Split-Check: Boosting Product Recognition via Instance-Level Retrieval
2024cites this paper
Texture Recognition Using InceptionNeXt-based Texture Perception Network
2024cites this paper
Descriptor Feature Map for Classification
2024cites this paper
Breaking the Frame: Visual Place Recognition by Overlap Prediction
2024cites this paper
Deep hashing and attention mechanism-based image retrieval of osteosarcoma scans for diagnosis of bone cancer
2024cites this paper
Precise occlusion-aware and feature-level reconstruction for occluded person re-identification
2024cites this paper
Towards Explainable Visual Vessel Recognition Using Fine-Grained Classification and Image Retrieval
2024cites this paper
MARs: Multi-view Attention Regularizations for Patch-Based Feature Recognition of Space Terrain
2024cites this paper
3D reconstruction of spherical images: a review of techniques, applications, and prospects
2024cites this paper
An Intelligent Memory Framework for Resource Constrained IoT Systems
2024cites this paper
Unifying Building Instance Extraction and Recognition in UAV Images
2024cites this paper
On image transformation for partial discharge source identification in vehicle cable terminals of high‐speed trains
2024cites this paper
Historical Postcards Retrieval through Vision Foundation Models
2024cites this paper
Investigating the diversity and stylization of contemporary user generated visual arts in the complexity entropy plane
2024cites this paper
Implementation of Scale-Invariant Feature Transform Convolutional Neural Network for Detecting Distracted Driver
2024cites this paper
Secure and Traceable Multikey Image Retrieval in Cloud-Assisted Internet of Things
2024cites this paper
A deep learning based interval type-2 fuzzy approach for image retrieval systems
2024cites this paper
A review of cross-modal retrieval for image-text
2024cites this paper
Segmented Hash-Based Privacy-Preserving Image Retrieval Scheme in Cloud-Assisted IoT
2024cites this paper
Handwritten text recognition and information extraction from ancient manuscripts using deep convolutional and recurrent neural network
2024cites this paper
Image retrieval using compact deep semantic correlation descriptors
2024cites this paper
Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization
2024cites this paper
MoReSo: A DNN Framework Expediting Content-Based Video Image Retrieval (CBVIR)
2024cites this paper
Indicative Vision Transformer for end-to-end zero-shot sketch-based image retrieval
2024cites this paper
OCR and AI Augmented CRM Systems: A Novel Approach to Customer Data Mining and Analysis for Digital Transformation
2023influential citation
Deep Adaptive Quadruplet Hashing With Probability Sampling for Large-Scale Image Retrieval
2023cites this paper
A dynamic programming approach for accurate content-based retrieval of ordinary and nano-scale medical images
2023cites this paper
Feature Contrastive Learning for No-Reference Segmentation Quality Evaluation
2023cites this paper
Deep convolutional feature aggregation for fine-grained cultivar recognition
2023cites this paper
Cross-Modal Retrieval Based on Transformer and Label Embedding
2023cites this paper
Development of a Visual Search Service Effectiveness Scale for Assessing Image Search Effectiveness: A Behavioral and Technological Perspective
2023cites this paper
Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments
2023cites this paper