Benchmarking Filtered Approximate Nearest Neighbor Search Algorithms on Transformer-based Embedding Vectors

Patrick Iff,Paul Bruegger,Marcin Chrapek,Maciej Besta,Torsten Hoefler

Published 2025 in arXiv.org

ABSTRACT

Advances in embedding models for text, image, audio, and video drive progress across multiple domains, including retrieval-augmented generation, recommendation systems, and others. Many of these applications require an efficient method to retrieve items that are close to a given query in the embedding space while satisfying a filter condition based on the item's attributes, a problem known as filtered approximate nearest neighbor search (FANNS). By performing an in-depth literature analysis on FANNS, we identify a key gap in the research landscape: publicly available datasets with embedding vectors from state-of-the-art transformer-based text embedding models that contain abundant real-world attributes covering a broad spectrum of attribute types and value distributions. To fill this gap, we introduce the arxiv-for-fanns dataset of transformer-based embedding vectors for the abstracts of over 2.7 million arXiv papers, enriched with 11 real-world attributes such as authors and categories. We benchmark eleven different FANNS methods on our new dataset to evaluate their performance across different filter types, numbers of retrieved neighbors, dataset scales, and query selectivities. We distill our findings into eight key observations that guide users in selecting the most suitable FANNS method for their specific use cases.

PUBLICATION RECORD

Publication year
2025
Venue
arXiv.org
Publication date
2025-07-29
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2507.21989 arXiv 2507.21989
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Filtered Approximate Nearest Neighbor Search: A Unified Benchmark and Systematic Experimental Study [Experiment, Analysis & Benchmark]
2025influential reference
Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems
2025cited by this paper
M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
2024cited by this paper
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
2024cited by this paper
Research on High-Accuracy Indoor Visual Positioning Technology Using an Optimized SE-ResNeXt Architecture
2024cited by this paper
Jasper and Stella: distillation of SOTA embedding models
2024influential reference
Approximate Nearest Neighbor Search with Window Filters
2024influential reference
UNIFY: Unified Index for Range Filtered Approximate Nearest Neighbors Search
2024cited by this paper
CAPS: A Practical Partition Index for Filtered Similarity Search
2023influential reference
Filtered-DiskANN: Graph Algorithms for Approximate Nearest Neighbor Search with Filters
2023influential reference
Retrieval-Augmented Generation for Large Language Models: A Survey
2023cited by this paper
An Efficient and Robust Framework for Approximate Nearest Neighbor Search with Attribute Constraint
2023influential reference
A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge
2023cited by this paper
Generalized Relative Neighborhood Graph (GRNG) for Similarity Search
2022cited by this paper
Navigable Proximity Graph-Driven Native Hybrid Queries with Structured and Unstructured Constraints
2022cited by this paper
Milvus: A Purpose-Built Vector Data Management System
2021cited by this paper
High Dimensional Similarity Search With Satellite System Graph: Efficiency, Scalability, and Unindexed Query Compatibility
2021cited by this paper
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search
2021cited by this paper
New Trends in High-D Vector Similarity Search: AI-driven, Progressive, and Distributed
2021cited by this paper
A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search
2021cited by this paper
RedCaps: web-curated image-text data created by the people, for the people
2021cited by this paper
Beyond Goldfish Memory: Long-Term Open-Domain Conversation
2021cited by this paper
FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search
2021cited by this paper
Meta-Research
2020cited by this paper
PASE: PostgreSQL Ultra-High-Dimensional Approximate Nearest Neighbor Search Extension
2020cited by this paper
I/O Efficient Approximate Nearest Neighbour Search based on Learned Functions
2020cited by this paper
R2LSH: A Nearest Neighbor Search Scheme Based on Two-dimensional Projected Spaces
2020cited by this paper
K-means tree: an optimal clustering tree for unsupervised learning
2020cited by this paper
Multiattribute approximate nearest neighbor search based on navigable small world graph
2020cited by this paper
EI-LSH: An early-termination driven I/O efficient incremental c-approximate nearest neighbor search
2020cited by this paper
SONG: Approximate Nearest Neighbor Search on GPU
2020cited by this paper
Qd-tree: Learning Data Layouts for Big Data Analytics
2020cited by this paper
mmLSH: A Practical and Efficient Technique for Processing Approximate Nearest Neighbor Queries on Multimedia Data
2020cited by this paper
Satellite System Graph: Towards the Efficiency Up-Boundary of Graph-Based Approximate Nearest Neighbor Search
2019cited by this paper
DiskANN : Fast Accurate Billion-point Nearest Neighbor Search on a Single Node
2019cited by this paper
PyTorch-BigGraph: A Large-scale Graph Embedding System
2019cited by this paper
AnalyticDB: Real-time OLAP Database System at Alibaba Cloud
2019cited by this paper
Reconfigurable Inverted Index
2018cited by this paper
Intelligent Probing for Locality Sensitive Hashing: Multi-Probe LSH and Beyond
2017cited by this paper
Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs
2016influential reference
Pruned Bi-directed K-nearest Neighbor Graph for Proximity Search
2016cited by this paper
FANNG: Fast Approximate Nearest Neighbour Graphs
2016cited by this paper
Adaptive bit allocation product quantization
2016cited by this paper
SoundNet: Learning Sound Representations from Unlabeled Video
2016cited by this paper
Deep Supervised Hashing for Fast Image Retrieval
2016cited by this paper
Optimal Data-Dependent Hashing for Approximate Near Neighbors
2015cited by this paper
PQTable: Fast Exact Asymmetric Distance Neighbor Search for Product Quantization Using Hash Tables
2015cited by this paper
A unified deep neural network for speaker and language recognition
2015cited by this paper
Query-Aware Locality-Sensitive Hashing for Approximate Nearest Neighbor Search
2015cited by this paper
Rank-Based Similarity Search: Reducing the Dimensional Dependence
2015cited by this paper
Cache locality is not enough: High-Performance Nearest Neighbor Search with Product Quantization Fast Scan
2015cited by this paper
Neighbor-Sensitive Hashing
2015cited by this paper
Practical and Optimal LSH for Angular Distance
2015cited by this paper
Approximate nearest neighbor algorithm based on navigable small world graphs
2014cited by this paper
Going deeper with convolutions
2014cited by this paper
Additive Quantization for Extreme Vector Compression
2014cited by this paper
Scalable Nearest Neighbor Algorithms for High Dimensional Data
2014cited by this paper
Locally Optimized Product Quantization for Approximate Nearest Neighbor Search
2014cited by this paper
Optimized Product Quantization for Approximate Nearest Neighbor Search
2013cited by this paper
Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing
2013cited by this paper
Approximate Nearest Neighbor: Towards Removing the Curse of Dimensionality
2012cited by this paper
The Inverted Multi-Index
2012cited by this paper
Efficient retrieval of recommendations in a matrix factorization framework
2012cited by this paper
Query-driven iterated neighborhood graph search for large scale indexing
2012cited by this paper
Locality-sensitive hashing scheme based on dynamic collision counting
2012cited by this paper
Efficient k-nearest neighbor graph construction for generic similarity measures
2011cited by this paper
Fast approximate similarity search based on degree-reduced neighborhood graphs
2011cited by this paper
Product Quantization for Nearest Neighbor Search
2011cited by this paper
Full Text Search Engine as Scalable k-Nearest Neighbor Recommendation System
2010cited by this paper
Five Balltree Construction Algorithms
2009cited by this paper
Optimised KD-trees for fast image descriptor matching
2008cited by this paper
Random projection trees and low dimensional manifolds
2008cited by this paper
Spectral Hashing
2008cited by this paper
Cover trees for nearest neighbor
2006cited by this paper
Locality-sensitive hashing scheme based on p-stable distributions
2004cited by this paper
Video Google: a text retrieval approach to object matching in videos
2003cited by this paper
Similarity Search in High Dimensions via Hashing
1999cited by this paper
M-tree: An Efficient Access Method for Similarity Search in Metric Spaces
1997cited by this paper
Shape indexing using approximate nearest-neighbour search in high-dimensional spaces
1997cited by this paper
Computational geometry: algorithms and applications
1997cited by this paper
A B+-tree structure for large quadtrees
1983cited by this paper
Nearest Neighbour Searches and the Curse of Dimensionality
1979cited by this paper
Multidimensional binary search trees used for associative searching
1975cited by this paper
Organization and maintenance of large ordered indices
1970cited by this paper

CITED BY

Filtered Approximate Nearest Neighbor Search Cost Estimation
2026cites this paper
JAG: Joint Attribute Graphs for Filtered Nearest Neighbor Search
2026cites this paper
Filtered Approximate Nearest Neighbor Search in Vector Databases: System Design and Performance Analysis
2026cites this paper
Elastic Index Selection for Label-Hybrid AKNN Search
2025cites this paper