DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection

Johan Edstedt,Georg Bökman,Mårten Wadenbäck,Michael Felsberg

Published 2025 in arXiv.org

ABSTRACT

Keypoints are what enable Structure-from-Motion (SfM) systems to scale to thousands of images. However, designing a keypoint detection objective is a non-trivial task, as SfM is non-differentiable. Typically, an auxiliary objective involving a descriptor is optimized. This however induces a dependency on the descriptor, which is undesirable. In this paper we propose a fully self-supervised and descriptor-free objective for keypoint detection, through reinforcement learning. To ensure training does not degenerate, we leverage a balanced top-K sampling strategy. While this already produces competitive models, we find that two qualitatively different types of detectors emerge, which are only able to detect light and dark keypoints respectively. To remedy this, we train a third detector, DaD, that optimizes the Kullback-Leibler divergence of the pointwise maximum of both light and dark detectors. Our approach significantly improve upon SotA across a range of benchmarks. Code and model weights are publicly available at https://github.com/parskatt/dad

PUBLICATION RECORD

Publication year
2025
Venue
arXiv.org
Publication date
2025-03-10
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2503.07347 arXiv 2503.07347
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed
2024cited by this paper
Global Structure-from-Motion Revisited
2024cited by this paper
Learning Color Equivariant Representations
2024cited by this paper
DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector
2024influential reference
Affine steerers for structured keypoint description
2024cited by this paper
DeDoDe: Detect, Don’t Describe — Describe, Don’t Detect for Local Feature Matching
2023cited by this paper
Detector-Free Structure from Motion
2023cited by this paper
LightGlue: Local Feature Matching at Light Speed
2023cited by this paper
RoMa: Robust Dense Feature Matching
2023cited by this paper
ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation
2023influential reference
Camera Calibration without Camera Access - A Robust Validation Technique for Extended PnP Methods
2023cited by this paper
Steerers: A Framework for Rotation Equivariant Keypoint Descriptors
2023cited by this paper
Color Equivariant Convolutional Networks
2023cited by this paper
S-TREK: Sequential Translation and Rotation Equivariant Keypoints for local feature extraction
2023influential reference
Self-Supervised Equivariant Learning for Oriented Keypoint Detection
2022influential reference
Generalized Differentiable RANSAC
2022cited by this paper
DKM: Dense Kernelized Feature Matching for Geometry Estimation
2022cited by this paper
Decoupling Makes Weakly Supervised Local Feature Better
2022cited by this paper
PDC-Net+: Enhanced Probabilistic Dense Correspondence Network
2021cited by this paper
LoFTR: Detector-Free Local Feature Matching with Transformers
2021cited by this paper
ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction
2021cited by this paper
DISK: Learning local features with policy gradient
2020influential reference
Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task
2019cited by this paper
SuperGlue: Learning Feature Matching With Graph Neural Networks
2019cited by this paper
Neural-Guided RANSAC: Learning Where to Sample Model Hypotheses
2019cited by this paper
Key.Net: Keypoint Detection by Handcrafted and Learned CNN Filters
2019cited by this paper
Matching Features without Descriptors: Implicitly Matched Interest Points
2018cited by this paper
SIPs: Succinct Interest Points from Unsupervised Inlierness Probability Learning
2018cited by this paper
SuperPoint: Self-Supervised Interest Point Detection and Description
2017cited by this paper
HPatches: A Benchmark and Evaluation of Handcrafted and Learned Local Descriptors
2017cited by this paper
ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes
2017cited by this paper
Structure-from-Motion Revisited
2016cited by this paper
DSAC — Differentiable RANSAC for Camera Localization
2016cited by this paper
Very Deep Convolutional Networks for Large-Scale Image Recognition
2014cited by this paper
Fast Poisson disk sampling in arbitrary dimensions
2007cited by this paper
Distinctive Image Features from Scale-Invariant Keypoints
2004cited by this paper
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning
2004cited by this paper
Synaesthesia? A window into perception, thought and language
2001cited by this paper
Stochastic sampling in computer graphics
1986cited by this paper
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography
1981cited by this paper

CITED BY

RaCo: Ranking and Covariance for Practical Learned Keypoints
2026influential citation
No Labels, No Look-Ahead: Unsupervised Online Video Stabilization with Classical Priors
2026cites this paper
Toward Free-Form Local Feature Matching
2025cites this paper