A Multilevel Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning

Hao Chen,Takuya Kiyokawa,Zhengtao Hu,Weiwei Wan,Kensuke Harada

Published 2025 in IEEE Transactions on robotics

ABSTRACT

Grasping unknown objects from a single view has remained a challenging topic in robotics due to the uncertainty of partial observation. Recent advances in large-scale models have led to benchmark solutions such as GraspNet-1Billion. However, such learning-based approaches still face a critical limitation in performance robustness for their sensitivity to sensing noise and environmental changes. To address this bottleneck in achieving highly generalized grasping, we abandon the traditional learning framework and introduce a new perspective: similarity matching, where similar known objects are utilized to guide the grasping of unknown target objects. We newly propose a method that robustly achieves unknown-object grasping from a single viewpoint through three key steps: 1) leverage the visual features of the observed object to perform similarity matching with an existing database containing various object models, identifying potential candidates with high similarity; 2) use the candidate models with pre-existing grasping knowledge to plan imitative grasps for the unknown target object; 3) optimize the grasp quality through a local fine-tuning process. To address the uncertainty caused by partial and noisy observation, we propose a multilevel similarity matching framework that integrates semantic, geometric, and dimensional features for comprehensive evaluation. Especially, we introduce a novel point cloud geometric descriptor, the clustered fast point feature histogram descriptor, which facilitates accurate similarity assessment between partial point clouds of observed objects and complete point clouds of database models. In addition, we incorporate the use of large language models, introduce the semioriented bounding box, and develop a novel point cloud registration approach based on plane detection to enhance matching accuracy under single-view conditions. Real-world experiments demonstrate that our proposed method significantly outperforms existing benchmarks in grasping a wide variety of unknown objects in both isolated and cluttered scenarios, showcasing exceptional robustness across varying object types and operating environments.

PUBLICATION RECORD

Publication year
2025
Venue
IEEE Transactions on robotics
Publication date
2025-07-16
Fields of study
Computer Science, Engineering
Identifiers
DOI 10.1109/TRO.2025.3588720 arXiv 2507.11938
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

3DSGrasp: 3D Shape-Completion for Robotic Grasp
2023cited by this paper
Grasp-Anything: Large-scale Grasp Dataset from Foundation Models
2023cited by this paper
Efficient Heatmap-Guided 6-Dof Grasp Detection in Cluttered Scenes
2023cited by this paper
Learning-Free Grasping of Unknown Objects Using Hidden Superquadrics
2023cited by this paper
Segment Anything
2023cited by this paper
SCARP: 3D Shape Completion in ARbitrary Poses for Improved Grasping
2023cited by this paper
Category-Association Based Similarity Matching for Novel Object Pick-and-Place Task
2022influential reference
Deep Learning Approaches to Grasp Synthesis: A Review
2022cited by this paper
Detecting Twenty-thousand Classes using Image-level Supervision
2022cited by this paper
Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations
2021cited by this paper
Robust and Accurate Superquadric Recovery: a Probabilistic Approach
2021cited by this paper
Ontology-Assisted Generalisation of Robot Action Execution Knowledge
2021cited by this paper
Robust grasp detection with incomplete point cloud and complex background
2021cited by this paper
Volumetric Grasping Network: Real-time 6 DOF Grasp Detection in Clutter
2021cited by this paper
GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping
2020cited by this paper
Planning Grasps With Suction Cups and Parallel Grippers Using Superimposed Segmentation of Object Meshes
2020cited by this paper
A robust statistics approach for plane detection in unorganized point clouds
2020cited by this paper
Learning ambidextrous robot grasping policies
2019influential reference
6-DOF GraspNet: Variational Grasp Generation for Object Manipulation
2019cited by this paper
Model-free and learning-free grasping by Local Contact Moment matching
2018cited by this paper
Closing the Loop for Robotic Grasping: A Real-time, Generative Grasp Synthesis Approach
2018cited by this paper
PointNetGPD: Detecting Grasp Configurations from Point Sets
2018cited by this paper
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
2018cited by this paper
Learning 6-DOF Grasping Interaction via Deep Geometry-Aware 3D Representations
2018influential reference
Robotic pick-and-place of novel objects in clutter with multi-affordance grasping and cross-domain image matching
2017cited by this paper
Grasp Pose Detection in Point Clouds
2017cited by this paper
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
2016cited by this paper
Benchmarking in Manipulation Research: Using the Yale-CMU-Berkeley Object and Model Set
2015cited by this paper
GloVe: Global Vectors for Word Representation
2014cited by this paper
Efficient Estimation of Word Representations in Vector Space
2013cited by this paper
Template-based learning of grasp selection
2012cited by this paper
CAD-model recognition and 6DOF pose estimation using 3D cues
2011cited by this paper
Fast Point Feature Histograms (FPFH) for 3D registration
2009cited by this paper
The Columbia grasp database
2009cited by this paper
Robust feature detection and local classification for surfaces based on moment analysis
2004cited by this paper
The ball-pivoting algorithm for surface reconstruction
1999cited by this paper
Object modeling by registration of multiple range images
1991cited by this paper
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography
1981cited by this paper

CITED BY

No citing papers are available for this paper.