菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-13
📄 Abstract - SECOND-Grasp: Semantic Contact-guided Dexterous Grasping

Achieving reliable robotic manipulation, such as dexterous grasping, requires a synergy between physically stable interactions and semantic task guidance, yet these objectives are often treated as separate, disjoint goals. In this paper, we investigate how to integrate dexterous grasping techniques, i.e., physically stable grasps for object lifting and language-guided grasp generation, to achieve both physical stability and semantic understanding. To this end, we propose SECOND-Grasp (SEmantic CONtact-guided Dexterous Grasping), a unified framework that enables robotic hands to dynamically adjust grasping strategies based on semantic reasoning while ensuring physical feasibility. We begin by obtaining coarse contact proposals through vision-language reasoning to infer where contacts should occur based on object properties, followed by segmentation to localize these regions across views. To further ensure consistency across multiple viewpoints, we introduce Semantic-Geometric Consistency Refinement (SGCR), which refines initial contact predictions by enforcing semantic consistency across views and removing geometrically invalid regions, yielding reliable 3D contact maps. Then, we derive a feasible hand pose for each contact map via inverse kinematics, generating a supervision signal for policy learning. Our approach, trained on DexGraspNet, consistently outperforms baselines in lifting success rate on both seen and unseen categories, achieving 98.2% and 97.7%, respectively, while also improving intent-aware grasping by 12.8% and 26.2%. We further show promising results on additional datasets and robotic hands, including Shadow Hand and Allegro Hand.

顶级标签: robotics computer vision multi-modal
详细标签: dexterous grasping semantic contact vision-language reasoning inverse kinematics grasp generation 或 搜索:

SECOND-Grasp:语义接触引导的灵巧抓取 / SECOND-Grasp: Semantic Contact-guided Dexterous Grasping


1️⃣ 一句话总结

本文提出了一种名为SECOND-Grasp的统一框架,通过结合视觉语言推理和几何一致性优化,让机器人手在灵巧抓取物体时既能理解语义指令(如“抓杯子柄”),又能保证物理稳定性,从而在多种任务中将抓取成功率提升至98%以上。

源自 arXiv: 2605.13117