GraspLDP:通过潜在扩散实现可泛化的抓取策略 / GraspLDP: Towards Generalizable Grasping Policy via Latent Diffusion
1️⃣ 一句话总结
这篇论文提出了一种结合抓取先验知识的潜在扩散策略,通过引导机器人动作生成和融入视觉重建目标,显著提升了模仿学习抓取策略的精确度和对不同物体、不同场景的泛化能力。
This paper focuses on enhancing the grasping precision and generalization of manipulation policies learned via imitation learning. Diffusion-based policy learning methods have recently become the mainstream approach for robotic manipulation tasks. As grasping is a critical subtask in manipulation, the ability of imitation-learned policies to execute precise and generalizable grasps merits particular attention. Existing imitation learning techniques for grasping often suffer from imprecise grasp executions, limited spatial generalization, and poor object generalization. To address these challenges, we incorporate grasp prior knowledge into the diffusion policy framework. In particular, we employ a latent diffusion policy to guide action chunk decoding with grasp pose prior, ensuring that generated motion trajectories adhere closely to feasible grasp configurations. Furthermore, we introduce a self-supervised reconstruction objective during diffusion to embed the graspness prior: at each reverse diffusion step, we reconstruct wrist-camera images back-projected the graspness from the intermediate representations. Both simulation and real robot experiments demonstrate that our approach significantly outperforms baseline methods and exhibits strong dynamic grasping capabilities.
GraspLDP:通过潜在扩散实现可泛化的抓取策略 / GraspLDP: Towards Generalizable Grasping Policy via Latent Diffusion
这篇论文提出了一种结合抓取先验知识的潜在扩散策略,通过引导机器人动作生成和融入视觉重建目标,显著提升了模仿学习抓取策略的精确度和对不同物体、不同场景的泛化能力。
源自 arXiv: 2602.22862