视觉提示引导的统一推动策略 / Visual Prompt Guided Unified Pushing Policy
1️⃣ 一句话总结
这项研究提出了一种新型的机器人推动策略,它通过引入简单的视觉提示来指导机器人进行灵活、多模式的推动操作,从而能够高效地适应各种不同的任务场景,比如桌面整理。
As one of the simplest non-prehensile manipulation skills, pushing has been widely studied as an effective means to rearrange objects. Existing approaches, however, typically rely on multi-step push plans composed of pre-defined pushing primitives with limited application scopes, which restrict their efficiency and versatility across different scenarios. In this work, we propose a unified pushing policy that incorporates a lightweight prompting mechanism into a flow matching policy to guide the generation of reactive, multimodal pushing actions. The visual prompt can be specified by a high-level planner, enabling the reuse of the pushing policy across a wide range of planning problems. Experimental results demonstrate that the proposed unified pushing policy not only outperforms existing baselines but also effectively serves as a low-level primitive within a VLM-guided planning framework to solve table-cleaning tasks efficiently.
视觉提示引导的统一推动策略 / Visual Prompt Guided Unified Pushing Policy
这项研究提出了一种新型的机器人推动策略,它通过引入简单的视觉提示来指导机器人进行灵活、多模式的推动操作,从而能够高效地适应各种不同的任务场景,比如桌面整理。
源自 arXiv: 2602.19193