菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-22
📄 Abstract - Visual Prompt Guided Unified Pushing Policy

As one of the simplest non-prehensile manipulation skills, pushing has been widely studied as an effective means to rearrange objects. Existing approaches, however, typically rely on multi-step push plans composed of pre-defined pushing primitives with limited application scopes, which restrict their efficiency and versatility across different scenarios. In this work, we propose a unified pushing policy that incorporates a lightweight prompting mechanism into a flow matching policy to guide the generation of reactive, multimodal pushing actions. The visual prompt can be specified by a high-level planner, enabling the reuse of the pushing policy across a wide range of planning problems. Experimental results demonstrate that the proposed unified pushing policy not only outperforms existing baselines but also effectively serves as a low-level primitive within a VLM-guided planning framework to solve table-cleaning tasks efficiently.

顶级标签: robotics agents computer vision
详细标签: pushing policy visual prompting flow matching non-prehensile manipulation table-cleaning 或 搜索:

视觉提示引导的统一推动策略 / Visual Prompt Guided Unified Pushing Policy


1️⃣ 一句话总结

这项研究提出了一种新型的机器人推动策略,它通过引入简单的视觉提示来指导机器人进行灵活、多模式的推动操作,从而能够高效地适应各种不同的任务场景,比如桌面整理。

源自 arXiv: 2602.19193