赫拉克勒斯:为通用人形机器人控制搭建精确跟踪与生成式合成之间的桥梁 / Heracles: Bridging Precise Tracking and Generative Synthesis for General Humanoid Control
1️⃣ 一句话总结
这篇论文提出了一种名为Heracles的新型控制中间件,它利用状态条件扩散模型,让人形机器人既能精确执行指令动作,又能在遭遇严重干扰时像人类一样自然地生成恢复动作,从而显著提升了机器人的鲁棒性和适应性。
Achieving general-purpose humanoid control requires a delicate balance between the precise execution of commanded motions and the flexible, anthropomorphic adaptability needed to recover from unpredictable environmental perturbations. Current general controllers predominantly formulate motion control as a rigid reference-tracking problem. While effective in nominal conditions, these trackers often exhibit brittle, non-anthropomorphic failure modes under severe disturbances, lacking the generative adaptability inherent to human motor control. To overcome this limitation, we propose Heracles, a novel state-conditioned diffusion middleware that bridges precise motion tracking and generative synthesis. Rather than relying on rigid tracking paradigms or complex explicit mode-switching, Heracles operates as an intermediary layer between high-level reference motions and low-level physics trackers. By conditioning on the robot's real-time state, the diffusion model implicitly adapts its behavior: it approximates an identity map when the state closely aligns with the reference, preserving zero-shot tracking fidelity. Conversely, when encountering significant state deviations, it seamlessly transitions into a generative synthesizer to produce natural, anthropomorphic recovery trajectories. Our framework demonstrates that integrating generative priors into the control loop not only significantly enhances robustness against extreme perturbations but also elevates humanoid control from a rigid tracking paradigm to an open-ended, generative general-purpose architecture.
赫拉克勒斯:为通用人形机器人控制搭建精确跟踪与生成式合成之间的桥梁 / Heracles: Bridging Precise Tracking and Generative Synthesis for General Humanoid Control
这篇论文提出了一种名为Heracles的新型控制中间件,它利用状态条件扩散模型,让人形机器人既能精确执行指令动作,又能在遭遇严重干扰时像人类一样自然地生成恢复动作,从而显著提升了机器人的鲁棒性和适应性。
源自 arXiv: 2603.27756