ARC-RL:受《ARC Raiders》启发的强化学习试验场 / ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders
1️⃣ 一句话总结
本文提出了ARC-RL,一个基于《ARC Raiders》游戏生物设计的强化学习测试平台,包含四种独特形态的机器人及其统一奖励函数,并用多种算法验证了该平台在应对多样化形态和动画风格约束时的有效性。
Reinforcement learning for legged locomotion has matured into a stack of multi-component reward functions and physics-engine benchmarks whose morphologies are uniformly derived from real commercial hardware. Game NPCs, however, are bound by stylistic constraints absent from sim-to-real robotics and routinely take the form of creatures with no real-robot counterpart. We introduce ARC-RL, a suite of four MuJoCo continuous-control environments featuring robotic morphologies inspired by the bestiary of ARC Raiders: the 18-DoF tall hexapod Queen, the 12-DoF armoured hexapod Bastion, the 18-DoF compact hexapod Tick, and the 12-DoF quadruped Leaper. All four robots share a unified observation template, action convention, simulation cadence, and a single closed-form multi-component reward function whose only per-morphology variation lives in a small set of weights and parameters. The reward fuses a velocity-tracking tent, a healthy survive bonus, a phase-locked gait-compliance bonus/cost pair, action regularisers, three safety penalties, and a posture anchor; no motion-capture data enters the reward at any point. We additionally provide hand-crafted Central Pattern Generator demonstrators per morphology, which serve both as fixed expert references and as sources of prior data for offline-to-online training. On this playground, we conduct a controlled empirical study comparing standard online algorithms (SAC, SPEQ, SOPE-EO) and methods augmented with prior data (SACfD, SPEQ-O2O, SOPE), and characterise how each paradigm copes with the playground's morphological diversity and animation-style stylistic constraints.
ARC-RL:受《ARC Raiders》启发的强化学习试验场 / ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders
本文提出了ARC-RL,一个基于《ARC Raiders》游戏生物设计的强化学习测试平台,包含四种独特形态的机器人及其统一奖励函数,并用多种算法验证了该平台在应对多样化形态和动画风格约束时的有效性。
源自 arXiv: 2605.19503