RoboBrain 2.5:深度感知与时间建模 / RoboBrain 2.5: Depth in Sight, Time in Mind
1️⃣ 一句话总结
这篇论文提出了新一代机器人AI基础模型RoboBrain 2.5,它通过引入精确的三维空间推理和密集时间价值估计两大能力,让机器人能够更准确地在物理世界中理解和执行复杂的精细操作任务。
We introduce RoboBrain 2.5, a next-generation embodied AI foundation model that advances general perception, spatial reasoning, and temporal modeling through extensive training on high-quality spatiotemporal supervision. Building upon its predecessor, RoboBrain 2.5 introduces two major capability upgrades. Specifically, it unlocks Precise 3D Spatial Reasoning by shifting from 2D pixel-relative grounding to depth-aware coordinate prediction and absolute metric constraint comprehension, generating complete 3D manipulation traces as ordered keypoint sequences under physical constraints. Complementing this spatial precision, the model establishes Dense Temporal Value Estimation that provides dense, step-aware progress prediction and execution state understanding across varying viewpoints, producing stable feedback signals for downstream learning. Together, these upgrades extend the framework toward more physically grounded and execution-aware embodied intelligence for complex, fine-grained manipulation. The code and checkpoints are available at project website: this https URL
RoboBrain 2.5:深度感知与时间建模 / RoboBrain 2.5: Depth in Sight, Time in Mind
这篇论文提出了新一代机器人AI基础模型RoboBrain 2.5,它通过引入精确的三维空间推理和密集时间价值估计两大能力,让机器人能够更准确地在物理世界中理解和执行复杂的精细操作任务。
源自 arXiv: 2601.14352