菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-24
📄 Abstract - UniTeD: Unified Temporal Diffusion for Joint Perception and Planning in Autonomous Driving

Diffusion models have shown strong potential for multi-modal planning in end-to-end autonomous driving. However, most existing methods confine diffusion to the planning module, conditioning on fixed outputs from separate discriminative perception networks. This decoupled design propagates perception errors to the planner, increasing optimization difficulty and reducing robustness. To overcome these limitations, we propose UniTeD, a Unified Temporal Diffusion framework that jointly models perception and planning through iterative denoising in a shared generative space. By enabling bidirectional information exchange, the framework facilitates mutual refinement between tasks and improves robustness via noise-conditioned multi-task training. We further extend this unified diffusion paradigm to a streaming setting by incorporating temporal context. A Temporal Transition Module (TTM) is introduced to resolve the noise-level mismatch between historical and current frames. In addition, we propose an Anchor Refresh Strategy (ARS) to alleviate the training-inference distribution shift commonly observed in sparse diffusion-based end-to-end driving frameworks. Without bells and whistles, UniTeD achieves state-of-the-art performance across multiple benchmarks, surpassing both recent discriminative end-to-end methods and diffusion-based planning approaches.

顶级标签: machine learning robotics multi-modal
详细标签: diffusion model autonomous driving perception planning temporal 或 搜索:

UniTeD:面向自动驾驶联合感知与规划的时序扩散统一框架 / UniTeD: Unified Temporal Diffusion for Joint Perception and Planning in Autonomous Driving


1️⃣ 一句话总结

本论文提出UniTeD框架,通过将感知和规划任务统一到同一个扩散模型中,让两者在迭代去噪过程中互相改善,并加入时序机制解决历史与当前帧的噪声不匹配问题,从而在自动驾驶任务上显著提升性能,超越了现有的传统方法和基于扩散的规划方法。

源自 arXiv: 2606.25736