菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-03
📄 Abstract - AnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical Scenario Generation

Autonomous driving systems require comprehensive evaluation in safety-critical scenarios to ensure safety and robustness. However, such scenarios are rare and difficult to collect from real-world driving data, necessitating simulation-based synthesis. Yet, existing methods often exhibit limitations in both controllability and realism. From a capability perspective, LLMs excel at controllable generation guided by natural language instructions, while diffusion models are better suited for producing trajectories consistent with realistic driving distributions. Leveraging their complementary strengths, we propose AnchorDrive, a two-stage safety-critical scenario generation framework. In the first stage, we deploy an LLM as a driver agent within a closed-loop simulation, which reasons and iteratively outputs control commands under natural language constraints; a plan assessor reviews these commands and provides corrective feedback, enabling semantically controllable scenario generation. In the second stage, the LLM extracts key anchor points from the first-stage trajectories as guidance objectives, which jointly with other guidance terms steer the diffusion model to regenerate complete trajectories with improved realism while preserving user-specified intent. Experiments on the highD dataset demonstrate that AnchorDrive achieves superior overall performance in criticality, realism, and controllability, validating its effectiveness for generating controllable and realistic safety-critical scenarios.

顶级标签: agents multi-modal model evaluation
详细标签: autonomous driving scenario generation diffusion models llm agents safety-critical simulation 或 搜索:

AnchorDrive:基于锚点引导扩散再生的LLM场景推演用于安全关键场景生成 / AnchorDrive: LLM Scenario Rollout with Anchor-Guided Diffusion Regeneration for Safety-Critical Scenario Generation


1️⃣ 一句话总结

这篇论文提出了一个名为AnchorDrive的两阶段框架,它巧妙地结合了大型语言模型(LLM)的指令控制能力和扩散模型(Diffusion Model)的逼真生成能力,用于高效地生成既符合用户意图又高度逼真的自动驾驶安全测试场景。

源自 arXiv: 2603.02542