菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-07
📄 Abstract - CuraLight: Debate-Guided Data Curation for LLM-Centered Traffic Signal Control

Traffic signal control (TSC) is a core component of intelligent transportation systems (ITS), aiming to reduce congestion, emissions, and travel time. Recent approaches based on reinforcement learning (RL) and large language models (LLMs) have improved adaptivity, but still suffer from limited interpretability, insufficient interaction data, and weak generalization to heterogeneous intersections. This paper proposes CuraLight, an LLM-centered framework where an RL agent assists the fine-tuning of an LLM-based traffic signal controller. The RL agent explores traffic environments and generates high-quality interaction trajectories, which are converted into prompt-response pairs for imitation fine-tuning. A multi-LLM ensemble deliberation system further evaluates candidate signal timing actions through structured debate, providing preference-aware supervision signals for training. Experiments conducted in SUMO across heterogeneous real-world networks from Jinan, Hangzhou, and Yizhuang demonstrate that CuraLight consistently outperforms state-of-the-art baselines, reducing average travel time by 5.34 percent, average queue length by 5.14 percent, and average waiting time by 7.02 percent. The results highlight the effectiveness of combining RL-assisted exploration with deliberation-based data curation for scalable and interpretable traffic signal control.

顶级标签: llm agents systems
详细标签: traffic signal control reinforcement learning multi-agent debate data curation intelligent transportation systems 或 搜索:

CuraLight:用于大语言模型交通信号控制的辩论引导数据策展框架 / CuraLight: Debate-Guided Data Curation for LLM-Centered Traffic Signal Control


1️⃣ 一句话总结

这篇论文提出了一个名为CuraLight的新框架,它巧妙地将强化学习的数据探索能力与大语言模型的推理能力相结合,通过让多个语言模型进行‘辩论’来优化决策,从而显著提升了交通信号控制的效率和可解释性,并在多个真实路网测试中取得了优于现有方法的性能。

源自 arXiv: 2604.05663