评估基于强化学习的自适应交通信号控制的鲁棒性 / Evaluating the Robustness of Reinforcement Learning based Adaptive Traffic Signal Control
1️⃣ 一句话总结
本研究开发并测试了一种基于强化学习的交通信号控制算法,该算法采用与真实信号控制器一致的复杂相位结构,并通过分布式训练提升效率,实验表明其在多种交通需求下能显著优于传统感应控制,但模型的鲁棒性高度依赖于训练数据的多样性。
Reinforcement learning (RL) has attracted increasing interest for adaptive traffic signal control due to its model-free ability to learn control policies directly from interaction with the traffic environment. However, several challenges remain before RL-based signal control can be considered ready for field deployment. Many existing studies rely on simplified signal timing structures, robustness of trained models under varying traffic demand conditions remains insufficiently evaluated, and runtime efficiency continues to pose challenges when training RL algorithms in traffic microscopic simulation environments. This study formulates an RL-based signal control algorithm capable of representing a full eight-phase ring-barrier configuration consistent with field signal controllers. The algorithm is trained and evaluated under varying traffic demand conditions and benchmarked against state-of-the-practice actuated signal control (ASC). To assess robustness, experiments are conducted across multiple traffic volumes and origin-destination (O-D) demand patterns with varying levels of structural similarity. To improve training efficiency, a distributed asynchronous training architecture is implemented that enables parallel simulation across multiple computing nodes. Results from a case study intersection show that the proposed RL-based signal control significantly outperforms optimized ASC, reducing average delay by 11-32% across movements. A model trained on a single O-D pattern generalizes well to similar unseen demand patterns but degrades under substantially different demand conditions. In contrast, a model trained on diverse O-D patterns demonstrates strong robustness, consistently outperforming ASC even under highly dissimilar unseen demand scenarios.
评估基于强化学习的自适应交通信号控制的鲁棒性 / Evaluating the Robustness of Reinforcement Learning based Adaptive Traffic Signal Control
本研究开发并测试了一种基于强化学习的交通信号控制算法,该算法采用与真实信号控制器一致的复杂相位结构,并通过分布式训练提升效率,实验表明其在多种交通需求下能显著优于传统感应控制,但模型的鲁棒性高度依赖于训练数据的多样性。
源自 arXiv: 2603.15283