面向多图的可扩展路由的两阶段学习分解方法 / Two-Stage Learned Decomposition for Scalable Routing on Multigraphs
1️⃣ 一句话总结
本文提出了一种名为NEPF的两阶段学习方法,将复杂的车辆路径问题分解为节点排序和边选择两个步骤,通过预编码聚合和非自回归架构显著提升训练与推理效率,在多种多图场景下达到了当前最优或相近的求解质量。
Most neural methods for Vehicle Routing Problems (VRPs) are limited to Euclidean settings or simple graphs. In this work, we instead consider multigraphs, where parallel edges represent distinct travel options with varying trade-offs (e.g., distance vs time). Few methods are designed for such formulations and those that do exist face major scalability issues. We mitigate these scalability issues via a Node-Edge Policy Factorization (NEPF) approach, which splits the routing policy into a node permutation stage and an edge selection stage. To enable the decomposition, we introduce a pre-encoding edge aggregation scheme and a non-autoregressive architecture for the edge stage, as well as a hierarchical reinforcement learning method to train the stages jointly. Our experiments across six VRP variants demonstrate that NEPF matches or outperforms the state-of-the-art in terms of solution quality, while being significantly faster in training and inference.
面向多图的可扩展路由的两阶段学习分解方法 / Two-Stage Learned Decomposition for Scalable Routing on Multigraphs
本文提出了一种名为NEPF的两阶段学习方法,将复杂的车辆路径问题分解为节点排序和边选择两个步骤,通过预编码聚合和非自回归架构显著提升训练与推理效率,在多种多图场景下达到了当前最优或相近的求解质量。
源自 arXiv: 2605.05389