菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-08
📄 Abstract - TourPlanner: A Competitive Consensus Framework with Constraint-Gated Reinforcement Learning for Travel Planning

Travel planning is a sophisticated decision-making process that requires synthesizing multifaceted information to construct itineraries. However, existing travel planning approaches face several challenges: (1) Pruning candidate points of interest (POIs) while maintaining a high recall rate; (2) A single reasoning path restricts the exploration capability within the feasible solution space for travel planning; (3) Simultaneously optimizing hard constraints and soft constraints remains a significant difficulty. To address these challenges, we propose TourPlanner, a comprehensive framework featuring multi-path reasoning and constraint-gated reinforcement learning. Specifically, we first introduce a Personalized Recall and Spatial Optimization (PReSO) workflow to construct spatially-aware candidate POIs' set. Subsequently, we propose Competitive consensus Chain-of-Thought (CCoT), a multi-path reasoning paradigm that improves the ability of exploring the feasible solution space. To further refine the plan, we integrate a sigmoid-based gating mechanism into the reinforcement learning stage, which dynamically prioritizes soft-constraint satisfaction only after hard constraints are met. Experimental results on travel planning benchmarks demonstrate that TourPlanner achieves state-of-the-art performance, significantly surpassing existing methods in both feasibility and user-preference alignment.

顶级标签: agents reinforcement learning systems
详细标签: travel planning multi-path reasoning constraint optimization reinforcement learning decision-making 或 搜索:

TourPlanner:一种用于旅行规划的、基于约束门控强化学习的竞争共识框架 / TourPlanner: A Competitive Consensus Framework with Constraint-Gated Reinforcement Learning for Travel Planning


1️⃣ 一句话总结

这篇论文提出了一个名为TourPlanner的新框架,它通过结合多路径推理和一种能优先满足硬约束的强化学习方法,有效解决了旅行规划中平衡地点筛选、方案多样性和各类约束条件的难题,从而生成更可行且符合用户偏好的行程。

源自 arXiv: 2601.04698