菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-16
📄 Abstract - Probe-then-Plan: Environment-Aware Planning for Industrial E-commerce Search

Modern e-commerce search is evolving to resolve complex user intents. While Large Language Models (LLMs) offer strong reasoning, existing LLM-based paradigms face a fundamental blindness-latency dilemma: query rewriting is agnostic to retrieval capabilities and real-time inventory, yielding invalid plans; conversely, deep search agents rely on iterative tool calls and reflection, incurring seconds of latency incompatible with industrial sub-second budgets. To resolve this conflict, we propose Environment-Aware Search Planning (EASP), reformulating search planning as a dynamic reasoning process grounded in environmental reality. EASP introduces a Probe-then-Plan mechanism: a lightweight Retrieval Probe exposes the retrieval snapshot, enabling the Planner to diagnose execution gaps and generate grounded search plans. The methodology comprises three stages: (1) Offline Data Synthesis: A Teacher Agent synthesizes diverse, execution-validated plans by diagnosing the probed environment. (2) Planner Training and Alignment: The Planner is initialized via Supervised Fine-Tuning (SFT) to internalize diagnostic capabilities, then aligned with business outcomes (conversion rate) via Reinforcement Learning (RL). (3) Adaptive Online Serving: A complexity-aware routing mechanism selectively activates planning for complex queries, ensuring optimal resource allocation. Extensive offline evaluations and online A/B testing on this http URL demonstrate that EASP significantly improves relevant recall and achieves substantial lifts in UCVR and GMV. EASP has been successfully deployed in this http URL's AI-Search system.

顶级标签: llm agents systems
详细标签: e-commerce search retrieval-augmented planning industrial deployment latency optimization reinforcement learning alignment 或 搜索:

先探测再规划:面向工业级电商搜索的环境感知规划方法 / Probe-then-Plan: Environment-Aware Planning for Industrial E-commerce Search


1️⃣ 一句话总结

这篇论文提出了一种名为EASP的新方法,通过‘先探测后规划’的机制,让大型语言模型在制定电商搜索策略前先快速了解商品库存等实时环境信息,从而在保证搜索质量的同时,将决策延迟降低到工业应用可接受的亚秒级水平,显著提升了转化率和交易额。

源自 arXiv: 2603.15262