菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-16
📄 Abstract - RecGPT-V2 Technical Report

Large language models (LLMs) have demonstrated remarkable potential in transforming recommender systems from implicit behavioral pattern matching to explicit intent reasoning. While RecGPT-V1 successfully pioneered this paradigm by integrating LLM-based reasoning into user interest mining and item tag prediction, it suffers from four fundamental limitations: (1) computational inefficiency and cognitive redundancy across multiple reasoning routes; (2) insufficient explanation diversity in fixed-template generation; (3) limited generalization under supervised learning paradigms; and (4) simplistic outcome-focused evaluation that fails to match human standards. To address these challenges, we present RecGPT-V2 with four key innovations. First, a Hierarchical Multi-Agent System restructures intent reasoning through coordinated collaboration, eliminating cognitive duplication while enabling diverse intent coverage. Combined with Hybrid Representation Inference that compresses user-behavior contexts, our framework reduces GPU consumption by 60% and improves exclusive recall from 9.39% to 10.99%. Second, a Meta-Prompting framework dynamically generates contextually adaptive prompts, improving explanation diversity by +7.3%. Third, constrained reinforcement learning mitigates multi-reward conflicts, achieving +24.1% improvement in tag prediction and +13.0% in explanation acceptance. Fourth, an Agent-as-a-Judge framework decomposes assessment into multi-step reasoning, improving human preference alignment. Online A/B tests on Taobao demonstrate significant improvements: +2.98% CTR, +3.71% IPV, +2.19% TV, and +11.46% NER. RecGPT-V2 establishes both the technical feasibility and commercial viability of deploying LLM-powered intent reasoning at scale, bridging the gap between cognitive exploration and industrial utility.

顶级标签: llm agents systems
详细标签: recommendation system multi-agent architecture efficient inference reinforcement learning explainable ai 或 搜索:

RecGPT-V2:一种高效、可扩展且与人类对齐的意图驱动推荐系统 / RecGPT-V2 Technical Report


1️⃣ 一句话总结

RecGPT-V2是一个新一代的意图驱动推荐系统框架,它通过分层多智能体系统、原子化实体压缩、元提示、约束强化学习和过程导向的智能体即法官评估等四项核心创新,系统地解决了其前身RecGPT-V1在计算效率、解释多样性、泛化能力和评估对齐方面的局限性,并在在线A/B测试中取得了显著性能提升。


2️⃣ 论文创新点

1. 分层多智能体系统(HMAS)

2. 原子化实体压缩与混合表示适应

3. 元提示框架与动态解释生成

4. 约束强化学习优化

5. 智能体即法官(过程导向的多步评估)


3️⃣ 主要结果与价值

结果亮点

实际价值


4️⃣ 术语表

源自 arXiv: 2512.14503