SAGE:面向生成式推荐的序列级自适应梯度演化 / SAGE: Sequence-level Adaptive Gradient Evolution for Generative Recommendation
1️⃣ 一句话总结
这篇论文提出了一个名为SAGE的优化框架,旨在解决现有生成式推荐系统依赖独立词汇表导致的维护成本高、扩展性差,以及其优化策略对冷门项目更新不足和多样性下降的问题,通过序列级信号解耦和自适应梯度动态调整,有效提升了冷启动效果和推荐多样性。
While works such as OneRec have validated the scaling laws of Large Language Models (LLMs) in recommender systems, they rely on a cumbersome separate vocabulary. This dependency prevents the model architecture from reusing native LLM vocabularies, resulting in high maintenance costs and poor scalability. In response, we aim to efficiently reuse open-source LLM architectures without constructing a separate tokenization vocabulary. Furthermore, we identify that the optimization strategy of OneRec Gradient Bounded Policy Optimization (GBPO),suffers from a "Symmetric Conservatism" problem: its static gradient boundaries structurally suppress the update momentum required for cold-start items and fail to prevent diversity collapse in high-noise this http URL address this issue, we propose SAGE (Sequence-level Adaptive Gradient Evolution), a unified optimization framework tailored for list-wise generative recommendation. SAGE introduces two key innovations:(1) Sequence-level Signal Decoupling: By combining a geometric mean importance ratio with decoupled multi-objective advantages, we eliminate token-level variance and resolve the "Reward Collapse" problem. (2) Asymmetric Adaptive Dynamics: We construct a dynamic gradient manifold that applies a "Boost Factor" to high-potential cold start items to achieve super-linear updates and employs an "Entropy Aware Penalty" to break information cocoons. Theoretical analysis and empirical results demonstrate that SAGE effectively unblocks cold-start traffic and sustains recommendation diversity, all while retaining the numerical stability of GBPO.
SAGE:面向生成式推荐的序列级自适应梯度演化 / SAGE: Sequence-level Adaptive Gradient Evolution for Generative Recommendation
这篇论文提出了一个名为SAGE的优化框架,旨在解决现有生成式推荐系统依赖独立词汇表导致的维护成本高、扩展性差,以及其优化策略对冷门项目更新不足和多样性下降的问题,通过序列级信号解耦和自适应梯度动态调整,有效提升了冷启动效果和推荐多样性。
源自 arXiv: 2601.21452