让智能体掌舵:通过影响力交换实现闭环排序优化 / Let the Agent Steer: Closed-Loop Ranking Optimization via Influence Exchange
1️⃣ 一句话总结
这篇论文提出了一个名为Sortify的AI智能体,它能像自主交易员一样,在推荐系统中自动调整不同排序因素的影响力权重,通过闭环学习直接优化线上业务指标,并在大规模实际应用中显著提升了商品交易总额和广告收入。
Recommendation ranking is fundamentally an influence allocation problem: a sorting formula distributes ranking influence among competing factors, and the business outcome depends on finding the optimal "exchange rates" among them. However, offline proxy metrics systematically misjudge how influence reallocation translates to online impact, with asymmetric bias across metrics that a single calibration factor cannot correct. We present Sortify, the first fully autonomous LLM-driven ranking optimization agent deployed in a large-scale production recommendation system. The agent reframes ranking optimization as continuous influence exchange, closing the full loop from diagnosis to parameter deployment without human intervention. It addresses structural problems through three mechanisms: (1) a dual-channel framework grounded in Savage's Subjective Expected Utility (SEU) that decouples offline-online transfer correction (Belief channel) from constraint penalty adjustment (Preference channel); (2) an LLM meta-controller operating on framework-level parameters rather than low-level search variables; (3) a persistent Memory DB with 7 relational tables for cross-round learning. Its core metric, Influence Share, provides a decomposable measure where all factor contributions sum to exactly 100%. Sortify has been deployed across two markets. In Country A, the agent pushed GMV from -3.6% to +9.2% within 7 rounds with peak orders reaching +12.5%. In Country B, a cold-start deployment achieved +4.15% GMV/UU and +3.58% Ads Revenue in a 7-day A/B test, leading to full production rollout.
让智能体掌舵:通过影响力交换实现闭环排序优化 / Let the Agent Steer: Closed-Loop Ranking Optimization via Influence Exchange
这篇论文提出了一个名为Sortify的AI智能体,它能像自主交易员一样,在推荐系统中自动调整不同排序因素的影响力权重,通过闭环学习直接优化线上业务指标,并在大规模实际应用中显著提升了商品交易总额和广告收入。
源自 arXiv: 2603.27765