RELATE:一种用于广告文本生成的强化学习增强型大语言模型框架 / RELATE: A Reinforcement Learning-Enhanced LLM Framework for Advertising Text Generation
1️⃣ 一句话总结
这篇论文提出了一个名为RELATE的端到端强化学习框架,它将广告文本的生成过程与最终的业务目标(如点击率和转化率)直接对齐,从而生成更有效、更合规的广告文案,并在实际应用中显著提升了广告效果。
In online advertising, advertising text plays a critical role in attracting user engagement and driving advertiser value. Existing industrial systems typically follow a two-stage paradigm, where candidate texts are first generated and subsequently aligned with online performance metrics such as click-through rate(CTR). This separation often leads to misaligned optimization objectives and low funnel efficiency, limiting global optimality. To address these limitations, we propose RELATE, a reinforcement learning-based end-to-end framework that unifies generation and objective alignment within a single model. Instead of decoupling text generation from downstream metric alignment, RELATE integrates performance and compliance objectives directly into the generation process via policy learning. To better capture ultimate advertiser value beyond click-level signals, We incorporate conversion-oriented metrics into the objective and jointly model them with compliance constraints as multi-dimensional rewards, enabling the model to generate high-quality ad texts that improve conversion performance under policy constraints. Extensive experiments on large-scale industrial datasets demonstrate that RELATE consistently outperforms baselines. Furthermore, online deployment on a production advertising platform yields statistically significant improvements in click-through conversion rate(CTCVR) under strict policy constraints, validating the robustness and real-world effectiveness of the proposed framework.
RELATE:一种用于广告文本生成的强化学习增强型大语言模型框架 / RELATE: A Reinforcement Learning-Enhanced LLM Framework for Advertising Text Generation
这篇论文提出了一个名为RELATE的端到端强化学习框架,它将广告文本的生成过程与最终的业务目标(如点击率和转化率)直接对齐,从而生成更有效、更合规的广告文案,并在实际应用中显著提升了广告效果。
源自 arXiv: 2602.11780