菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-26
📄 Abstract - Sequential Regression for Continuous Value Prediction using Residual Quantization

Continuous value prediction plays a crucial role in industrial-scale recommendation systems, including tasks such as predicting users' watch-time and estimating the gross merchandise value (GMV) in e-commerce transactions. However, it remains challenging due to the highly complex and long-tailed nature of the data distributions. Existing generative approaches rely on rigid parametric distribution assumptions, which fundamentally limits their performance when such assumptions misalign with real-world data. Overly simplified forms cannot adequately model real-world complexities, while more intricate assumptions often suffer from poor scalability and generalization. To address these challenges, we propose a residual quantization (RQ)-based sequence learning framework that represents target continuous values as a sum of ordered quantization codes, predicted recursively from coarse to fine granularity with diminishing quantization errors. We introduce a representation learning objective that aligns RQ code embedding space with the ordinal structure of target values, allowing the model to capture continuous representations for quantization codes and further improving prediction accuracy. We perform extensive evaluations on public benchmarks for lifetime value (LTV) and watch-time prediction, alongside a large-scale online experiment for GMV prediction on an industrial short-video recommendation platform. The results consistently show that our approach outperforms state-of-the-art methods, while demonstrating strong generalization across diverse continuous value prediction tasks in recommendation systems.

顶级标签: machine learning model training systems
详细标签: residual quantization continuous value prediction recommendation systems sequential regression representation learning 或 搜索:

基于残差量化的序列回归用于连续值预测 / Sequential Regression for Continuous Value Prediction using Residual Quantization


1️⃣ 一句话总结

这篇论文提出了一种基于残差量化的序列学习新方法,通过将连续值分解为一系列从粗到细的量化码来递归预测,有效解决了推荐系统中用户观看时长、交易额等连续值预测的难题,并在多个实际任务中超越了现有最佳方法。

源自 arXiv: 2602.23012