生成式推荐系统持续适应的高效数据集选择方法 / Efficient Dataset Selection for Continual Adaptation of Generative Recommenders
1️⃣ 一句话总结
这篇论文提出了一种通过智能选择少量关键数据来高效更新推荐系统的方法,使其能持续适应用户行为变化,同时避免了大规模数据重训练带来的计算负担。
Recommendation systems must continuously adapt to evolving user behavior, yet the volume of data generated in large-scale streaming environments makes frequent full retraining impractical. This work investigates how targeted data selection can mitigate performance degradation caused by temporal distributional drift while maintaining scalability. We evaluate a range of representation choices and sampling strategies for curating small but informative subsets of user interaction data. Our results demonstrate that gradient-based representations, coupled with distribution-matching, improve downstream model performance, achieving training efficiency gains while preserving robustness to drift. These findings highlight data curation as a practical mechanism for scalable monitoring and adaptive model updates in production-scale recommendation systems.
生成式推荐系统持续适应的高效数据集选择方法 / Efficient Dataset Selection for Continual Adaptation of Generative Recommenders
这篇论文提出了一种通过智能选择少量关键数据来高效更新推荐系统的方法,使其能持续适应用户行为变化,同时避免了大规模数据重训练带来的计算负担。
源自 arXiv: 2604.07739