菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-15
📄 Abstract - TokenFormer: Unify the Multi-Field and Sequential Recommendation Worlds

Recommender systems have historically developed along two largely independent paradigms: feature interaction models for modeling correlations among multi-field categorical features, and sequential models for capturing user behavior dynamics from historical interaction sequences. Although recent trends attempt to bridge these paradigms within shared backbones, we empirically reveal that naive unifying these two branches may lead to a failure mode of Sequential Collapse Propagation (SCP). That is, the interaction with those dimensionally ill non-sequence fields leads to the dimensional collapse of the sequence features. To overcome this challenge, we propose TokenFormer, a unified recommendation architecture with the following innovations. First, we introduce a Bottom-Full-Top-Sliding (BFTS) attention scheme, which applies full self-attention in the lower layers and shrinking-window sliding attention in the upper layers. Second, we introduce a Non-Linear Interaction Representation (NLIR) that applies one-sided non-linear multiplicative transformations to the hidden states. Extensive experiments on public benchmarks and Tencent's advertising platform demonstrate state-of-the-art performance, while detailed analysis confirm that TokenFormer significantly improves dimensional robustness and representation discriminability under unified modeling.

顶级标签: machine learning systems model training
详细标签: recommender systems sequential modeling attention mechanisms unified architecture dimensional collapse 或 搜索:

TokenFormer:统一多字段与序列推荐世界 / TokenFormer: Unify the Multi-Field and Sequential Recommendation Worlds


1️⃣ 一句话总结

这篇论文提出了一种名为TokenFormer的新型推荐系统架构,它通过创新的注意力机制和非线性交互表示,成功解决了将传统特征交互模型与序列模型简单结合时会导致序列特征信息丢失的问题,从而在统一建模中实现了更强大且高效的推荐性能。

源自 arXiv: 2604.13737