MoToRec:用于冷启动推荐的稀疏正则化多模态标记化方法 / MoToRec: Sparse-Regularized Multimodal Tokenization for Cold-Start Recommendation
1️⃣ 一句话总结
这篇论文提出了一种名为MoToRec的新方法,它将物品的多模态信息(如图像、文本)转化为一组离散、可解释的语义标记,从而有效解决了推荐系统中新物品因缺乏用户交互数据而难以被准确推荐的‘冷启动’难题。
Graph neural networks (GNNs) have revolutionized recommender systems by effectively modeling complex user-item interactions, yet data sparsity and the item cold-start problem significantly impair performance, particularly for new items with limited or no interaction history. While multimodal content offers a promising solution, existing methods result in suboptimal representations for new items due to noise and entanglement in sparse data. To address this, we transform multimodal recommendation into discrete semantic tokenization. We present Sparse-Regularized Multimodal Tokenization for Cold-Start Recommendation (MoToRec), a framework centered on a sparsely-regularized Residual Quantized Variational Autoencoder (RQ-VAE) that generates a compositional semantic code of discrete, interpretable tokens, promoting disentangled representations. MoToRec's architecture is enhanced by three synergistic components: (1) a sparsely-regularized RQ-VAE that promotes disentangled representations, (2) a novel adaptive rarity amplification that promotes prioritized learning for cold-start items, and (3) a hierarchical multi-source graph encoder for robust signal fusion with collaborative signals. Extensive experiments on three large-scale datasets demonstrate MoToRec's superiority over state-of-the-art methods in both overall and cold-start scenarios. Our work validates that discrete tokenization provides an effective and scalable alternative for mitigating the long-standing cold-start challenge.
MoToRec:用于冷启动推荐的稀疏正则化多模态标记化方法 / MoToRec: Sparse-Regularized Multimodal Tokenization for Cold-Start Recommendation
这篇论文提出了一种名为MoToRec的新方法,它将物品的多模态信息(如图像、文本)转化为一组离散、可解释的语义标记,从而有效解决了推荐系统中新物品因缺乏用户交互数据而难以被准确推荐的‘冷启动’难题。
源自 arXiv: 2602.11062