菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-12
📄 Abstract - From Noise to Order: Learning to Rank via Denoising Diffusion

In information retrieval (IR), learning-to-rank (LTR) methods have traditionally limited themselves to discriminative machine learning approaches that model the probability of the document being relevant to the query given some feature representation of the query-document pair. In this work, we propose an alternative denoising diffusion-based deep generative approach to LTR that instead models the full joint distribution over feature vectors and relevance labels. While in the discriminative setting, an over-parameterized ranking model may find different ways to fit the training data, we hypothesize that candidate solutions that can explain the full data distribution under the generative setting produce more robust ranking models. With this motivation, we propose DiffusionRank that extends TabDiff, an existing denoising diffusion-based generative model for tabular datasets, to create generative equivalents of classical discriminative pointwise and pairwise LTR objectives. Our empirical results demonstrate significant improvements from DiffusionRank models over their discriminative counterparts. Our work points to a rich space for future research exploration on how we can leverage ongoing advancements in deep generative modeling approaches, such as diffusion, for learning-to-rank in IR.

顶级标签: machine learning natural language processing data
详细标签: learning-to-rank diffusion models information retrieval generative modeling tabular data 或 搜索:

从噪声到有序:通过去噪扩散进行排序学习 / From Noise to Order: Learning to Rank via Denoising Diffusion


1️⃣ 一句话总结

这篇论文提出了一种名为DiffusionRank的新方法,它利用去噪扩散生成模型来学习信息检索中的排序任务,相比传统判别式模型,该方法通过建模数据和标签的完整联合分布,能生成更鲁棒的排序结果,并在实验中取得了显著提升。

源自 arXiv: 2602.11453