OpenDecoder:开放大语言模型解码以在RAG中融入文档质量 / OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG
1️⃣ 一句话总结
这篇论文提出了一种名为OpenDecoder的新方法,通过显式评估检索到的文档质量(如相关度、排名等指标)来指导大语言模型生成答案,从而提升检索增强生成系统的鲁棒性和效果。
The development of large language models (LLMs) has achieved superior performance in a range of downstream tasks, including LLM-based retrieval-augmented generation (RAG). The quality of generated content heavily relies on the usefulness of the retrieved information and the capacity of LLMs' internal information processing mechanism to incorporate it in answer generation. It is generally assumed that the retrieved information is relevant to the question. However, the retrieved information may have a variable degree of relevance and usefulness, depending on the question and the document collection. It is important to take into account the relevance of the retrieved information in answer generation. In this paper, we propose OpenDecoder, a new approach that leverages explicit evaluation of the retrieved information as quality indicator features for generation. We aim to build a RAG model that is more robust to varying levels of noisy context. Three types of explicit evaluation information are considered: relevance score, ranking score, and QPP (query performance prediction) score. The experimental results on five benchmark datasets demonstrate the effectiveness and better robustness of OpenDecoder by outperforming various baseline methods. Importantly, this paradigm is flexible to be integrated with the post-training of LLMs for any purposes and incorporated with any type of external indicators.
OpenDecoder:开放大语言模型解码以在RAG中融入文档质量 / OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG
这篇论文提出了一种名为OpenDecoder的新方法,通过显式评估检索到的文档质量(如相关度、排名等指标)来指导大语言模型生成答案,从而提升检索增强生成系统的鲁棒性和效果。
源自 arXiv: 2601.09028