菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-26
📄 Abstract - Separating Semantic Competition from Context Length in RAG Reading

Retrieval-augmented generation (RAG) systems can respond incorrectly even when the correct passage was retrieved. The model must still read the retrieved passages and identify which one contains the answer among others that look relevant. This passage-reading model is called the reader. Does it fail simply because the context is longer or because the other passages genuinely compete with the correct one? We introduce and demonstrate a matched-control protocol for RAG reading: we keep the number and length of passages fixed, but replace hard competitors with less competitive real passages. We apply this control across two compact open models on SQuAD. This replacement partially restores performance, with the strongest effects on F1 and answer inclusion. For Phi-2, this recovers +6.0 EM points, +7.0 answer-inclusion points, and +0.057 F1. For Qwen2.5-1.5B, it recovers +4.5 EM points, +9.0 answer-inclusion points, and +0.068 F1. To track how performance changes as competitors accumulate, we also report retention curves and summarize them with a right-censored half-life when the curves do not cross half-retention. Together, these results show the protocol isolates a competition effect distinct from context length, though the effect is clearer for F1 and answer inclusion than for exact match, and also varies with snippet length.

顶级标签: llm model evaluation
详细标签: retrieval-augmented generation reader model semantic competition context length matched-control protocol 或 搜索:

在RAG阅读理解中区分语义竞争与上下文长度的影响 / Separating Semantic Competition from Context Length in RAG Reading


1️⃣ 一句话总结

本文提出一种新的实验方法,通过在保持上下文长度不变的情况下替换干扰文档,证明检索增强生成(RAG)系统的错误不仅由长上下文引起,更来自语义相似的干扰文档的竞争,并在两个小模型上验证了该方法能显著提升回答准确率。

源自 arXiv: 2605.27294