超越相关性:论检索与RAG信息覆盖范围之间的关系 / Beyond Relevance: On the Relationship Between Retrieval and RAG Information Coverage
1️⃣ 一句话总结
这项研究通过实验证明,在检索增强生成(RAG)系统中,衡量检索结果信息覆盖范围的指标,能够有效预测最终生成答案的信息完整性,为评估RAG系统性能提供了更直接的早期参考。
Retrieval-augmented generation (RAG) systems combine document retrieval with a generative model to address complex information seeking tasks like report generation. While the relationship between retrieval quality and generation effectiveness seems intuitive, it has not been systematically studied. We investigate whether upstream retrieval metrics can serve as reliable early indicators of the final generated response's information coverage. Through experiments across two text RAG benchmarks (TREC NeuCLIR 2024 and TREC RAG 2024) and one multimodal benchmark (WikiVideo), we analyze 15 text retrieval stacks and 10 multimodal retrieval stacks across four RAG pipelines and multiple evaluation frameworks (Auto-ARGUE and MiRAGE). Our findings demonstrate strong correlations between coverage-based retrieval metrics and nugget coverage in generated responses at both topic and system levels. This relationship holds most strongly when retrieval objectives align with generation goals, though more complex iterative RAG pipelines can partially decouple generation quality from retrieval effectiveness. These findings provide empirical support for using retrieval metrics as proxies for RAG performance.
超越相关性:论检索与RAG信息覆盖范围之间的关系 / Beyond Relevance: On the Relationship Between Retrieval and RAG Information Coverage
这项研究通过实验证明,在检索增强生成(RAG)系统中,衡量检索结果信息覆盖范围的指标,能够有效预测最终生成答案的信息完整性,为评估RAG系统性能提供了更直接的早期参考。
源自 arXiv: 2603.08819