菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-13
📄 Abstract - Why Retrieval-Augmented Generation Fails: A Graph Perspective

Retrieval-Augmented Generation (RAG) has become a powerful and widely used approach for improving large language models by grounding generation in retrieved evidence. However, RAG systems still produce incorrect answers in many cases. Why RAG fails despite having access to external information remains poorly understood. We present a model-internal study of retrieval-augmented generation that examines how retrieved evidence influences answer generation. Using circuit tracing, we construct attribution graphs that model the flow of information through transformer layers during decoding. These graphs represent interactions among retrieved context, intermediate model activations, and generated tokens, providing a graph, circuit-level view of how external evidence is integrated into the model's reasoning process across multiple question answering benchmarks, we observe consistent structural differences: correct predictions exhibit deeper reasoning paths, more distributed evidence flow, and a more structured pattern of local connectivity, while failed predictions show shallower, fragmented, and overly concentrated evidence flow. Building on these findings, we develop a graph-based error detection framework that uses attribution-graph topology features. Furthermore, we show that attribution graphs enable targeted interventions. By reinforcing question-constrained evidence grounding, we reshape internal routing so that answer generation remains guided by the question, leading to more effective integration of retrieved information and fewer errors.

顶级标签: llm machine learning natural language processing
详细标签: retrieval-augmented generation circuit tracing attribution graphs error detection reasoning analysis 或 搜索:

检索增强生成为何失败:基于图的视角 / Why Retrieval-Augmented Generation Fails: A Graph Perspective


1️⃣ 一句话总结

本文通过构建归因图来可视化检索增强生成(RAG)过程中信息在模型内部的流动路径,发现正确回答通常依赖更深、更分散的结构化证据链,而失败案例中证据流动浅层且集中;基于此,作者提出了一个利用图拓扑特征检测错误并改进模型推理的新方法。

源自 arXiv: 2605.14192