菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-10
📄 Abstract - With Argus Eyes: Assessing Retrieval Gaps via Uncertainty Scoring to Detect and Remedy Retrieval Blind Spots

Reliable retrieval-augmented generation (RAG) systems depend fundamentally on the retriever's ability to find relevant information. We show that neural retrievers used in RAG systems have blind spots, which we define as the failure to retrieve entities that are relevant to the query, but have low similarity to the query embedding. We investigate the training-induced biases that cause such blind spot entities to be mapped to inaccessible parts of the embedding space, resulting in low retrievability. Using a large-scale dataset constructed from Wikidata relations and first paragraphs of Wikipedia, and our proposed Retrieval Probability Score (RPS), we show that blind spot risk in standard retrievers (e.g., CONTRIEVER, REASONIR) can be predicted pre-index from entity embedding geometry, avoiding expensive retrieval evaluations. To address these blind spots, we introduce ARGUS, a pipeline that enables the retrievability of high-risk (low-RPS) entities through targeted document augmentation from a knowledge base (KB), first paragraphs of Wikipedia, in our case. Extensive experiments on BRIGHT, IMPLIRET, and RAR-B show that ARGUS achieves consistent improvements across all evaluated retrievers (averaging +3.4 nDCG@5 and +4.5 nDCG@10 absolute points), with substantially larger gains in challenging subsets. These results establish that preemptively remedying blind spots is critical for building robust and trustworthy RAG systems (Code and Data).

顶级标签: llm natural language processing model evaluation
详细标签: retrieval-augmented generation retrieval blind spots embedding uncertainty document augmentation retrieval evaluation 或 搜索:

以阿格斯之眼:通过不确定性评分评估检索盲区以检测和补救检索盲点 / With Argus Eyes: Assessing Retrieval Gaps via Uncertainty Scoring to Detect and Remedy Retrieval Blind Spots


1️⃣ 一句话总结

这篇论文发现并解决了检索增强生成系统中神经检索器存在的“盲点”问题,即会遗漏那些与查询相关但语义相似度低的实体,并提出了一种名为ARGUS的预干预方法,通过有针对性地增强文档来显著提升检索系统的整体性能和可靠性。

源自 arXiv: 2602.09616