菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-05
📄 Abstract - CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

Discharge summaries require extracting critical information from lengthy electronic health records (EHRs), a process that is labor-intensive when performed manually. Large language models (LLMs) can improve generation efficiency; however, they are prone to producing faithfulness hallucinations, statements that contradict source records, posing direct risks to patient safety. To address this, we present CuraView, a multi-agent framework for sentence-level detection and evidence-grounded explanation of faithfulness hallucinations in discharge summaries. CuraView constructs a GraphRAG-based knowledge graph from patient-level EHRs and implements a closed-loop generation-detection pipeline with sentence-level evidence retrieval and classification spanning four evidence grades from strong support to direct contradiction (E1-E4), yielding structured and interpretable evidence chains. We evaluate CuraView on a subset of 250 patients from the Discharge-Me benchmark, with 50 patients held out for testing. Our fine-tuned Qwen3-14B detection model achieves an F1 of 0.831 on the safety-critical E4 metric (90.9% recall, 76.5% precision) and an F1 of 0.823 on E3+E4, representing a 50.0% relative improvement over the base model and outperforming RAGTruth-style and QAGS-style baselines. These results demonstrate that evidence-chain-based graph retrieval verification substantially improves the factual reliability of clinical documentation, while simultaneously producing reusable annotated datasets for downstream model training and distillation.

顶级标签: medical llm multi-agents
详细标签: hallucination detection graphrag discharge summaries knowledge verification multi-agent framework 或 搜索:

CuraView:基于GraphRAG增强知识验证的医疗幻觉检测多智能体框架 / CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification


1️⃣ 一句话总结

本文提出一个名为CuraView的多智能体框架,通过构建患者电子病历的知识图谱并设计闭环检测流程,精准识别出院小结中由大语言模型编造的内容,并用结构化证据链解释检测结果,显著提升了医疗文档的事实可靠性。

源自 arXiv: 2605.03476