面向藏医药问答的可追溯跨源检索增强生成 / Traceable Cross-Source RAG for Chinese Tibetan Medicine Question Answering
1️⃣ 一句话总结
这篇论文针对藏医药领域多源知识库问答中权威证据检索不足和答案可追溯性差的问题,提出了一个结合知识库路由与对齐图证据融合的系统,有效提升了跨源证据覆盖率和答案的可靠性。
Retrieval-augmented generation (RAG) promises grounded question answering, yet domain settings with multiple heterogeneous knowledge bases (KBs) remain challenging. In Chinese Tibetan medicine, encyclopedia entries are often dense and easy to match, which can dominate retrieval even when classics or clinical papers provide more authoritative evidence. We study a practical setting with three KBs (encyclopedia, classics, and clinical papers) and a 500-query benchmark (cutoff $K{=}5$) covering both single-KB and cross-KB questions. We propose two complementary methods to improve traceability, reduce hallucinations, and enable cross-KB verification. First, DAKS performs KB routing and budgeted retrieval to mitigate density-driven bias and to prioritize authoritative sources when appropriate. Second, we use an alignment graph to guide evidence fusion and coverage-aware packing, improving cross-KB evidence coverage without relying on naive concatenation. All answers are generated by a lightweight generator, \textsc{openPangu-Embedded-7B}. Experiments show consistent gains in routing quality and cross-KB evidence coverage, with the full system achieving the best CrossEv@5 while maintaining strong faithfulness and citation correctness.
面向藏医药问答的可追溯跨源检索增强生成 / Traceable Cross-Source RAG for Chinese Tibetan Medicine Question Answering
这篇论文针对藏医药领域多源知识库问答中权威证据检索不足和答案可追溯性差的问题,提出了一个结合知识库路由与对齐图证据融合的系统,有效提升了跨源证据覆盖率和答案的可靠性。
源自 arXiv: 2602.05195