Graph-Aware Late Chunking for Retrieval-Augmented Generation in Biomedical Literature

📄 Abstract - Graph-Aware Late Chunking for Retrieval-Augmented Generation in Biomedical Literature

Retrieval-Augmented Generation (RAG) systems for biomedical literature are typically evaluated using ranking metrics like Mean Reciprocal Rank (MRR), which measure how well the system identifies the single most relevant chunk. We argue that for full-text scientific documents, this paradigm is incomplete: it rewards retrieval precision while ignoring retrieval breadth -- the ability to surface evidence from across a document's structural sections. We propose GraLC-RAG, a framework that unifies late chunking with graph-aware structural intelligence, introducing structure-aware chunk boundary detection, UMLS knowledge graph infusion, and graph-guided hybrid retrieval. We evaluate six strategies on 2,359 IMRaD-filtered PubMed Central articles using 2,033 cross-section questions and two metric families: standard ranking metrics (MRR, Recall@k) and structural coverage metrics (SecCov@k, CS Recall). Our results expose a sharp divergence: content-similarity methods achieve the highest MRR (0.517) but always retrieve from a single section, while structure-aware methods retrieve from up to 15.6x more sections. Generation experiments show that KG-infused retrieval narrows the answer-quality gap to delta-F1 = 0.009 while maintaining 4.6x section diversity. These findings demonstrate that standard metrics systematically undervalue structural retrieval and that closing the multi-section synthesis gap is a key open problem for biomedical RAG.

面向生物医学文献检索增强生成的图感知延迟分块方法 / Graph-Aware Late Chunking for Retrieval-Augmented Generation in Biomedical Literature

1️⃣ 一句话总结

这篇论文提出了一个名为GraLC-RAG的新框架，它通过结合延迟分块和图感知的结构智能，解决了现有生物医学文献检索系统只关注检索精度而忽视从文档不同结构部分获取广泛证据的问题，从而提升了回答跨章节复杂问题的能力。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要