DTCRS:用于递归摘要的动态树构建方法 / DTCRS: Dynamic Tree Construction for Recursive Summarization
1️⃣ 一句话总结
这篇论文提出了一种名为DTCRS的新方法,它能根据文档结构和问题语义动态构建摘要树,有效减少冗余、提升问答效率,并分析了递归摘要技术对不同类型问题的适用性。
Retrieval-Augmented Generation (RAG) mitigates the hallucination problem of Large Language Models (LLMs) by incorporating external knowledge. Recursive summarization constructs a hierarchical summary tree by clustering text chunks, integrating information from multiple parts of a document to provide evidence for abstractive questions involving multi-step reasoning. However, summary trees often contain a large number of redundant summary nodes, which not only increase construction time but may also negatively impact question answering. Moreover, recursive summarization is not suitable for all types of questions. We introduce DTCRS, a method that dynamically generates summary trees based on document structure and query semantics. DTCRS determines whether a summary tree is necessary by analyzing the question type. It then decomposes the question and uses the embeddings of sub-questions as initial cluster centers, reducing redundant summaries while improving the relevance between summaries and the question. Our approach significantly reduces summary tree construction time and achieves substantial improvements across three QA tasks. Additionally, we investigate the applicability of recursive summarization to different question types, providing valuable insights for future research.
DTCRS:用于递归摘要的动态树构建方法 / DTCRS: Dynamic Tree Construction for Recursive Summarization
这篇论文提出了一种名为DTCRS的新方法,它能根据文档结构和问题语义动态构建摘要树,有效减少冗余、提升问答效率,并分析了递归摘要技术对不同类型问题的适用性。
源自 arXiv: 2604.07012