TA-Mem:面向大语言模型长期对话问答的工具增强自主记忆检索 / TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA
1️⃣ 一句话总结
这篇论文提出了一种名为TA-Mem的新框架,通过让大语言模型像自主使用工具一样灵活检索记忆,有效解决了其在长对话中因记忆有限而难以进行长期推理的问题,并在测试中显著超越了现有方法。
Large Language Model (LLM) has exhibited strong reasoning ability in text-based contexts across various domains, yet the limitation of context window poses challenges for the model on long-range inference tasks and necessitates a memory storage system. While many current storage approaches have been proposed with episodic notes and graph representations of memory, retrieval methods still primarily rely on predefined workflows or static similarity top-k over embeddings. To address this inflexibility, we introduced a novel tool-augmented autonomous memory retrieval framework (TA-Mem), which contains: (1) a memory extraction LLM agent which is prompted to adaptively chuck an input into sub-context based on semantic correlation, and extract information into structured notes, (2) a multi-indexed memory database designed for different types of query methods including both key-based lookup and similarity-based retrieval, (3) a tool-augmented memory retrieval agent which explores the memory autonomously by selecting appropriate tools provided by the database based on the user input, and decides whether to proceed to the next iteration or finalizing the response after reasoning on the fetched memories. The TA-Mem is evaluated on the LoCoMo dataset, achieving significant performance improvements over existing baseline approaches. In addition, an analysis of tool use across different question types also demonstrates the adaptivity of the proposed method.
TA-Mem:面向大语言模型长期对话问答的工具增强自主记忆检索 / TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA
这篇论文提出了一种名为TA-Mem的新框架,通过让大语言模型像自主使用工具一样灵活检索记忆,有效解决了其在长对话中因记忆有限而难以进行长期推理的问题,并在测试中显著超越了现有方法。
源自 arXiv: 2603.09297