菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-08
📄 Abstract - HingeMem: Boundary Guided Long-Term Memory with Query Adaptive Retrieval for Scalable Dialogues

Long-term memory is critical for dialogue systems that support continuous, sustainable, and personalized interactions. However, existing methods rely on continuous summarization or OpenIE-based graph construction paired with fixed Top-\textit{k} retrieval, leading to limited adaptability across query categories and high computational overhead. In this paper, we propose HingeMem, a boundary-guided long-term memory that operationalizes event segmentation theory to build an interpretable indexing interface via boundary-triggered hyperedges over four elements: person, time, location, and topic. When any such element changes, HingeMem draws a boundary and writes the current segment, thereby reducing redundant operations and preserving salient context. To enable robust and efficient retrieval under diverse information needs, HingeMem introduces query-adaptive retrieval mechanisms that jointly decide (a) \textit{what to retrieve}: determine the query-conditioned routing over the element-indexed memory; (b) \textit{how much to retrieve}: control the retrieval depth based on the estimated query type. Extensive experiments across LLM scales (from 0.6B to production-tier models; \textit{e.g.}, Qwen3-0.6B to Qwen-Flash) on LOCOMO show that HingeMem achieves approximately $20\%$ relative improvement over strong baselines without query categories specification, while reducing computational cost (68\%$\downarrow$ question answering token cost compared to HippoRAG2). Beyond advancing memory modeling, HingeMem's adaptive retrieval makes it a strong fit for web applications requiring efficient and trustworthy memory over extended interactions.

顶级标签: llm agents natural language processing
详细标签: long-term memory dialogue systems retrieval mechanisms event segmentation query-adaptive routing 或 搜索:

HingeMem:基于边界引导的长时记忆与查询自适应检索机制,用于可扩展对话系统 / HingeMem: Boundary Guided Long-Term Memory with Query Adaptive Retrieval for Scalable Dialogues


1️⃣ 一句话总结

这篇论文提出了一种名为HingeMem的新型对话系统长时记忆框架,它通过检测对话中人物、时间、地点和话题的变化来智能划分记忆片段,并能够根据用户查询动态决定检索哪些记忆以及检索多少内容,从而在显著提升对话质量的同时大幅降低了计算成本。

源自 arXiv: 2604.06845