菜单

🤖 系统
📄 Abstract - Beyond Fact Retrieval: Episodic Memory for RAG with Generative Semantic Workspaces

Large Language Models (LLMs) face fundamental challenges in long-context reasoning: many documents exceed their finite context windows, while performance on texts that do fit degrades with sequence length, necessitating their augmentation with external memory frameworks. Current solutions, which have evolved from retrieval using semantic embeddings to more sophisticated structured knowledge graphs representations for improved sense-making and associativity, are tailored for fact-based retrieval and fail to build the space-time-anchored narrative representations required for tracking entities through episodic events. To bridge this gap, we propose the \textbf{Generative Semantic Workspace} (GSW), a neuro-inspired generative memory framework that builds structured, interpretable representations of evolving situations, enabling LLMs to reason over evolving roles, actions, and spatiotemporal contexts. Our framework comprises an \textit{Operator}, which maps incoming observations to intermediate semantic structures, and a \textit{Reconciler}, which integrates these into a persistent workspace that enforces temporal, spatial, and logical coherence. On the Episodic Memory Benchmark (EpBench) \cite{huet_episodic_2025} comprising corpora ranging from 100k to 1M tokens in length, GSW outperforms existing RAG based baselines by up to \textbf{20\%}. Furthermore, GSW is highly efficient, reducing query-time context tokens by \textbf{51\%} compared to the next most token-efficient baseline, reducing inference time costs considerably. More broadly, GSW offers a concrete blueprint for endowing LLMs with human-like episodic memory, paving the way for more capable agents that can reason over long horizons.

顶级标签: llm agents systems
详细标签: episodic memory retrieval-augmented generation long-context reasoning memory frameworks semantic workspaces 或 搜索:

📄 论文总结

超越事实检索:基于生成式语义工作区的RAG情景记忆 / Beyond Fact Retrieval: Episodic Memory for RAG with Generative Semantic Workspaces


1️⃣ 一句话总结

这项研究提出了一种名为‘生成式语义工作区’的新型记忆框架,通过模拟人类情景记忆来帮助大型语言模型理解和推理长文本中随时间、空间演变的事件关系,相比现有技术显著提升了长文本处理性能并降低了计算成本。


📄 打开原文 PDF