菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-10
📄 Abstract - Context-Driven Incremental Compression for Multi-Turn Dialogue Generation

Modern conversational agents condition on an ever-growing dialogue history at each turn, incurring redundant attention and encoding costs that grow with conversation length. Naive truncation or summarization degrades fidelity, while existing context compressors lack cross-turn memory sharing or revision, causing information loss and compounding errors in long dialogues. We revisit the context compression under conversational dynamics and empirically present its fragility. To improve both efficiency and robustness, we introduce Context-Driven Incremental Compression (C-DIC), which treats a conversation as interleaved contextual threads and stores revisable per-thread compression states in a single, compact dialogue memory. At each turn, a lightweight retrieve, revise, and write-back loop shares information across turns and updates stale memories, stabilizing long-horizon behavior. In addition, we adapt truncated backpropagation-through-time (TBPTT) to our multi-turn setting, learning cross-turn dependencies without full-history backpropagation. Extensive experiments on long-form dialogue benchmarks demonstrate superior performance and efficiency of C-DIC; notably, C-DIC shows stable inference latency and perplexity over hundreds of dialogue turns, supporting a scalable path to high-quality dialogue modeling.

顶级标签: llm natural language processing systems
详细标签: context compression dialogue generation long-form dialogue multi-turn conversation incremental memory 或 搜索:

上下文驱动的增量压缩:面向多轮对话生成 / Context-Driven Incremental Compression for Multi-Turn Dialogue Generation


1️⃣ 一句话总结

本文提出一种名为C-DIC的新方法,通过将对话拆分成可独立更新的上下文线索并存储在紧凑的记忆中,让AI在长对话中只处理必要的新信息,大幅提升效率并避免遗忘,从而精准生成后续回复。

源自 arXiv: 2606.12411