菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-26
📄 Abstract - AGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agents

The token-level extractive compressors widely used for general LM context are structurally inappropriate for LLM agents: across 17 (env, backbone, method) cells spanning two independent token-level method families, every cell collapses to mean reward <= 0.05 despite 1.3-13.3x realized compression. We name and characterize this failure mode as action-grammar destruction -- the tokens carrying action semantics (identifiers, brackets, action verbs) are exactly those self-information ranks lowest, so a general-purpose compressor reliably removes them and the environment rejects the residual. The diagnosis points to step-granularity compression. We introduce AGORA, an inference-free step-level compressor combining a structural prompt parser, an always-keep floor for format- and recency-critical content, and a 125M-parameter relevance scorer trained on counterfactual next-action-change labels (~2ms/step, zero per-step LLM toll). Across the compared inference-free and LLM-based methods, AGORA is the only one retaining >= 75% uncompressed performance in 8 of 9 cells (with the lone exception at 73%); a four-way component ablation isolates the structural floor as the dominant quality lever and the learned scorer as the source of 1.0-11.5x adaptive end-to-end compression from a single fixed keep ratio.

顶级标签: llm agents
详细标签: prompt compression action-grammar destruction token-level extractor step-level compressor relevance scoring 或 搜索:

AGORA:面向LLM智能体的无需推理的提示压缩方法——基于适配器基础的观察-动作保留机制 / AGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agents


1️⃣ 一句话总结

本文指出,现有的通用型提示压缩方法会破坏智能体动作中的关键语法(如标识符、括号等),导致其性能崩溃,因此提出了一种无需额外推理、按步骤压缩的新方法AGORA,通过保留关键格式和最近信息,并利用一个小型评分模型智能取舍内容,从而在几乎不损失性能的前提下实现高效压缩。

源自 arXiv: 2605.26596