菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-03
📄 Abstract - AIP: A Graph Representation for Learning and Governing Agent Skills

Agent Skills today consist largely of free-form prose requiring the agent to read, interpret, and re-derive how to act in every session. This imposes two compounding costs: reduced reliability on implementation-heavy tasks, and difficulty in skill creation and improvement, since editing prose is a fragile process that both humans and agents struggle with, particularly for domain-specific procedural knowledge underrepresented in model training. The Agent Instruction Protocol (AIP) addresses both by modeling a skill as a directed execution graph: discrete steps as nodes backed by deterministic scripts or natural-language descriptions, connected by explicit typed input/output edges, and governed by a schema-validated YAML specification. A compiler meta-skill translates existing human-written skills into this form. The benefits are twofold. First, compiling human-written skills to AIP raised Claude Sonnet's mean task reward from 0.60 to 0.71 and pass rate from 53% to 67% across 27 real agent tasks from SkillsBench - a statistically significant gain (Wilcoxon signed-rank p = 0.011), winning 12 tasks to 2 with 13 ties - often in less wall-clock time. The graph delivers vetted, runnable units to the agent rather than asking it to re-derive code, commands, and tool calls from natural language. Second, on creation and improvement, because each skill is schema-validated, functionally testable, and addressable node-by-node, failures can be diagnosed and repaired precisely. Two authored-skill failures were traced to the script level. After adjusting the AIP spec and recompiling, both recovered with zero regressions (one task going from 0/5 to 5/5), turning skill improvement into a measurable tuning loop rather than a prose rewrite. That same graph structure supports corpus-level governance and skill introspection, and provides a natural action space for reinforcement learning over skills.

顶级标签: agents llm
详细标签: skill representation execution graph agent instruction skill compilation evaluation 或 搜索:

AIP:一种用于学习和管理智能体技能的图形化表示方法 / AIP: A Graph Representation for Learning and Governing Agent Skills


1️⃣ 一句话总结

本文提出一种名为AIP的智能体技能表示方法,将传统自由文本的技能描述转换为结构化的有向执行图,每个步骤由确定性脚本或自然语言节点及明确的输入输出边构成,实验表明该方法能显著提升智能体在真实任务中的成功率(从53%到67%),并使技能调试和改进变成可量化的迭代过程。

源自 arXiv: 2606.04781