菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-27
📄 Abstract - Skill-as-Pseudocode: Refactoring Skill Libraries to Pseudocode for LLM Agents

Markdown skill libraries for LLM agents ship as free-form prose, forcing the agent to re-derive both the input schema and the concrete invocation syntax on every retrieval. We observe that this often produces a "confused -> re-retrieve -> still confused" loop in which the agent issues a partially-correct action, receives uninformative environment feedback, and re-retrieves the same prose. We propose Skill-as-Pseudocode (SaP), an automatic conversion of markdown skill libraries into typed pseudocode with deterministic quality control. For each cluster of similar procedural passages drawn from one or more skills, SaP extracts a typed contract and filters it through a four-check deterministic verifier (coverage, binding, replacement, risk). Promoted contracts are inlined into a rewritten skill skeleton together with restored concrete action templates, giving the agent two complementary signals: a typed signature for what the skill does and a concrete template for how to invoke it. On the 134-game ALFWorld unseen split with gpt-4o-mini, pooled across three seeds, SaP wins 82/402 paired games versus 47/402 for the Graph-of-Skills (GoS) baseline (pooled McNemar p = 8.2e-5), at -22.8 +/- 6.4% input tokens and -14.5 +/- 4.1% LLM calls per game.

顶级标签: agents llm
详细标签: skill libraries pseudocode conversion evaluation alfworld 或 搜索:

技能即伪代码:为LLM智能体重构技能库为伪代码形式 / Skill-as-Pseudocode: Refactoring Skill Libraries to Pseudocode for LLM Agents


1️⃣ 一句话总结

该论文提出一种名为“技能即伪代码”(SaP)的方法,自动将LLM智能体使用的自由文本技能库转换为带有类型标注的伪代码,从而避免智能体反复读取模糊描述而陷入“困惑-重读-仍困惑”的低效循环,在ALFWorld游戏任务中显著提升了任务成功率,同时减少了输入资源和调用次数。

源自 arXiv: 2605.27955