菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-15
📄 Abstract - Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Enabling Large Language Models (LLMs) to effectively utilize tools in multi-turn interactions is essential for building capable autonomous agents. However, acquiring diverse and realistic multi-turn tool-use data remains a significant challenge. In this work, we propose a novel text-based paradigm. We observe that textual corpora naturally contain rich, multi-step problem-solving experiences, which can serve as an untapped, scalable, and authentic data source for multi-turn tool-use tasks. Based on this insight, we introduce GEM, a data synthesis pipeline that enables the generation and extraction of multi-turn tool-use trajectories from text corpora through a four-stage process: relevance filtering, workflow & tool extraction, trajectory grounding, and complexity refinement. To reduce the computational cost, we further train a specialized Trajectory Synthesizer via supervised fine-tuning. This model distills the complex generation pipeline into an efficient, end-to-end trajectory generator. Experiments demonstrate that our GEM-32B achieve a 16.5% improvement on the BFCL V3 Multi-turn benchmark. Our models partially surpass the performance of models trained on {\tau} - bench (Airline and Retail) in-domain data, highlighting the superior generalization capability derived from our text-based synthesis paradigm. Notably, our Trajectory Synthesizer matches the quality of the full pipeline while significantly reducing inference latency and costs.

顶级标签: llm agents data
详细标签: tool usage data synthesis multi-turn interaction trajectory generation benchmark 或 搜索:

解锁隐性经验:从文本中合成工具使用轨迹 / Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text


1️⃣ 一句话总结

这篇论文提出了一种名为GEM的新方法,能够从普通的文本资料中自动提取和生成大型语言模型学习使用工具所需的多轮对话数据,从而有效提升了模型使用工具解决问题的能力,并且比传统方法成本更低、效果更好。

源自 arXiv: 2601.10355