菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-13
📄 Abstract - User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

The recent paradigm shift toward large reasoning models (LRMs) as autonomous agents has intensified the demand for sophisticated, multi-turn tool-use capabilities. Yet, existing datasets and data-generation approaches are limited by static, predefined toolsets that cannot scale to the complexity of open-ended human-agent collaboration. To address this, we initially developed a framework for automated task-oriented multi-turn dialogue generation at scale, utilizing an LRM-based simulator to dynamically generate high-value, domain-specific tools to solve specified tasks. However, we observe that a purely task-oriented design often results in "solely task-solving" trajectories, where the agent completes the objective with minimal interaction, failing to generate the high turn-count conversations seen in realistic scenarios. To bridge this gap, we shift toward a user-oriented simulation paradigm. By decoupling task generation from a dedicated user simulator that mimics human behavioral rules - such as incremental request-making and turn-by-turn feedback - we facilitate more authentic, extended multi-turn dialogues that reflect the iterative nature of real-world problem solving. Our generation pipeline operates as a versatile, plug-and-play module capable of initiating generation from any state, ensuring high scalability in producing extended tool-use data. Furthermore, by facilitating multiple task completions within a single trajectory, it yields a high-density dataset that reflects the multifaceted demands of real-world human-agent interaction.

顶级标签: llm agents systems
详细标签: tool use dialogue generation multi-turn interaction user simulation data generation 或 搜索:

面向用户的大规模多轮对话生成与工具使用 / User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale


1️⃣ 一句话总结

这篇论文提出了一种新的方法,通过模拟人类用户逐步提出请求和反馈的行为,来生成更真实、回合数更多、工具使用更复杂的多轮对话数据,以解决现有AI助手在开放式人机协作中互动过于简单直接的问题。

源自 arXiv: 2601.08225