METRO:面向非协作对话的专家对话文本策略归纳 / METRO: Towards Strategy Induction from Expert Dialogue Transcripts for Non-collaborative Dialogues
1️⃣ 一句话总结
这篇论文提出了一种名为METRO的新方法,它利用大语言模型直接从专家对话记录中自动学习策略动作和规划逻辑,从而以低成本、可扩展的方式构建非协作对话智能体,并在实验中取得了优于现有方法的效果。
Developing non-collaborative dialogue agents traditionally requires the manual, unscalable codification of expert strategies. We propose \ours, a method that leverages large language models to autonomously induce both strategy actions and planning logic directly from raw transcripts. METRO formalizes expert knowledge into a Strategy Forest, a hierarchical structure that captures both short-term responses (nodes) and long-term strategic foresight (branches). Experimental results across two benchmarks show that METRO demonstrates promising performance, outperforming existing methods by an average of 9%-10%. Our further analysis not only reveals the success behind METRO (strategic behavioral diversity and foresight), but also demonstrates its robust cross-task transferability. This offers new insights into building non-collaborative agents in a cost-effective and scalable way. Our code is available at this https URL.
METRO:面向非协作对话的专家对话文本策略归纳 / METRO: Towards Strategy Induction from Expert Dialogue Transcripts for Non-collaborative Dialogues
这篇论文提出了一种名为METRO的新方法,它利用大语言模型直接从专家对话记录中自动学习策略动作和规划逻辑,从而以低成本、可扩展的方式构建非协作对话智能体,并在实验中取得了优于现有方法的效果。
源自 arXiv: 2604.11427