当仅存最终文本时:面向多智能体归因的隐式执行追踪 / When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution
1️⃣ 一句话总结
这篇论文提出了一种名为IET的隐式执行追踪技术,它能在不依赖外部日志的情况下,仅通过分析最终生成的文本内容,就准确识别出多智能体系统中每个智能体的贡献以及它们之间的协作流程,从而解决系统出错时的责任归属问题。
When a multi-agent system produces an incorrect or harmful answer, who is accountable if execution logs and agent identifiers are unavailable? Multi-agent language systems increasingly rely on structured interactions such as delegation and iterative refinement, yet the final output often obscures the underlying interaction topology and agent contributions. We introduce IET (Implicit Execution Tracing), a metadata-independent framework that enables token-level attribution directly from generated text and a simple mechanism for interaction topology reconstruction. During generation, agent-specific keyed signals are embedded into the token distribution, transforming the text into a self-describing execution trace detectable only with a secret key. At detection time, a transition-aware scoring method identifies agent handover points and reconstructs the interaction graph. Experiments show that IET recovers agent segments and coordination structure with high accuracy while preserving generation quality, enabling privacy-preserving auditing for multi-agent language systems.
当仅存最终文本时:面向多智能体归因的隐式执行追踪 / When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution
这篇论文提出了一种名为IET的隐式执行追踪技术,它能在不依赖外部日志的情况下,仅通过分析最终生成的文本内容,就准确识别出多智能体系统中每个智能体的贡献以及它们之间的协作流程,从而解决系统出错时的责任归属问题。
源自 arXiv: 2603.17445