菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-30
📄 Abstract - Trace-Level Analysis of Information Contamination in Multi-Agent Systems

Reasoning over heterogeneous artifacts (PDFs, spreadsheets, slide decks, etc.) increasingly occurs within structured agent workflows that iteratively extract, transform, and reference external information. In these workflows, uncertainty is not merely an input-quality issue: it can redirect decomposition and routing decisions, reshape intermediate state, and produce qualitatively different execution trajectories. We study this phenomenon by treating uncertainty as a controlled variable: we inject structured perturbations into artifact-derived representations, execute fixed workflows under comprehensive logging, and quantify contamination via trace divergence in plans, tool invocations, and intermediate state. Across 614 paired runs on 32 GAIA tasks with three different language models, we find a decoupling: workflows may diverge substantially yet recover correct answers, or remain structurally similar while producing incorrect outputs. We characterize three manifestation types: silent semantic corruption, behavioral detours with recovery, and combined structural disruption and their control-flow signatures (rerouting, extended execution, early termination). We measure operational costs and characterize why commonly used verification guardrails fail to intercept contamination. We contribute (i) a formal taxonomy of contamination manifestations in structured workflows, (ii) a trace-based measurement framework for detecting and localizing contamination across agent interactions, and (iii) empirical evidence with implications for targeted verification, defensive design, and cost control.

顶级标签: agents system evaluation
详细标签: multi-agent information contamination trace analysis workflow robustness uncertainty injection 或 搜索:

多智能体系统中信息污染的痕迹级分析 / Trace-Level Analysis of Information Contamination in Multi-Agent Systems


1️⃣ 一句话总结

本文通过向多智能体工作流中的人工制品(如PDF、表格)注入可控不确定性,系统研究了信息如何“污染”智能体的决策轨迹,发现即使工作流路径严重偏离也能得到正确答案,反之亦然,并据此提出了三种污染类型、一套痕迹测量框架及其对验证防御设计的启示。

源自 arXiv: 2604.27586