一个用于解释多变量生理时间序列的多智能体框架 / A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series
1️⃣ 一句话总结
这篇论文提出了一个名为Vivaldi的多智能体系统,用于解释复杂的生理监测数据,并通过专家评估发现,智能体协作的价值在于有选择地外部化计算和结构,而非一味追求复杂的推理,这对于医疗等安全关键领域可解释AI的设计具有重要参考意义。
Continuous physiological monitoring is central to emergency care, yet deploying trustworthy AI is challenging. While LLMs can translate complex physiological signals into clinical narratives, it is unclear how agentic systems perform relative to zero-shot inference. To address these questions, we present Vivaldi, a role-structured multi-agent system that explains multivariate physiological time series. Due to regulatory constraints that preclude live deployment, we instantiate Vivaldi in a controlled, clinical pilot to a small, highly qualified cohort of emergency medicine experts, whose evaluations reveal a context-dependent picture that contrasts with prevailing assumptions that agentic reasoning uniformly improves performance. Our experiments show that agentic pipelines substantially benefit non-thinking and medically fine-tuned models, improving expert-rated explanation justification and relevance by +6.9 and +9.7 points, respectively. Contrarily, for thinking models, agentic orchestration often degrades explanation quality, including a 14-point drop in relevance, while improving diagnostic precision (ESI F1 +3.6). We also find that explicit tool-based computation is decisive for codifiable clinical metrics, whereas subjective targets, such as pain scores and length of stay, show limited or inconsistent changes. Expert evaluation further indicates that gains in clinical utility depend on visualization conventions, with medically specialized models achieving the most favorable trade-offs between utility and clarity. Together, these findings show that the value of agentic AI lies in the selective externalization of computation and structure rather than in maximal reasoning complexity, and highlight concrete design trade-offs and learned lessons, broadly applicable to explainable AI in safety-critical healthcare settings.
一个用于解释多变量生理时间序列的多智能体框架 / A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series
这篇论文提出了一个名为Vivaldi的多智能体系统,用于解释复杂的生理监测数据,并通过专家评估发现,智能体协作的价值在于有选择地外部化计算和结构,而非一味追求复杂的推理,这对于医疗等安全关键领域可解释AI的设计具有重要参考意义。
源自 arXiv: 2603.04142