菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-05
📄 Abstract - TRACE: A Metrologically-Grounded Engineering Framework for Trustworthy Agentic AI Systems in Operationally Critical Domains

We introduce TRACE, a cross-domain engineering framework for trustworthy agentic AI in operationally critical domains. TRACE combines a four-layer reference architecture with an explicit classical-ML vs. LLM-validator split (L2a/L2b), a stateful orchestration-and-escalation policy (L3), and bounded human supervision (L4); a metrologically grounded trust-metric suite mapped to GUM/VIM/ISO 17025; and a Model-Parsimony principle quantified by the Computational Parsimony Ratio (CPR). Three instantiations--clinical decision support, industrial multi-domain operations, and a judicial AI assistant--transfer the samearchitecture and metrics across principally different governance contexts. The L2a/L2b separation makes the use of large language models a deliberate design decision rather than an architectural default, with parsimony quantified through CPR. TRACE introduces CPR as a first-class design principle in trustworthy-AI engineering.

顶级标签: llm systems multi-agents
详细标签: trustworthy ai reference architecture metrology parsimony critical domains 或 搜索:

TRACE:面向关键操作领域可信代理式AI系统的计量学工程框架 / TRACE: A Metrologically-Grounded Engineering Framework for Trustworthy Agentic AI Systems in Operationally Critical Domains


1️⃣ 一句话总结

本文提出了一个名为TRACE的工程框架,通过将经典机器学习与大语言模型明确分离、引入可量化的信任度量和模型精简原则(CPR),为医疗、工业和司法等高风险领域设计可信赖的自主AI系统提供了统一、可迁移的架构与评估方法。

源自 arXiv: 2605.03838