菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-07-01
📄 Abstract - Exploring the Semantic Gap in Agentic Data Systems: A Formative Study of Operationalization Failures in Analytical Workflows

Large language models (LLMs) are increasingly used to generate queries, invoke tools, and construct analytical workflows. Although recent advances have substantially improved workflow generation and execution, the semantic information required to operationalize analytical concepts often lies beyond what is explicitly represented in database schemas and data values. We present a cross-domain formative study of operationalization failures in agent-generated analytical workflows. Across 236 analytical intents spanning finance, human resources, and public safety domains, we identify 153 recurring failures despite successful workflow generation and execution. Our analysis reveals five recurring classes of failures: comparative grounding, process reasoning, quantitative reasoning, role confusion, and policy grounding. These findings suggest a semantic gap between user-level analytical concepts and the information available to workflow-generation systems. More broadly, they raise questions about the admissibility of analytical operations and suggest that future agentic data systems may require richer semantic representations to bridge the gap between analytical intent and executable computation.

顶级标签: llm agents data
详细标签: semantic gap workflow generation analytical workflows operationalization failures llm agents 或 搜索:

探索智能数据系统中的语义鸿沟:分析工作流中操作化失败的形成性研究 / Exploring the Semantic Gap in Agentic Data Systems: A Formative Study of Operationalization Failures in Analytical Workflows


1️⃣ 一句话总结

该研究通过分析金融、人力资源和公共安全领域的236个分析意图,发现即使智能系统能成功生成并执行工作流,仍会出现153种反复出现的操作化失败,揭示了用户分析概念与系统可用信息之间存在语义鸿沟,并提出未来系统需要更丰富的语义表示来弥合这一差距。

源自 arXiv: 2607.00828