菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-17
📄 Abstract - Data Intelligence Agents: Interpreting, Modeling, and Querying Enterprise Data via Autonomous Coding Agents

Production data integration is bottlenecked by repeated, lossy handoffs between data owners, engineers, and analysts who must collaboratively discover, structure, and query enterprise data. We present Data Intelligence Agents (DIA), a system of three agents (Data Interpreter, Schema Creator, and Query Generator) that compresses this workflow by treating autonomous coding agents (ACAs) as a first-class abstraction: rather than emitting text, the agents generate, execute, validate, and repair concrete artifacts, draw on a shared memory for experience reuse, and surface each for review by domain experts. DIA is deployed in production for enterprise customers. We study the Query Generator in depth and evaluate it in fully autonomous mode across seven SQL benchmarks spanning four task categories and four dialects. It matches or surpasses the best published results on all seven, demonstrating that an architecture grounded in execution, built on ACAs and a shared memory, generalizes across the data intelligence workload with adaptation confined to natural-language instructions.

顶级标签: agents system data
详细标签: autonomous coding agents enterprise data sql benchmark shared memory workflow compression 或 搜索:

数据智能体:通过自主编码智能体解释、建模和查询企业数据 / Data Intelligence Agents: Interpreting, Modeling, and Querying Enterprise Data via Autonomous Coding Agents


1️⃣ 一句话总结

本文提出了一种由三个自主编码智能体(数据解释器、模式创建器和查询生成器)组成的系统,通过让智能体自动生成、执行和验证代码来简化企业数据集成流程,并在七项SQL基准测试中取得最优或持平结果,证明了基于执行和共享记忆的架构能有效泛化处理各类数据智能任务。

源自 arXiv: 2606.19319