6arXiv cs.AI (Artificial Intelligence)·2d ago

Data Intelligence Agents (DIA): Autonomous coding agents for enterprise data integration and SQL generation

Researchers present Data Intelligence Agents (DIA), a production-deployed system of three autonomous coding agents (Data Interpreter, Schema Creator, Query Generator) that automate enterprise data integration workflows. Rather than generating text, the agents produce, execute, validate, and repair concrete artifacts (code, schemas, SQL) with shared memory for experience reuse. The Query Generator is evaluated across seven SQL benchmarks spanning four dialects and task categories, matching or surpassing best published results on all seven. The system is deployed in production for enterprise customers, making it a notable applied research contribution.

Evaluation and Benchmarking Enterprise Deployment Patterns Agent and Tool Ecosystem Data Intelligence Agents: Interpreting, Modeling, and Querying Enterprise Data via Autonomous Coding Agents Data Intelligence Agents

Related guides (3)

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From AI Demo to Production Reality

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Evaluation and BenchmarkingTopic guide

Evaluation and Benchmarking: How We Measure AI — and Why It Keeps Getting Harder

Read asBeginner In-depth

Related events (8)

4Simon Willison'S Weblog·1mo ago·source ↗

Datasette Agent

Simon Willison describes a Datasette Agent, an AI agent built on top of the Datasette data exploration tool. The post appears to demonstrate an agent capable of querying and reasoning over SQLite databases via natural language. This represents a practical deployment of LLM-powered tooling for data analysis workflows.

Agent and Tool Ecosystem SQLite Simon Willison Datasette +1 more

3Github Trending·28d ago·source ↗

Databricks AI Dev Kit: Toolkit for Coding Agents

Databricks Field Engineering has published an open-source Python toolkit on GitHub aimed at coding agents, accumulating 1,555 stars. The repository appears to provide utilities and scaffolding for building AI-powered development agents on the Databricks platform. It is a community/field engineering project rather than an official product release.

Enterprise Deployment Patterns Agent and Tool Ecosystem Databricks databricks-solutions/ai-dev-kit

5Hugging Face Blog·1mo ago·source ↗

DABStep: Data Agent Benchmark for Multi-step Reasoning

Hugging Face introduces DABStep, a benchmark designed to evaluate data agents on multi-step reasoning tasks. The benchmark targets agentic systems that must perform complex, sequential data operations rather than single-step queries. It aims to fill a gap in evaluation tooling for realistic data analysis workflows involving tool use and chained reasoning.

Evaluation and Benchmarking Agent and Tool Ecosystem DABStep Hugging Face

5Github Trending·3d ago·source ↗

Microsoft RD-Agent: automated AI-driven R&D for data and model development

Microsoft has released RD-Agent, an open-source Python framework aimed at automating high-value R&D processes in AI, with a focus on data and model development. The project positions AI as the driver of data-driven AI workflows, targeting industrial productivity use cases. With 13,500 GitHub stars, it has attracted meaningful community interest, and a technical report is available.

Enterprise Deployment Patterns Agent and Tool Ecosystem Microsoft RD-Agent

7Openai Blog·1mo ago·source ↗

Inside OpenAI's In-House Data Agent

OpenAI describes the architecture and capabilities of an internal AI data agent built on GPT-5 and Codex, designed to reason over large datasets and return reliable analytical insights within minutes. The system incorporates memory components to handle complex, multi-step data queries at scale. This represents a concrete internal deployment of frontier models in an agentic, tool-using workflow. The post offers a rare look at how OpenAI itself operationalizes its own models for enterprise-style data analysis.

Frontier Model Releases Inference Economics OpenAI OpenAI Data Agent Codex +3 more

5arXiv · cs.CL·10d ago·source ↗

Data2Story: Multi-agent framework for end-to-end data journalism with verifiable claims

Researchers introduce Data2Story (Data Journalist Agent), a multi-agent framework that orchestrates specialized roles to transform raw data into multimodal news articles. A key innovation is an Inspector module that grounds every claim back to data, code, or external references, enabling verifiability. The system also generates interactive multimodal outputs (maps, audio) rather than static text and charts. Evaluation across 18 articles with 53 human participants shows competitive quality versus expert-written pieces, with particular strength in transparency and auditability, though human journalists retain an edge in editorial angle and creative design.

Agent and Tool Ecosystem Multimodal Progress Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories Data Journalist Agent

5Latent Space·1mo ago·source ↗

Giving Agents Computers — Ivan Burazin, Daytona

Latent Space interviews Daytona CEO Ivan Burazin about the company's infrastructure for giving AI agents secure compute environments. The discussion covers Daytona's bare metal sandbox architecture, 850K daily runs, 74% month-over-month growth, and their approach to RL-based evaluations for agent workloads. The piece positions Daytona as part of an emerging 'agent cloud' category providing isolated execution environments for autonomous AI systems.

Training Infrastructure Inference Economics Daytona Ivan Burazin bare metal sandboxes +3 more

4Github Trending·1mo ago·source ↗

K-Dense-AI/scientific-agent-skills: Ready-to-Use Agent Skills Library for Research and Engineering

A Python repository providing a collection of pre-built agent skills targeting research, science, engineering, analysis, finance, and writing tasks. The project has accumulated 24,087 stars with a notable single-day gain of 762 stars, indicating significant community traction. No detailed technical documentation is available from the snippet, but the scope suggests a modular agent tooling library.

Agent and Tool Ecosystem K-Dense-AI/scientific-agent-skills K-Dense-AI