5Simon Willison's Weblog·20h ago

Simon Willison uses DSPy to evaluate and optimize Datasette Agent SQL system prompts

Simon Willison documents an experiment using DSPy to systematically evaluate and improve the SQL system prompts used by Datasette Agent. The post covers applying DSPy's prompt optimization framework to a real-world agentic tool, demonstrating a practical workflow for automated prompt engineering. This is a hands-on practitioner account of using DSPy for prompt evaluation in a production-adjacent context.

Evaluation and Benchmarking Agent and Tool Ecosystem DSPy Simon Willison Datasette

Related guides (3)

Simon Willison

Simon Willison: The Practitioner's Guide to the AI Landscape

Read asBeginner In-depth

Evaluation and BenchmarkingTopic guide

AI Evaluation and Benchmarking: From Leaderboards to the Limits of Measurement

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Related events (8)

4Simon Willison'S Weblog·1mo ago·source ↗

Datasette Agent

Simon Willison describes a Datasette Agent, an AI agent built on top of the Datasette data exploration tool. The post appears to demonstrate an agent capable of querying and reasoning over SQLite databases via natural language. This represents a practical deployment of LLM-powered tooling for data analysis workflows.

Agent and Tool Ecosystem SQLite Simon Willison Datasette +1 more

4Simon Willison'S Weblog·25d ago·source ↗

Simon Willison releases datasette-agent-edit 0.1a0

Simon Willison published an early alpha release (0.1a0) of datasette-agent-edit, a new plugin or tool for the Datasette ecosystem. The release appears to add agent-based editing capabilities to Datasette, a tool for exploring and publishing data. As an alpha release, details are sparse but it signals active development at the intersection of AI agents and data tooling.

Agent and Tool Ecosystem Simon Willison Datasette Datasette Agent

3Simon Willison'S Weblog·22d ago·source ↗

Simon Willison releases datasette-agent 0.2a0

Simon Willison published datasette-agent 0.2a0, an early alpha release of an agent tool built around Datasette. The release is part of the growing ecosystem of AI agent tooling for data exploration and querying. Minimal detail is available from the body, but the versioning indicates active development.

Agent and Tool Ecosystem Simon Willison Datasette Datasette Agent

5arXiv · cs.LG·10d ago·source ↗

MAS-PromptBench: Systematic study of prompt optimization in multi-agent LLM systems

A new arXiv preprint introduces MAS-PromptBench, a benchmark and study examining when and how much system-prompt optimization improves multi-agent LLM systems (MAS). The authors evaluate two prompt optimizers across diverse MAS configurations varying in task, workflow, communication protocol, and team size. Results show prompt optimization can unlock significant gains but also expose open challenges, particularly around the exponentially growing search space as agent count increases.

Evaluation and Benchmarking Agent and Tool Ecosystem MAS-PromptBench

3Simon Willison'S Weblog·17d ago·source ↗

Simon Willison releases datasette-agent 0.3a0

Simon Willison published datasette-agent 0.3a0, an alpha release of an agent tooling package for Datasette. The release appears to be a tooling update in the agent/data exploration space. Datasette is an open-source tool for exploring and publishing data, and this agent layer extends it with AI-driven capabilities.

Agent and Tool Ecosystem Simon Willison Datasette Datasette Agent

3Simon Willison'S Weblog·1mo ago·source ↗

datasette-agent 0.1a4

Simon Willison releases datasette-agent 0.1a4, an early alpha of an AI agent plugin for Datasette, the open-source data exploration tool. The release represents ongoing development of agentic tooling that allows LLMs to interact with SQLite databases through the Datasette interface. As an alpha release from a prominent AI tooling developer, it signals continued growth in the agent-tool ecosystem for data querying use cases.

Agent and Tool Ecosystem Simon Willison Datasette Datasette Agent

3Simon Willison'S Weblog·1mo ago·source ↗

datasette-agent 0.1a3

Simon Willison releases datasette-agent 0.1a3, an early alpha of an AI agent plugin for Datasette, the open-source data exploration tool. The release represents ongoing development of agentic tooling that allows LLMs to interact with SQLite databases through Datasette's interface. As an alpha release from a prominent AI tooling developer, it signals continued growth in the agent-tool ecosystem for data workflows.

Agent and Tool Ecosystem Simon Willison Datasette Datasette Agent

6Simon Willison'S Weblog·6d ago·source ↗

Simon Willison reports on prompt injection and adversarial attacks after 2,000 people tried to hack his AI assistant

Simon Willison documents the results of a public experiment in which approximately 2,000 people attempted to compromise or manipulate his personal AI assistant. The post covers the attack patterns observed, what succeeded or failed, and lessons learned about prompt injection and adversarial robustness in deployed AI systems. This is a practical, first-hand account of real-world AI security challenges from a respected practitioner.

AI Safety Research Enterprise Deployment Patterns Simon Willison