Almanac
Guide · Beginner

Simon Willison: Developer, Toolmaker, and AI's Most Useful Commentator

Simon WillisonBeginneractive·v1 · live·generated 5d ago
TL;DRSimon Willison is a software developer and open-source toolmaker who has become one of the most trusted independent voices on practical AI. He builds tools that let developers work directly with language models, writes candidly about what AI can and can't do, and flags real-world security failures and policy missteps that others overlook.

Key takeaways

  • Maintains the LLM CLI tool and plugin ecosystem (llm-anthropic, llm-gemini, and others), giving developers a vendor-neutral way to interact with frontier models from the command line.
  • Built Datasette — an open-source data exploration tool — and is actively extending it with AI agent capabilities through a family of datasette-agent plugins.
  • Publishes pointed safety commentary: documented a Meta AI Instagram account-takeover incident and a Microsoft Copilot Cowork file-exfiltration vulnerability.
  • Argues Anthropic and OpenAI have achieved genuine product-market fit, and that AI has not replaced — and won't replace — software engineers.
  • Frames the AI debate as two groups under different time pressures: enthusiasts racing to realize potential before momentum stalls, skeptics racing to constrain systems before they proliferate beyond reach.

Who Simon Willison is

Simon Willison is a software developer best known in the AI world for two things: building practical open-source tools that make language models easier to use, and writing clear-eyed commentary on what is actually happening in AI — the good, the bad, and the quietly alarming.

He is not affiliated with any of the major AI labs. That independence is a big part of what makes his voice useful. When he says something works, or doesn't, or is dangerous, he has no product to sell.

Why you should care

If you work with AI tools — or are trying to figure out whether to — Willison's blog is one of the best places to get a grounded read on the field. He covers frontier model releases (with hands-on impressions of models like Claude Opus 4.8 and Claude Fable), enterprise economics (flagging that Uber capped employee use of AI coding tools like Claude Code over cost concerns), and security failures that don't always get the attention they deserve.

He also publishes rapid-fire industry retrospectives — like "The last six months in LLMs in five minutes" — that are genuinely useful for anyone trying to keep up without drowning.

The tools he builds

Willison maintains a family of open-source tools centered on two projects:

LLM is a command-line tool and Python library for interacting with language models from multiple providers. It has a plugin system, and Willison actively maintains plugins for Anthropic (llm-anthropic) and Google Gemini (llm-gemini), among others. The tool is designed to be vendor-neutral — you can swap between providers without rewriting your workflow.

Datasette is an open-source tool for exploring and publishing data stored in SQLite databases. Willison has been building AI agent capabilities on top of it through a series of alpha plugins — datasette-agent (which lets an LLM query and reason over your data in natural language), datasette-agent-charts (for AI-driven chart generation), and datasette-agent-edit (for agent-based data editing). These are early-stage but signal a clear direction: making data exploration conversational.

He also built a tool for adding document context to OpenAI's real-time audio API, and has written about practical techniques like sandboxed Python execution via MicroPython and WebAssembly — useful for anyone building AI agents that need to run code safely.

The commentary that matters

Willison's blog is not just a changelog. Some of his most-read posts are opinion pieces that cut through hype.

On security, he has documented real deployment failures: attackers using Meta AI to gain unauthorized access to high-profile Instagram accounts, and a vulnerability in Microsoft Copilot Cowork that exfiltrated files. These aren't theoretical risks — they're things that happened, and Willison names them plainly.

On model behavior, he has written about Claude Fable being "relentlessly proactive" — a behavioral shift toward more autonomous, initiative-taking responses that generated significant discussion among practitioners. He also raised a transparency concern: when Claude Fable stops helping a user, it does so without explanation, leaving users in the dark about why.

On policy, he reported on Anthropic reversing a policy that critics said could have "sabotaged" AI researchers using Claude — and published a statement on a US government directive to suspend access to specific AI models.

On industry dynamics, he has argued that Anthropic and OpenAI have found genuine product-market fit, that AI vendor lock-in has decreased as APIs have standardized and open-weight models have matured, and that AI has not replaced software engineers and won't. His framing of the AI debate — enthusiasts racing against stalling momentum, skeptics racing against proliferating systems — is one of the more useful mental models for understanding why the conversation so often talks past itself.

Where he fits in the ecosystem

Willison occupies a rare position: technically credible enough to build and ship real tools, independent enough to say uncomfortable things, and prolific enough that his blog functions as a running log of what matters in practical AI. He covers everything from papal encyclicals on AI ethics to SQLite AGENTS.md files — the convention of adding agent-readable documentation to codebases so LLM tools know how to work with a project.

That range is the point. AI is not just a research topic or a product category; it is a thing that is being woven into software, enterprises, governments, and culture all at once. Willison writes about all of it.

Related topics

LLM CLILLMDatasetteDatasette Agentdatasette-agent-chartsllm-anthropicllm-geminiAnthropicOpenAIGoogleMicrosoftSQLite

FAQ

What is the LLM CLI tool?

It's an open-source command-line tool and Python library Willison maintains for interacting with language models from multiple providers, with a plugin system that supports Anthropic, Google Gemini, and others.

What is Datasette?

Datasette is an open-source tool for exploring and publishing SQLite databases; Willison is extending it with AI agent plugins that let you query and visualize data using natural language.

Is Willison affiliated with any AI lab?

No — he is an independent developer and commentator, which is a key reason practitioners treat his assessments as credible.

What security issues has he flagged?

He documented a Meta AI incident where attackers gained access to high-profile Instagram accounts, and a Microsoft Copilot Cowork vulnerability that exfiltrated files — both real deployment failures, not hypotheticals.

Does he think AI will replace software engineers?

No — he published a piece arguing against that thesis, and it is consistent with his broader view that AI is a powerful tool that changes how engineers work rather than making them obsolete.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

  • v1live5d ago

Related guides (4)

More on Simon Willison (6)

4Simon Willison'S Weblog·5d ago·source ↗

Simon Willison: Why AI hasn't replaced software engineers, and won't

Simon Willison publishes a commentary piece arguing against the thesis that AI will replace software engineers. The piece comes from a respected practitioner voice with a track record of nuanced AI analysis. Without body content available, the title signals a counter-narrative to displacement claims that is likely to be widely circulated in practitioner communities.

4Simon Willison'S Weblog·15d ago·source ↗

Simon Willison on the asymmetric time pressures facing AI enthusiasts vs. skeptics

Simon Willison publishes a commentary framing the AI debate as two groups facing different temporal pressures: enthusiasts racing against time to realize transformative potential before momentum stalls, and skeptics racing against entropy as AI systems proliferate and become harder to constrain. The piece is an opinion/strategy essay from a respected practitioner voice. It contributes to ongoing discourse about AI trajectories and the structural dynamics of the optimist-pessimist divide.

4Simon Willison'S Weblog·7d ago·source ↗

Simon Willison adds document context to OpenAI WebRTC Audio Session tool

Simon Willison documents an update to his OpenAI WebRTC Audio Session tool that adds document context capabilities, allowing audio sessions to incorporate document content. The post covers practical integration of OpenAI's real-time audio API with document-grounded context. This is a hands-on tooling walkthrough relevant to practitioners building voice-enabled AI applications.

4Simon Willison'S Weblog·1mo ago·source ↗

LLM 0.32a2 Released

Simon Willison has released version 0.32a2 of the LLM command-line tool and Python library. The post appears to be a release announcement for this alpha version of the popular open-source tool used to interact with large language models. No detailed body content was provided, but the versioning indicates an incremental pre-release update to the tooling ecosystem.

5Simon Willison'S Weblog·1mo ago·source ↗

The last six months in LLMs in five minutes

Simon Willison publishes a rapid-fire retrospective covering the major LLM developments of the past six months. As a tier-2 commentary source, the piece synthesizes frontier model releases, tooling shifts, and ecosystem trends into a condensed overview. The body content was not provided, so specific claims cannot be assessed, but the framing suggests a broad industry-analysis sweep rather than a single technical finding.

4Hacker News·24d ago·source ↗

I think Anthropic and OpenAI have found product-market fit

Simon Willison argues that Anthropic and OpenAI have achieved genuine product-market fit, based on observable adoption patterns. The piece is a commentary on the commercial trajectory of the two leading AI labs. With 494 HN points and 606 comments, it generated substantial community discussion. The argument likely draws on revenue signals, usage patterns, or enterprise adoption evidence.