Almanac
← Events
5Simon Willison's Weblog·8h ago

Simon Willison covers Claude Sonnet 5 release

Simon Willison published a commentary post on Claude Sonnet 5, covering what is new in the release. The post is from a tier-2 source and represents secondary analysis of Anthropic's model update. The body content was not provided, so specific claims cannot be assessed, but the subject is a notable mid-tier model release from Anthropic.

Related guides (3)

Related events (8)

7Hacker News·8h ago·source ↗

Anthropic releases Claude Sonnet 5

Anthropic has released Claude Sonnet 5, a new mid-tier model in their Claude lineup. The announcement comes via the official Anthropic news page and generated significant community engagement on Hacker News with 714 points and 386 comments. As a new named model release from a frontier lab, this is a notable update to the Claude model family.

5Simon Willison'S Weblog·21d ago·source ↗

Simon Willison's initial impressions of Claude Fable 5

Simon Willison shares initial impressions of Claude Fable 5, a new Anthropic model. The body of the post is not available in the provided content, but the title indicates a hands-on evaluation or commentary from a prominent AI practitioner. As a tier-2 commentary source on what appears to be a new frontier model release, this is worth indexing for the model tracking thread.

8Anthropic News·29d ago·source ↗

Anthropic Releases Claude Sonnet 4.6 with 1M Token Context, Improved Computer Use, and Coding Capabilities

Anthropic has released Claude Sonnet 4.6, positioned as a major upgrade over Sonnet 4.5 with improvements across coding, computer use, long-context reasoning, and agent planning. The model features a 1M token context window in beta and is now the default on claude.ai Free and Pro plans at unchanged pricing ($3/$15 per million tokens). Notably, users preferred Sonnet 4.6 over the prior Opus 4.5 frontier model 59% of the time in coding tasks, and the model shows significant gains on OSWorld computer-use benchmarks alongside improved prompt injection resistance. Safety evaluations found no major alignment concerns and rated it as safe or safer than prior Claude models.

9Anthropic News·29d ago·source ↗

Anthropic Releases Claude Sonnet 4.5: Top Coding and Computer-Use Model with Agent SDK

Anthropic has released Claude Sonnet 4.5, claiming it is the best coding model and strongest model for building complex agents, with a 61.4% score on OSWorld (up from 42.2% for Sonnet 4) and state-of-the-art performance on SWE-bench Verified. The release is accompanied by major product upgrades including checkpoints in Claude Code, a native VS Code extension, a Claude Agent SDK giving developers access to the same infrastructure powering Claude Code, and new context editing and memory tools in the Claude API. Pricing is unchanged from Sonnet 4 at $3/$15 per million input/output tokens. Early enterprise customers including Cursor, GitHub Copilot, Devin, Canva, and Figma report significant gains in coding, agentic, and long-context tasks.

6Latent Space·2h ago·source ↗

AINews: Claude Sonnet 5 release and Fable 5 preview coverage

Latent Space's AINews digest covers the release of Claude Sonnet 5 and previews Fable 5, suggesting both are significant near-term developments in the AI landscape. The newsletter aggregates community and industry signals around these releases. The brief body ('Everything is open again!') suggests a theme around open-weights or open-access model availability.

8Anthropic News·28d ago·source ↗

Introducing Claude 3.5 Sonnet

Anthropic launches Claude 3.5 Sonnet, the first model in its Claude 3.5 family, claiming it outperforms Claude 3 Opus and competitor models on GPQA, MMLU, and HumanEval benchmarks while operating at twice the speed and mid-tier pricing ($3/$15 per million tokens). The model features a 200K context window, improved vision capabilities, and an internal agentic coding evaluation score of 64% versus 38% for Opus. Alongside the model, Anthropic introduces Artifacts on Claude.ai, a dedicated workspace for real-time editing of AI-generated content. The model was pre-deployment evaluated by the UK AI Safety Institute and assessed at ASL-2.

4Simon Willison'S Weblog·1mo ago·source ↗

Claude Opus 4.8: "a modest but tangible improvement"

Simon Willison offers commentary on Claude Opus 4.8, characterizing it as a modest but tangible improvement over its predecessor. The post appears to be a brief evaluation or first-impressions piece from a well-known developer and AI commentator. No detailed benchmark data or technical specifics are visible in the provided body text.

5Don'T Worry About The Vase·28d ago·source ↗

Zvi Mowshowitz analyzes Claude Opus 4.8 capabilities and community reactions

Zvi Mowshowitz (Don't Worry About the Vase) publishes a roundup and analysis of Claude Opus 4.8, aggregating capability observations and community reactions to the new model. The post synthesizes multiple data points to characterize the model's strengths and weaknesses. This is a secondary commentary piece following what appears to be a recent Anthropic model release.