7Anthropic News·16d ago

Anthropic launches Claude 2 with 100K context window and improved coding, reasoning, and safety

Anthropic released Claude 2, featuring a 100K token context window, improved performance on coding (71.2% on Codex HumanEval, up from 56.0%), math (88.0% on GSM8k), and legal reasoning (76.5% on the Bar exam multiple choice section). The model is available via API at the same price as Claude 1.3 and through a new public beta at claude.ai for US and UK users. Safety improvements include a 2x reduction in harmful outputs on internal red-team evaluations compared to Claude 1.3. Early API partners include Jasper and Sourcegraph.

Long Context Evolution Frontier Model Releases AI Safety Research claude.ai Claude Sourcegraph Codex HumanEval Claude 3.5 Jasper Cody GSM8K Anthropic

Related guides (3)

Claude

Claude: Anthropic's AI Assistant Built for Safety and Scale

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Long Context EvolutionTopic guide

Long Context Evolution: From Bigger Windows to Smarter Memory

Read asBeginner In-depth

Related events (8)

9Anthropic News·19d ago·source ↗

Claude Opus 4.6 Released with 1M Token Context, Agentic Coding Advances, and State-of-the-Art Benchmarks

Anthropic has released Claude Opus 4.6, its most capable model to date, featuring a 1M token context window in beta, improved agentic coding and planning capabilities, and adaptive thinking with developer-controlled effort levels. The model claims top scores on Terminal-Bench 2.0, Humanity's Last Exam, GDPval-AA, and BrowseComp, outperforming OpenAI's GPT-5.2 by 144 Elo points on GDPval-AA. New product features include agent teams in Claude Code, context compaction for long-running tasks, and Claude in PowerPoint (research preview). Pricing remains unchanged at $5/$25 per million input/output tokens.

Long Context Evolution Frontier Model Releases GPT-5.2 Claude Opus 4.6 adaptive thinking +13 more

7Anthropic News·16d ago·source ↗

Anthropic releases Claude 2.1 with 200K context window, reduced hallucinations, and tool use beta

Anthropic released Claude 2.1, featuring an industry-first 200,000-token context window (roughly 500 pages), a claimed 2x reduction in hallucination rates versus Claude 2.0, and a new beta tool-use capability allowing Claude to orchestrate across developer-defined APIs and functions. The release also introduces system prompts and a revamped developer Workbench console. Claude 2.1 is available via API and powers claude.ai for both free and Pro tiers, with the 200K context window reserved for Pro users.

Long Context Evolution Frontier Model Releases claude.ai Claude Workbench +2 more

8Anthropic News·19d ago·source ↗

Anthropic Releases Claude Sonnet 4.6 with 1M Token Context, Improved Computer Use, and Coding Capabilities

Anthropic has released Claude Sonnet 4.6, positioned as a major upgrade over Sonnet 4.5 with improvements across coding, computer use, long-context reasoning, and agent planning. The model features a 1M token context window in beta and is now the default on claude.ai Free and Pro plans at unchanged pricing ($3/$15 per million tokens). Notably, users preferred Sonnet 4.6 over the prior Opus 4.5 frontier model 59% of the time in coding tasks, and the model shows significant gains on OSWorld computer-use benchmarks alongside improved prompt injection resistance. Safety evaluations found no major alignment concerns and rated it as safe or safer than prior Claude models.

Long Context Evolution Frontier Model Releases claude.ai Claude Sonnet 4 Claude Opus 4.6 +11 more

8Anthropic News·18d ago·source ↗

Introducing Claude 3.5 Sonnet

Anthropic launches Claude 3.5 Sonnet, the first model in its Claude 3.5 family, claiming it outperforms Claude 3 Opus and competitor models on GPQA, MMLU, and HumanEval benchmarks while operating at twice the speed and mid-tier pricing ($3/$15 per million tokens). The model features a 200K context window, improved vision capabilities, and an internal agentic coding evaluation score of 64% versus 38% for Opus. Alongside the model, Anthropic introduces Artifacts on Claude.ai, a dedicated workspace for real-time editing of AI-generated content. The model was pre-deployment evaluated by the UK AI Safety Institute and assessed at ASL-2.

Long Context Evolution Frontier Model Releases claude.ai Thorn Amazon Bedrock +16 more

5Anthropic News·17d ago·source ↗

Anthropic releases Claude Instant 1.2 with improved math, coding, and safety

Anthropic released Claude Instant 1.2, an updated version of its faster, lower-cost model tier, now available via API. The release incorporates capabilities from Claude 2 and shows measurable benchmark gains: 58.7% on Codex (vs 52.8% for 1.1) and 86.7% on GSM8K (vs 80.9% for 1.1). Safety improvements include reduced hallucination and greater jailbreak resistance as measured by automated red-teaming.

Frontier Model Releases Inference Economics Claude Codex GSM8K +2 more

9Anthropic News·19d ago·source ↗

Anthropic Releases Claude Opus 4.5 with State-of-the-Art Coding, Agent, and Computer Use Capabilities

Anthropic has released Claude Opus 4.5, positioning it as the best model in the world for coding, agentic workflows, and computer use, with pricing reduced to $5/$25 per million input/output tokens. The model demonstrates significant token efficiency gains—up to 65% fewer tokens than prior models on equivalent tasks—alongside improvements in long-horizon autonomous task execution, multi-step reasoning, and self-improving agent behavior. The release is accompanied by updates to Claude Code, the Claude Developer Platform, and integrations with Excel, Chrome, and desktop environments. Early partner feedback from GitHub Copilot, Cursor, Notion, Warp, and others reports measurable benchmark improvements and new use cases previously out of reach.

Frontier Model Releases Evaluation and Benchmarking Notion Claude Opus 4.6 Lovable +12 more

7Anthropic News·17d ago·source ↗

Anthropic launches Claude publicly with two model tiers after closed alpha

Anthropic announced the public launch of Claude on March 14, 2023, following a closed alpha with partners including Notion, Quora, and DuckDuckGo. The release introduced two model variants — Claude (high-performance) and Claude Instant (lighter and faster) — accessible via chat interface and API. Early partners reported Claude produced fewer harmful outputs and was more steerable than competing models, with deployments spanning education, legal tech, productivity, and search.

Frontier Model Releases Enterprise Deployment Patterns Quora Notion Poe +7 more

9Anthropic News·19d ago·source ↗

Claude 3.7 Sonnet and Claude Code: Anthropic's First Hybrid Reasoning Model and Agentic Coding Tool

Anthropic has released Claude 3.7 Sonnet, described as their most capable model to date and the first hybrid reasoning model on the market, capable of operating in both standard and extended thinking modes within a single unified model. The model achieves state-of-the-art results on SWE-bench Verified and TAU-bench, with particular strength in coding and front-end web development. Alongside the model, Anthropic is launching Claude Code in limited research preview, a command-line agentic coding tool that can read/edit files, run tests, and push to GitHub. Pricing remains unchanged at $3/M input and $15/M output tokens, with availability across Claude.ai plans, Amazon Bedrock, and Google Cloud Vertex AI.

Frontier Model Releases Evaluation and Benchmarking Canva Amazon Bedrock GitHub +14 more