Almanac
model

Claude 3 Sonnet

modelactiveprovisionalclaude-3-sonnet-773e6c40·8 events·first seen 15d ago

Aliases: Claude 3 Sonnet

Co-occurring entities

More like this (12)

Recent events (8)

9Anthropic News·13d ago·source ↗

Anthropic launches Claude 3 model family: Haiku, Sonnet, and Opus

Anthropic announced the Claude 3 model family on March 4, 2024, comprising three models — Haiku, Sonnet, and Opus — in ascending capability order. Claude 3 Opus claims top performance on major benchmarks including MMLU, GPQA, and GSM8K, with near-perfect recall on long-context evaluations (200K context window, 99%+ NIAH accuracy) and new multimodal vision capabilities. The release also highlights reduced unnecessary refusals, a twofold accuracy improvement over Claude 2.1, and Constitutional AI-based safety tuning. Opus and Sonnet launched immediately via claude.ai and the Claude API across 159 countries, with Haiku to follow.

7Anthropic News·13d ago·source ↗

Anthropic demonstrates feature steering in Claude 3 Sonnet via interpretability research

Anthropic released a 24-hour public demo called 'Golden Gate Claude' to illustrate findings from a major interpretability paper on Claude 3 Sonnet. The research identifies millions of internal 'features' — neuron combinations that activate for specific concepts — and shows these can be surgically amplified or suppressed to alter model behavior without prompting or fine-tuning. The Golden Gate Bridge feature was amplified as a demonstration, causing the model to reference the bridge in nearly all responses. Anthropic argues this mechanistic control over internal activations has direct implications for AI safety, including the ability to modulate safety-relevant features like those tied to deception or dangerous code.

5Anthropic News·13d ago·source ↗

Claude 3 Haiku and Sonnet reach general availability on Google Cloud Vertex AI

Anthropic announced general availability of Claude 3 Haiku and Claude 3 Sonnet on Google Cloud's Vertex AI platform, with Claude 3 Opus to follow in coming weeks. The deployment gives enterprise customers access to Claude models within their existing Google Cloud environment, with associated data governance and security benefits. Quora's Poe app is cited as an early adopter, reporting millions of daily messages exchanged via Claude-based bots.

7Anthropic News·13d ago·source ↗

Anthropic makes Claude 3 Haiku and Sonnet available to US Intelligence Community and AWS GovCloud

Anthropic has made Claude 3 Haiku and Claude 3 Sonnet available via AWS Marketplace for the US Intelligence Community and AWS GovCloud, marking a significant expansion into government deployment. The company has crafted contractual exceptions to its general Usage Policy to permit legally authorized foreign intelligence analysis, including combating human trafficking and identifying covert influence campaigns, while maintaining restrictions on disinformation, weapons design, and malicious cyber operations. The deployment is currently limited to ASL-2 models under Anthropic's Responsible Scaling Policy. Anthropic also notes prior pre-release access to Claude 3.5 Sonnet was provided to the UK AI Safety Institute for pre-deployment testing.

7The Batch·4d ago·source ↗

Study finds state media in training data causes LLMs to reflect government propaganda in native languages

Researchers from University of Oregon, Purdue, UCSD, NYU, and Princeton found that state-controlled media is heavily overrepresented in web-scraped training datasets, causing Claude 3 Sonnet and GPT-4o to express significantly more favorable attitudes toward authoritarian governments when prompted in those governments' native languages. Chinese state media accounts for over 40x more documents in CulturaX than Chinese Wikipedia, and both models reproduced state-media strings at 3-5% rates. When prompted in Chinese, both models favored China's government roughly 68-75% of the time versus English prompts on the same topics, with the effect scaling with a country's World Press Freedom Index ranking.

8Anthropic News·15d ago·source ↗

Introducing Claude 3.5 Sonnet

Anthropic launches Claude 3.5 Sonnet, the first model in its Claude 3.5 family, claiming it outperforms Claude 3 Opus and competitor models on GPQA, MMLU, and HumanEval benchmarks while operating at twice the speed and mid-tier pricing ($3/$15 per million tokens). The model features a 200K context window, improved vision capabilities, and an internal agentic coding evaluation score of 64% versus 38% for Opus. Alongside the model, Anthropic introduces Artifacts on Claude.ai, a dedicated workspace for real-time editing of AI-generated content. The model was pre-deployment evaluated by the UK AI Safety Institute and assessed at ASL-2.

3Anthropic News·13d ago·source ↗

Anthropic launches Claude in Canada with full product suite

Anthropic expanded Claude's availability to Canada as of June 5, 2024, offering access to Claude.ai, the iOS app, the API, and the Team plan. Canadian users can subscribe to Claude Pro at CA$28/month for access to the Claude 3 model family (Opus, Sonnet, Haiku) with 5x usage limits. The expansion is a geographic rollout with no new technical capabilities announced.

6Anthropic News·12d ago·source ↗

Anthropic releases Claude 3 Haiku, fastest and most affordable model in the Claude 3 family

Anthropic released Claude 3 Haiku, the fastest and most cost-efficient model in the Claude 3 lineup, processing 21K tokens per second for prompts under 32K tokens. The model is positioned for enterprise workloads requiring high throughput and low cost, with pricing enabling analysis of 400 Supreme Court cases or 2,500 images for one dollar. Haiku is available via the Claude API, Claude Pro on claude.ai, and Amazon Bedrock, with Google Cloud Vertex AI support forthcoming.