Events
US government orders Anthropic to suspend access to Fable 5 and Mythos 5 citing national security jailbreak concerns
The US government issued an export control directive requiring Anthropic to immediately disable Fable 5 and Mythos 5 for all foreign nationals, effectively forcing a full customer suspension to ensure compliance. The government cited awareness of a jailbreak method, but Anthropic disputes the severity, stating the demonstrated technique is a narrow, non-universal jailbreak that produces results already achievable by other publicly available models including GPT-5.5. Anthropic is complying with the directive while publicly disagreeing with the standard applied, arguing that requiring perfect jailbreak resistance would halt all frontier model deployments industry-wide. This is a significant regulatory and safety governance flashpoint involving government authority over commercial AI model access.
Anthropic launches Claude Mythos 5 and Claude Fable 5; Andrew Ng introduces OpenCoworker desktop agent
Anthropic released Claude Mythos 5 and Claude Fable 5, two variants of the same frontier model that set new state-of-the-art results across software engineering, knowledge work, cybersecurity, and agentic coding benchmarks. Claude Fable 5 is the general-availability version with safety classifiers that restrict responses on security, biology, chemistry, and cutting-edge AI topics, priced at $10/$50 per million input/output tokens; Mythos 5 is restricted to selected partners via Project Glasswing. Separately, Andrew Ng and collaborators released OpenCoworker, a free open-source desktop agent harness built on top of aisuite, designed to give users privacy-preserving agentic workflows with their own API keys or local models. The newsletter also contextualizes the broader shift toward LLM-driven agent harnesses as frontier models have become capable enough to reliably drive next-action decisions.
Anthropic releases Claude Mythos 5 and Claude Fable 5 with unprecedented capability restrictions and safety tiers
Anthropic launched Claude Mythos 5, a restricted-access model capable of cracking previously secure software, and Claude Fable 5, a general-use version with novel safety classifiers that block or degrade responses on cybersecurity, biology, chemistry, and AI-development topics. Both models set new state-of-the-art results across software engineering, agentic coding, knowledge work, and scientific reasoning benchmarks, and are priced at roughly half the cost of the prior Claude Mythos Preview. Claude Fable 5 initially included undisclosed capability degradation for AI-development prompts — applied silently via prompt modification or steering vectors — which sparked controversy before Anthropic modified the policy. The release represents a significant escalation in both frontier capability and the operational complexity of safety-tiered model deployment.
MaxProof achieves gold-medal-level performance on IMO 2025 and USAMO 2026 via population-level test-time scaling
MiniMax introduces MaxProof, a test-time scaling framework for competition-level mathematical proof built on their MiniMax-M3 model. The system trains three capabilities — proof generation, verification, and critique-conditioned repair — then at inference time runs tournament selection over a population of candidate proofs. MaxProof scores 35/42 on IMO 2025 and 36/42 on USAMO 2026, exceeding the human gold-medal threshold on both competitions.
ABC-Bench: Agentic biosecurity benchmark finds LLM agents surpass median expert humans on dual-use biology tasks
Researchers introduce ABC-Bench, a benchmark evaluating LLM agents on biosecurity-relevant biology tasks including liquid-handling robot programming, DNA fragment design, and evasion of DNA synthesis screening. All tested agents outperformed the median expert human baseline across all three tasks. Wet-lab validation confirmed that OpenAI's o4-mini-high produced scripts that successfully assembled DNA on an OpenTrons robot. The results highlight a meaningful shift in the biosecurity risk landscape as AI agents acquire practical wet-lab-adjacent capabilities.
Anthropic releases system card for Claude Fable 5 and Claude Mythos 5
Anthropic has published a system card PDF for two new models, Claude Fable 5 and Claude Mythos 5, surfaced via Hacker News with 211 points. The system card is a primary safety and capability disclosure document accompanying a model release. The naming convention suggests these are new frontier-tier models from Anthropic, distinct from the existing Claude Opus/Sonnet/Haiku naming scheme.
Anthropic releases Claude Fable 5
Anthropic has released Claude Fable 5, a new model in the Claude family, announced via their official news channel. The Hacker News discussion generated substantial engagement with 1,468 points and 1,156 comments, indicating significant community interest. No detailed capability claims or benchmark results are available from this item alone.
Goedel-Architect achieves state-of-the-art formal theorem proving with blueprint-based agentic framework
Goedel-Architect is an agentic framework for formal theorem proving in Lean 4 that uses blueprint generation — a dependency graph of definitions and lemmas — rather than recursive decomposition, enabling parallel lemma closure and global refinement. Built on DeepSeek-V4-Flash (284B-A13B), it achieves 99.2% pass@1 on MiniF2F-test and 75.6% on PutnamBench, scaling to 100% on MiniF2F, 88.8% on PutnamBench, and 4/6 on IMO 2025 when seeded with natural-language proofs. The authors claim state-of-the-art performance for an open-source pipeline at up to 500x lower cost than comparable systems.
Amazon invests up to $4 billion in Anthropic, becomes primary cloud provider
Anthropic announced a strategic investment of up to $4 billion from Amazon, with AWS becoming Anthropic's primary cloud provider for mission-critical workloads. The deal includes access to AWS Trainium and Inferentia chips for model training and deployment, expanded Claude availability on Amazon Bedrock with enterprise fine-tuning capabilities, and joint collaboration on future Trainium and Inferentia chip development. Amazon takes a minority stake while Anthropic's governance structure, including its Long Term Benefit Trust, remains unchanged.
Anthropic and AWS expand partnership with $4B investment and Trainium hardware collaboration
Anthropic announced an expanded partnership with Amazon Web Services, including a new $4 billion investment that brings Amazon's total stake to $8 billion, while establishing AWS as Anthropic's primary cloud and training partner. The collaboration includes deep hardware-software co-development on AWS Trainium accelerators, with Anthropic engineers writing low-level kernels and contributing to the AWS Neuron software stack to optimize model training from the silicon up. Claude on Amazon Bedrock is described as core infrastructure for tens of thousands of enterprises, with named deployments at Pfizer, Intuit, Perplexity, and the European Parliament. The deal also extends Claude's availability to AWS GovCloud and classified cloud regions for government customers.
Anthropic launches Claude 3 model family: Haiku, Sonnet, and Opus
Anthropic announced the Claude 3 model family on March 4, 2024, comprising three models — Haiku, Sonnet, and Opus — in ascending capability order. Claude 3 Opus claims top performance on major benchmarks including MMLU, GPQA, and GSM8K, with near-perfect recall on long-context evaluations (200K context window, 99%+ NIAH accuracy) and new multimodal vision capabilities. The release also highlights reduced unnecessary refusals, a twofold accuracy improvement over Claude 2.1, and Constitutional AI-based safety tuning. Opus and Sonnet launched immediately via claude.ai and the Claude API across 159 countries, with Haiku to follow.
Anthropic publishes Responsible Scaling Policy with AI Safety Level framework
Anthropic released its Responsible Scaling Policy (RSP), a formal framework of technical and organizational protocols for managing catastrophic risks from increasingly capable AI systems. The policy introduces AI Safety Levels (ASL-1 through ASL-5+), modeled on US biosafety level standards, requiring progressively stricter safety, security, and operational standards as models become more capable. Current Claude models are classified as ASL-2; ASL-3 triggers stricter deployment constraints including adversarial red-teaming requirements. The policy has been approved by Anthropic's board and is intended as a template for industry-wide adoption.
U.S. Department of War bans Anthropic, contracts OpenAI for classified AI systems after standoff over safety restrictions
The U.S. Department of War designated Anthropic a supply-chain risk to national security after the company refused to remove restrictions on Claude's use for domestic surveillance and autonomous weapons, effectively banning it from military and contractor use. OpenAI signed a contract allowing use of its models 'for all lawful purposes' with ambiguous carve-outs for surveillance and autonomous weapons, which Altman later called rushed and renegotiated. The standoff culminated in a Trump Truth Social post threatening civil and criminal consequences against Anthropic, followed by Hegseth's formal designation. The episode marks a significant precedent: the supply-chain risk designation, previously applied only to foreign companies, was used against a U.S. AI lab over its own usage policies.
Anthropic Frontier Red Team reports early-warning signs of rapid AI progress in cybersecurity and biosecurity capabilities
Anthropic's Frontier Red Team published findings from a year of safety evaluations across four model releases, documenting rapid capability gains in dual-use domains. In cybersecurity, Claude 3.7 Sonnet now solves roughly a third of Cybench CTF challenges (up from ~5% a year ago), and with the Incalmo toolset was able to replicate a large-scale network attack in realistic cyber range environments. In biosecurity, Claude has moved from underperforming virology experts to exceeding them on the VCT benchmark within one year, and exceeds human expert baselines on cloning workflows. Anthropic assesses current models as showing 'early warning' signs but not yet crossing thresholds of substantially elevated national security risk.
Anthropic expands Claude context window from 9K to 100K tokens
Anthropic announced a roughly 10x expansion of Claude's context window, from 9K to 100K tokens (~75,000 words), available via API. The capability enables processing of hundreds of pages of documents, full codebases, or hours of transcribed audio in under a minute. Anthropic positions this as superior to vector search for complex multi-document synthesis tasks, and partner AssemblyAI demonstrated the feature on a 58K-word podcast transcript.
GPT-5.4 released with tool search, computer use, and frontier benchmark performance
OpenAI released GPT-5.4 in Thinking and Pro variants, featuring an expanded context window (up to 1.05M input tokens), native computer use, tool search capabilities, and adjustable reasoning levels. In independent testing by Artificial Analysis, GPT-5.4 Pro at xhigh reasoning achieved state-of-the-art on GDP-Val-AA, BrowseComp, Terminal-Bench-Hard, SWE-Bench-Pro, and MCP Atlas, while trailing Gemini 3.1 Pro Preview on MMMU-Pro and Humanity's Last Exam. Pricing is set at the top of the market ($30/$180 per million input/output tokens for Pro), and the release also powers Codex, OpenAI's competitor to Claude Code. The item is reported via The Batch (tier 2 commentary) and includes additional context on Andrew Ng's chub CLI tool for agent documentation sharing.
Anthropic introduces computer use capability, upgraded Claude 3.5 Sonnet, and Claude 3.5 Haiku
Anthropic announced three major developments: an upgraded Claude 3.5 Sonnet with significant coding improvements (SWE-bench Verified rising from 33.4% to 49.0%, surpassing all publicly available models including reasoning models), a new Claude 3.5 Haiku that matches Claude 3 Opus performance at Haiku-tier speed, and a public beta of 'computer use' — a capability allowing Claude to control computers by viewing screens, moving cursors, clicking, and typing. Computer use is available via the Anthropic API, Amazon Bedrock, and Google Cloud Vertex AI, with early adopters including Replit, The Browser Company, and Cognition. Both safety institutes (US AISI and UK AISI) conducted pre-deployment testing, and the model was assessed as remaining within ASL-2 under Anthropic's Responsible Scaling Policy.
Anthropic maps 832 AI-enabled cyberattacks, finds MITRE ATT&CK framework inadequate for agentic threats
Anthropic's Frontier Red Team analyzed 832 accounts banned for malicious cyber activity between March 2025 and March 2026, mapping their techniques against the MITRE ATT&CK framework. Key findings: medium-or-higher-risk actors grew from 33% to 56% across the study period; AI use is shifting from initial-access techniques toward post-compromise operations like lateral movement and privilege escalation; and traditional risk signals (technique count, platform used) no longer reliably distinguish threat levels. The report concludes that MITRE ATT&CK lacks coverage for agentic orchestration behaviors—where AI chains attack stages autonomously with minimal human input—which characterize the highest-risk actors, including a state-sponsored espionage operation disrupted in November 2025.
Iran strikes AWS data centers in Middle East; Claude/Palantir MSS used in U.S.-Iran war targeting
Iranian drone strikes damaged at least three AWS data centers in Bahrain and the UAE in early March 2026, disrupting cloud services across the region and threatening billions in Gulf AI infrastructure investment. The attacks coincided with revelations that Anthropic's Claude, integrated with Palantir's Maven Smart System, was used to accelerate U.S. military targeting in Iran — reportedly compressing a 12-hour targeting process to under one minute and helping select over 1,000 targets in the first 24 hours of operations. A subsequent investigation found U.S. forces likely struck a school killing 170+ people, with stale target data potentially a contributing factor. The episode marks a significant escalation in AI-enabled warfare and the first known targeting of commercial cloud infrastructure during active conflict.
Anthropic activates ASL-3 safety protections for Claude Opus 4 launch
Anthropic has activated its AI Safety Level 3 (ASL-3) Deployment and Security Standards in conjunction with launching Claude Opus 4, marking the first time any Anthropic model has been deployed under ASL-3 rather than the baseline ASL-2. The activation is described as precautionary: Anthropic has not conclusively determined that Opus 4 crosses the ASL-3 capability threshold, but cannot rule it out due to continued improvements in CBRN-related knowledge. ASL-3 measures include Constitutional Classifiers to block end-to-end CBRN weapon development workflows and enhanced model-weight security against sophisticated non-state attackers. Claude Sonnet 4 was evaluated and cleared for ASL-2, and ASL-4 was ruled out for Opus 4.
Anthropic expands Project Glasswing to 150 new organizations across critical infrastructure sectors
Anthropic is expanding Project Glasswing, its AI-assisted cybersecurity initiative, from ~50 initial partners to approximately 150 additional organizations spanning power, water, healthcare, communications, and hardware sectors across 15+ countries. Partners use Claude Mythos Preview to scan codebases for vulnerabilities, with the initial cohort already identifying more than 10,000 high- or critical-severity security flaws. Anthropic also announced Claude Security, a product using Claude Opus 4.8 for codebase scanning and patch suggestions, and is releasing internal vulnerability-finding tools to trusted security teams. The company warns that Mythos-class cyber capabilities will be widely available within 6–12 months and frames Project Glasswing as a proactive effort to help defenders adapt before that threshold is reached.
Anthropic raises Series E at $61.5B post-money valuation
Anthropic has closed a $3.5 billion Series E round at a $61.5 billion post-money valuation, led by Lightspeed Venture Partners with participation from Bessemer, Cisco, Fidelity, General Catalyst, Salesforce Ventures, and others. Proceeds will fund next-generation AI system development, expanded compute capacity, mechanistic interpretability and alignment research, and international expansion. The raise follows the launch of Claude 3.7 Sonnet and Claude Code, with Anthropic citing strong enterprise adoption across customers including Cursor, Zoom, Snowflake, Pfizer, and Amazon's Alexa+.
OpenAI and Amazon Partner to Build Stateful Runtime Environment for AI Agents on AWS
OpenAI and Amazon Web Services announced a partnership to build a stateful runtime environment for AI agents, designed to manage agent working states including memories, tool connections, and user permissions, running on Amazon Bedrock. The deal includes a $15 billion Amazon investment in OpenAI (with up to $35 billion more contingent on conditions), a $100 billion expansion of compute commitments using Amazon Trainium chips over 8 years, and makes AWS the exclusive third-party cloud provider for OpenAI Frontier. The arrangement exploits a legal distinction between stateful runtime environments and stateless APIs, allowing OpenAI to work with AWS while Microsoft retains exclusive rights to host OpenAI's stateless API calls. This marks a significant loosening of OpenAI's exclusive cloud relationship with Microsoft, mirroring a parallel diversification trend with Anthropic across cloud providers.
Introducing Claude 3.5 Sonnet
Anthropic launches Claude 3.5 Sonnet, the first model in its Claude 3.5 family, claiming it outperforms Claude 3 Opus and competitor models on GPQA, MMLU, and HumanEval benchmarks while operating at twice the speed and mid-tier pricing ($3/$15 per million tokens). The model features a 200K context window, improved vision capabilities, and an internal agentic coding evaluation score of 64% versus 38% for Opus. Alongside the model, Anthropic introduces Artifacts on Claude.ai, a dedicated workspace for real-time editing of AI-generated content. The model was pre-deployment evaluated by the UK AI Safety Institute and assessed at ASL-2.
Anthropic Releases Computer Use Capability for Claude 3.5 Sonnet
Anthropic has launched a public beta of computer use for Claude 3.5 Sonnet, enabling the model to control a computer by interpreting screenshots and issuing pixel-level cursor and keyboard commands. The model achieves 14.9% on the OSWorld benchmark, roughly double the next-best AI model's 7.7%, though well below human-level performance of 70-75%. Anthropic trained the model on a small set of simple software tools and found it generalized rapidly to broader computer interaction. Safety analysis confirmed the capability remains at AI Safety Level 2, with prompt injection identified as a primary near-term risk.