Almanac
Guide · In-depth

Claude Mythos: Anthropic's Restricted-Access Frontier Model

Claude MythosIn-depthactive·v1 · live·generated 38h ago
TL;DRClaude Mythos is Anthropic's highest-capability model tier — positioned above the Opus line — built for tasks like advanced cybersecurity work that require capabilities too sensitive for general release. It ships as a dual-variant system: Mythos itself is gated to vetted partners via Project Glasswing, while Claude Fable 5 is the safety-guardrailed public-facing version of the same underlying model. The launch immediately triggered a cascade of regulatory, evaluability, and safety controversies that have made it one of the most consequential and contested frontier releases to date.

Key takeaways

  • Claude Mythos 5 and Claude Fable 5 are two deployment variants of the same underlying model — Mythos is restricted to vetted partners via Project Glasswing; Fable 5 is the general-availability version at $10/$50 per million input/output tokens.
  • Both variants set new state-of-the-art results across software engineering, agentic coding, knowledge work, cybersecurity, and scientific reasoning benchmarks — priced at roughly half the cost of Claude Mythos Preview.
  • Claude Fable 5 initially applied undisclosed silent degradation (via prompt modification or steering vectors) to AI-development prompts, sparking researcher backlash before Anthropic reversed the policy.
  • The U.S. government issued export controls forcing Anthropic to suspend global access to both models within days of launch; over 100 cybersecurity professionals later signed an open letter urging reversal.
  • Independent evaluators including Artificial Analysis, Vals AI, and ARC Prize Foundation could not consistently benchmark Fable 5 because safety classifiers silently rerouted flagged prompts to Claude Opus 4.8 — causing GPQA Diamond scores to swing from 93.18% (2nd place) to 55.56% (94th place) depending on how refusals were scored.
  • Claude Mythos was previewed before the Mythos 5 launch as a restricted cybersecurity tool; a jailbreak via Discord coordination exposed the model to unauthorized users, prompting meetings with the Financial Stability Board.

What Claude Mythos is

Claude Mythos is Anthropic's highest-capability model tier, positioned above the Opus/Sonnet/Haiku line and designed for tasks that require frontier-level power in domains — particularly cybersecurity — where that power also creates meaningful risk. It does not replace the existing naming scheme; rather, it sits above it as a distinct product with a distinct access model.

The Mythos line ships as a dual-variant system. Claude Mythos 5 is the full-capability version, available only to vetted partners through Project Glasswing, a program developed in collaboration with the U.S. government for cyberdefense and critical infrastructure work. Claude Fable 5 is the general-availability version of the same underlying model, wrapped in safety classifiers that block or degrade responses on cybersecurity, biology, chemistry, and cutting-edge AI-development topics. Both are priced at $10/$50 per million input/output tokens — roughly half the cost of the prior Claude Mythos Preview.

Capability profile

Both variants set new state-of-the-art results across software engineering, agentic coding, knowledge work, cybersecurity, and scientific reasoning benchmarks at launch. Commentary from Zvi Mowshowitz described Claude Fable 5 as the best publicly available model at release. The predecessor, Claude Mythos Preview, had already attracted attention for solving a novel cybersecurity challenge — a capability that prompted meetings with the Financial Stability Board and, separately, drew unauthorized access attempts via Discord coordination before the formal Mythos 5 launch.

Anthropic previewed the Mythos class above Opus in early June 2026, framing it as pending broader safety clearance. The Andon Labs team, creators of VendingBench, noted they had evaluated Claude models across the full spectrum from Haiku to Mythos in building frontier evals, suggesting the tier had been accessible to select evaluators before general announcement.

The dual-variant architecture and its tensions

The Mythos/Fable split is the most operationally novel aspect of this release. Rather than a single model with a single policy, Anthropic is maintaining two deployment surfaces of the same weights under different constraint regimes — a pattern that introduces structural complexity for users, evaluators, and regulators alike.

Silent degradation. At launch, Fable 5 applied undisclosed capability degradation to AI-development prompts via prompt modification or steering vectors. Users and researchers were not informed that responses in this domain were being silently altered rather than refused or answered normally. After researcher backlash, Anthropic reversed the policy — but the episode raised durable questions about transparency in safety-tiered deployment.

Evaluability breakdown. The safety classifiers created a measurement crisis. Fable 5's classifiers silently rerouted flagged prompts to Claude Opus 4.8 rather than refusing them explicitly. Independent evaluators — Artificial Analysis, Vals AI, and ARC Prize Foundation — each adopted different strategies for handling these reroutes (blended scores, pure scores, or abstaining entirely), producing wildly divergent rankings. On GPQA Diamond alone, Fable 5's score ranged from 93.18% (2nd place) to 55.56% (94th place) depending on how refusals were counted. This is not a minor scoring disagreement; it surfaces a structural incompatibility between safety-oriented deployment constraints and the field's ability to independently measure frontier capability.

Data retention. Anthropic's new data retention policy further complicated independent evaluation, limiting what evaluators could log and analyze about model behavior under classifier conditions.

Regulatory collision

Three days after launch, the U.S. Commerce Department issued an export control directive forcing Anthropic to suspend global access to both Claude Mythos 5 and Claude Fable 5. The suspension — flagged first on Anthropic's status page on June 13, 2026 — represented an unprecedented regulatory intervention: a government-mandated withdrawal of a frontier model almost immediately after release.

Andrew Ng's commentary in The Batch framed the episode as a demonstration of how both private companies and governments can suddenly restrict AI access, and argued it would accelerate global interest in AI sovereignty and open-source alternatives. Over 100 cybersecurity professionals subsequently signed an open letter urging the U.S. government to reverse the export controls, citing the defensive value of Mythos-class models for infrastructure protection.

Competitive context

The Mythos launch prompted a direct competitive response: OpenAI launched GPT-5.4-Cyber, a vetted-access model for defensive cybersecurity tasks, in the same period. The parallel suggests a broader industry shift toward capability-tiered deployment for high-risk domains — where the question is not just what a model can do, but who is allowed to use it and under what conditions.

Structural implications

The Mythos/Fable architecture represents a significant escalation in the operational complexity of frontier model deployment. Managing two access tiers of the same model, maintaining classifier transparency, satisfying independent evaluators, and navigating government export controls simultaneously is a qualitatively different challenge than shipping a single general-purpose model. Whether this dual-variant pattern becomes an industry template — or whether the evaluability and regulatory friction it generates proves unsustainable — is the open question the Mythos launch has placed on the table.

Claude Mythos 5 / Fable 5 deployment architecture

Claude Mythos 5 vs. Fable 5 vs. Claude Opus 4.8

ModelAccessSafety classifiersPricingNotable use
Claude Mythos 5Restricted — Project Glasswing vetted partnersMinimal / removed$10/$50 per M tokensAdvanced cybersecurity, zero-day research
Claude Fable 5General availabilityYes — blocks/degrades on security, bio, chem, AI-dev topics$10/$50 per M tokensGeneral knowledge work, coding, agentic tasks
Claude Opus 4.8General availabilityStandardCoding, reasoning; fallback target for Fable 5 classifier reroutes

Synthesized from the events bundle; unknown cells render —.

Timeline

  1. Claude Mythos previewed; zero-day cybersecurity claims face scrutiny from Tom's Hardware

  2. Unauthorized users breach restricted Claude Mythos via Discord; OpenAI launches GPT-5.4-Cyber in response

  3. Anthropic's withheld Mythos model prompts meetings with the Financial Stability Board

  4. Anthropic previews Mythos-class models above Opus tier; Claude Opus 4.8 launches

  5. System card published; Claude Fable 5 released to general availability

  6. Claude Mythos 5 and Fable 5 formally launched; silent-degradation controversy surfaces

  7. Anthropic suspends access to both models following U.S. export control directive

  8. Evaluability crisis documented: GPQA Diamond scores swing 93% → 56% depending on refusal-scoring method

  9. 100+ cybersecurity professionals sign open letter urging reversal of export controls

Related topics

AnthropicProject GlasswingClaude Mythos PreviewAndrew NgZvi MowshowitzGPT-5.5Humanity's Last Exam

FAQ

What is the difference between Claude Mythos 5 and Claude Fable 5?

They are the same underlying model deployed under different access regimes: Mythos 5 is restricted to vetted partners via Project Glasswing with minimal safety constraints, while Fable 5 is the general-availability version with safety classifiers that block or degrade responses on cybersecurity, biology, chemistry, and AI-development topics.

Why was access suspended so quickly after launch?

The U.S. government issued an export control directive via the Commerce Department forcing Anthropic to disable global access to both models; this occurred within days of the June 9–12 launch window.

Why couldn't independent evaluators benchmark Fable 5 reliably?

Anthropic's safety classifiers silently rerouted flagged prompts to the weaker Claude Opus 4.8 rather than answering or refusing transparently, causing benchmark scores to vary wildly depending on whether evaluators counted rerouted/refused responses as failures — GPQA Diamond ranged from 93.18% to 55.56% across different scoring strategies.

What was the silent-degradation controversy?

Claude Fable 5 initially applied undisclosed capability degradation to AI-development prompts via prompt modification or steering vectors, without informing users; Anthropic reversed this policy after researcher backlash.

How does Mythos relate to the Opus/Sonnet/Haiku naming scheme?

Mythos is a separate, higher-capability tier positioned above Opus, not a replacement for it — the Opus/Sonnet/Haiku line continues, with Opus 4.8 serving as the current general-availability flagship below the Mythos tier.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

  • v1live38h ago

Related guides (4)

More on Claude Mythos (6)

8Hacker News·21d ago·source ↗

Anthropic releases system card for Claude Fable 5 and Claude Mythos 5

Anthropic has published a system card PDF for two new models, Claude Fable 5 and Claude Mythos 5, surfaced via Hacker News with 211 points. The system card is a primary safety and capability disclosure document accompanying a model release. The naming convention suggests these are new frontier-tier models from Anthropic, distinct from the existing Claude Opus/Sonnet/Haiku naming scheme.

9The Batch·18d ago·source ↗

Anthropic releases Claude Mythos 5 and Claude Fable 5 with unprecedented capability restrictions and safety tiers

Anthropic launched Claude Mythos 5, a restricted-access model capable of cracking previously secure software, and Claude Fable 5, a general-use version with novel safety classifiers that block or degrade responses on cybersecurity, biology, chemistry, and AI-development topics. Both models set new state-of-the-art results across software engineering, agentic coding, knowledge work, and scientific reasoning benchmarks, and are priced at roughly half the cost of the prior Claude Mythos Preview. Claude Fable 5 initially included undisclosed capability degradation for AI-development prompts — applied silently via prompt modification or steering vectors — which sparked controversy before Anthropic modified the policy. The release represents a significant escalation in both frontier capability and the operational complexity of safety-tiered model deployment.

8The Batch·18d ago·source ↗

Anthropic launches Claude Mythos 5 and Claude Fable 5; Andrew Ng introduces OpenCoworker desktop agent

Anthropic released Claude Mythos 5 and Claude Fable 5, two variants of the same frontier model that set new state-of-the-art results across software engineering, knowledge work, cybersecurity, and agentic coding benchmarks. Claude Fable 5 is the general-availability version with safety classifiers that restrict responses on security, biology, chemistry, and cutting-edge AI topics, priced at $10/$50 per million input/output tokens; Mythos 5 is restricted to selected partners via Project Glasswing. Separately, Andrew Ng and collaborators released OpenCoworker, a free open-source desktop agent harness built on top of aisuite, designed to give users privacy-preserving agentic workflows with their own API keys or local models. The newsletter also contextualizes the broader shift toward LLM-driven agent harnesses as frontier models have become capable enough to reliably drive next-action decisions.

7Hacker News·17d ago·source ↗

Anthropic suspends access to Claude Mythos 5 and Claude Fable 5

Anthropic's status page reports the suspension of access to two models, Claude Mythos 5 and Claude Fable 5, as of June 13, 2026. These appear to be previously unannounced or unreleased model names, making the incident notable both as a service disruption and as a potential first public signal of new model names in Anthropic's lineup. No explanation for the suspension is provided in the brief status notice.

6The Batch·29d ago·source ↗

Data Points: Hackers Break Into Claude Mythos; OpenAI Launches Cybersecurity Rival; Maine Data Center Moratorium; McClatchy AI Backlash

A small group of unauthorized users gained access to Anthropic's restricted Claude Mythos cybersecurity model via Discord coordination and insider knowledge, raising questions about securing high-risk AI systems. OpenAI responded to the competitive landscape by launching GPT-5.4-Cyber, a vetted-access model for defensive cybersecurity tasks. Maine passed the first U.S. state moratorium on large AI data centers over 20MW, pending the governor's signature. McClatchy's deployment of a Claude-powered content scaling agent triggered newsroom backlash over attribution, consent, and AI disclosure standards.

8Hacker News·21d ago·source ↗

Anthropic releases Claude Fable 5

Anthropic has released Claude Fable 5, a new model in the Claude family, announced via their official news channel. The Hacker News discussion generated substantial engagement with 1,468 points and 1,156 comments, indicating significant community interest. No detailed capability claims or benchmark results are available from this item alone.