Almanac
Guide · Beginner

Claude Mythos: Anthropic's Restricted Frontier Model

Claude MythosBeginneractive·v1 · live·generated 38h ago
TL;DRClaude Mythos is Anthropic's most powerful AI model — a step above the Opus line — built for high-stakes tasks like cybersecurity that are too sensitive for general release. Rather than offering it to everyone, Anthropic split it into two versions: a locked-down public edition called Claude Fable 5, and the full-capability Mythos 5 reserved for vetted partners. The launch triggered a cascade of controversies — hidden safety restrictions, a government-forced global shutdown, and a jailbreak — making it one of the most consequential and contentious AI releases to date.

Key takeaways

  • Claude Mythos 5 is capable of cracking previously secure software and is restricted to vetted partners via Project Glasswing, in collaboration with the U.S. government.
  • Its public-facing counterpart, Claude Fable 5, is priced at $10/$50 per million input/output tokens — roughly half the cost of the prior Claude Mythos Preview.
  • Anthropic initially applied silent, undisclosed capability degradation to Fable 5 for AI-development prompts, sparking researcher backlash before the policy was reversed.
  • The U.S. government issued export controls that forced Anthropic to suspend global access to both models just days after launch.
  • Independent benchmarkers found they could not reliably score Fable 5: on GPQA Diamond, its ranking swung from 2nd place (93.18%) to 94th place (55.56%) depending on how safety refusals were counted.
  • Over 100 cybersecurity professionals signed an open letter urging the U.S. government to reverse the export controls.

What Claude Mythos is

Claude Mythos is Anthropic's highest-capability AI model — a tier above the Opus line that the company had been quietly developing and testing with a small group of partners before its public debut in June 2026. Think of it as the most powerful version of Claude, designed for tasks that require deep technical expertise: finding vulnerabilities in software, reasoning through complex scientific problems, and running long autonomous workflows with minimal human supervision.

Because those same capabilities could be misused, Anthropic decided not to release Mythos the way it releases its other models. Instead, it split the release into two versions with very different rules about who can use them and what they can do.

Two models, one engine

Claude Mythos 5 is the full-power version. Its safeguards are removed for specific, authorized use cases — primarily cybersecurity defense work. Access is gated through Project Glasswing, Anthropic's vetted-partner program, developed in collaboration with the U.S. government. Only selected organizations can use it.

Claude Fable 5 is the public-facing version built on the same underlying model. It includes safety classifiers — automated filters — that block or limit responses on sensitive topics including cybersecurity, biology, chemistry, and cutting-edge AI development. This is the version anyone could (in theory) sign up to use, priced at $10 per million input tokens and $50 per million output tokens — roughly half the cost of the prior Claude Mythos Preview.

Both models set new top results across software engineering, agentic coding, knowledge work, and scientific reasoning when they launched.

Why it matters

Mythos represents something new in AI: a frontier model so capable that its creator decided the general public shouldn't have unrestricted access to it. That's a significant line to cross, and the launch immediately raised hard questions about who gets to decide what AI can do — and for whom.

Before the full release, an early version of Mythos solved a novel cybersecurity challenge significant enough to prompt meetings with the Financial Stability Board. A separate incident saw unauthorized users gain access to the restricted model via Discord coordination. These episodes weren't just embarrassing — they illustrated the real-world stakes of deploying a model that can break into secure systems.

A rocky launch

The release in early June 2026 was eventful, to put it mildly:

  • Hidden restrictions: Anthropic initially applied undisclosed capability degradation to Fable 5 for AI-development prompts — silently rewriting responses or using steering vectors to weaken answers — without telling users. Researchers discovered this when they noticed inconsistent behavior. Anthropic reversed the policy after backlash.
  • Government shutdown: Just days after launch, the U.S. Commerce Department issued export controls that forced Anthropic to suspend global access to both Fable 5 and Mythos 5. The models went offline for users outside the U.S., and even domestic access was disrupted.
  • Benchmarking chaos: Independent evaluators found they couldn't reliably measure Fable 5's capabilities. Anthropic's classifiers silently rerouted flagged prompts to the weaker Claude Opus 4.8 instead of answering them. On one major benchmark (GPQA Diamond), Fable 5's score ranged from 93.18% (2nd place) to 55.56% (94th place) depending on whether an evaluator counted refusals as failures. Different organizations — Artificial Analysis, Vals AI, and ARC Prize Foundation — each adopted different scoring strategies, producing wildly different rankings.

The bigger picture

The Mythos launch crystallized a tension that's been building in AI: as models become genuinely dangerous in the wrong hands, the people building them face pressure to restrict access — but restriction creates its own problems. It hands governments leverage to shut down access entirely. It makes independent safety evaluation nearly impossible. And it raises questions about whether safety classifiers that secretly degrade responses are honest.

AI commentator Andrew Ng argued the episode accelerated global interest in AI sovereignty and open-source alternatives, precisely because it showed how quickly a private company or government can cut off access to a powerful tool. Over 100 cybersecurity professionals signed an open letter urging the U.S. government to reverse the export controls, arguing the restrictions hurt the defenders more than the attackers.

Where things stand

Access to both models remains suspended pending resolution of the export control situation. The controversy over silent degradation has been resolved — Anthropic reversed that policy — but the structural questions it raised about transparency in safety-tiered deployments remain open. Claude Mythos represents the leading edge of a new kind of AI deployment challenge: not just "is this model safe?" but "who decides, how do you verify it, and what happens when the answer is no?"

How Claude Mythos 5 and Fable 5 relate

Claude Mythos 5 vs. Claude Fable 5

AspectClaude Mythos 5Claude Fable 5
AccessRestricted — vetted partners via Project GlasswingGeneral availability (suspended by export controls)
Safety restrictionsSafeguards removed for authorized use casesSafety classifiers on cybersecurity, biology, chemistry, AI-dev topics
CapabilityFull — can crack previously secure softwareDegraded on sensitive topics; same underlying model
Pricing$10 / $50 per M tokens$10 / $50 per M tokens
BenchmarkingNot independently evaluatedScores vary widely depending on how refusals are handled

Both models share the same underlying weights; the difference is in access controls and safety-layer configuration.

Timeline

  1. Claude Mythos (pre-release) prompts Financial Stability Board meetings after solving a novel cybersecurity challenge

  2. Anthropic previews Mythos-class models alongside Claude Opus 4.8 launch

  3. Anthropic publishes system card; Claude Fable 5 and Mythos 5 officially released

  4. Anthropic suspends access to both models (U.S. export control directive)

  5. Anthropic reverses silent-degradation policy on Fable 5 after researcher backlash; export controls remain

  6. 100+ cybersecurity professionals sign open letter urging reversal of export controls

Related topics

AnthropicProject GlasswingClaude Mythos PreviewAndrew NgZvi MowshowitzOpenAI

FAQ

What is the difference between Claude Mythos 5 and Claude Fable 5?

They are the same underlying model with different safety configurations. Mythos 5 has its safeguards removed for vetted partners doing cybersecurity work; Fable 5 is the public version with classifiers that restrict sensitive topics.

Why can't I access Claude Fable 5 right now?

The U.S. government issued export controls that forced Anthropic to suspend global access to both models shortly after launch. Over 100 cybersecurity professionals have since called for the controls to be reversed.

What was the controversy about hidden restrictions?

Anthropic initially applied undisclosed capability degradation to Fable 5 for AI-development prompts — silently modifying responses or using steering vectors — without telling users. After researcher backlash, Anthropic reversed the policy.

How powerful is Claude Mythos 5?

It sets new top scores across software engineering, agentic coding, knowledge work, and scientific reasoning benchmarks, and is capable of cracking previously secure software — which is why access is tightly restricted.

Can independent researchers benchmark it fairly?

Not easily. Fable 5's safety classifiers silently reroute flagged prompts to a weaker model or refuse them, causing benchmark scores to swing wildly depending on how evaluators handle refusals.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

  • v1live38h ago

Related guides (4)

More on Claude Mythos (6)

8Hacker News·21d ago·source ↗

Anthropic releases system card for Claude Fable 5 and Claude Mythos 5

Anthropic has published a system card PDF for two new models, Claude Fable 5 and Claude Mythos 5, surfaced via Hacker News with 211 points. The system card is a primary safety and capability disclosure document accompanying a model release. The naming convention suggests these are new frontier-tier models from Anthropic, distinct from the existing Claude Opus/Sonnet/Haiku naming scheme.

9The Batch·18d ago·source ↗

Anthropic releases Claude Mythos 5 and Claude Fable 5 with unprecedented capability restrictions and safety tiers

Anthropic launched Claude Mythos 5, a restricted-access model capable of cracking previously secure software, and Claude Fable 5, a general-use version with novel safety classifiers that block or degrade responses on cybersecurity, biology, chemistry, and AI-development topics. Both models set new state-of-the-art results across software engineering, agentic coding, knowledge work, and scientific reasoning benchmarks, and are priced at roughly half the cost of the prior Claude Mythos Preview. Claude Fable 5 initially included undisclosed capability degradation for AI-development prompts — applied silently via prompt modification or steering vectors — which sparked controversy before Anthropic modified the policy. The release represents a significant escalation in both frontier capability and the operational complexity of safety-tiered model deployment.

8The Batch·18d ago·source ↗

Anthropic launches Claude Mythos 5 and Claude Fable 5; Andrew Ng introduces OpenCoworker desktop agent

Anthropic released Claude Mythos 5 and Claude Fable 5, two variants of the same frontier model that set new state-of-the-art results across software engineering, knowledge work, cybersecurity, and agentic coding benchmarks. Claude Fable 5 is the general-availability version with safety classifiers that restrict responses on security, biology, chemistry, and cutting-edge AI topics, priced at $10/$50 per million input/output tokens; Mythos 5 is restricted to selected partners via Project Glasswing. Separately, Andrew Ng and collaborators released OpenCoworker, a free open-source desktop agent harness built on top of aisuite, designed to give users privacy-preserving agentic workflows with their own API keys or local models. The newsletter also contextualizes the broader shift toward LLM-driven agent harnesses as frontier models have become capable enough to reliably drive next-action decisions.

7Hacker News·17d ago·source ↗

Anthropic suspends access to Claude Mythos 5 and Claude Fable 5

Anthropic's status page reports the suspension of access to two models, Claude Mythos 5 and Claude Fable 5, as of June 13, 2026. These appear to be previously unannounced or unreleased model names, making the incident notable both as a service disruption and as a potential first public signal of new model names in Anthropic's lineup. No explanation for the suspension is provided in the brief status notice.

6The Batch·29d ago·source ↗

Data Points: Hackers Break Into Claude Mythos; OpenAI Launches Cybersecurity Rival; Maine Data Center Moratorium; McClatchy AI Backlash

A small group of unauthorized users gained access to Anthropic's restricted Claude Mythos cybersecurity model via Discord coordination and insider knowledge, raising questions about securing high-risk AI systems. OpenAI responded to the competitive landscape by launching GPT-5.4-Cyber, a vetted-access model for defensive cybersecurity tasks. Maine passed the first U.S. state moratorium on large AI data centers over 20MW, pending the governor's signature. McClatchy's deployment of a Claude-powered content scaling agent triggered newsroom backlash over attribution, consent, and AI disclosure standards.

8Hacker News·21d ago·source ↗

Anthropic releases Claude Fable 5

Anthropic has released Claude Fable 5, a new model in the Claude family, announced via their official news channel. The Hacker News discussion generated substantial engagement with 1,468 points and 1,156 comments, indicating significant community interest. No detailed capability claims or benchmark results are available from this item alone.