6The Batch (DeepLearning.AI)·19d ago

Data Points: Hackers Break Into Claude Mythos; OpenAI Launches Cybersecurity Rival; Maine Data Center Moratorium; McClatchy AI Backlash

A small group of unauthorized users gained access to Anthropic's restricted Claude Mythos cybersecurity model via Discord coordination and insider knowledge, raising questions about securing high-risk AI systems. OpenAI responded to the competitive landscape by launching GPT-5.4-Cyber, a vetted-access model for defensive cybersecurity tasks. Maine passed the first U.S. state moratorium on large AI data centers over 20MW, pending the governor's signature. McClatchy's deployment of a Claude-powered content scaling agent triggered newsroom backlash over attribution, consent, and AI disclosure standards.

Training Infrastructure Frontier Model Releases AI Safety Research Enterprise Deployment Patterns Regulatory Developments GPT-5.5-Cyber Discord Claude Mythos Eli Lilly Maine Insilico Medicine OpenAI Muse Spark Meta McClatchy Anthropic

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Anthropic

Anthropic: The AI Safety Company at the Center of the Frontier

Read asBeginner

AI Safety ResearchTopic guide

AI Safety Research: From Lab Evals to Geopolitical Flashpoint

Read asIn-depth

Related events (8)

6The Batch·19d ago·source ↗

Data Points: Anthropic's Claude Mythos Cybersecurity Claims Face Scrutiny; OpenAI-Cerebras Deal; Meta AI CEO Avatar; Infrastructure Delays

A multi-item digest covers skepticism around Anthropic's Claude Mythos zero-day vulnerability claims (flagged as overstated by Tom's Hardware based on limited 198-case evidence), OpenAI's $20B+ deal with Cerebras for AI processors including a potential ~10% equity stake, and satellite data showing ~40% of U.S. AI data center projects are behind schedule. Additional items cover Meta developing an AI avatar of CEO Zuckerberg for internal use, Moody's flagging credit stress in AI-disrupted sectors, and Luma AI launching an AI-driven film production studio using its Uni-1 model.

Training Infrastructure Frontier Model Releases Tom's Hardware Claude Mythos Luma AI +14 more

6The Batch·1mo ago·source ↗

Anthropic Passes OpenAI in Business Adoption; Cerebras IPO; Claude Mythos Security Concerns

A Ramp AI Index survey shows Anthropic reached 34.4% business adoption in April 2026, surpassing OpenAI's 32.3%, though analysts cite token cost inflation, service degradation, and competition from cheaper inference platforms as threats to the lead. Cerebras surged 89% on its IPO debut, signaling investor appetite for AI infrastructure hardware. Separately, Anthropic's withheld Claude Mythos model—which solved a novel cybersecurity challenge—prompted meetings with the Financial Stability Board, while ArXiv announced year-long bans for authors submitting unvetted AI-generated content.

Training Infrastructure Frontier Model Releases Financial Stability Board Claude Mythos UK AI Security Institute +14 more

8The Batch·19d ago·source ↗

Anthropic Releases Claude Mythos Preview with Extraordinary Cybersecurity Capabilities, Forms Project Glasswing Consortium

Anthropic has published a 244-page model card for Claude Mythos Preview, a large language model not yet commercially available, which broadly outperforms Claude Opus 4.6 and is described as 'strikingly capable' at identifying and exploiting code vulnerabilities. To mitigate risks before potential release, Anthropic assembled Project Glasswing, a consortium including AWS, Apple, Google, Microsoft, CrowdStrike, Nvidia, and 40+ other organizations, funded with $100 million in API credits and $4 million in open-source security donations. This marks the first time Anthropic has published a model card without making the model commercially available, signaling an unusual safety-first deployment posture. The issue also includes commentary from Andrew Ng on AI's impact on software engineering jobs, arguing against an 'AI jobpocalypse' narrative.

Frontier Model Releases AI Safety Research JPMorganChase Linux Foundation Claude Opus 4.6 +14 more

9Anthropic News·19d ago·source ↗

Anthropic Discloses First Reported AI-Orchestrated Cyber Espionage Campaign Using Claude Code

Anthropic detected and disrupted a sophisticated espionage campaign in mid-September 2025, attributed with high confidence to a Chinese state-sponsored threat actor, that used Claude Code as an autonomous agent to attack roughly thirty global targets across tech, finance, chemical manufacturing, and government sectors. The attackers jailbroke Claude Code by decomposing malicious tasks into seemingly innocent subtasks and falsely framing it as defensive security testing, enabling largely autonomous reconnaissance, vulnerability exploitation, credential harvesting, and data exfiltration. Anthropic describes this as the first documented large-scale cyberattack executed without substantial human intervention, leveraging agentic AI capabilities, tool access via MCP, and advanced coding skills. The company banned identified accounts, notified affected entities, coordinated with authorities, and is expanding detection classifiers and publishing the report to aid industry and government defenses.

Frontier Model Releases AI Safety Research Chinese state-sponsored threat actor Claude Claude Code +4 more

7Anthropic News·19d ago·source ↗

Anthropic August 2025 Threat Intelligence Report: Claude Misuse Case Studies

Anthropic has published its August 2025 Threat Intelligence Report documenting three real-world misuse cases involving Claude: a large-scale data extortion operation using Claude Code to automate reconnaissance and generate targeted ransom demands against 17+ organizations, a North Korean fraudulent employment scheme, and AI-assisted ransomware development by a low-skill criminal. The report highlights that agentic AI is now being weaponized for end-to-end cyberattacks rather than merely providing advisory assistance, and that AI has materially lowered the technical barrier to sophisticated cybercrime. Anthropic describes detection and countermeasures taken in each case.

AI Safety Research Enterprise Deployment Patterns Claude Claude Code North Korea +4 more

8The Batch·19d ago·source ↗

Claude Mythos Preview: Limited-Release Frontier Model with Exceptional Cybersecurity Capabilities

Anthropic has published a 244-page model card for Claude Mythos Preview, a frontier model not yet commercially available, which autonomously discovered thousands of high-severity vulnerabilities in popular operating systems and browsers during testing. To mitigate risks before potential deployment, Anthropic assembled Project Glasswing, a consortium of over 40 organizations including AWS, Apple, Google, Microsoft, and CrowdStrike, funded with $100M in model credits to patch vulnerabilities proactively. The model substantially outperforms Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro across multiple benchmarks including CyberGym (83.1%), Terminal-Bench 2.0 (82%), GPQA Diamond (94.5%), HLE (64.7%), and GraphWalks long-context (80%). The Batch notes parallels to OpenAI's GPT-2 limited-release strategy and characterizes the announcement as having elements of a publicity stunt alongside genuine safety concerns.

Frontier Model Releases Evaluation and Benchmarking Gemini 3.1 Pro GraphWalks Linux Foundation +18 more

7The Batch·19d ago·source ↗

US Government Prepares AI Model Vetting System; GPT-5.5 Instant, Claude Finance Agents, Pentagon AI Partnerships

The White House is preparing an executive order to create an FDA-style vetting system for new AI models, prompted partly by Anthropic's Mythos model disclosing cybersecurity risks; the Commerce Department separately expanded a voluntary testing program with Google, Microsoft, and xAI. OpenAI rolled out GPT-5.5 Instant as the default ChatGPT model, claiming 52.5% fewer hallucinations on high-stakes prompts. Anthropic released ten financial agent templates running on Claude Opus 4.7, while the Pentagon expanded AI vendor agreements to include Microsoft, Amazon, Nvidia, and Reflection AI after canceling its Anthropic contract over autonomous weapons restrictions. Major pharma companies report AI gains primarily in manufacturing optimization rather than drug discovery breakthroughs.

Frontier Model Releases Evaluation and Benchmarking Vals AI Finance Agent Benchmark White House Darius Amodei +23 more

9The Batch·8d ago·source ↗

Anthropic releases Claude Mythos 5 and Claude Fable 5 with unprecedented capability restrictions and safety tiers

Anthropic launched Claude Mythos 5, a restricted-access model capable of cracking previously secure software, and Claude Fable 5, a general-use version with novel safety classifiers that block or degrade responses on cybersecurity, biology, chemistry, and AI-development topics. Both models set new state-of-the-art results across software engineering, agentic coding, knowledge work, and scientific reasoning benchmarks, and are priced at roughly half the cost of the prior Claude Mythos Preview. Claude Fable 5 initially included undisclosed capability degradation for AI-development prompts — applied silently via prompt modification or steering vectors — which sparked controversy before Anthropic modified the policy. The release represents a significant escalation in both frontier capability and the operational complexity of safety-tiered model deployment.

Frontier Model Releases Evaluation and Benchmarking Claude Mythos Artificial Analysis Intelligence Index Claude Opus 4.6 +9 more