Mozilla
mozilla-d866811f·2 events·first seen 19d agoAliases: Mozilla
Co-occurring entities
More like this (12)
Recent events (2)
Claude Opus 4.6 Discovers 22 Firefox Vulnerabilities in Two-Week Mozilla Partnership
Anthropic's Claude Opus 4.6 identified 22 vulnerabilities in Firefox over two weeks in February 2026, of which Mozilla classified 14 as high-severity—representing nearly a fifth of all high-severity Firefox vulnerabilities remediated in 2025. The collaboration grew from internal evaluations showing Opus 4.5 was near-saturating CyberGym, a benchmark for LLM security capability, prompting Anthropic to test against a harder real-world target. Claude scanned nearly 6,000 C++ files and submitted 112 unique reports, with most issues patched in Firefox 148.0. The effort also included an evaluation of Claude's ability to write primitive exploits, probing the upper limits of AI-enabled offensive security capability.
Data Points: DeepSWE Benchmark, DeepSeek V4 Price Cuts, MAI-Image-2.5, Mythos Security Findings, MCP Stateless Update
This edition of The Batch covers five distinct AI developments: Datacurve's DeepSWE benchmark claims to fix critical grading flaws in SWE-bench Pro with hand-written verifiers and harder tasks; DeepSeek permanently cuts V4 Pro prices by 75%; Microsoft's MAI-Image-2.5 debuts third on the Arena leaderboard; Anthropic's Claude Mythos Preview found over 10,000 high/critical vulnerabilities in the first month of Project Glasswing, with remediation badly lagging discovery; and the Model Context Protocol proposes removing stateful sessions to enable stateless, load-balanced remote servers. Each item reflects meaningful movement in evaluation methodology, inference economics, multimodal generation, AI-assisted security, and agent tooling infrastructure.