Anthropic Endorses California SB 53 AI Safety Disclosure Bill
Anthropic has announced its endorsement of California Senate Bill 53, which would require large frontier AI developers to publish safety frameworks, release transparency reports before deploying powerful models, report critical safety incidents within 15 days, and provide whistleblower protections. The bill, authored by Senator Scott Wiener and informed by the Joint California Policy Working Group, takes a disclosure-based approach rather than prescriptive technical mandates, drawing lessons from the failed SB 1047. Anthropic frames the bill as formalizing practices already followed by major labs including Google DeepMind, OpenAI, and Microsoft, while creating a level playing field that prevents competitive pressure from eroding voluntary safety programs. Anthropic notes the bill's compute-based threshold (10^26 FLOPS) is an acceptable starting point but calls for future refinement as AI capabilities advance.
Related guides (4)
Related events (8)
Anthropic responds to California Governor Newsom's AI working group draft report
Anthropic published a formal response to the California Governor's Working Group on AI Frontier Models draft report, endorsing its emphasis on transparency and evidence-based policy. Anthropic argues that light-touch mandatory disclosure of safety and security practices would be beneficial without impeding innovation, noting that current voluntary practices are uneven across frontier labs. The response also references Anthropic's Responsible Scaling Policy and Economic Index as examples of existing transparency efforts, and signals urgency given Anthropic's view that powerful AI systems may arrive as early as end of 2026.
Anthropic Publishes Frontier Compliance Framework for California's SB 53 Transparency in Frontier AI Act
Anthropic has released its Frontier Compliance Framework (FCF) in advance of California's SB 53 taking effect on January 1, 2026, which establishes the first mandatory frontier AI safety and transparency requirements in the US. The FCF covers risk assessment and mitigation for cyber, CBRN, and AI autonomy/control risks, tiered capability evaluation, model weight protection, and incident response. Anthropic frames the FCF as an evolution of its existing Responsible Scaling Policy, which will continue as a voluntary safety policy beyond regulatory minimums. The company also calls for a federal AI transparency framework with analogous requirements applied only to the largest frontier developers.
Anthropic proposes federal AI transparency framework with mandatory Secure Development Frameworks and system cards
Anthropic published a policy proposal calling for a targeted AI transparency framework applicable at federal, state, or international levels, targeting only the largest frontier AI developers (suggested thresholds: ~$100M annual revenue or ~$1B R&D/capex). The framework would require covered developers to publicly disclose a Secure Development Framework covering CBRN and misalignment risks, publish system cards at deployment, self-certify compliance, and face legal liability for false statements. The proposal is explicitly lightweight and flexible, designed to avoid prescriptive standards while creating accountability mechanisms and whistleblower protections during the period before comprehensive safety standards are established.
Anthropic launches initiative to fund third-party AI safety evaluations
Anthropic announced a funded initiative to source third-party evaluations measuring advanced AI capabilities and safety risks, with priority areas including cybersecurity, CBRN threats, model autonomy, national security risks, social manipulation, and misalignment. The initiative is tied to Anthropic's Responsible Scaling Policy and AI Safety Level (ASL) framework, aiming to address a gap between demand and supply of high-quality safety-relevant evals. Proposals are solicited via an application form, with Anthropic framing the effort as benefiting the broader AI safety ecosystem rather than just internal use.
Anthropic Responds to White House AI Action Plan, Calls for Transparency Standards and Export Controls
Anthropic published a policy response to the White House's 'Winning the Race: America's AI Action Plan,' endorsing its focus on AI infrastructure, federal adoption, and safety research while urging additional steps on export controls and mandatory AI development transparency standards. The company highlighted alignment between the plan and its prior OSTP submissions, and noted its proactive activation of ASL-3 protections with Claude Opus 4 as evidence that safety and innovation are compatible. Anthropic called for a single national standard for frontier model transparency rather than a state-by-state patchwork, and encouraged continued investment in NIST's CAISI for evaluating frontier models on national security risks including CBRN capabilities.
Anthropic submits AI accountability recommendations to NTIA, covering evals, red teaming, and pre-registration
Anthropic submitted a formal response to the NTIA's Request for Comment on AI Accountability, outlining a multi-part policy framework for governing advanced AI systems. Key recommendations include increased government funding for evaluation research, mandatory disclosure of evaluation methods, pre-registration of large training runs with national governments, mandated external red teaming before model release, and antitrust guidance to enable industry safety collaboration. The submission reflects Anthropic's core policy positions and advocates for risk-tiered oversight proportional to model capabilities.
Anthropic publishes foundational 'Core Views on AI Safety' position paper
Anthropic released a detailed position paper outlining their core views on AI safety, arguing that transformative AI could arrive within a decade driven by predictable scaling laws, and that no one currently knows how to train powerful AI systems to robustly behave well. The document explains Anthropic's founding rationale and research strategy, highlighting four priority areas: scaling supervision, mechanistic interpretability, process-oriented learning, and understanding AI generalization. Originally published March 2023, this represents Anthropic's canonical public statement of their safety philosophy and strategic priorities.
Anthropic commits to signing the EU General-Purpose AI Code of Practice
Anthropic announced its intention to sign the EU's General-Purpose AI Code of Practice, citing alignment with its existing Responsible Scaling Policy on transparency, safety, and accountability. The company frames the Code's mandatory Safety and Security Frameworks—including CBRN risk assessment—as complementary to its own internal standards. Anthropic also signals continued collaboration with the EU AI Office and third-party bodies like the Frontier Model Forum to keep standards adaptive as the technology evolves.



