Anthropic commits to signing the EU General-Purpose AI Code of Practice
Anthropic announced its intention to sign the EU's General-Purpose AI Code of Practice, citing alignment with its existing Responsible Scaling Policy on transparency, safety, and accountability. The company frames the Code's mandatory Safety and Security Frameworks—including CBRN risk assessment—as complementary to its own internal standards. Anthropic also signals continued collaboration with the EU AI Office and third-party bodies like the Frontier Model Forum to keep standards adaptive as the technology evolves.
Related guides (3)
Related events (8)
OpenAI Joins EU AI Code of Practice
OpenAI has signed onto the EU Code of Practice, a voluntary framework under the EU AI Act designed to establish responsible AI development standards. The move signals OpenAI's formal engagement with European regulatory structures while simultaneously positioning the company as a partner to European governments on AI infrastructure and economic development. This is part of broader industry efforts to shape how the EU AI Act's general-purpose AI provisions are implemented.
Anthropic achieves ISO/IEC 42001:2023 certification for AI management systems
Anthropic has received accredited certification under ISO/IEC 42001:2023, the first international standard for AI governance and management systems, issued by Schellman Compliance LLC. The certification covers Anthropic's policies, testing, monitoring, transparency measures, and oversight structures for responsible AI development. Anthropic claims to be among the first frontier AI labs to achieve this certification, positioning it as external validation of their safety commitments alongside existing frameworks like their Responsible Scaling Policy and Constitutional AI.
Anthropic policy recap: US Executive Order, G7 Code of Conduct, and Bletchley Park AI Safety Summit
Anthropic published a policy commentary summarizing three major AI governance events from late October/early November 2023: the US Executive Order on AI, the G7 International Code of Conduct for advanced AI developers, and the UK-hosted Bletchley Park AI Safety Summit. The post covers Anthropic's positions on each, including support for NIST capacity-building, the G7 Code of Conduct, and the newly announced UK and US AI Safety Institutes. Dario Amodei presented Anthropic's Responsible Scaling Policy at Bletchley as a potential regulatory prototype, and the 28-country Bletchley Declaration notably included China among its signatories.
Anthropic publishes framework for safe and trustworthy agent development
Anthropic released a formal framework for responsible agent development, articulating principles around human oversight, transparency, value alignment, and privacy for autonomous AI agents. The document draws on Claude Code as a reference implementation and cites enterprise deployments at Trellix and Block as real-world examples. The framework is positioned as a contribution to emerging industry standards for agentic AI systems, acknowledging open technical challenges in value alignment measurement and oversight calibration.
Anthropic Responds to White House AI Action Plan, Calls for Transparency Standards and Export Controls
Anthropic published a policy response to the White House's 'Winning the Race: America's AI Action Plan,' endorsing its focus on AI infrastructure, federal adoption, and safety research while urging additional steps on export controls and mandatory AI development transparency standards. The company highlighted alignment between the plan and its prior OSTP submissions, and noted its proactive activation of ASL-3 protections with Claude Opus 4 as evidence that safety and innovation are compatible. Anthropic called for a single national standard for frontier model transparency rather than a state-by-state patchwork, and encouraged continued investment in NIST's CAISI for evaluating frontier models on national security risks including CBRN capabilities.
Anthropic launches initiative to fund third-party AI safety evaluations
Anthropic announced a funded initiative to source third-party evaluations measuring advanced AI capabilities and safety risks, with priority areas including cybersecurity, CBRN threats, model autonomy, national security risks, social manipulation, and misalignment. The initiative is tied to Anthropic's Responsible Scaling Policy and AI Safety Level (ASL) framework, aiming to address a gap between demand and supply of high-quality safety-relevant evals. Proposals are solicited via an application form, with Anthropic framing the effort as benefiting the broader AI safety ecosystem rather than just internal use.
Dario Amodei's AI Safety Summit remarks detail Anthropic's Responsible Scaling Policy and ASL framework
Dario Amodei delivered prepared remarks at the UK AI Safety Summit (November 2023) explaining Anthropic's Responsible Scaling Policy (RSP), which was the first such policy published by a major AI lab. The RSP introduces AI Safety Levels (ASL-1 through ASL-4), modeled on biosafety level frameworks, with capability thresholds triggering mandatory safeguards before further training or deployment. Key implementation lessons include deep executive involvement, integrating RSP requirements into product roadmaps, and formal accountability through Anthropic's board and Long Term Benefit Trust. The remarks outline specific ASL-3 requirements around CBRN misuse prevention and security, and preview ASL-4 criteria involving near-human autonomy or becoming a primary source of global security threats.
Anthropic joins Thorn-led child safety principles initiative for generative AI
Anthropic has signed onto a set of child safety principles organized by Thorn and All Tech Is Human, alongside other leading AI companies, committing to specific mitigations across model development, deployment, and maintenance. The commitments include avoiding CSAM-contaminated training data, red-teaming models for AI-generated CSAM, detecting and reporting abusive content, and reporting to NCMEC. The initiative formalizes a 'Safety by Design' framework for preventing generative AI misuse against children.


