technique
AI Safety Level Standards
techniqueactiveprovisional
ai-safety-level-standards-0b35cccd·1 events·first seen 13d agoAliases: AI Safety Level Standards
Co-occurring entities
More like this (12)
AI Safety Level (ASL)Japan AI Safety InstituteAI Liability DirectiveUK Artificial Intelligence Safety InstituteAI Risk Management FrameworkAI Safety FundUK AI Safety SummitAustralia AI Safety InstituteVoluntary AI Safety CommitmentsUS Cyber and AI Safety InstituteAI biosecurity risk assessmentjoint safety evaluation
Recent events (1)
Anthropic publishes major update to Responsible Scaling Policy with new capability thresholds and ASL standards
Anthropic released a significant revision to its Responsible Scaling Policy (RSP), its risk governance framework for managing catastrophic risks from frontier AI. The update introduces two explicit capability thresholds—autonomous AI R&D and CBRN weapons uplift—that trigger mandatory upgrades to AI Safety Level (ASL) standards, with current models operating under ASL-2. New elements include safety-case-inspired documentation processes, internal governance stress-testing, and external expert input mechanisms, drawing on risk management practices from high-consequence industries like biosafety.