Almanac
technique

AI Safety Level Standards

techniqueactiveprovisionalai-safety-level-standards-0b35cccd·1 events·first seen 13d ago

Aliases: AI Safety Level Standards

Co-occurring entities

More like this (12)

Recent events (1)

7Anthropic News·13d ago·source ↗

Anthropic publishes major update to Responsible Scaling Policy with new capability thresholds and ASL standards

Anthropic released a significant revision to its Responsible Scaling Policy (RSP), its risk governance framework for managing catastrophic risks from frontier AI. The update introduces two explicit capability thresholds—autonomous AI R&D and CBRN weapons uplift—that trigger mandatory upgrades to AI Safety Level (ASL) standards, with current models operating under ASL-2. New elements include safety-case-inspired documentation processes, internal governance stress-testing, and external expert input mechanisms, drawing on risk management practices from high-consequence industries like biosafety.