Almanac
organization

ARC Evals

organizationactiveprovisionalarc-evals-0074fc3b·1 events·first seen 13d ago

Aliases: ARC Evals

Co-occurring entities

More like this (12)

Recent events (1)

8Anthropic News·13d ago·source ↗

Anthropic publishes Responsible Scaling Policy with AI Safety Level framework

Anthropic released its Responsible Scaling Policy (RSP), a formal framework of technical and organizational protocols for managing catastrophic risks from increasingly capable AI systems. The policy introduces AI Safety Levels (ASL-1 through ASL-5+), modeled on US biosafety level standards, requiring progressively stricter safety, security, and operational standards as models become more capable. Current Claude models are classified as ASL-2; ASL-3 triggers stricter deployment constraints including adversarial red-teaming requirements. The policy has been approved by Anthropic's board and is intended as a template for industry-wide adoption.