Entity · organization

ARC Evals

organizationactivearc-evals-0074fc3b·1 events·first seen Jun 3, 2026

Aliases: ARC Evals

Co-occurring entities

Claude Responsible Scaling Policy Long-Term Benefit Trust Anthropic

More like this (12)

G-Eval L-Eval Community Evals T-Eval OpenAI Evals ProActEval STAGE-Eval AIriskEval-edu Demo ParaEval ValueEval Every Eval Ever HypoEval

Recent events (1)

8Anthropic News·Jun 3, 2026·source ↗

Anthropic publishes Responsible Scaling Policy with AI Safety Level framework

Anthropic released its Responsible Scaling Policy (RSP), a formal framework of technical and organizational protocols for managing catastrophic risks from increasingly capable AI systems. The policy introduces AI Safety Levels (ASL-1 through ASL-5+), modeled on US biosafety level standards, requiring progressively stricter safety, security, and operational standards as models become more capable. Current Claude models are classified as ASL-2; ASL-3 triggers stricter deployment constraints including adversarial red-teaming requirements. The policy has been approved by Anthropic's board and is intended as a template for industry-wide adoption.

Frontier Model Releases AI Safety Research ARC Evals Claude Responsible Scaling Policy +3 more