Almanac
benchmark

Google-Proof Q&A

benchmarkactiveprovisionalgoogle-proof-q-a-dd20bb27·1 events·first seen 13d ago

Aliases: Google-Proof Q&A

Co-occurring entities

More like this (12)

Recent events (1)

7Anthropic News·13d ago·source ↗

Anthropic launches initiative to fund third-party AI safety evaluations

Anthropic announced a funded initiative to source third-party evaluations measuring advanced AI capabilities and safety risks, with priority areas including cybersecurity, CBRN threats, model autonomy, national security risks, social manipulation, and misalignment. The initiative is tied to Anthropic's Responsible Scaling Policy and AI Safety Level (ASL) framework, aiming to address a gap between demand and supply of high-quality safety-relevant evals. Proposals are solicited via an application form, with Anthropic framing the effort as benefiting the broader AI safety ecosystem rather than just internal use.