Almanac
other

third-party AI evaluations

otheractiveprovisionalthird-party-ai-evaluations-a09ca5f1·1 events·first seen 18d ago

Aliases: third-party AI evaluations

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·18d ago·source ↗

A shared playbook for trustworthy third party evaluations

OpenAI has published guidance outlining a shared framework for conducting trustworthy third-party evaluations of frontier AI systems. The playbook covers methodology for assessing model capabilities, safeguards, and evaluation validity. This represents OpenAI's attempt to standardize and legitimize external auditing practices for frontier models.