Entity · other

joint safety evaluation

otheractivejoint-safety-evaluation-9de9ba7b·1 events·first seen May 20, 2026

Aliases: joint safety evaluation

Co-occurring entities

More like this (12)

Safety & Preparedness Report AI-assisted human evaluation Safe Exploration Benchmark Safety Gym Japan AI Safety Institute AI Safety Level Standards biological risk evaluation Community Evals OpenAI Safety & Security Committee G-Eval SAST AI Safety Level (ASL)

Recent events (1)

8Openai Blog·May 20, 2026·source ↗

OpenAI and Anthropic Share Findings from Joint Safety Evaluation

OpenAI and Anthropic conducted a first-of-its-kind cross-lab safety evaluation, testing each other's frontier models across dimensions including misalignment, instruction following, hallucinations, and jailbreaking resistance. The collaboration represents a novel form of inter-lab safety research cooperation. Findings highlight both progress and ongoing challenges in AI safety, and establish a potential template for future cross-organizational evaluations.

Frontier Model Releases Evaluation and Benchmarking joint safety evaluation OpenAI Anthropic +1 more