Almanac
benchmark

Red-Teaming Resistance Leaderboard

benchmarkactivered-teaming-resistance-leaderboard-9282f64c·1 events·first seen 28d ago

Aliases: Red-Teaming Resistance Leaderboard

Co-occurring entities

More like this (12)

Recent events (1)

5Hugging Face Blog·28d ago·source ↗

Introducing the Red-Teaming Resistance Leaderboard

Hugging Face and Haize Labs have launched a Red-Teaming Resistance Leaderboard to systematically benchmark how well AI models resist adversarial prompting and jailbreak attempts. The leaderboard provides a standardized evaluation framework for comparing model robustness against red-teaming attacks. This fills a gap in the evaluation ecosystem where safety and adversarial robustness metrics have been less formalized than capability benchmarks.