benchmark
Red-Teaming Resistance Leaderboard
benchmarkactive
red-teaming-resistance-leaderboard-9282f64c·1 events·first seen 28d agoAliases: Red-Teaming Resistance Leaderboard
Co-occurring entities
More like this (12)
Recent events (1)
Introducing the Red-Teaming Resistance Leaderboard
Hugging Face and Haize Labs have launched a Red-Teaming Resistance Leaderboard to systematically benchmark how well AI models resist adversarial prompting and jailbreak attempts. The leaderboard provides a standardized evaluation framework for comparing model robustness against red-teaming attacks. This fills a gap in the evaluation ecosystem where safety and adversarial robustness metrics have been less formalized than capability benchmarks.