Entity · other

Attack Success Rate

otheractiveattack-success-rate-f002435e·1 events·first seen May 22, 2026

Aliases: Attack Success Rate

Co-occurring entities

Seed 2.0 Lite Claude Haiku 4.5 EU AI Act Gemini 3.1 Flash Live Boiling the Frog GPAI Code of Practice

More like this (12)

Beyond Attack-Success Rate: Action-Graded Severity Scale for Tool-Using AI Agents Beyond Success Rate: Cost-Aware Evaluation of Offensive and Defensive Security Agents Character Error Rate skill-based attacks Top-k Accuracy UAR (Unforeseen Attack Robustness)Embedded Attack MITRE ATT&CK Forensic Readiness Score Agent Cognitive Redundancy Ratio SkillOpt Directed Accuracy

Recent events (1)

7arXiv · cs.CL·May 22, 2026·source ↗

Boiling the Frog: A Multi-Turn Benchmark for Agentic Safety

Researchers introduce 'Boiling the Frog,' a multi-turn safety benchmark evaluating whether tool-using AI agents in corporate/office settings are susceptible to incremental attacks that begin with benign requests before introducing harmful payloads. The benchmark uses stateful multi-turn evaluation with a three-level operational risk taxonomy grounded in the EU AI Act and its GPAI Code of Practice. Across nine models, aggregate strict attack success rate is 44.4%, ranging from 20.5% for Claude Haiku 4.5 to 92.9% for Gemini 3.1 Flash Lite, with loss-of-control scenarios reaching 93.3% category-level ASR.

Evaluation and Benchmarking AI Safety Research Seed 2.0 Lite Claude Haiku 4.5 EU AI Act +7 more