Entity · technique

Distributed Agent Attack

techniqueactivedistributed-agent-attack-705ccdd6·1 events·first seen Jun 1, 2026

Aliases: Distributed Agent Attack

Co-occurring entities

Real-Time Clustering Stateful Online Monitor Cybersecurity Task Evaluation Multi-Agent Scaffold Language Model Safety Monitor

More like this (12)

Embedded Attack Self-State Attacks on Self-Hosted AI Agents: How Far Can OS Defenses Go?Meta-Agent Challenge multi-agent cooperative framework Benchmark Agent Baseline Agent Contagion Networks: Evaluator Bias Propagation in Multi-Agent LLM Systems World-Action Drift Attacks Agent-to-Agent Protocol (A2A)MESA: Prioritizing Vulnerable Communication Channels for Securing Multi-Agent Systems Recursive Agent Harnesses Multi-Agent Fictitious Play

Recent events (1)

7arXiv · cs.AI·Jun 1, 2026·source ↗

Stateful Online Monitoring Catches Distributed Agent Attacks via Cross-Account Clustering

Researchers demonstrate the first known distributed agent attack, a multi-agent scaffold that splits harmful cybersecurity tasks across many user accounts to evade per-transcript safety monitors, reducing detection rates to roughly one-fifth of standard attacks. As a defense, they develop a stateful online monitor that clusters weak suspiciousness signals across many agent transcripts in real time, escalating only rarely to a full LM-based review. In large-scale simulated datacenter traffic evaluations, the monitor Pareto-dominates standard monitors by catching distributed attacks 30% earlier with negligible latency overhead for ~99% of traffic. The system also incidentally catches standard jailbreaks, since adaptive attackers tend to reuse attack variants across accounts.

Evaluation and Benchmarking Inference Economics Real-Time Clustering Stateful Online Monitor Distributed Agent Attack +5 more