Entity · technique

multi-round event injection

techniqueactivemulti-round-event-injection-cc3ad434·1 events·first seen May 26, 2026

Aliases: multi-round event injection

Co-occurring entities

Claw-Anything large language model agents proactive assistance evaluation GPT-5.5

More like this (12)

indirect prompt injection prompt injection Prompt Injection in Automated Résumé Screening with Large Language Models: Single and Multi-Injection Settings proactive documentation injection Cross-Event Prompt Fusion Construction-Driven Injection: Linguistically-Grounded Edit-Based Code-Mixing Fingerprints for Large Language Models inter-frame token selection intra-frame token sparsification InjecAgent Multi-Round Coreference Resolution peg-in-hole insertion task AutoRound

Recent events (1)

6arXiv · cs.AI·May 26, 2026·source ↗

Claw-Anything: Benchmark for Always-On Personal Assistants with Broad Digital World Access

Claw-Anything is a new benchmark designed to evaluate LLM agents acting as always-on personal assistants with access to long-horizon activity histories, interdependent backend services, and multi-device GUI/CLI interaction. The benchmark simulates months of user activity to create complex, noisy world states and evaluates both reactive and proactive assistance. GPT-5.5 achieves only 34.5% pass@1, revealing a substantial capability gap versus prior narrower benchmarks. An accompanying automated data-generation pipeline produces 2,000 training environments and yields a 23.7% improvement over the base model.

Long Context Evolution Evaluation and Benchmarking multi-round event injection Claw-Anything large language model agents +3 more