Entity · technique

multi-agent systems

techniqueactivemulti-agent-systems-6c39c13a·3 events·first seen May 19, 2026

Aliases: multi-agent systems, multi-agent LLM systems

Co-occurring entities

Code as Agent Harness Executable Operational Cognition HarnessMutation KV Cache representation-level sensitive information leakage LCGuard adversarial training latent communication embodied agents large language models multi-agent coordination GUI/OS automation execution-based verification

More like this (12)

multi-agent architecture multi-agent coordination multi-agent systematizer multi-agent cooperative framework multimodal agents AI Agents Hide-and-Seek Multi-Agent Environment When Do Multi-Agent Systems Help? An Information Bottleneck Perspective Agent-S memory-augmented agent frameworks GUI Agents OpenAI Managed Agents

Recent events (3)

5arXiv · cs.AI·May 27, 2026·source ↗

Governed Evolution of Agent Runtimes through Executable Operational Cognition

This paper proposes a framework for governed runtime evolution in multi-agent systems, formalizing agent-generated code artifacts as persistent runtime capabilities rather than transient outputs. It introduces HarnessMutation, a lifecycle-aware mechanism for runtime adaptation operating under explicit validation, traceability, evaluation, and rollback constraints. The framework models agent self-modification as a bounded, observable, and auditable process over persistent operational memory, building on prior 'Code as Agent Harness' work.

AI Safety Research Agent and Tool Ecosystem Executable Operational Cognition Code as Agent Harness multi-agent systems +1 more

6arXiv · cs.AI·May 22, 2026·source ↗

LCGuard: Adversarial Training Framework for Safe KV Cache Sharing in Multi-Agent LLM Systems

LCGuard introduces a framework for preventing sensitive information leakage when multi-agent LLM systems share KV caches as a latent communication channel. The approach formalizes leakage operationally via reconstruction: a shared cache artifact is deemed unsafe if an adversarial decoder can recover sensitive inputs from it. An adversarial training loop pits a reconstructor against LCGuard's representation-level transformations, which aim to preserve task-relevant semantics while suppressing recoverable sensitive content. Empirical results across multiple model families and multi-agent benchmarks show reduced reconstruction-based leakage and attack success rates with competitive task performance.

Inference Economics AI Safety Research KV Cache representation-level sensitive information leakage LCGuard +4 more

6arXiv · cs.CL·May 19, 2026·source ↗

Code as Agent Harness: A Survey of Code as Operational Substrate for Agentic AI Systems

This survey paper introduces the concept of 'code as agent harness,' framing code not merely as output but as the operational infrastructure for LLM-based agents—covering reasoning, action, environment modeling, and execution-based verification. The authors organize the analysis across three layers: harness interface, harness mechanisms (planning, memory, tool use, feedback control), and scaling to multi-agent systems. Applications span coding assistants, GUI/OS automation, embodied agents, scientific discovery, and enterprise workflows. Open challenges include evaluation beyond task success, verification under incomplete feedback, and human oversight for safety-critical actions.

Evaluation and Benchmarking AI Safety Research embodied agents large language models Code as Agent Harness +6 more