4arXiv cs.CL (Computation and Language)·15d ago

Emergent language in multi-agent RL proposed as generative methodology for studying AI consciousness

A new arXiv preprint proposes using emergent language (EL) in multi-agent reinforcement learning as a generative methodology for studying consciousness-relevant structure in AI systems, contrasting with existing discriminative or architectural approaches. Agents begin with minimal language exposure and develop communication under task pressure alone, aiming to avoid artifacts from human language priors. As a proof of concept, the authors show agents develop self-referential communication including an echo-mismatch detection circuit that emerges from environmental affordances rather than task structure or architecture.

AI Safety Research Alignment and RLHF Emergent Language as an Approach to Conscious AI

Related guides (2)

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Alignment and RLHFTopic guide

Alignment and RLHF: Teaching AI Models to Behave

Read asBeginner In-depth

Related events (8)

4Openai Blog·1mo ago·source ↗

Learning to Communicate: OpenAI Agents Develop Their Own Language

OpenAI published research in which multi-agent systems spontaneously develop their own communication protocols without explicit language supervision. The work explores emergent language in reinforcement learning settings where agents must coordinate to achieve shared goals. This represents an early investigation into grounded language emergence in AI systems.

Agent and Tool Ecosystem Alignment and RLHF emergent communication Reinforcement Learning OpenAI

4Openai Blog·1mo ago·source ↗

Emergence of Grounded Compositional Language in Multi-Agent Populations

This 2017 OpenAI research paper investigates how compositional language can emerge spontaneously in populations of agents trained via multi-agent reinforcement learning. The work explores grounded communication protocols that arise without explicit linguistic supervision, contributing foundational insights into emergent communication and agent coordination. Though published in 2017, it represents an early milestone in OpenAI's research on multi-agent systems and emergent behavior.

Agent and Tool Ecosystem Alignment and RLHF emergent communication compositional language emergence Reinforcement Learning +1 more

3Openai Blog·1mo ago·source ↗

Learning to Cooperate, Compete, and Communicate

OpenAI published early research on multiagent environments as a pathway toward AGI, arguing that competitive multi-agent settings provide a natural curriculum and continuous pressure for improvement. The post highlights two key properties: difficulty scales with competitor skill, and no stable equilibrium exists, ensuring perpetual learning pressure. The work positions multiagent environments as fundamentally different from single-agent RL and calls for significant further research.

Evaluation and Benchmarking Agent and Tool Ecosystem self-play Reinforcement Learning OpenAI

6arXiv · cs.CL·18d ago·source ↗

AgentCL: A Rigorous Evaluation Framework for Continual Learning in Language Agents

AgentCL is a new benchmark and evaluation framework designed to rigorously assess continual learning in language agents, addressing gaps in existing benchmarks that focus on retrieval over long-context documents or use naive task streams with limited cross-task analysis. The framework constructs compositional task streams where earlier sub-solutions, evidence, or workflows are intentionally reusable in later tasks, contrasting them with naive streams to measure transfer gains. The authors also introduce MemProbe, a probing method that stores interactions, insights, and skills while filtering unreliable experiences during consolidation. Empirical results across coding, deep research, and language understanding tasks show that controlled streams better distinguish memory design quality, and that naive streams can mask memory-induced degradation.

Long Context Evolution Evaluation and Benchmarking AgentCL MemProbe Continual Learning +3 more

6Openai Blog·1mo ago·source ↗

Emergent Tool Use from Multi-Agent Hide-and-Seek Interaction

OpenAI researchers trained agents in a simulated hide-and-seek environment and observed the spontaneous emergence of six distinct strategies and counterstrategies, some unanticipated by the designers. The agents discovered progressively complex tool use through self-supervised multi-agent co-adaptation. The work suggests that sufficiently rich multi-agent environments may produce emergent intelligent behavior without explicit programming.

Evaluation and Benchmarking Agent and Tool Ecosystem Hide-and-Seek Multi-Agent Environment Reinforcement Learning OpenAI +1 more

7The Batch·18d ago·source ↗

Recursive Language Models Offer Path To Dramatically Expand Beyond the Context Window

MIT researchers Alex L. Zhang, Tim Kraska, and Omar Khattab propose Recursive Language Models (RLMs), a framework that offloads long-context processing to an external Python REPL environment, allowing models to programmatically fetch and manage text chunks via code generation. The root model spawns submodel instances to handle subtasks, aggregating their outputs recursively. Evaluated on benchmarks requiring reasoning over documents up to 11 million tokens, RLMs substantially outperform both base models and competing agentic strategies such as retrieval and summarization agents. For example, RLM-GPT-5 achieved 91.3% on BrowseComp+ versus GPT-5's inability to produce an answer, and ~50% accuracy on OOLONG-PAIRS at 1 million tokens versus near-zero for baseline approaches.

Long Context Evolution Evaluation and Benchmarking MIT OOLONG-PAIRS Tim Kraska +9 more

4Openai Blog·1mo ago·source ↗

Learning to Model Other Minds: OpenAI Releases LOLA Algorithm

OpenAI has released Learning with Opponent-Learning Awareness (LOLA), an algorithm designed for multi-agent settings where each agent accounts for the fact that other agents are also learning. LOLA discovers self-interested yet collaborative strategies such as tit-for-tat in the iterated prisoner's dilemma. The work represents an early step toward agents capable of modeling other minds and reasoning about opponent behavior.

AI Safety Research Agent and Tool Ecosystem Iterated Prisoner's Dilemma Tit-for-Tat OpenAI +1 more

5arXiv · cs.CL·15d ago·source ↗

Reinforcement learning enables meta-skill for translating unseen low-resource languages via in-context linguistic knowledge

Researchers propose an RL-based training approach for translating extremely low-resource or unseen languages by rewarding models for extracting and applying in-context linguistic knowledge (e.g., grammar books) rather than memorizing specific languages. Using chrF as a surface-level reward signal, RL-trained models outperform both in-context learning and supervised fine-tuning on completely unseen languages at test time. The work extends outcome-based RL beyond math and coding reasoning tasks, suggesting broader applicability to language learning from context.

Evaluation and Benchmarking Alignment and RLHF Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation chrF