4arXiv cs.CL (Computation and Language)·4d ago

Conceptual framework for analyzing dialogue dynamics in human-AI and multi-agent collaborative problem-solving

A new arXiv preprint proposes a hierarchical two-layer coding scheme for analyzing dialogue in collaborative problem-solving, integrating cognitive and metacognitive dimensions. The framework is validated across nine datasets spanning multiple domains and is positioned to apply to both human-AI and multi-agent collaboration contexts. A key finding is that metacognitive regulation is a strong discriminator of deeper collaboration quality.

Evaluation and Benchmarking Agent and Tool Ecosystem Bridging Talk and Thought: Understanding Dialogue Dynamics Across Collaborative Problem-Solving Contexts

Related guides (2)

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Evaluation and BenchmarkingTopic guide

Evaluation and Benchmarking: How We Measure AI — and Why It Keeps Getting Harder

Read asBeginner In-depth

Related events (8)

3arXiv · cs.CL·7d ago·source ↗

Taxonomy of conceptual alignment in human-robot dialogue with dialogue act schema

A preprint introduces a taxonomy for characterizing conceptual alignment in human-robot interaction, framing it as a bidirectional, co-constructive process rather than a unidirectional one. The authors define what triggers alignment initiation and what levels of conceptual understanding are involved, and provide a dialogue act schema as an operational tool for analyzing alignment moves. The work aims to give researchers and designers a structured foundation for comparing and building conceptual alignment systems in HRI.

Agent and Tool Ecosystem A Taxonomy of Conceptual Alignment in Human-Robot Dialogue

4arXiv · cs.CL·12d ago·source ↗

HACD-H: Formal theory of social intelligence emergence in long-term human-AI interaction

Researchers propose the Human-AI Coevolution Dynamics Framework (HACD-H), a formal dynamical model treating long-term human-AI interaction as a self-organizing social cognitive system. The framework unifies emotional adaptation, relational organization, social memory, and personality consistency, introducing concepts like relational attractors, trust basins, and social cognitive energy. Empirical evaluation on a ~14,700-turn conversational dataset finds that social intelligence correlates negatively with social cognitive energy (r = -0.391) and that interaction trajectories show progressive energy reduction and phase-transition-like developmental patterns. The work argues social intelligence emerges from coevolution over time rather than from isolated conversational capabilities.

Alignment and RLHF Human-AI Coevolution Dynamics Framework Human-AI Coevolution Dynamics: A Formal Theory of Social Intelligence Emergence Through Long-Term Interaction

4arXiv · cs.CL·13h ago·source ↗

SIMAX framework generates annotated synthetic clinician-patient dialogues for AI communication coding evaluation

Researchers introduce SIMAX, a framework for generating controlled, annotated synthetic clinician-patient dialogues to support development and evaluation of AI-driven clinical communication coding systems. The framework produces dialogues with reference behavioral annotations using two codebooks (Global and WISER), generating 3,388 simulated dialogues across three medical specialties with varied personas and accent conditions. Evaluation shows reasonable speech naturalness and high transcription fidelity, with downstream testing revealing the framework can expose sensitivity gaps in communication coding systems. The work addresses a data scarcity bottleneck in deploying ambient AI scribes in clinical settings.

Evaluation and Benchmarking SIMAX UTMOS WISER Codebook +1 more

4arXiv · cs.AI·14d ago·source ↗

Causal DAG model for when AI systems should engage Theory of Mind in conflict scenarios

A new arXiv preprint proposes a structural causal model (formalized as a directed acyclic graph) that treats Theory of Mind as a conditionally activated mechanism rather than an always-on capacity in AI systems. The model specifies exogenous situational and agent-level conditions, five endogenous mediators, and three causal pathways (tractability, reasoning-depth, enabling-cause) leading to an epistemic accuracy outcome. The work targets human-machine teaming in conflict contexts, offering a resource-rational decision procedure for when AI should engage social reasoning. Simulation validation and ethical considerations for conflict-optimized mentalizing are discussed.

AI Safety Research Agent and Tool Ecosystem A Causal Model of Theory of Mind in Conflict for Artificial Intelligence

4arXiv · cs.CL·25d ago·source ↗

Emergent language in multi-agent RL proposed as generative methodology for studying AI consciousness

A new arXiv preprint proposes using emergent language (EL) in multi-agent reinforcement learning as a generative methodology for studying consciousness-relevant structure in AI systems, contrasting with existing discriminative or architectural approaches. Agents begin with minimal language exposure and develop communication under task pressure alone, aiming to avoid artifacts from human language priors. As a proof of concept, the authors show agents develop self-referential communication including an echo-mismatch detection circuit that emerges from environmental affordances rather than task structure or architecture.

AI Safety Research Alignment and RLHF Emergent Language as an Approach to Conscious AI

4arXiv · cs.CL·19d ago·source ↗

C-DIC: Context-Driven Incremental Compression for efficient long-horizon multi-turn dialogue

A new arXiv preprint introduces Context-Driven Incremental Compression (C-DIC), a method for managing growing dialogue history in conversational agents by treating conversations as interleaved contextual threads with revisable per-thread compression states stored in a compact dialogue memory. A retrieve-revise-write-back loop shares information across turns and updates stale memories, while truncated backpropagation-through-time (TBPTT) is adapted to learn cross-turn dependencies. Experiments on long-form dialogue benchmarks show stable inference latency and perplexity over hundreds of turns, addressing compounding errors seen in existing context compressors.

Long Context Evolution Inference Economics Context-Driven Incremental Compression Context-Driven Incremental Compression for Multi-Turn Dialogue Generation Context-Driven Incremental Compression for Multi-Turn Dialogue Generation

3Hugging Face Blog·1mo ago·source ↗

What Makes a Dialog Agent Useful?

A Hugging Face blog post from January 2023 examining the properties that make dialog agents useful, likely covering aspects such as instruction-following, helpfulness, and alignment techniques. Published in the context of growing interest in ChatGPT and RLHF-trained conversational models, the post reflects the community's effort to understand and replicate capable dialog systems. As a tier-2 commentary piece, it offers analytical framing rather than new empirical results.

Agent and Tool Ecosystem Alignment and RLHF ChatGPT Reinforcement Learning from Human Feedback Hugging Face

4arXiv · cs.CL·7d ago·source ↗

CURIOBOT framework uses LLM tutoring dialogues to operationalize curiosity-driven learning interventions

Researchers introduce CURIOBOT, a conversational tutoring framework that implements Berlyne's collative variables (novelty, complexity, conflict, uncertainty) as adaptive linguistic interventions via LLMs. Across 270 tutoring conversations, curiosity-oriented prompting strategies produced up to 2.4x more exploratory conversational turns under fixed time budgets. The study also introduces a learner-centered evaluation framework measuring exploratory questioning, conversational agency, and productive struggle. Results suggest curiosity functions as a partially independent interaction-level mechanism, and that LLM-mediated dialogue can serve as a scalable experimental platform for studying language's effect on cognition.

Agent and Tool Ecosystem CURIOBOT