Conceptual framework for analyzing dialogue dynamics in human-AI and multi-agent collaborative problem-solving
A new arXiv preprint proposes a hierarchical two-layer coding scheme for analyzing dialogue in collaborative problem-solving, integrating cognitive and metacognitive dimensions. The framework is validated across nine datasets spanning multiple domains and is positioned to apply to both human-AI and multi-agent collaboration contexts. A key finding is that metacognitive regulation is a strong discriminator of deeper collaboration quality.
Related guides (2)
Related events (8)
Taxonomy of conceptual alignment in human-robot dialogue with dialogue act schema
A preprint introduces a taxonomy for characterizing conceptual alignment in human-robot interaction, framing it as a bidirectional, co-constructive process rather than a unidirectional one. The authors define what triggers alignment initiation and what levels of conceptual understanding are involved, and provide a dialogue act schema as an operational tool for analyzing alignment moves. The work aims to give researchers and designers a structured foundation for comparing and building conceptual alignment systems in HRI.
HACD-H: Formal theory of social intelligence emergence in long-term human-AI interaction
Researchers propose the Human-AI Coevolution Dynamics Framework (HACD-H), a formal dynamical model treating long-term human-AI interaction as a self-organizing social cognitive system. The framework unifies emotional adaptation, relational organization, social memory, and personality consistency, introducing concepts like relational attractors, trust basins, and social cognitive energy. Empirical evaluation on a ~14,700-turn conversational dataset finds that social intelligence correlates negatively with social cognitive energy (r = -0.391) and that interaction trajectories show progressive energy reduction and phase-transition-like developmental patterns. The work argues social intelligence emerges from coevolution over time rather than from isolated conversational capabilities.
SIMAX framework generates annotated synthetic clinician-patient dialogues for AI communication coding evaluation
Researchers introduce SIMAX, a framework for generating controlled, annotated synthetic clinician-patient dialogues to support development and evaluation of AI-driven clinical communication coding systems. The framework produces dialogues with reference behavioral annotations using two codebooks (Global and WISER), generating 3,388 simulated dialogues across three medical specialties with varied personas and accent conditions. Evaluation shows reasonable speech naturalness and high transcription fidelity, with downstream testing revealing the framework can expose sensitivity gaps in communication coding systems. The work addresses a data scarcity bottleneck in deploying ambient AI scribes in clinical settings.
Causal DAG model for when AI systems should engage Theory of Mind in conflict scenarios
A new arXiv preprint proposes a structural causal model (formalized as a directed acyclic graph) that treats Theory of Mind as a conditionally activated mechanism rather than an always-on capacity in AI systems. The model specifies exogenous situational and agent-level conditions, five endogenous mediators, and three causal pathways (tractability, reasoning-depth, enabling-cause) leading to an epistemic accuracy outcome. The work targets human-machine teaming in conflict contexts, offering a resource-rational decision procedure for when AI should engage social reasoning. Simulation validation and ethical considerations for conflict-optimized mentalizing are discussed.
Emergent language in multi-agent RL proposed as generative methodology for studying AI consciousness
A new arXiv preprint proposes using emergent language (EL) in multi-agent reinforcement learning as a generative methodology for studying consciousness-relevant structure in AI systems, contrasting with existing discriminative or architectural approaches. Agents begin with minimal language exposure and develop communication under task pressure alone, aiming to avoid artifacts from human language priors. As a proof of concept, the authors show agents develop self-referential communication including an echo-mismatch detection circuit that emerges from environmental affordances rather than task structure or architecture.
C-DIC: Context-Driven Incremental Compression for efficient long-horizon multi-turn dialogue
A new arXiv preprint introduces Context-Driven Incremental Compression (C-DIC), a method for managing growing dialogue history in conversational agents by treating conversations as interleaved contextual threads with revisable per-thread compression states stored in a compact dialogue memory. A retrieve-revise-write-back loop shares information across turns and updates stale memories, while truncated backpropagation-through-time (TBPTT) is adapted to learn cross-turn dependencies. Experiments on long-form dialogue benchmarks show stable inference latency and perplexity over hundreds of turns, addressing compounding errors seen in existing context compressors.
What Makes a Dialog Agent Useful?
A Hugging Face blog post from January 2023 examining the properties that make dialog agents useful, likely covering aspects such as instruction-following, helpfulness, and alignment techniques. Published in the context of growing interest in ChatGPT and RLHF-trained conversational models, the post reflects the community's effort to understand and replicate capable dialog systems. As a tier-2 commentary piece, it offers analytical framing rather than new empirical results.
CURIOBOT framework uses LLM tutoring dialogues to operationalize curiosity-driven learning interventions
Researchers introduce CURIOBOT, a conversational tutoring framework that implements Berlyne's collative variables (novelty, complexity, conflict, uncertainty) as adaptive linguistic interventions via LLMs. Across 270 tutoring conversations, curiosity-oriented prompting strategies produced up to 2.4x more exploratory conversational turns under fixed time budgets. The study also introduces a learner-centered evaluation framework measuring exploratory questioning, conversational agency, and productive struggle. Results suggest curiosity functions as a partially independent interaction-level mechanism, and that LLM-mediated dialogue can serve as a scalable experimental platform for studying language's effect on cognition.

