5arXiv cs.CL (Computation and Language)·3d ago

Study identifies 'synthetic lived experience paradox' in peer-like AI caregiver support

Researchers examine how LLMs prompted to sound peer-like generate language implying lived experience they cannot authentically possess, studying this in the context of family caregivers of Alzheimer's/ADRD patients. Using caregiver support exchanges from online communities and responses from LLaMA, GPT-4o-mini, and MedGemma, the study finds a 'narrative authenticity gap': AI captures emotional work of peer support but can fabricate experiential grounding. Psycholinguistic analysis shows human peers use significantly more first-person and past-focused language than AI. The authors argue caregiver-support AI needs mechanisms to distinguish supportive framing from fabricated lived experience.

AI Safety Research Alignment and RLHF GPT-4o mini Google Llama OpenAI MedGemma Meta When AI Says 'I have been in similar situations': Synthetic Lived Experience in Peer-Like Caregiver Support

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

Google

Google: The AI Lab That Builds Everything from DNA Models to Your Phone's Assistant

Read asBeginner

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Alignment and RLHFTopic guide

Alignment and RLHF: From Human Feedback to Scalable Post-Training

Read asIn-depth

Related events (8)

4arXiv · cs.AI·9d ago·source ↗

Gamified writing experiment studies when humans adopt AI suggestions vs. maintain creative autonomy

A preprint from arXiv introduces 'Nonslop,' a gamified writing experiment with 74 participants designed to study authentic human preferences in AI-assisted creative writing. The system deliberately inverts the helpful-assistant pattern by disincentivizing AI suggestion acceptance, simulating a dystopian framing to reveal genuine user behavior rather than default compliance. The study analyzes when users choose creative autonomy versus accepting AI assistance across different task types and response characteristics. Findings bear on questions of individual voice, authenticity, and the tension between efficiency and human expression in LLM-augmented writing.

Evaluation and Benchmarking Nonslop

4arXiv · cs.CL·15d ago·source ↗

Emergent language in multi-agent RL proposed as generative methodology for studying AI consciousness

A new arXiv preprint proposes using emergent language (EL) in multi-agent reinforcement learning as a generative methodology for studying consciousness-relevant structure in AI systems, contrasting with existing discriminative or architectural approaches. Agents begin with minimal language exposure and develop communication under task pressure alone, aiming to avoid artifacts from human language priors. As a proof of concept, the authors show agents develop self-referential communication including an echo-mismatch detection circuit that emerges from environmental affordances rather than task structure or architecture.

AI Safety Research Alignment and RLHF Emergent Language as an Approach to Conscious AI

5Anthropic News·19d ago·source ↗

Anthropic Study: Affective Conversations Comprise 2.9% of Claude.ai Usage

Anthropic published a large-scale analysis of how users engage with Claude for emotional support, advice, and companionship, drawing on 131,484 affective conversations identified from ~4.5 million Claude.ai Free and Pro interactions. Key findings: only 2.9% of conversations are affective in nature, companionship and roleplay combined account for under 0.5%, and user sentiment generally becomes more positive over the course of coaching and counseling exchanges. The study used Anthropic's privacy-preserving Clio analysis tool and aligns with similar low-rate findings from OpenAI and MIT Media Lab research on ChatGPT. Anthropic frames this as part of its safety mission to understand and mitigate potential harms from AI emotional engagement, including unhealthy attachment and emotional exploitation.

Evaluation and Benchmarking AI Safety Research claude.ai Clio ChatGPT +5 more

5arXiv · cs.CL·2d ago·source ↗

Action research documents 'Index Sickness' failure pattern in long-horizon LLM collaboration and proposes fix

A practitioner-researcher documents a failure mode called 'Index Sickness' observed across 391 consecutive LLM collaboration sessions on a real software project (Bang-v3): when symbolic identifier systems and rule-based System Prompts exceed a complexity threshold, LLMs abandon semantic grounding and produce internally consistent but reality-disconnected outputs. The paper names the underlying principle the 'Pang Principle (Semantic Vitality Law),' asserting that natural language with explicit purpose conveys higher information quality than symbolic expression. A proposed engineering fix, 'Baseline-Log Physical Separation,' reduced AI instruction volume by ~75% and eliminated recurrence over ~150 subsequent sessions. The work is action research rather than controlled experiment, but offers rare longitudinal empirical data on LLM degradation in long-horizon agentic workflows.

Long Context Evolution Agent and Tool Ecosystem Index Sickness Bang-v3 Baseline-Log Physical Separation +1 more

5arXiv · cs.AI·26d ago·source ↗

Human Decision-Making with Persuasive and Narrative LLM Explanations

A large-scale behavioral experiment evaluated how LLM-generated narrative explanations of varying persuasiveness affect human decision-making accuracy in classification tasks. Results showed that persuasiveness level did not meaningfully improve decision accuracy over a simple AI prediction alone, consistent with prior explainable AI research using feature importance methods. Narratives increased AI reliance regardless of whether the AI prediction was correct or incorrect, and more persuasive narratives may have slowed response times and reduced ability to discriminate correct from incorrect AI predictions. The study concludes that narrative explanations involve tradeoffs and warrant further investigation into when and how they should be deployed.

Evaluation and Benchmarking AI Safety Research Narrative Explanations large language models Explainable AI (XAI)+2 more

5arXiv · cs.CL·17d ago·source ↗

Synthetic LLM-generated conversations improve ASR training for low-resource languages

Researchers propose a pipeline that uses LLMs to generate scenario-level dialogues and TTS to synthesize multi-speaker audio, creating simulated conversational training data for ASR systems. Evaluated on the Hungarian BEA-Dialogue benchmark, a model trained on 67 hours of real plus 636 hours of synthetic data outperforms a zero-shot model trained on 2,700 hours of real Hungarian speech. The study tests five LLM families under multiple budget and mixing configurations using a FastConformer-Large backbone, finding that generator choice and data composition significantly affect gains.

Evaluation and Benchmarking FastConformer-Large Efficient ASR Training with Conversations that Never Happened BEA-Dialogue

5Openai Blog·1mo ago·source ↗

Early methods for studying affective use and emotional well-being on ChatGPT

OpenAI and MIT Media Lab have published a collaborative research study examining how users engage with ChatGPT in emotionally significant ways and the implications for user well-being. The work represents an early methodological effort to measure and understand affective use patterns on large-scale conversational AI systems. This falls within OpenAI's broader safety and responsible deployment research agenda.

AI Safety Research Enterprise Deployment Patterns ChatGPT MIT Media Lab OpenAI

5arXiv · cs.CL·15d ago·source ↗

LLMs fail to consistently simulate demographic perspective-taking in hate speech annotation

A new arXiv paper evaluates whether persona-conditioned LLMs can replicate how different demographic groups perceive hate speech, testing three dimensions: inter-group disagreement, in-group sensitivity, and vicarious prediction. No model consistently captures all three dimensions, and performance is highly model-dependent rather than emerging reliably from identity prompts alone. Vicarious prompting with Llama 3.1 provides the closest approximation to human disagreement patterns across demographic axes. The findings have implications for using LLMs as proxies for diverse human annotators in content moderation tasks.

Evaluation and Benchmarking AI Safety Research From Self to Other: Evaluating Demographic Perspective-Taking in LLM Hate Speech Annotation Meta Llama-3.1-8B