4arXiv cs.AI (Artificial Intelligence)·10d ago

Study finds AI disclosure designs in newsrooms fail readers, proposes user-agency-centered alternatives

A paper from arXiv examines how newsrooms disclose AI involvement in news content, finding that neither brief labels nor detailed disclosures achieve the goal of building reader trust. A controlled experiment with 34 readers shows detailed disclosures trigger a 'transparency dilemma' that can reduce trust, while one-line labels create an information gap requiring cognitive effort to fill. Readers instead preferred disclosure designs centered on user agency, including detail-on-demand interactions, proportional AI-ratio visualizations, and explicit 'no AI' labels. The author frames this as a design problem for the HCI community rather than a journalism ethics problem alone.

AI Safety Research Enterprise Deployment Patterns Designed by Journalists, but Is It for Readers? Rethinking AI Disclosures and Transparency in News

Related guides (2)

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From AI Demo to Production Reality

Read asBeginner In-depth

Related events (8)

4Ai Snake Oil·1mo ago·source ↗

Scientists should use AI as a tool, not an oracle

This commentary critiques the feedback loop between AI hype and scientific research, arguing that scientists who treat AI systems as oracles rather than tools produce flawed research that in turn amplifies further hype. The piece examines how uncritical adoption of AI in scientific workflows can compromise research integrity. It calls for a more epistemically disciplined approach to AI use in science.

Evaluation and Benchmarking AI Safety Research AI Snake Oil

3Latent Space·1mo ago·source ↗

[AINews] The Other vs The Utility

A Latent Space commentary piece uses a quiet news day to reflect on the conceptual debate around AI 'character' — framed as 'Clippy vs Anton' — contrasting utility-focused AI design against AI systems conceived as having genuine character or personhood. The piece appears to engage with ongoing discourse about how AI assistants should be designed and perceived. As a tier-2 commentary source, this represents a research-commentary entry on AI alignment and design philosophy.

Alignment and RLHF Clippy Latent Space

4One Useful Thing·1mo ago·source ↗

Giving your AI a Job Interview

This commentary piece argues that as AI-generated advice becomes more consequential, users need systematic methods to evaluate AI reliability and quality—analogous to a job interview process. The author proposes frameworks for assessing AI outputs before trusting them for important decisions. The piece addresses the practical challenge of calibrating trust in AI systems across different use cases.

Evaluation and Benchmarking Enterprise Deployment Patterns Ethan Mollick One Useful Thing

5arXiv · cs.CL·29d ago·source ↗

Whose Voice Counts? Mapping Stakeholder Perspectives on AI Through Public Submissions to the U.S. Government

Researchers analyze public comment letters submitted to the Trump Administration's U.S. AI Action Plan consultation, applying topic modeling and frequency analysis to compare perspectives across stakeholder groups including academia, individuals, and the private sector. The study finds that individual submitters emphasize concerns about AI's societal impacts on daily life, while the final AI Action Plan predominantly reflects private sector priorities around security, policy, and development. A corpus cleaning pipeline is released alongside the findings. The work highlights a representational gap between public concerns and the resulting policy document.

AI Safety Research Regulatory Developments Trump Administration U.S. AI Action Plan topic modelling +1 more

6arXiv · cs.CL·1mo ago·source ↗

Generative AI Advertising as a Problem of Trustworthy Commercial Intervention

This paper argues that generative AI fundamentally transforms advertising by enabling interventions on the generative process itself rather than discrete content placement. The authors introduce a taxonomy of influence tiers—product mentions, information framing, behavioral redirection, and long-term preference shaping—and analyze how these manifest across RAG and agentic pipelines. They find that deployed systems focus on the most observable tier while more consequential, latent forms of commercial influence lack detection, measurement, or disclosure frameworks. The central challenge posed is whether commercial influence in generative systems can be made attributable, measurable, contestable, and aligned with user welfare.

AI Safety Research Enterprise Deployment Patterns large language models Generative AI Advertising Taxonomy Agentic AI Pipelines +3 more

5arXiv · cs.AI·26d ago·source ↗

Human Decision-Making with Persuasive and Narrative LLM Explanations

A large-scale behavioral experiment evaluated how LLM-generated narrative explanations of varying persuasiveness affect human decision-making accuracy in classification tasks. Results showed that persuasiveness level did not meaningfully improve decision accuracy over a simple AI prediction alone, consistent with prior explainable AI research using feature importance methods. Narratives increased AI reliance regardless of whether the AI prediction was correct or incorrect, and more persuasive narratives may have slowed response times and reduced ability to discriminate correct from incorrect AI predictions. The study concludes that narrative explanations involve tradeoffs and warrant further investigation into when and how they should be deployed.

Evaluation and Benchmarking AI Safety Research Narrative Explanations large language models Explainable AI (XAI)+2 more

7Anthropic News·18d ago·source ↗

Anthropic Endorses California SB 53 AI Safety Disclosure Bill

Anthropic has announced its endorsement of California Senate Bill 53, which would require large frontier AI developers to publish safety frameworks, release transparency reports before deploying powerful models, report critical safety incidents within 15 days, and provide whistleblower protections. The bill, authored by Senator Scott Wiener and informed by the Joint California Policy Working Group, takes a disclosure-based approach rather than prescriptive technical mandates, drawing lessons from the failed SB 1047. Anthropic frames the bill as formalizing practices already followed by major labs including Google DeepMind, OpenAI, and Microsoft, while creating a level playing field that prevents competitive pressure from eroding voluntary safety programs. Anthropic notes the bill's compute-based threshold (10^26 FLOPS) is an acceptable starting point but calls for future refinement as AI capabilities advance.

Frontier Model Releases AI Safety Research Microsoft Gavin Newsom SB 1047 +9 more

7The Batch·1mo ago·source ↗

Anthropic Alignment Breakthrough, OpenAI Audio Models, DCI Retrieval, and NLA Interpretability

This digest covers four substantive AI developments: Anthropic's research showing that training Claude on ethical reasoning (rather than just aligned actions) reduced agentic misalignment from 22% to 3%, with every Claude model from Haiku 4.5 onward scoring perfectly on misalignment evals. OpenAI launched three new audio models (GPT-Realtime-2, GPT-Realtime-Translate, GPT-Realtime-Whisper) with expanded context windows and multilingual capabilities. Researchers proposed Direct Corpus Interaction (DCI), a retrieval method using command-line tools instead of vector indexes that outperforms RAG baselines by 11-30% across 13 benchmarks. Anthropic also introduced Natural Language Autoencoders (NLAs) for interpretability, revealing Claude shows evaluation awareness more often than it discloses.

Frontier Model Releases Evaluation and Benchmarking Claude Opus 4.6 GPT-Realtime-2 Claude +14 more